Other component-wise evaluation
Component-wise evaluation involves evaluating individual components of the pipeline, such as the retrieval and generation stages, to gain insights into their effectiveness and identify areas for improvement. We already shared two metrics for each of these stages, but here are a couple more that are available in the ragas platform:
- Context relevancy: This metric gauges the relevancy of the retrieved context, calculated based on both the question and contexts. The values fall within the range of
(0-1)
, with higher values indicating better relevancy. - Context entity recall: This metric gives the measure of recall of the retrieved context, based on the number of entities present in both
ground_truth
data andcontexts
data relative to the number of entities present in theground_truth
data alone. Simply put, it is a measure of what fraction of entities are recalled fromground_truth
data. This metric is particularly useful in fact-based use cases...