Evaluating RAG Quantitatively and with Visualizations
Evaluation plays a crucial role in building and maintaining retrieval-augmented generation (RAG) pipelines. While you build the pipeline, you can use evaluation to identify areas for improvement, optimize the system’s performance, and systematically measure the impact of improvements. When your RAG system is deployed, evaluation can help ensure the effectiveness, reliability, and performance of the system.
In this chapter, we will cover the following topics:
- Evaluating when building a RAG application
- Evaluating a RAG application after deployment
- Standardized evaluation frameworks
- Ground truth
- Code lab 9.1 – ragas
- Additional evaluation techniques for RAG systems
Let’s start by talking about how evaluation can help during the initial stages of building your RAG system.