Naïve RAG and its limitations
So far, we have worked with three types of RAG approaches, naïve RAG, hybrid RAG, and re-ranking. Initially, we were working with what is called naïve RAG. This is the basic RAG approach that we had in our starter code in Chapter 2 and multiple code labs after. Naive RAG models, the initial iterations of RAG technology, provide a foundational framework for integrating retrieval mechanisms with generative models, albeit with limitations in flexibility and scalability.
Naïve RAG retrieves numerous fragmented context chunks, the chunks of text that we vectorize, to put into the LLM context window. If you do not use large enough chunks of text, your context will experience higher levels of fragmentation. This fragmentation leads to decreased understanding and capture of the context and semantics within your chunks, reducing the effectiveness of the retrieval mechanism of your RAG application. In the typical naïve RAG application...