You're reading from Building Data-Driven Applications with LlamaIndex A practical guide to retrieval-augmented generation (RAG) to enhance LLM applications

Product type Paperback

Published in May 2024

Publisher Packt

ISBN-13 9781835089507

Length 368 pages

Edition 1st Edition

Languages

Python

Tools

LlamaIndex

Concepts

GPT/LLMs

Author (1):

Andrei Gheorghiu

View More author details

Table of Contents (18) Chapters

Preface

1. Part 1:Introduction to Generative AI and LlamaIndex FREE CHAPTER

2. Chapter 1: Understanding Large Language Models

3. Chapter 2: LlamaIndex: The Hidden Jewel - An Introduction to the LlamaIndex Ecosystem

4. Part 2: Starting Your First LlamaIndex Project

5. Chapter 3: Kickstarting Your Journey with LlamaIndex

6. Chapter 4: Ingesting Data into Our RAG Workflow

7. Chapter 5: Indexing with LlamaIndex

8. Part 3: Retrieving and Working with Indexed Data

9. Chapter 6: Querying Our Data, Part 1 – Context Retrieval

10. Chapter 7: Querying Our Data, Part 2 – Postprocessing and Response Synthesis

11. Chapter 8: Building Chatbots and Agents with LlamaIndex

12. Part 4: Customization, Prompt Engineering, and Final Words

13. Chapter 9: Customizing and Deploying Our LlamaIndex Project

14. Chapter 10: Prompt Engineering Guidelines and Best Practices

15. Chapter 11: Conclusion and Additional Resources

16. Index

Why subscribe?

17. Other Books You May Enjoy

Estimating the potential cost of using metadata extractors

A key consideration when utilizing the various metadata extractors in LlamaIndex is the associated LLM compute costs. As mentioned earlier, most of these extractors rely on LLMs under the hood to analyze text and generate descriptive metadata.

Repeatedly calling LLMs to process large volumes of text can quickly add up in charges. For example, if you are extracting summaries and keywords from thousands of document nodes using SummaryExtractor and KeywordExtractor, those constant LLM invocations will carry a significant cost.

Follow these simple best practices to minimize your costs

Let’s talk about some common best practices for minimizing your LLM costs:

Batch content into fewer LLM calls instead of individual calls per node. This amortizes the overhead because you consume fewer tokens compared to multiple separate calls. Using the Pydantic extractor is very useful for this purpose since it generates...

The rest of the chapter is locked

You're reading from Building Data-Driven Applications with LlamaIndex A practical guide to retrieval-augmented generation (RAG) to enhance LLM applications

Table of Contents (18) Chapters

Estimating the potential cost of using metadata extractors

Follow these simple best practices to minimize your costs

Authors (1)

Personalised recommendations for you

You're reading from Building Data-Driven Applications with LlamaIndex A practical guide to retrieval-augmented generation (RAG) to enhance LLM applications

Table of Contents (18) Chapters

Estimating the potential cost of using metadata extractors

Follow these simple best practices to minimize your costs

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you