Search icon CANCEL
Subscription
0
Cart icon
Cart
Close icon
You have no products in your basket yet
Save more on your purchases!
Savings automatically calculated. No voucher code required
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Enterprise GENERATIVE AI Well-Architected Framework & Patterns

You're reading from  Enterprise GENERATIVE AI Well-Architected Framework & Patterns

Product type Book
Published in Apr 2024
Publisher Packt
ISBN-13 9781836202912
Pages 113 pages
Edition 1st Edition
Languages
Author (1):
Suvoraj Biswas Suvoraj Biswas
Profile icon Suvoraj Biswas
Toc

Table of Contents (43) Chapters close

1. What is Generative AI?
2. What is the background of GenAI - Well Architected Framework?
3. Why do we need a Well Architected Framework for GenAI based Enterprise Use Case?
4. What are the pillars of the Well Architected Framework?
5. What are the building blocks for each pillar? 6. What is a Large Language Model?
7. What is a Foundation Model?
8. When should we be using the Foundation Model?
9. Introduction to Content Moderation building block:
10. What is AI Guardrails and How to integrate it?
11. Introduction to the Compliance Pillar:
12. Benefits of having an Archival for Compliance building block:
13. What are some solutions for Archiving AI & Human interactions for Regulatory Compliance?
14. The FM or LLM observability Building Block:
15. What are some LLM Observability Solutions?
16. Introduction
17. What are some best practices to optimize Cost during Architecting?
18. Prompt Engineering
19. Why Prompt Engineering plays a crucial role in the GenAI apps?
20. Some Best Practices for Effective Prompts:
21. Embedding & Vector Database
22. What are Embeddings and why are they so important?
23. What are OpenAI's Offerings on Embeddings?
24. What is a Vector Database & How do they really work?
25. Options of Vector Databases for Solutioning
26. The Orchestrator Building Block
27. Some Orchestrator Application framework for AI development:
28. Introduction
29. What is Retrieval-augmented generation (RAG) Pattern for GenAI?
30. Demo Time - a GenAI based QnA python app using RAG pattern (LangChain, PineCone & Open AI api)
31. What is Fine tuning?
32. Why do we need fine tuning? Can’t we use RAG for domain specific use cases?
33. Are Fine Tuning & Pre-Training referring to the similar process?
34. Foundation Model or Large Language Model fine-tuning techniques:
35. Parameter Efficient Fine Tuning:
36. What are some Enterprise Use cases where SFT/RLHF or PEFT can be used?
37. What is LoRA (Low Rank Adaptation Model) technique?
38. What are the advantages of the LoRA method in fine tuning LLM?
39. What is QLoRA (Quantized Low Rank Adaptation Model) ?
40. Vector Database & Semantic Search capabilities
41. How to run PostGreSQL with pgVector extension in EC2 ?
42. Overview of AWS Sagemaker and AWS Bedrock
43. Example Generative AI based Enterprise Product Search Architecture in AWS

What is Fine tuning?

Fine tuning refers to the process of  selecting an appropriate pre-trained model or foundation model as the starting point. Then a domain specific high quality  dataset or target task specific dataset is used to fine-tune the model via additional training rounds where the model makes predictions on this curated dataset, errors are calculated, and weights are updated to minimize the prediction error. This way the LLMs are fine tuned for that domain specific tasks.

Why do we need fine tuning? Can’t we use RAG for domain specific use cases?

Foundation Models or Large Language Models perform great when we have regular use cases like summarizing content or question answering from existing articles or data as well as regular coding. However these LLMs do not perform well when domain specific...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}