You're reading from Mastering NLP from Foundations to LLMs Apply advanced rule-based techniques to LLMs and solve real-world business problems using Python

Product type Paperback

Published in Apr 2024

Publisher Packt

ISBN-13 9781804619186

Length 340 pages

Edition 1st Edition

Languages

Python

Concepts

Deep Learning

Authors (2):

Meysam Ghaffari

Lior Gazit

View More author details

Table of Contents (14) Chapters

Preface

1. Chapter 1: Navigating the NLP Landscape: A Comprehensive Introduction

2. Chapter 2: Mastering Linear Algebra, Probability, and Statistics for Machine Learning and NLP FREE CHAPTER

3. Chapter 3: Unleashing Machine Learning Potentials in Natural Language Processing

4. Chapter 4: Streamlining Text Preprocessing Techniques for Optimal NLP Performance

5. Chapter 5: Empowering Text Classification: Leveraging Traditional Machine Learning Techniques

6. Chapter 6: Text Classification Reimagined: Delving Deep into Deep Learning Language Models

7. Chapter 7: Demystifying Large Language Models: Theory, Design, and Langchain Implementation

8. Chapter 8: Accessing the Power of Large Language Models: Advanced Setup and Integration with RAG

9. Chapter 9: Exploring the Frontiers: Advanced Applications and Innovations Driven by LLMs

10. Chapter 10: Riding the Wave: Analyzing Past, Present, and Future Trends Shaped by LLMs and AI

11. Chapter 11: Exclusive Industry Insights: Perspectives and Predictions from World Class Experts

12. Index

Why subscribe?

13. Other Books You May Enjoy

Handling imbalanced data

In most real-world problems, our data is imbalanced, which means that the distribution of records from different classes (such as patients with and without cancer) is different. Handling imbalanced datasets is an important task in machine learning as it is common to have datasets with uneven class distribution. In such cases, the minority class is often under-represented, which can cause poor model performance and biased predictions. The reason behind this is that machine learning methods are trying to optimize their fitness function to minimize the error in the training set. Now, let’s say that we have 99% of the data from the positive class and 1% from the negative class. In this case, if the model predicts all records as positive, the error will be 1%; however, this model is not useful for us. That’s why, if we have an imbalanced dataset, we need to use various methods to handle imbalanced data. In general, we can have three categories of methods...

The rest of the chapter is locked

You're reading from Mastering NLP from Foundations to LLMs Apply advanced rule-based techniques to LLMs and solve real-world business problems using Python

Table of Contents (14) Chapters

Handling imbalanced data

Unlock this book and the full library FREE for 7 days

Authors (2)

Personalised recommendations for you