You're reading from Machine Learning for Imbalanced Data Tackle imbalanced datasets using machine learning and deep learning techniques

Product type Paperback

Published in Nov 2023

Publisher Packt

ISBN-13 9781801070836

Length 344 pages

Edition 1st Edition

Languages

Rust

Tools

TensorFlow Lite

Concepts

Data Science

Authors (2):

Dr. Mounir Abdelaziz

Kumar Abhishek

View More author details

Table of Contents (15) Chapters

Preface

1. Chapter 1: Introduction to Data Imbalance in Machine Learning

2. Chapter 2: Oversampling Methods FREE CHAPTER

3. Chapter 3: Undersampling Methods

4. Chapter 4: Ensemble Methods

5. Chapter 5: Cost-Sensitive Learning

6. Chapter 6: Data Imbalance in Deep Learning

7. Chapter 7: Data-Level Deep Learning Methods

8. Chapter 8: Algorithm-Level Deep Learning Techniques

9. Chapter 9: Hybrid Deep Learning Methods

10. Chapter 10: Model Calibration

11. Assessments

12. Index

Why subscribe?

13. Other Books You May Enjoy

Appendix: Machine Learning Pipeline in Production

📌 Usage of techniques – In production tips

Throughout this book, you will come across “In production” tip boxes like the following one, highlighting real-world applications of the techniques discussed:

🚀 Class reweighting in production at OpenAI

OpenAI was trying to solve the problem of bias in training data of the image generation model DALL-E 2 [1]. DALL-E 2 is trained on a massive dataset of images from the internet, which can contain biases. For example, the dataset may contain more images of men than women or more images of people from certain racial or ethnic groups than others.

These snippets offer insights into how well-known companies grappled with data imbalance and what strategies they adopted to effectively navigate these challenges. For instance, the tip on OpenAI’s approach with DALL-E 2 sheds light on the intricate balance between filtering training data and inadvertently amplifying biases. Such examples underscore the importance of being both strategic and cautious when dealing with imbalanced data. To delve deeper into the specifics and understand the nitty-gritty of these implementations, you are encouraged to follow the company blog or paper links provided. These insights can provide a clearer understanding of how to adapt and apply techniques in varied real-world scenarios effectively.

The rest of the chapter is locked

You're reading from Machine Learning for Imbalanced Data Tackle imbalanced datasets using machine learning and deep learning techniques

Table of Contents (15) Chapters

📌 Usage of techniques – In production tips

Unlock this book and the full library FREE for 7 days

Authors (2)

Personalised recommendations for you