You're reading from Machine Learning for Imbalanced Data Tackle imbalanced datasets using machine learning and deep learning techniques

Product type Paperback

Published in Nov 2023

Publisher Packt

ISBN-13 9781801070836

Length 344 pages

Edition 1st Edition

Languages

Rust

Tools

TensorFlow Lite

Concepts

Data Science

Authors (2):

Dr. Mounir Abdelaziz

Kumar Abhishek

View More author details

Table of Contents (15) Chapters

Preface

1. Chapter 1: Introduction to Data Imbalance in Machine Learning FREE CHAPTER

2. Chapter 2: Oversampling Methods

3. Chapter 3: Undersampling Methods

4. Chapter 4: Ensemble Methods

5. Chapter 5: Cost-Sensitive Learning

6. Chapter 6: Data Imbalance in Deep Learning

7. Chapter 7: Data-Level Deep Learning Methods

8. Chapter 8: Algorithm-Level Deep Learning Techniques

9. Chapter 9: Hybrid Deep Learning Methods

10. Chapter 10: Model Calibration

11. Assessments

12. Index

Why subscribe?

13. Other Books You May Enjoy

Appendix: Machine Learning Pipeline in Production

Preface

Hello and welcome! Machine Learning (ML) enables computers to learn from data using algorithms to make informed decisions, automate tasks, and extract valuable insights. One particular aspect that often garners attention is imbalanced data, where certain classes may have considerably fewer samples than others.

This book provides an in-depth guide to understanding and navigating the intricacies of skewed data. You will gain insights into best practices for managing imbalanced datasets in ML contexts.

While imbalanced data can present challenges, it’s important to understand that the techniques to address this imbalance are not universally applicable. Their relevance and necessity depend on various factors such as the domain, the data distribution, the performance metrics you’re optimizing, and the business objectives. Before adopting any techniques, it’s essential to establish a baseline. Even if you don’t currently face issues with imbalanced data, it can be beneficial to be aware of the challenges and solutions discussed in this book. Familiarizing yourself with these techniques will provide you with a comprehensive toolkit, preparing you for scenarios that you may not yet know you’ll encounter. If you do find that model performance is lacking, especially for underrepresented (minority) classes, the insights and strategies covered in the book can be instrumental in guiding effective improvements.

As the domains of ML and artificial intelligence continue to grow, there will be an increasing demand for professionals who can adeptly handle various data challenges, including imbalance. This book aims to equip you with the knowledge and tools to be one of those sought-after experts.