You're reading from Hands-On Data Analysis with Pandas A Python data science handbook for data collection, wrangling, analysis, and visualization

Product type Paperback

Published in Apr 2021

Publisher Packt

ISBN-13 9781800563452

Length 788 pages

Edition 2nd Edition

Languages

Python

Tools

Pandas

Concepts

Data Analysis

Author (1):

Stefanie Molin

View More author details

Table of Contents (21) Chapters

Preface

1. Section 1: Getting Started with Pandas

2. Chapter 1: Introduction to Data Analysis FREE CHAPTER

3. Chapter 2: Working with Pandas DataFrames

4. Section 2: Using Pandas for Data Analysis

5. Chapter 3: Data Wrangling with Pandas

6. Chapter 4: Aggregating Pandas DataFrames

7. Chapter 5: Visualizing Data with Pandas and Matplotlib

8. Chapter 6: Plotting with Seaborn and Customization Techniques

9. Section 3: Applications – Real-World Analyses Using Pandas

10. Chapter 7: Financial Analysis – Bitcoin and the Stock Market

11. Chapter 8: Rule-Based Anomaly Detection

12. Section 4: Introduction to Machine Learning with Scikit-Learn

13. Chapter 9: Getting Started with Machine Learning in Python

14. Chapter 10: Making Better Predictions – Optimizing Models

15. Chapter 11: Machine Learning Anomaly Detection

16. Section 5: Additional Resources

17. Chapter 12: The Road Ahead

18. Solutions

19. Other Books You May Enjoy

Appendix

Inspecting classification prediction confidence

As we saw with ensemble methods, when we know the strengths and weaknesses of our model, we can employ strategies to attempt to improve performance. We may have two models to classify something, but they most likely won't agree on everything. However, say that we know that one does better on edge cases, while the other is better on the more common ones. In that case, we would likely want to investigate a voting classifier to improve our performance. How can we know how the models perform in different situations, though?

By looking at the probabilities the model predicts of an observation belonging to a given class, we can gain insight into how confident our model is when it is correct and when it errs. We can use our pandas data wrangling skills to make quick work of this. Let's see how confident our original white_or_red model from Chapter 9, Getting Started with Machine Learning in Python, was in its predictions: