You're reading from Interactive Dashboards and Data Apps with Plotly and Dash Harness the power of a fully fledged frontend web framework in Python – no JavaScript required

Product type Paperback

Published in May 2021

Publisher Packt

ISBN-13 9781800568914

Length 364 pages

Edition 1st Edition

Languages

JavaScript

Tools

Dash

Concepts

Data Visualization

Author (1):

Elias Dabbas

View More author details

Table of Contents (18) Chapters

Preface

1. Section 1: Building a Dash App

2. Chapter 1: Overview of the Dash Ecosystem FREE CHAPTER

3. Chapter 2: Exploring the Structure of a Dash App

4. Chapter 3: Working with Plotly's Figure Objects

5. Chapter 4: Data Manipulation and Preparation, Paving the Way to Plotly Express

6. Section 2: Adding Functionality to Your App with Real Data

7. Chapter 5: Interactively Comparing Values with Bar Charts and Dropdown Menus

8. Chapter 6: Exploring Variables with Scatter Plots and Filtering Subsets with Sliders

9. Chapter 7: Exploring Map Plots and Enriching Your Dashboards with Markdown

10. Chapter 8: Calculating the Frequency of Your Data with Histograms and Building Interactive Tables

11. Section 3: Taking Your App to the Next Level

12. Chapter 9: Letting Your Data Speak for Itself with Machine Learning

13. Chapter 10: Turbo-charge Your Apps with Advanced Callbacks

14. Chapter 11: URLs and Multi-Page Apps

15. Chapter 12: Deploying Your App

16. Chapter 13: Next Steps

17. Other Books You May Enjoy

Preparing data with scikit-learn

scikit-learn is one of the most widely used and comprehensive machine learning libraries in Python. It plays very well with the rest of the data-science ecosystem libraries, such as NumPy, pandas, and matplotlib. We will be using it for modeling our data and for some preprocessing as well.

We now have two issues that we need to tackle first: missing values and scaling data. Let's see two simple examples for each, and then tackle them in our dataset. Let's start with missing values.

Handling missing values

Models need data, and they can't know what to do with a set of numbers containing missing values. In such cases (and there are many in our dataset), we need to make a decision on what to do with those missing values.

There are several options, and the right choice depends on the application as well as the nature of the data, but we won't get into those details. For simplicity, we will make a generic choice of replacing...