Visualizing and exploring data using JupyterHub
Recall from Chapter 9, Building Your Data Pipeline, that the data engineer has worked with the SME of the business and prepared the flight data that can be used to predict the flights' on-time performance.
In this section, you will understand the data produced by the data engineering team. This is the role of the data scientist who is responsible for building the model. You will see how the platform enables your data science and data engineering teams to collaborate and how the data scientist can use the platform to build a model for the given problem.
Let's do some base data exploring using the platform. Keep in mind that the focus of this book is to enable your team to work efficiently. The focus is not on data science or data engineering but on building and using the platform:
- Launch JupyterHub, but this time select the image that is relative to the data science life cycle. SciKit is one such image available...