12.2 Ingesting data into a notebook
As a sample data analytics problem, we’ll look at a data collection containing four closely related series of samples. The file is named anscombe.json. Each series of data is a sequence of (x,y) data pairs and a name for the series, represented as a Python dictionary. The series key has the name of the series. The data key is the list of data pairs. The four series are sometimes called Anscombe’s Quartet.
We’ll create a notebook to ingest the data. To begin the work, this initial recipe will focus on ordinary Python expressions to confirm the data was loaded properly. In later recipes, we’ll use visualization and statistical methods to see if there are correlations between the two variables.
12.2.1 Getting ready
There are a few preliminary steps:
Make sure the Jupyter Lab server is running. If it isn’t, see the Starting a Notebook and...