To make an effective, representative dashboard with different visualizations, we first need to download, ingest, and analyze a dataset that exists in the public domain.
We've chosen a very old dataset from NASA that you can obtain at http://ita.ee.lbl.gov/html/contrib/NASA-HTTP.html. The dataset contains HTTP access logs for the NASA website from between July 1, 1995 and August 31, 1995. It's almost a fossil dataset at the time of writing this book, it's nearly 24 years old!
However, the information in the logs is timeless and is perfect for what we want to achieve here. First, let's get an understanding of the dataset.