Exploring public datasets
Public datasets are collections of data that are made available to the public by various sources, such as governments, organizations, and researchers. Public datasets can be useful for analyzing trends and patterns in different domains, such as health, education, environment, or social impact.
DuckDB is a powerful tool for exploring, understanding, and gaining insights from public datasets. In this section, we’ll work with a public dataset that has been made available in CSV format. We’ll use DuckDB to load it in an appropriate form, summarize it, and export it to another format. This worked example will allow us to showcase some of DuckDB’s versatile features that make it well suited for analytical workflows.
Bike-share station readings
We are going to be exploring the Melbourne Bike Share dataset, which provides historical data from the Melbourne Bike Share service, which operated from 2010 to 2019, and is made available by...