For our real-world dataset example, we are going to use two different sources and blend them together using the techniques we've learned throughout this book. Since Know Your Data (KYD) still applies, let's walk through the sources.
KYD sources
The first source is from the World Bank and is a list of green bonds, which are used to fund the reduction of carbon emissions and climate-related projects. It was downloaded from the website, so it's a snapshot based on a point in time stored as a CSV file with 115 rows and 10 columns, including a header.
A visual preview of the data in Microsoft Excel can be seen in the following screenshot:
The source data has some insights that we can mine through as is, such as the following:
- How many bonds are issued by Currency?
- What is the total distribution of the bonds by Currency?
- Which bonds are maturing in...