Test your knowledge
You've started a new data science position at a solar cell installation company. They have some solar cell and solar irradiation data in Excel files they want you to load, clean, and analyze, and then deliver your results to the executive team and president. You should deliver a small summary of your EDA work from pandas and save your cleaned and prepared data as a new Excel file. The data files are solar_data_1.xlsx
and solar_data_2.xlsx
on the GitHub repository for this book. The metadata.csv
file describes the different columns.
You can read more about this data and what the different fields mean here: https://www.kaggle.com/jboysen/google-project-sunroof
You can also look at the notebooks of existing and aspiring data scientists linked on the Kaggle dataset page for more inspiration.