Defining the problem and Python data selection
Before we look at the pandas library, let's define what data analysis is. When we talk about data analysis, we are talking about the process of inspecting, cleansing, transforming, and modeling data with the intent of discovering useful data, notifying conclusions, and supporting decision-making. Decision-making is critical. We don't just want to see what the data says has happened in the past. We want to use data in order to make informed decisions for the future.
Take a look at some of the uses of data analysis:
- Business: It helps when making decisions based on customer trends and behavior prediction, increasing business productivity, and driving effective decision-making.
- Weather forecasting: Data about the atmosphere (temperature, humidity, wind, and more) is collected and analyzed to understand atmospheric processes (meteorology) to determine how the atmosphere will evolve in the future.
- Transportation...