Reviewing the requirements for the solution
Now that you’ve explored the FAA Wildlife Strike data, you have a better understanding of what data is available for your solution. The original assignment that you were given by your stakeholders was as follows:
- Provide leadership with tools to do interactive analysis of the FAA Wildlife Strike data
- Find insights about factors that influence the incidents
- Make predictions about future wildlife strike incidents
Those requirements sound pretty vague! Now that you have a better understanding of the available data, it’s a good time to circle back with the stakeholders and clarify those requirements. You ask them questions such as the following:
- What types of interactive analysis do you want to do?
- Are you interested in the impact on endangered bird species?
- Maybe you’d like to view trends due to wildlife strikes and compare airports and regions?
- Perhaps you’d like to see the height and frequency of wildlife strikes at different times of the year?
- Would understanding correlations to factors such as aircraft size, time of day, season, geography, and height be useful?
- Are you interested in predicting specific risks or outcomes?
Entire books have been written about gathering requirements for data and analytics projects, and discussions on this particular topic could also be extensive. For the sake of keeping this book consumable, let’s assume that your discussions with the stakeholders led to prioritizing the following deliverables for the project:
- Analytic report: Viewing trends over time such as number of incidents, location of incidents, height, and details such as types of aircraft and wildlife species
- Predict damage: When a strike is reported, make a prediction as to whether there will be a cost associated with any damage
- Predict size: When a strike is reported, make a prediction about the size of the wildlife that struck the aircraft
- Predict height: For wildlife strikes, predict the height of the incidents
Now, you review the notes you took about the FAA Wildlife Strike data during your data exploration efforts. In doing so, you can think about how the data might match up to the use cases. Based on the requirements and your initial exploration of the data, you decide that the FAA Wildlife Strike data from the wildlife.accdb
file and the tables from the read_me.xls
file (Engine Codes, Aircraft Type, and Engine Position) are appropriate content to include during the initial phases of your project.