Summary
In this chapter, we prepared the data in a Star Schema, which has been optimized for data analysis and reporting purposes on top of a flat data structure. We identified potential dimensions and discussed the reasons for creating or not creating separate dimension tables. We then went through the transformation steps to create the justified dimension tables. Finally, we added all the dimension key columns to the fact table and removed all the unnecessary columns, which gave us a tidy fact table that only contained all the necessary columns.
The next chapter covers an exciting and rather important topic: Data preparation common best practices. By following these best practices, we can avoid a lot of reworks and maintenance costs.
Join us on Discord!
Join The Big Data and Analytics Community on the Packt Discord Server!
Hang out with 558 other members and enjoy free voice and text chat.