Summary
In this chapter, we discussed how to manage data integrity issues with BI tools. At the beginning, we learned how to ensure we have consistent data type formats in our working files. Then, we covered data profiling features such as column quality, column distribution, and column profiling. After that, we worked out how to cleanse the data. In addition to this, we learned how to identify data outliers as well as how to manage relationships in data models. Lastly, we went through how to deal with large financial datasets using data validation. We have explored really powerful techniques and concepts in this chapter. You should be feeling confident about what you have learned, knowing that you can use these techniques whenever you need to clean the data prior to analysis.
The topics in the next chapter will be pretty exciting! We will continue our journey with these BI tools and cover how to implement best practices!