In this chapter, we learned about the different data life cycle stages, including when data is created, shared, maintained, archived, retained, and deleted.
This chapter gave you a detailed understanding of how big data is managed, considering the fact that it is either unstructured or semi-structured and it has a fast arrival rate and large volume.
As the complexity of the infrastructure that generates and uses data in business organizations has increased drastically, it has become imperative to secure your data properly. This chapter further covered data security tools, such as Apache Ranger, and patterns to help us learn how to have control over the access patterns of data.
In the next chapter, we will take a look at Hadoop installation, its architecture and key components.