Working with AWS Glue
In the preceding chapter, we discussed various data storage types, including data warehouses, data lakes, data lakehouses, and data meshes, along with their key differences.
This chapter will explore the distinct components of AWS Glue, providing insight into how they can aid in data wrangling tasks.
After completing this chapter, you will be able to comprehend and define how AWS Glue can be utilized for data wrangling. You will also be capable of explaining the fundamental concepts associated with various AWS Glue features, such as AWS Glue Data Catalog, AWS Glue connections, AWS Glue crawlers, AWS Glue Schema Registry, AWS Glue jobs, AWS Glue development endpoints, AWS Glue interactive sessions, and AWS Glue triggers.
The following topics will be covered in this chapter:
- Spark basics
- AWS Glue features
- Data discovery using AWS Glue
- Data ingestion using AWS Glue