Monitoring the components of a data platform
Data platforms can consist of multiple components: data ingestion jobs, ETL jobs, data crawlers, data catalogs, ad hoc query engines, BI dashboards, and more. In order to detect potential issues that can affect an end user’s experience, it is recommended that you monitor the individual components of your data platform. Here’s a list of key topics to monitor AWS Glue and its related components:
- Monitoring overall statistics
- Monitoring state changes
- Monitoring delay
- Monitoring performance
- Monitoring common failures
- Monitoring log messages
In the following sub-sections, we will look at each of these key topics in detail.
Monitoring overall statistics
For AWS Glue jobs, Glue Studio gives you an aggregated view of the overall statistics, as shown in the following screenshot. This is useful for monitoring the trends of an entire AWS account/region:
Figure 11.3 –...