The AWS Glue/Lake Formation data catalog
As discussed previously, the AWS Glue catalog is a technical data catalog that can capture some business attributes using key/value tags. For example, you can have a key called data_owner
and an associated value as a tag on each table in the catalog.
Within AWS, there are two services for interacting with the data catalog. So far, we have only discussed the AWS Glue service, but the AWS Lake Formation service also provides an interface for the same catalog.
It is important to understand that there is only a single data catalog, but that both Glue and Lake Formation provide an interface to the catalog. For example, if you set zone:curated
as a table property on the film_category
table in curatedzonedb
using the Glue console, you will see that same property set when viewing the table using Lake Formation.
Here, we can see the table details for the film_category
table in the AWS Glue console, and we can see that one of the tags on this...