Populating metadata
The first step of any Data Catalog is to populate it with databases and tables. AWS Glue provides both manual and automatic options for doing so, the latter being particularly useful to avoid the cumbersomeness of defining datasets from scratch. This section will explain how the Data Catalog works and will demonstrate how to interact with it in different ways.
Glue Data Catalog API
Just as in other AWS services, AWS Glue offers a fully fledged application programming interface (API; https://docs.aws.amazon.com/glue/latest/dg/aws-glue-api.html) to interact with it, which includes the Data Catalog. Thus, operations such as creating a database or a table can be done through said API or any of its containers, such as the AWS CLI or any of the software development kits (SDKs).
For instance, let’s start populating our catalog manually. The first step is to create a database, which we can do using the AWS CLI. The CLI Command Reference page (https://docs...