Hands-on – ingesting data with AWS DMS
As we discussed earlier in this chapter, AWS DMS can be used to replicate a database into an Amazon S3-based data lake (among other uses). Follow the steps in this section to do the following:
- Deploy a CloudFormation template that configures a MySQL RDS instance and then deploys an EC2 instance to load a demo database into MySQL.
- Set up a DMS replication instance and configure endpoints and tasks.
- Run the DMS instance in full-load mode.
- Run a Glue Crawler to add the tables that were newly loaded into S3 into the AWS Glue Data Catalog.
- Query the data with Amazon Athena.
- Delete the CloudFormation template in order to remove the resources that have been deployed.
NOTE
The following steps assume the use of your AWS account’s default VPC and security group. You will need to modify the steps as needed if you’re not using the default.