Chapter 4: Data Pipelines
Companies build modern cloud-based data warehouses to either migrate from their on-premises data warehouses or to build new workloads. To hydrate data in these modern data warehouses, users can build data pipelines based on the source data. In this chapter, we will cover the different types of data pipelines that we can design on Amazon Web Services (AWS) with Amazon Redshift as a destination data warehouse.
The following recipes are discussed in this chapter:
- Ingesting data from transactional sources using AWS Database Migration Service (AWS DMS)
- Streaming data to Amazon Redshift via Amazon Kinesis Firehose
- Cataloging and ingesting data using AWS Glue