Updating and inserting data
An Extract Transform Load (ETL) process is a common technique for refreshing the data warehouse of the source system. The ETL process can be executed as a batch/near-real-time process that allows us to stage the data from the source system and perform bulk refreshes of the Amazon Redshift data warehouse. Amazon Redshift, being an RDBMS-based system, allows data refreshes to occur in the form of UPDATE
/INSERT
/DELETE
operations, broadly known as Data Manipulation Language (DML).
In this recipe, we will delve into some of the common ETL strategies for refreshing a dimensional model.
Getting ready
To complete this recipe, you will need to do the following:
- Gain access to the AWS Console.
- Deploy an Amazon Redshift cluster in AWS region eu-west-1.
- Create Amazon Redshift cluster master user credentials.
- Gain access to any SQL interface, such as a SQL client or the Amazon Redshift Query Editor.
- Set up a sample dimensional model. ...