Hands-on – automated deployment of data platform components and data transformation code
While we do not have space to cover all aspects of building a modern data platform, in this section we will cover how to use various AWS services to deploy some components of a data platform. We start by setting up an AWS CodeCommit repository that will contain all the resources for our data repository (such as Glue ETL scripts and CloudFormation templates). We then use AWS CodePipeline to configure pipeline jobs that push any code or infrastructure changes into our target account.
Setting up a Cloud9 IDE environment
Our first step is to create a Cloud9 IDE environment, which we can use for writing our code and committing code to a CodeCommit repository. Cloud9 is an AWS service that can be used to provision a managed EC2 instance to provide us with a browser-based Integrated Development Environment (IDE) that we can use to write, run, and debug code from within our web browser...