Designing a high-performance data warehouse
When you’re looking to design a high-performing data warehouse, multiple factors need to be considered. These include items such as cluster type and sizing, compression types, distribution keys, sort keys, data types, and table constraints.
As part of the design process, you will need to consider several trade-offs, such as cost versus performance. Business requirements and the available budget will often drive these decisions.
Beyond decisions about infrastructure and storage, the logical schema design also plays a big part in optimizing the performance of the data warehouse. Often, this will be an iterative process, where you start with an initial schema design that you refine over time to optimize for increased performance.
Provisioned versus Redshift Serverless clusters
When creating an Amazon Redshift cluster, you can select to either use a serverless Redshift configuration or provision specific resources. With...