Designing a high-performance data warehouse
When you're looking to design a high-performing data warehouse, multiple factors need to be considered. These include items such as cluster type and sizing, compression types, distribution keys, sort keys, data types, and table constraints.
As part of the design process, you will need to consider several trade-offs, such as cost verse performance or the size of storage verse performance. Business requirements and the available budget will often drive these decisions.
Beyond decisions about infrastructure and storage, the logical schema design also plays a big part in optimizing the performance of the data warehouse. Often, this will be an iterative process, where you start with an initial schema design that you refine over time to optimize for increased performance.
Selecting the optimal Redshift node type
There are different types of nodes available, each with different combinations of CPU, memory, storage capacity, and...