Managing data distribution
Distribution style is a table property that dictates how that table's data is distributed throughout the compute nodes. The goal of data distribution is to leverage the massively parallel processing of Amazon Redshift and reduce the I/O during query processing to improve performance. Amazon Redshift Advisor provides actionable recommendations on distribution style for the table via the alter
statement. Using automatic table optimization allows you to self-manage the table distribution style based on workload patterns:
- KEY: The value is hashed. The same value goes to the same location (slice).
- ALL: The entirety of the table data goes to the first slice of every compute node.
- EVEN: Round robin data distribution is performed across the compute nodes and slices.
- AUTO: Combines the EVEN, ALL, and KEY distributions:
In this recipe, you will learn how Amazon Redshift...