Introducing Amazon Redshift
Amazon Redshift is one of the database as a service (DBaaS) offerings from AWS that provides a massively scalable data warehouse as a managed service, at significantly lower costs. The data warehouse is based on the open source PostgreSQL database technology however; not all features offered in PostgreSQL are present in Amazon Redshift. Here's a look at some of the essential concepts and terminologies that you ought to keep in mind when working with Amazon Redshift:
- Clusters: Just like Amazon EMR, Amazon Redshift too relies on the concept of clusters. Clusters here are logical containers containing one or more instances or compute nodes, and one leader node that is responsible for the cluster's overall management. Here's a brief look at what each node provides:
- Leader node: The leader node is a single node present in a cluster that is responsible for orchestrating and executing various database operations, as well as facilitating communication between the database...