Understanding the basics of partitioning
In the previous chapter, we briefly introduced the concept of partitioning as part of the Designing storage for efficient querying section. We explored storage-side partitioning concepts such as replicating data, reducing cross-partition operations such as joins, and eventual consistency to improve query performance. In this chapter, we will deep dive more systematically into both storage and analytical partitioning techniques. Let's start with the benefits of partitioning.
Benefits of partitioning
Partitioning has several benefits apart from just query performance. Let's take a look at a few important ones.
Improving performance
As we discussed in the previous chapter, partitioning helps improve the parallelization of queries by splitting massive monolithic data into smaller, easily consumable chunks.
Apart from parallelization, partitioning also improves performance via data pruning, another concept that we already...