Summary
In this chapter, we covered how to aggregate data using basic aggregation functions. Then, we introduced the advanced aggregations with GROUPING SETS
, ROLLUP
, and CUBE
, as well as aggregation conditions using HAVING
. We also covered the various analytic functions and windowing clauses. At the end of the chapter, we introduced three ways of sampling data in Hive. After going through this chapter, you should be able to do basic and advanced aggregations and data sampling in Hive.
In the next chapter, we'll talk about performance considerations in Hive.