Further reading
To learn more about what we’ve touched on in this chapter, please refer to the following resources:
- Apache Parquet: https://parquet.apache.org/docs/
- Apache ORC: https://orc.apache.org/specification/ORCv0/
- Apache Avro: https://avro.apache.org
- TPC-DS and its specification: http://www.tpc.org/tpcds/ and http://tpc.org/tpc_documents_current_versions/pdf/tpc-ds_v3.2.0.pdf
- Improve query performance using AWS Glue partition indexes: https://aws.amazon.com/blogs/big-data/improve-query-performance-using-aws-glue-partition-indexes/
- Video recording on YouTube: https://youtu.be/jyfJ1X_RaCs
- Effective data lakes using AWS Lake Formation, Part 1: Getting started with governed tables: https://aws.amazon.com/blogs/big-data/part-1-effective-data-lakes-using-aws-lake-formation-part-1-getting-started-with-governed-tables/
- Transitioning objects using Amazon S3 Lifecycle: https://docs.aws.amazon.com/AmazonS3/latest/userguide/lifecycle-transition-general...