Data Consumption Layer
In today’s data-driven world, organizations are dealing with an ever-increasing volume of data, and the ability to effectively consume and analyze this data is crucial for making informed business decisions. As we delve into the realm of big data on Kubernetes, we must address the critical component of the data consumption layer. This layer serves as the bridge between the vast repositories of data and the business analysts who need to extract valuable insights and make decisions that have an impact on the business.
In this chapter, we will explore two powerful tools that will enable you to unlock the potential of your Kubernetes-based data architecture: Trino and Elasticsearch. Trino, a distributed SQL query engine, will empower you to directly query your data lake, eliminating the need for a traditional data warehouse. You will learn how to deploy Trino on Kubernetes, monitor its performance, and execute SQL queries against your data stored in Amazon...