You're reading from Business Intelligence with Databricks SQL Concepts, tools, and techniques for scaling business intelligence on the data lakehouse

Product type Paperback

Published in Sep 2022

Publisher Packt

ISBN-13 9781803235332

Length 348 pages

Edition 1st Edition

Languages

SQL

Concepts

Business Intelligence

Author (1):

Vihag Gupta

View More author details

Table of Contents (21) Chapters

Preface

1. Part 1: Databricks SQL on the Lakehouse

2. Chapter 1: Introduction to Databricks FREE CHAPTER

3. Chapter 2: The Databricks Product Suite – A Visual Tour

4. Chapter 3: The Data Catalog

5. Chapter 4: The Security Model

6. Chapter 5: The Workbench

7. Chapter 6: The SQL Warehouses

8. Chapter 7: Using Business Intelligence Tools with Databricks SQL

9. Part 2: Internals of Databricks SQL

10. Chapter 8: The Delta Lake

11. Chapter 9: The Photon Engine

12. Chapter 10: Warehouse on the Lakehouse

13. Part 3: Databricks SQL Commands

14. Chapter 11: SQL Commands – Part 1

15. Chapter 12: SQL Commands – Part 2

16. Part 4: TPC-DS, Experiments, and Frequently Asked Questions

17. Chapter 13: Playing with the TPC-DS Dataset

18. Chapter 14: Ask Me Anything

19. Index

Why subscribe?

20. Other Books You May Enjoy

Summary

In this chapter, we dove headfirst into Photon Engine. We discussed the standard Apache Spark execution model and what has made Apache Spark so fast. Then, we discussed the prevalent query engine design models and why the vectorization model was chosen to replace the code generation design of Apache Spark. We learned about the core concept of vectorization and how it enables Photon to be as fast as it is. Finally, we discussed what Photon can and cannot do now and what its known feature roadmap is.

Before we end this chapter, I will provide you with one final reminder – the aim of this chapter is only to give you a conceptual idea of how Photon works and why is it so fast. All the concepts have been simplified for better understanding. To deep dive into the nuances, follow the content in the section Further Reading.

With that, we have a complete understanding of the Databricks SQL toolset and its storage and computation technologies. In the next chapter, we will...