You're reading from Business Intelligence with Databricks SQL Concepts, tools, and techniques for scaling business intelligence on the data lakehouse

Product type Paperback

Published in Sep 2022

Publisher Packt

ISBN-13 9781803235332

Length 348 pages

Edition 1st Edition

Languages

SQL

Concepts

Business Intelligence

Author (1):

Vihag Gupta

View More author details

Table of Contents (21) Chapters

Preface

1. Part 1: Databricks SQL on the Lakehouse

2. Chapter 1: Introduction to Databricks FREE CHAPTER

3. Chapter 2: The Databricks Product Suite – A Visual Tour

4. Chapter 3: The Data Catalog

5. Chapter 4: The Security Model

6. Chapter 5: The Workbench

7. Chapter 6: The SQL Warehouses

8. Chapter 7: Using Business Intelligence Tools with Databricks SQL

9. Part 2: Internals of Databricks SQL

10. Chapter 8: The Delta Lake

11. Chapter 9: The Photon Engine

12. Chapter 10: Warehouse on the Lakehouse

13. Part 3: Databricks SQL Commands

14. Chapter 11: SQL Commands – Part 1

15. Chapter 12: SQL Commands – Part 2

16. Part 4: TPC-DS, Experiments, and Frequently Asked Questions

17. Chapter 13: Playing with the TPC-DS Dataset

18. Chapter 14: Ask Me Anything

19. Index

Why subscribe?

20. Other Books You May Enjoy

Understanding the data organization model in  Databricks SQL

In this section, we will learn about how data assets are organized in Databricks SQL. We call this the data organization model.

The open data lake, which is the foundation of the Databricks Lakehouse platform, relies on cloud object storage for storing data. This data is stored in human-readable formats such as CSV, TSV, and JSON, or big data-optimized formats such as Apache Parquet, Apache ORC, or Delta Lake.

A Note on Data Engineering

The data in the data lake is ingested by data engineering processes. Data engineers create data pipelines that bring data from source systems, clean them, transform them, and write them to the designated destinations in the data lake. These destinations are directories in the data lake. The data within the directory can be further arranged in some fashion – for example, by date.

These file formats are structured and have a defined schema. Having a schema...

The rest of the chapter is locked

You're reading from Business Intelligence with Databricks SQL Concepts, tools, and techniques for scaling business intelligence on the data lakehouse

Table of Contents (21) Chapters

Understanding the data organization model in  Databricks SQL

Authors (1)

Personalised recommendations for you

You're reading from Business Intelligence with Databricks SQL Concepts, tools, and techniques for scaling business intelligence on the data lakehouse

Table of Contents (21) Chapters

Understanding the data organization model in Databricks SQL

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you

Understanding the data organization model in  Databricks SQL