Search icon CANCEL
Subscription
0
Cart icon
Cart
Close icon
You have no products in your basket yet
Save more on your purchases!
Savings automatically calculated. No voucher code required
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Data Exploration and Preparation with BigQuery

You're reading from  Data Exploration and Preparation with BigQuery

Product type Book
Published in Nov 2023
Publisher Packt
ISBN-13 9781805125266
Pages 264 pages
Edition 1st Edition
Languages
Author (1):
Mike Kahn Mike Kahn
Profile icon Mike Kahn
Toc

Table of Contents (21) Chapters close

Preface 1. Part 1: Introduction to BigQuery
2. Chapter 1: Introducing BigQuery and Its Components 3. Chapter 2: BigQuery Organization and Design 4. Part 2: Data Exploration with BigQuery
5. Chapter 3: Exploring Data in BigQuery 6. Chapter 4: Loading and Transforming Data 7. Chapter 5: Querying BigQuery Data 8. Chapter 6: Exploring Data with Notebooks 9. Chapter 7: Further Exploring and Visualizing Data 10. Part 3: Data Preparation with BigQuery
11. Chapter 8: An Overview of Data Preparation Tools 12. Chapter 9: Cleansing and Transforming Data 13. Chapter 10: Best Practices for Data Preparation, Optimization, and Cost Control 14. Part 4: Hands-On and Conclusion
15. Chapter 11: Hands-On Exercise – Analyzing Advertising Data 16. Chapter 12: Hands-On Exercise – Analyzing Transportation Data 17. Chapter 13: Hands-On Exercise – Analyzing Customer Support Data 18. Chapter 14: Summary and Future Directions 19. Index 20. Other Books You May Enjoy

Exploring data loading techniques

There are various ways to ingest data into BigQuery. Data loading can be broken up into a few approaches: batch, streaming, and scheduled. Data load jobs can be executed via the console, the bq command-line interface, the API, other Google Cloud service integrations, and scheduled jobs. In this section, we will discuss each approach to help you understand the best data loading technique to get started and establish a data loading strategy for one-time or regular ingestion and analysis.

Batch loading data

Batch loading or bounded data is a process of loading data into BigQuery in a single operation. This can be done from a variety of sources, including files located in Google Cloud Storage or Google Drive, and local files. Acceptable file types include CSV, JSON, Apache Avro, ORC, and Apache Parquet formats. Batch loading is typically driven through the console UI, automated via API, or via scheduled jobs.

When loading data from a source, you...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at €14.99/month. Cancel anytime