Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases now! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Data Engineering with Google Cloud Platform
Data Engineering with Google Cloud Platform

Data Engineering with Google Cloud Platform: A guide to leveling up as a data engineer by building a scalable data platform with Google Cloud , Second Edition

eBook
€17.99 €25.99
Paperback
€31.99
Subscription
Free Trial
Renews at €18.99p/m

What do you get with eBook?

Product feature icon Instant access to your Digital eBook purchase
Product feature icon Download this book in EPUB and PDF formats
Product feature icon Access this title in our online reader with advanced features
Product feature icon DRM FREE - Read whenever, wherever and however you want
Product feature icon AI Assistant (beta) to help accelerate your learning
Table of content icon View table of contents Preview book icon Preview Book

Data Engineering with Google Cloud Platform

Part 1: Getting Started with Data Engineering with GCP

This part will talk about the purpose, value, and concepts of big data and cloud computing and how Google Cloud platform (GCP) products are relevant for data engineering. You will learn about a data engineer’s core responsibilities, how they differ from data scientists, and how to facilitate the flow of data through an organization to derive insights.

This part has the following chapters:

Left arrow icon Right arrow icon
Download code icon Download Code

Key benefits

  • Get up to speed with data governance on Google Cloud
  • Learn how to use various Google Cloud products like Dataform, DLP, Dataplex, Dataproc Serverless, and Datastream
  • Boost your confidence by getting Google Cloud data engineering certification guidance from real exam experiences
  • Purchase of the print or Kindle book includes a free PDF eBook

Description

The second edition of Data Engineering with Google Cloud builds upon the success of the first edition by offering enhanced clarity and depth to data professionals navigating the intricate landscape of data engineering. Beyond its foundational lessons, this new edition delves into the essential realm of data governance within Google Cloud, providing you with invaluable insights into managing and optimizing data resources effectively. Written by a Data Strategic Cloud Engineer at Google, this book helps you stay ahead of the curve by guiding you through the latest technological advancements in the Google Cloud ecosystem. You’ll cover essential aspects, from exploring Cloud Composer 2 to the evolution of Airflow 2.5. Additionally, you’ll explore how to work with cutting-edge tools like Dataform, DLP, Dataplex, Dataproc Serverless, and Datastream to perform data governance on datasets. By the end of this book, you'll be equipped to navigate the ever-evolving world of data engineering on Google Cloud, from foundational principles to cutting-edge practices.

Who is this book for?

Data analysts, IT practitioners, software engineers, or any data enthusiasts looking to have a successful data engineering career will find this book invaluable. Additionally, experienced data professionals who want to start using Google Cloud to build data platforms will get clear insights on how to navigate the path. Whether you're a beginner who wants to explore the fundamentals or a seasoned professional seeking to learn the latest data engineering concepts, this book is for you.

What you will learn

  • Load data into BigQuery and materialize its output
  • Focus on data pipeline orchestration using Cloud Composer
  • Formulate Airflow jobs to orchestrate and automate a data warehouse
  • Establish a Hadoop data lake, generate ephemeral clusters, and execute jobs on the Dataproc cluster
  • Harness Pub/Sub for messaging and ingestion for event-driven systems
  • Apply Dataflow to conduct ETL on streaming data
  • Implement data governance services on Google Cloud

Product Details

Country selected
Publication date, Length, Edition, Language, ISBN-13
Publication date : Apr 30, 2024
Length: 476 pages
Edition : 2nd
Language : English
ISBN-13 : 9781835085363
Vendor :
Google
Category :
Languages :
Tools :

What do you get with eBook?

Product feature icon Instant access to your Digital eBook purchase
Product feature icon Download this book in EPUB and PDF formats
Product feature icon Access this title in our online reader with advanced features
Product feature icon DRM FREE - Read whenever, wherever and however you want
Product feature icon AI Assistant (beta) to help accelerate your learning

Product Details

Publication date : Apr 30, 2024
Length: 476 pages
Edition : 2nd
Language : English
ISBN-13 : 9781835085363
Vendor :
Google
Category :
Languages :
Tools :

Packt Subscriptions

See our plans and pricing
Modal Close icon
€18.99 billed monthly
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Simple pricing, no contract
€189.99 billed annually
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Choose a DRM-free eBook or Video every month to keep
Feature tick icon PLUS own as many other DRM-free eBooks or Videos as you like for just €5 each
Feature tick icon Exclusive print discounts
€264.99 billed in 18 months
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Choose a DRM-free eBook or Video every month to keep
Feature tick icon PLUS own as many other DRM-free eBooks or Videos as you like for just €5 each
Feature tick icon Exclusive print discounts

Frequently bought together


Stars icon
Total 96.97
Data Engineering with Google Cloud Platform
€31.99
Google Machine Learning and Generative AI for Solutions Architects
€37.99
Database Design and Modeling with Google Cloud
€26.99
Total 96.97 Stars icon

Table of Contents

18 Chapters
Part 1: Getting Started with Data Engineering with GCP Chevron down icon Chevron up icon
Chapter 1: Fundamentals of Data Engineering Chevron down icon Chevron up icon
Chapter 2: Big Data Capabilities on GCP Chevron down icon Chevron up icon
Part 2: Build Solutions with GCP Components Chevron down icon Chevron up icon
Chapter 3: Building a Data Warehouse in BigQuery Chevron down icon Chevron up icon
Chapter 4: Building Workflows for Batch Data Loading Using Cloud Composer Chevron down icon Chevron up icon
Chapter 5: Building a Data Lake Using Dataproc Chevron down icon Chevron up icon
Chapter 6: Processing Streaming Data with Pub/Sub and Dataflow Chevron down icon Chevron up icon
Chapter 7: Visualizing Data to Make Data-Driven Decisions with Looker Studio Chevron down icon Chevron up icon
Chapter 8: Building Machine Learning Solutions on GCP Chevron down icon Chevron up icon
Part 3: Key Strategies for Architecting Top-Notch Solutions Chevron down icon Chevron up icon
Chapter 9: User and Project Management in GCP Chevron down icon Chevron up icon
Chapter 10: Data Governance in GCP Chevron down icon Chevron up icon
Chapter 11: Cost Strategy in GCP Chevron down icon Chevron up icon
Chapter 12: CI/CD on GCP for Data Engineers Chevron down icon Chevron up icon
Chapter 13: Boosting Your Confidence as a Data Engineer Chevron down icon Chevron up icon
Index Chevron down icon Chevron up icon
Other Books You May Enjoy Chevron down icon Chevron up icon

Customer reviews

Most Recent
Rating distribution
Full star icon Full star icon Full star icon Full star icon Half star icon 4.5
(6 Ratings)
5 star 66.7%
4 star 16.7%
3 star 16.7%
2 star 0%
1 star 0%
Filter icon Filter
Most Recent

Filter reviews by




mayanktripathi4u Sep 27, 2024
Full star icon Full star icon Full star icon Full star icon Empty star icon 4
This book offers a comprehensive exploration of data engineering principles, specifically in the context of Google Cloud Platform (GCP). Aimed at both beginners and intermediate data engineers, it serves as an excellent resource for those looking to understand the fundamentals of building scalable data pipelines using GCP services. The book is particularly well-suited for data engineers, cloud architects, and IT professionals seeking to build robust, scalable data pipelines using Google Cloud’s services.What I liked:One of the most valuable aspects of this book is its structured approach. Adi Wijaya begins by laying a solid foundation, introducing readers to essential tools such as BigQuery, Cloud Storage, and Cloud Dataflow. From there, he builds upon that knowledge with more advanced topics like real-time data processing and machine learning integration, making it accessible for readers with varying levels of experience.The hands-on tutorials are another highlight, offering step-by-step instructions that allow readers to practice and implement what they've learned. This practical emphasis makes complex topics easier to grasp, particularly for those who prefer learning by doing. The author also includes command-line tools like gcloud and gsutil for interacting with Google Cloud services, providing readers with real-world experience in managing cloud resources. Additionally, the author does an excellent job showcasing real-world use cases, allowing readers to understand how these tools are applied in professional data engineering settings.Things which are missing as per my opinion:Although the book is packed with useful information, it may feel fast-paced for absolute beginners to cloud computing. Some prior understanding of cloud concepts would be beneficial to fully grasp the more advanced sections. Additionally, while the book provides a detailed look into GCP, readers looking for cross-platform comparisons (e.g., AWS or Azure) won’t find such insights here.Final Thoughts:Overall, "Data Engineering with Google Cloud Platform" is a highly valuable resource for anyone looking to master data engineering within GCP. Adi Wijaya delivers a balanced mix of theory and practical application, making it an ideal read for aspiring and practicing data engineers. Whether you're developing pipelines, optimizing workflows, or integrating machine learning, this book provides the knowledge you need to excel in GCP’s data ecosystem.
Amazon Verified review Amazon
Johnnie Sep 15, 2024
Full star icon Full star icon Full star icon Full star icon Full star icon 5
Many concepts are covered (batch and streaming data pipeline creation, job orchestration, data governance and cost strategies) - as well as GCP cloud data storage options (with discussions in data warehouse design using BigQuery).The book went into more complex data engineering concepts in GCP such as ephemeral clusters, Dataproc (examining Hadoop, Spark and Dataframe concepts) and CI/CD practices.Note:For the curious minded engineer who ask, “dude … you mentioned ephemeral clusters. What’s the difference in ephemeral and persistent clusters???”Good question! With Persistent clusters there always is some infrastructure running. But with ephemeral clusters the clusters are created, exist for the time it takes for jobs to complete, and then cease to exist when they are brought downHow about transient clusters? I’ll leave that research up to you!“Lots of examples and exercises” are provided that enable a “hands on experience” for the reader to engage for greater understanding.This book provides data engineers with the concepts, hands on activities and guidance necessary to navigate the Google Cloud Platform (GCP).
Amazon Verified review Amazon
Zeynep Jul 10, 2024
Full star icon Full star icon Full star icon Empty star icon Empty star icon 3
Topics are good but needs more explanations and better screenshots. Some screenshots are not readable and partial. There are grammatical and spelling errors in the sentences and some codes.
Subscriber review Packt
Steve Young Jun 21, 2024
Full star icon Full star icon Full star icon Full star icon Full star icon 5
Google Cloud Platform can be a very broad topic and contains many different products and services, yet this author was able to articulate on data engineering within the platform in a way that was much less dense than other books I’ve read. I am a data engineer and work with GCP on a daily basis. It was a pretty easy read while containing a lot of insightful and useful information about building data pipelines and other necessary activities of data engineering. The author also provided good color on how the platform is often used in particular industries which I found both useful and interesting. This book is a must-read if you have an interest in becoming a more functional and knowledgeable data engineer using GCP.
Amazon Verified review Amazon
Daniel J. Hampton III Jun 17, 2024
Full star icon Full star icon Full star icon Full star icon Full star icon 5
I have been struggling to find a book that covered comprehensive big picture concepts as well as technical details and I think this book balances it quite well.
Amazon Verified review Amazon
Get free access to Packt library with over 7500+ books and video courses for 7 days!
Start Free Trial

FAQs

How do I buy and download an eBook? Chevron down icon Chevron up icon

Where there is an eBook version of a title available, you can buy it from the book details for that title. Add either the standalone eBook or the eBook and print book bundle to your shopping cart. Your eBook will show in your cart as a product on its own. After completing checkout and payment in the normal way, you will receive your receipt on the screen containing a link to a personalised PDF download file. This link will remain active for 30 days. You can download backup copies of the file by logging in to your account at any time.

If you already have Adobe reader installed, then clicking on the link will download and open the PDF file directly. If you don't, then save the PDF file on your machine and download the Reader to view it.

Please Note: Packt eBooks are non-returnable and non-refundable.

Packt eBook and Licensing When you buy an eBook from Packt Publishing, completing your purchase means you accept the terms of our licence agreement. Please read the full text of the agreement. In it we have tried to balance the need for the ebook to be usable for you the reader with our needs to protect the rights of us as Publishers and of our authors. In summary, the agreement says:

  • You may make copies of your eBook for your own use onto any machine
  • You may not pass copies of the eBook on to anyone else
How can I make a purchase on your website? Chevron down icon Chevron up icon

If you want to purchase a video course, eBook or Bundle (Print+eBook) please follow below steps:

  1. Register on our website using your email address and the password.
  2. Search for the title by name or ISBN using the search option.
  3. Select the title you want to purchase.
  4. Choose the format you wish to purchase the title in; if you order the Print Book, you get a free eBook copy of the same title. 
  5. Proceed with the checkout process (payment to be made using Credit Card, Debit Cart, or PayPal)
Where can I access support around an eBook? Chevron down icon Chevron up icon
  • If you experience a problem with using or installing Adobe Reader, the contact Adobe directly.
  • To view the errata for the book, see www.packtpub.com/support and view the pages for the title you have.
  • To view your account details or to download a new copy of the book go to www.packtpub.com/account
  • To contact us directly if a problem is not resolved, use www.packtpub.com/contact-us
What eBook formats do Packt support? Chevron down icon Chevron up icon

Our eBooks are currently available in a variety of formats such as PDF and ePubs. In the future, this may well change with trends and development in technology, but please note that our PDFs are not Adobe eBook Reader format, which has greater restrictions on security.

You will need to use Adobe Reader v9 or later in order to read Packt's PDF eBooks.

What are the benefits of eBooks? Chevron down icon Chevron up icon
  • You can get the information you need immediately
  • You can easily take them with you on a laptop
  • You can download them an unlimited number of times
  • You can print them out
  • They are copy-paste enabled
  • They are searchable
  • There is no password protection
  • They are lower price than print
  • They save resources and space
What is an eBook? Chevron down icon Chevron up icon

Packt eBooks are a complete electronic version of the print edition, available in PDF and ePub formats. Every piece of content down to the page numbering is the same. Because we save the costs of printing and shipping the book to you, we are able to offer eBooks at a lower cost than print editions.

When you have purchased an eBook, simply login to your account and click on the link in Your Download Area. We recommend you saving the file to your hard drive before opening it.

For optimal viewing of our eBooks, we recommend you download and install the free Adobe Reader version 9.