Professional Data Engineer Certification
Up next is the list of services that are typically a must-know for a Professional Data Engineer.
Topics for Cloud Data Engineer Certification
The following are the list of important topics required for the certification:
- BigQuery
- Dataflow
- Dataproc
- Machine Learning API
- TensorFlow
- Stream Pipeline
- Streaming Analytics and Dashboards
Let’s discuss all of them in more detail.
BigQuery
BigQuery, as we know by now, is a serverless SQL data analysis tool on petabyte-scale data. Have some very good hands-on experience on the service and study different use cases. Learn how BigQuery works and the features it supports: serverlessness, SQL-like queries, wildcards, loading data (using a CLI, web UI, or API).
You can also have a user-defined function and the different constraints it has. Learn a few best practices such as stopping projecting unnecessary columns, filtering often with the where cause, and many others.
Dataflow
As you already know by now, Dataflow is about autoscaling...