Practice questions
Questions 1 to 4 are based on the data pipeline shown in Figure 9.22. The pipeline has the default configurations and the following resources:
- Pub topic =
t1
, and subscription =s1
- Dataflow job =
df1
, with a GCS bucket calledb1
- BigQuery dataset =
ds1
, and table =ds1-table
Figure 9.22 – GCP data pipeline
1. Which of the following is not part of df1
’s metrics?
A. Latency
B. CPU
C. Memory
D. Storage
2. What machine types will be used by df1
’s workers?
A. n1-standard
B. f1-micro
C. e2-medium
D. g1-small
3. When defining BigQuery table names, what’s your recommendation?
A. Use delimited identifiers
B. Use different versions of SQL
C. It doesn’t matter since you can change the table name on the fly
D. Use something related to the pipeline
4. We need to update df1
without losing any existing data. What’s your recommendation?
A. Update...