- Given a data processing workload that extracts data from an on-premise ERP (source) and loads into S3 (target), what is the LEAST cost-effective type of data processing—assuming business analysts don't require real-time analysis since they access the financial reports once a day?
A) Batch
B) Micro-batch
C) Streaming
- One of the situations where Glue custom classifiers are required is when data sources contain nested data structures. When data structures are flat on the source, the default classifiers are sufficient for standard data ingestion processing.
A) True
B) False
- When is it necessary to run a crawler?
A) When the schema of the crawled dataset has changed
B) When data was added to a previously crawled dataset but the schema hasn't changed
C) When the dataset was moved to a different S3 bucket
D) Both A and B
- Which file format is best to...