Summary
In this chapter, we looked at how organizations can share data that’s internal to the organization as well as externally for monetization. Internal data sharing can be as easy as sharing the data in the S3 data lake by providing cross-account access to Amazon Athena. Athena can read data from a shared Glue Data Catalog, making it easy to share different objects from the catalog. We also looked at how Redshift’s data sharing feature helps in sharing data that’s stored in one Redshift cluster with many other clusters in the organization. By creating a producer cluster and providing grants, the consumer cluster can easily access the objects shared with it.
Finally, we looked at patterns for sharing data external to the organization by leveraging AWS Data Exchange. Data Exchange helps us share datasets via various modes, such as files, S3, Redshift, Lake Formation, and APIs. Without data sharing features, complex ETL pipelines would have to be built to move...