Atlas Data Lake
MongoDB Atlas Data Lake is an analytics-optimized object storage service designed for extracted data. It provides an analytic storage service optimized for both flat and nested data, ensuring low-latency query performance.
Essentially, the data lake capability enables you to run a single query that will route to either object storage or a database. This allows for more advantageous data storage use cases, including the ability to handle data stored in various formats outside of JSON and BSON, such as CSV, TSV, Parquet files, and the like.
Atlas Data Lake requires a paid tier cluster usage with backup enabled. It supports collection snapshots from Atlas clusters as a data source for extracted data. The service automatically ingests data from the snapshots, partitions it, and stores it in an analytics-optimized format.
Data storage and optimization
Atlas Data Lake stores data in Parquet files, an analytic-oriented format based on open source standards, with...