Determining how the data will be used
The aforementioned data types are stored in either a data lake or a database. How the data will be used will determine in which service the data needs to be stored.
As described in the previous chapters, a data lake is a centralized repository that allows data to be stored in its raw format without the need for predefined schemas. Data lakes are often used for big data and analytics workloads, as they enable storing and processing large amounts of data from various sources in a flexible way.
A database, on the other hand, can store structured (and, in some cases, semi-structured) data that is organized in a specific way, typically with a defined schema and defined relationships between the data. This form of organization makes it easy to search, sort, and manipulate the data, and is often used for transactional workloads.
Relational databases
Structured data is often stored and queried using relational databases. These databases utilize...