Ingesting data from data sources
In this section, we will learn about various data sources that a big data analytics system uses as a source of data. Typical data sources include transactional systems such as RDBMSes, file-based data sources such as data lakes, and message queues such as Apache Kafka. Additionally, you will learn about Apache Spark's built-in connectors to ingest data from these data sources and also write code so that you can view these connectors in action.
Ingesting from relational data sources
A Transactional System, or an Operational System, is a data processing system that helps an organization carry out its day-to-day business functions. These transactional systems deal with individual business transactions, such as a point-of-service transaction at a retail kiosk, an order placed on an online retail portal, an airline ticket booked, or a banking transaction. A historical aggregate of these transactions forms the basis of data analytics, and analytics...