Chapter 6. Getting Acquainted with Spark
It is all about data! Isn't it?
One of the most critical objectives of most enterprises is to churn/analyze and mashup the variety of data received from different channels, such as CRM, portals, and so on, and uncover the truth that can help them formulate business/marketing strategies, informed decisions, predictions, recommendations, and so on. Now what matters is how efficiently, effectively, and quickly you can uncover the hidden patterns in the data.
The sooner you can, the better it will be!
Distributed computing (https://en.wikipedia.org/wiki/Distributed_computing) or the paradigm of parallel computing/processing played a pivotal role in achieving the key objectives of enterprises. Distributed computing helped enterprises to process large datasets on multiple nodes that were connected to each other, which may be geographically distributed. All these nodes interact with each other and work toward achieving the common goal.
One of...