Data analytics is the process of ingesting, transforming, and visualizing data to discover useful insights for business decision-making. Over the previous decade, more data is collected and customers are looking for greater insight into their data. These customers also wanted this insight in the least amount of time, and sometimes even in real time. They wanted more ad hoc queries to answer more business questions. To answer these questions, customers needed more powerful and efficient systems.
Batch processing typically involves querying large amounts of cold data. In batch processing, it may take hours to get answers to business questions. For example, you may use batch processing to generate a billing report at the end of the month. Stream processing in real time typically involves querying small amounts of hot data and it takes only a short amount of time to get answers. MapReduce-based systems such as Hadoop are examples of platforms that...