In this chapter, we will design a high-level architecture for an intelligent, distributed machine learning system that uses Spark as its core computation engine. The problem we will focus on will be taking the existing architecture for a web-based business and redesigning it to use automated machine learning systems to power key areas of the business.
Before we dig deeper into our scenario, we will spend some time understanding what machine learning is.
Then we will:
- Introduce a hypothetical business scenario
- Provide an overview of the current architecture
- Explore various ways in which machine learning systems can enhance or replace certain business functions
- Provide a new architecture based on these ideas
A modern large-scale data environment includes the following requirements:
- It must integrate with the other components of the system, especially with data collection and...