Spark for fraud detection
In this section, we will start with a real business case of fraud detection to further illustrate our step-by-step machine learning process and then describe how to prepare Spark for this fraud detection project.
The use case
The ABC Corporation is a billion-dollar company that processes payments for thousands of clients in many industries, including real estate and vacation travel. Many kinds of frauds happened to this company and cost a lot. Most of the frauds happened online.
In order to prevent frauds from happening, the company collected a lot of data on its clients relating to payment processing transactions and also about past online activities for each client. Also, the company purchased a lot of data from third parties about the computer devices and bank accounts their clients use.
As for this project, our unit of analysis can be an individual company or person (ABC's client). Our unit of analysis can also be a payment transaction. In real practice, we performed...