In this chapter, we created a spam classifier. We started with two datasets, one representing ham and the other, spam. We combined both datasets into one combined corpus that we put through a set of preprocessing steps, as mentioned in the Implementation steps section.
In the next chapter, we will build on some of the techniques learned so far to create a fraud detection ML application.