In this chapter, we will cover the following recipes:
- Preprocessing data using tokenization
- Stemming text data
- Converting text to its base form using lemmatization
- Dividing text using chunking
- Building a bag-of-words model
- Building a text classifier
- Identifying the gender of a name
- Analyzing the sentiment of a sentence
- Identifying patterns in text using topic modeling
- Parts of speech tagging with spaCy
- Word2Vec using gensim
- Shallow learning for spam detection