Technical requirements
The code for this chapter is located at https://github.com/PacktPublishing/Python-Natural-Language-Processing-Cookbook/tree/master/Chapter03. In this chapter, we will need additional packages. The installation instructions for Anaconda are as follows:
pip install sklearn pip install gensim pip install pickle pip install langdetect conda install pytorch torchvision cudatoolkit=10.2 -c pytorch pip install transformers pip install -U sentence-transformers pip install whoosh
In addition, we will use the models and datasets located at the following URLs:
- http://vectors.nlpl.eu/repository/20/40.zip
- https://www.kaggle.com/currie32/project-gutenbergs-top-20-books
- https://www.yelp.com/dataset
- https://www.kaggle.com/PromptCloudHQ/imdb-data