In this recipe, we will join all the lessons of the previous recipes and will search the files in the directory for a particular keyword. This is a recap of the rest of the recipes in this chapter and includes a script that searches different kinds of files.
Scanning documents for a keyword
Getting ready
Be sure to include all the following modules in the requirements.txt file and install them into your virtual environment:
beautifulsoup4==4.6.0
Pillow==5.1.0
PyPDF2==1.26.0
python-docx==0.8.6
Check that the directory to search has the following files (all are available in GitHub in the Chapter04/documents directory). Note that file5.pdf and file6.pdf are copies of document-1.pdf, for simplicity. file1.txt to file4.txt are empty...