We need to load raw sentences in text format and iterate them using an underlined iterator that serves the purpose. A text corpus can also be subjected to preprocessing, such as lowercase conversion. Stop words can be mentioned while configuring the Word2Vec model. In this recipe, we will extract and load text data from various data-input scenarios.
Reading and loading text data
Getting ready
Select an iterator approach from step 1 to step 5 depending on what kind of data you're looking for and how you want to load it.
How to do it...
- Create a sentence iterator...