Text augmentation libraries
There are many more Python open source image augmentation libraries than text augmentation libraries. Some libraries are more adaptable to a particular category than others, but in general, it is a good idea to pick one or two and become proficient in them.
The well-known libraries are Nlpaug, Natural Language Toolkit (NLTK), Generate Similar (Gensim), TextBlob, TextAugment, and AugLy:
- Nlpaug is a library used for textual augmentation for DL. The goal is to improve DL model performance by generating textual data. The GitHub link is https://github.com/makcedward/nlpaug.
- NLTK is a platform used for building Python programs to work with human language data. It provides interfaces to over 50 corpora and lexical resources, such as WordNet. NLTK contains text-processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning. The GitHub link is https://github.com/nltk/nltk.
- Gensim is a popular open source...