Text analysis is the process of deriving information from text. Information is typically derived through techniques such as IR, NLP, and SL, and it involves structuring text, deriving patterns with the structured data, and finally evaluating and interpreting the output. The basic models used for text analysis are the bag-of-words models, the vector space model, and the semantic parsing model.
The bag-of-words model is a simplified text representation in which a text (a review in our case) is represented as the set of its terms (words), disregarding grammar and word order but keeping multiplicity (hence the term bag). After transforming the text into a bag-of-words and structuring into a corpus (a structured collection of the text data), we can calculate various measures to characterize the text into a vector space. The bag-of-words model...