In this section, we will learn about some basic statistical modeling approaches to build an information retrieval system using term frequency-inverse document frequency (TF-IDF), which we can use with the NLP pipelines to build fully functional chatbots. Also, later on, we will learn to build a much more advanced conversational bot that can extract a specific piece of information, such as location, capture time, and so on, using NER.
Building conversational bots
What is TF-IDF?
TF-IDFs are a way to represent documents as feature vectors. But what are they? TF-IDFs can be understood as a modification of the raw term frequency (TF) and inverse document frequency (IDF). The TF is the count of how often a particular word occurs...