Different IR algorithms you will learn
Information retrieval is sometimes referred to as IR. You will learn several algorithms in this chapter that are:
- Distance based: Two documents are matched based on their proximity, calculated by several distance metrics on the vector representation of the document
- Set based: Two documents are matched based on their proximity, calculated by several set based/fuzzy set based metrics based on the bag of words (BoW) model of the document
Don't worry if some of the phrases in this section don't make sense right now. By the end of this chapter, you will have a thorough understanding of these techniques and how to use them.