Chapter 7. Entity Recognition
Extracting information out of unstructured text data is a tedious process, because of the complex nature of natural language. Even after advancements in the field of Natural language processing (NLP), we are far from the point where any unrestricted text can be analyzed and the meaning can be extracted for general purposes. However, if we just focus on a specific set of questions, we can extract a significant amount of information from the text data. Named entity recognition helps identify the important entities in a text, to be able to derive the meaning from the unstructured data. It is a vital component of NLP applications, for example, question-answering systems, product discovery on e-commerce websites, and so on.
In this chapter, we will cover the following topics:
- Entity extraction
- Coreference and relationship extraction
- Sentence boundary detection
- Named entity recognition