Overview of text summarization
The core idea in summarization is to condense long-form text or articles into a short representation. The shorter representation should contain the main idea of crucial information from the longer form. A single document can be summarized. This document could be long or may contain just a couple of sentences. An example of a short document summarization is generating a headline from the first few sentences of an article. This is called sentence compression. When multiple documents are being summarized, they are usually related. They could be the financial reports of a company or news reports about an event. The generated summary could itself be long or short. A shorter summary would be desirable when generating a headline. A lengthier summary would be something like an abstract and could have multiple sentences.
There are two main approaches when summarizing text:
- Extractive summarization: Phrases or sentences from the articles are selected...