3.2. Datasets recommendation for data linking
Coreference resolution is a common thread across many communities, which is referred to as entity matching, entity disambiguation, cross-document coreference, duplicate record detection or record linkage. These terms all describe the process of determining the presence of different and heterogeneous descriptions of the same real-world objects and also the process of determining links and relations among these descriptions in order to make their correspondence explicit. Coreference resolution can build on a large body of related work across different communities. For example within database communities, we refer the interested reader to the works of Winkler et al. [WIN 06] on record linkage and Elmagarmid et al. [ELM 07] for duplicate detection. In the natural language processing field, we cite the survey of Soon et al. [SOO 01] where coreference resolution can be seen as the task of finding all expressions that refer to the same entity in...