Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
NoSQL Data Models

You're reading from   NoSQL Data Models Addresses severe issues related to NoSQL data models

Arrow left icon
Product type Paperback
Published in Aug 2018
Publisher Wiley
ISBN-13 9781786303646
Length 278 pages
Edition 1st Edition
Languages
Arrow right icon
Author (1):
Arrow left icon
Olivier Pivert Olivier Pivert
Author Profile Icon Olivier Pivert
Olivier Pivert
Arrow right icon
View More author details
Toc

Table of Contents (11) Chapters Close

Preface
1 NoSQL Languages and Systems 2 Distributed SPARQL Query Processing: a Case Study with Apache Spark FREE CHAPTER 3 Doing Web Data: from Dataset Recommendation to Data Linking 4 Big Data Integration in Cloud Environments: Requirements, Solutions and Challenges 5 Querying RDF Data: a Multigraph-based Approach 6 Fuzzy Preference Queries to NoSQL Graph Databases 7 Relevant Filtering in a Distributed Content-based Publish/Subscribe System List of Authors
Index
End User License Agreement

3.4. Techniques applied to the data linking process

Identity link discovery (also called linkset discovery) requires a three step process to identify equivalent resources across different datasets: prepare data (preprocessing, step 1), align resources (instance matching, step 2) and fix erroneous links generated between some of them (post-processing, step 3). First, the resources need to be represented in a uniform manner. This preprocessing proves necessary when we deal with different vocabularies, when resources are valued by using different languages, or when the number of resources and properties to be compared is too high. To establish links, it is important to compare resources regarding their values. However, the comparison can be done at different levels going from the URI of resources to the description of their neighborhoods in the RDF graph. Finally, once equivalent resources are connected, some systems perform an additional step to evaluate the generated links and therefore...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at €18.99/month. Cancel anytime