Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Microsoft SQL Server 2014 Business Intelligence Development Beginner's Guide

You're reading from   Microsoft SQL Server 2014 Business Intelligence Development Beginner's Guide Get to grips with Microsoft Business Intelligence and Data Warehousing technologies using this practical guide

Arrow left icon
Product type Paperback
Published in May 2014
Publisher
ISBN-13 9781849688888
Length 350 pages
Edition Edition
Arrow right icon
Authors (2):
Arrow left icon
Reza Rad Reza Rad
Author Profile Icon Reza Rad
Reza Rad
Abolfazl Radgoudarzi Abolfazl Radgoudarzi
Author Profile Icon Abolfazl Radgoudarzi
Abolfazl Radgoudarzi
Arrow right icon
View More author details
Toc

Table of Contents (19) Chapters Close

Microsoft SQL Server 2014 Business Intelligence Development Beginner's Guide
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
1. Data Warehouse Design 2. SQL Server Analysis Services Multidimensional Cube Development FREE CHAPTER 3. Tabular Model Development of SQL Server Analysis Services 4. ETL with Integration Services 5. Master Data Management 6. Data Quality and Data Cleansing 7. Data Mining – Descriptive Models in SSAS 8. Identifying Data Patterns – Predictive Models in SSAS 9. Reporting Services 10. Dashboard Design 11. Power BI 12. Integrating Reports in Applications Index

Matching


Data, naturally, is fuzzy. There are many reasons for that; mistakes in typing, using abbreviations, and so on. The following screenshot (sourced from Microsoft) shows an example of two records for the same person:

From the human point of view, both the records shown in the preceding screenshot are for the same person; it just has some abbreviations and different string formats. But from the computer's point of view, these records are different; or, in the other words, they are not exactly similar.

The data matching component of DQS works with a similarity threshold between domain values. The data steward can create matching policies in the Knowledge Base. Each matching policy contains one or more matching rules. Matching rules define how records will match each other. In the matching rules, the type of similarity can be defined as prerequisite, exact match, or similar. Matching rules can be tuned incrementally with the incoming data with the data steward's supervision.

There are four...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image