SQL S ERVER D ATA Q UALITY S ERVICES Marc Jellinek Principal Consultant – Neudesic

Slides:



Advertisements
Similar presentations
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Advertisements

Data Quality Services + Whats new in SSIS in SQL Server 2012 James Beresford
1er Simposio Latinoamericano Data Quality Fundamentals Miguel Angel Granados Troncoso.
Power BI Sites and Mobile BI. What You Will Learn Sharing and Collaboration Introducing Power BI Exploring Power BI Features and Services Partner Opportunities.
Setting Big Data Capabilities Free How to Make Business on Big Data? Stig Torngaard, Partner Platon.
Implementing enterprise governance can sometimes feel like trying to corral an exuberant crowd.
November 10 th, 2011 DQS MATCHING G ADI P ELEG, S ENIOR P ROGRAM M ANAGER SQL S ERVER D ATA Q UALITY S ERVICES Microsoft SQL Server 2012.
SQL Server Data Quality Services A knowledge driven Data Quality Solution.
Chapter 17: Client/Server Computing Business Data Communications, 4e.
Integrate into existing systems with PowerShell integration modules Extend by building PS modules to enable integrating into other systems Optimize.
Introduction to Building a BI Solution 권오주 OLAPForum
DBI207 3 Data QualityIssueSample Data Problem Standard Are data elements consistently defined and understood ? Gender code = M, F, U in one system and.
Managing Master Data with MDS and Microsoft Excel
Enterprise Information Management (EIM): Bringing Together SSIS, DQS, and MDS Matt Masson Senior Program Manager Microsoft Corporation Matthew Roche Senior.
Master Data Services In SQL Server Denali Jeremy Kashel
Creating a SharePoint App with Microsoft Access Services
SharePoint Portal Server 2003 JAMES WEIMHOLT WEIDER HAO JUAN TURCIOS BILL HUERTA BRANDON BROWN JAMES WEIMHOLT INTRODUCTION OVERVIEW IMPLEMENTATION CASE.
November 10 th, 2011 DQS BOOTCAMP D AVID F AIBISH, S ENIOR P ROGRAM M ANAGER SQL S ERVER D ATA Q UALITY S ERVICES Microsoft SQL Server 2012.
Maintaining a Microsoft SQL Server 2008 Database SQLServer-Training.com.
Overview of SQL Server Alka Arora.
Crystal Hoyer Program Manager IIS Team Preview of features that will be announced at MIX09 Please do not blog, take pictures or video of session.
Get More Value from Your Reference Data—Make it Meaningful with TopBraid RDM Bob DuCharme Data Governance and Information Quality Conference June 9.
Using the WDK for Windows Logo and Signature Testing Craig Rowland Program Manager Windows Driver Kits Microsoft Corporation.
1 Keith Vicens, Managing Consultant CRM Housing Solution Extending Your Case Management Capabilities.
- 1 - Roadmap to Re-aligning the Customer Master with Oracle's TCA Northern California OAUG March 7, 2005.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
INTRODUCTION TO DATA QUALITY SERVICES Presentation by Tim Mitchell (Artis Consulting)
Microsoft and Community Tour 2011 – Infrastrutture in evoluzione Community Tour 2011 Infrastrutture in evoluzione.
HDNUG 27-March-2007 SQL Server 2005 Suite as a Business Intelligence Solution.
Cloud On Your Terms Breakthrough Insight Unlock new insights with pervasive data discovery across the organization Create business solutions fast, on.
© 2008 IBM Corporation ® IBM Cognos Business Viewpoint Miguel Garcia - Solutions Architect.
Microsoft SharePoint Server 2010 for the Microsoft ASP.NET Developer Yaroslav Pentsarskyy
DENALI SSIS AND DATA QUALITY ENHANCEMENTS Dr Greg Low Principal Mentor and CEO SolidQ Australia SESSION CODE: DAT307 (c) 2011 Microsoft. All rights reserved.
Chapter 17: Client/Server Computing Business Data Communications, 4e.
Atlanta User Group Introduction to: Data Quality & Master Data Management.
Master Data Management & Microsoft Master Data Services Presented By: Jeff Prom Data Architect MCTS - Business Intelligence (2008), Admin (2008), Developer.
Design and Implementation of a Rationale-Based Analysis Tool (RAT) Diploma thesis from Timo Wolf Design and Realization of a Tool for Linking Source Code.
Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217.
© 2013, published by Flat World Knowledge Chapter 10 Understanding Software: A Primer for Managers 10-1.
Powered by Microsoft Azure, PointMatter Is a Flexible Solution to Move and Share Data between Business Groups and IT MICROSOFT AZURE ISV PROFILE: LOGICMATTER.
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Welcome November 2012 Einführung in.
MGT305 - Application Management in Private and Public Clouds Sean Christensen Senior Product Marketing Manager Microsoft Corporation MGT305.
MGT305 - Application Management in Private and Public Clouds Daniel Savage Microsoft Corporation MGT305 Kenan Owens Microsoft Corporation.
MAKING BUSINESS INTELLIGENT Brought to you by your local PASS Community! Self Service ETL with Power Query Welcome.
November 10 th, 2011 C LEANSING D ATA IN SSIS D AVID F AIBISH, S ENIOR P ROGRAM M ANAGER SQL S ERVER D ATA Q UALITY S ERVICES Microsoft SQL Server 2012.
Mastering Master Data Services Presented By: Jeff Prom BI Data Architect Bridgepoint Education MCTS - Business Intelligence, Admin, Developer.
Steve Simon MVP SQL Server BI
Data Platform and Analytics Foundational Training
Bought to you by.
DQS: Business Logic Meets Enterprise Integration
What’s New in SQL Server 2016 Master Data Services
Matt Masson Senior Program Manager Microsoft Corporation
Steve Simon MVP SQL Server BI
Extensible Platform Microsoft Dynamics 365
IBM INFOSPHERE MDM online Training in Bangalore
Welcome! Power BI User Group (PUG)
Business Intelligence for Project Server/Online
Swagatika Sarangi (Jazz), MDM Expert
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
Welcome! Power BI User Group (PUG)
Introducing Qwory, a Business-to-Business Search Engine That’s Powered by Microsoft Azure and Detects Vital Contact Information for Businesses MICROSOFT.
06 | Managing Enterprise Data
TechEd /4/2018 3:19 AM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
TechEd /11/ :54 PM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered.
Data Quality in the BI Life Cycle
Chapter 17: Client/Server Computing
Technical Capabilities
Trust Your Data With Data Quality Services
TechEd /11/ :25 AM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered.
ArcGIS Online – The Road Ahead
Presentation transcript:

SQL S ERVER D ATA Q UALITY S ERVICES Marc Jellinek Principal Consultant – Neudesic

A BOUT M E Experience Principal Consultant - Neudesic Assistant Director (SQL Team) – Application Engineering at Ernst & Young IT Manager at MLB Network Sr. Technology Specialist at Microsoft Technologies Microsoft SQL Server 6.0, 6.5, 7.0, 2000, 2005, 2008, 2008 R2 and 2012 Relational Engine, Analysis Services, Integration Services and Reporting Services Marc Jellinek –

S ESSION O BJECTIVES Introduction to SQL Server Data Quality Services (DQS) Understanding the problem Demo Where do we go from here?

S ETTING THE STAGE Building from the SQL Server Series: Master Data Services in SQL Server 2012, presented by Patrick Gallucci (start at 6:26) PASS MDM/DQS Virtual Chapter Based on demos from “SQL Server 2012 Developers Update”

Data QualityIssueSample Data Problem Standard Are data elements consistently defined and understood? Gender code = M, F, U in one system and Gender code = 0, 1, 2 in another system Complete Is all necessary data present?20% of customers’ last name is blank, 50% of zip-codes are Accurate Does the data accurately represent reality or a verifiable source? A Supplier is listed as ‘Active’ but went out of business six years ago Valid Do data values fall within acceptable ranges? Salary values should be between 60, ,000 Unique Data appears several timesBoth John Ryan and Jack Ryan appear in the system – are they the same person? T HE D ATA Q UALITY P ROBLEM S PACE

W HAT ’ S T HE P ROBLEM My name is Marc Jellinek Marc <> “Mark”, “Marck” or “March” Jellinek <> “Jelinek”, “Jellineck”, “Jelineck”, “Jelliner”, “Jeliner” or “Jellyneck” RrRr

W HAT ’ S THE P ROBLEM JellinekJelinekJellineckJelineckJellinerJelinerJellyneck Marc Mark Marck March

T HE NIGHTMARE S CENARIO The Customer Dimension – Jellinek, Marc – Jellinek, Mark – Jellinek, Marck – Jellinek, March – Jelinek, Marc – Jelinek, Mark – Jelinek, Marck – Jelinek, March – Jellineck, Marc – Jellineck, Mark – Jellineck, Marck – Jellineck, March – Jelineck, Marc – Jelineck, Mark – Jelineck, Marck – Jelineck, March – Jelliner, Marc – Jelliner, Mark – Jelliner, Marck – Jelliner, March – Jelliner, Marc – Jelliner, Mark – Jelliner, Marck – Jelliner, March – Jellyneck, Marc – Jellyneck, Mark – Jellyneck, Marck – Jellyneck, March

A NALYTIC I MPACT Average Revenue per customer Average Profit per customer Number of customers Customers per Geography Customers by Income Customers by Gender Customers by Educational Level Customers by Product Bought

O BLIGATORY T RUISMS The accuracy of your reporting is determined by the accuracy of your data (Garbage In, Garbage Out) Decisions made based on data will only be as good as the data on which you are basing your decisions. You can’t manage what you can’t measure. Inaccurate measurements lead to interesting management challenges.

T HE D ATA Q UALITY S OLUTION S PACE Amend, remove or enrich data that is incorrect or incomplete. This includes correction, enrichment and standardization. Identifying, linking or merging related entries within or across sets of data. CleansingMatching ProfilingMonitoring Analysis of the data source to provide insight into the quality of the data and help to identify data quality issues. Tracking and monitoring the state of Quality activities and Quality of Data.

SQL S ERVER 2012 D ATA Q UALITY S ERVICES High quality data is critical to effective business intelligence and to business activities DQS is an on-premise Data Quality product in SQL Server 2012, extendible with knowledge from multiple parties thru Azure DataMarket Richer DQ knowledge and capabilities in the cloud will make it even easier to provide high quality data Data Quality Services (DQS) is a Knowledge-Driven data quality solution enabling IT Pros and data stewards to easily improve the quality of their data Included with SQL Server 2012 Enterprise and BI Editions

K EY D ATA Q UALITY S ERVICES C ONCEPTS Knowledge-Driven Semantics Knowledge Discovery Based on a Data Quality Knowledge Base (DQKB) Data Domains capture the semantics of your data Acquires additional knowledge the more you use it Open and Extendible Easy to use Add user-generated knowledge & 3 rd party reference data providers User experience designed for increased productivity

DQS A RCHITECTURE Matching Reference Data DQ Clients DQS UI DQ Server DQ Projects StoreCommon Knowledge Store Knowledge Base Store DQ Engine 3 rd Party / Internal MS DQ Domains Store MS DQ Domains Store Reference Data Services Reference Data Sets SSIS DQ Component DQ Active Projects MS Data Domains Local Data Domains Published KBs Knowledge Discovery Data Profiling & Exploration Cleansing Knowledge Discovery and Management Interactive DQ Projects Data Exploration Azure Market Place Categorized Reference Data Categorized Reference Data Services Reference Data API (Browse, Get, Update…) Reference Data API (Browse, Get, Update…) RD Services API (Browse, Set, Validate…) RD Services API (Browse, Set, Validate…) MDS Excel Add in

D ATA Q UALITY S ERVICES P ROCESSES Build Use DQ Projects Knowledge Management Match & De-dupe Correct & standardize Manage Knowledge Enterprise Data Reference Data Reference Data Cloud Services Integrated Profiling Notifications Progress Status Knowledge Base Discover / Explore Data

B ASIC D EFINITIONS Knowledge Base –Stores all the knowledge related to a specific type of data source –Container for domains Domain –Semantic representation of a type of data in a data field or column –Trusted values, invalid values and erroneous data –Synonym associations, term relationships, validation and business rules, matching policies Matching Rule –Set of rules and conditions that determine a match or duplicate

D ATA Q UALITY S ERVICES C OMPONENTS Data Quality Server Data Quality Client DQS Cleansing Component for SQL Server Integration Data Quality Processes in Master Data Management

D ATA Q UALITY S ERVICES C OMPONENTS Data Quality Server –SQL Server Databases DQS_MAIN –DQS Stored Procedures, the DQS Engine and published Knowledge Bases DQS_PROJECTS –Data required for knowledge base management and DQS project activities DQS_STAGING_AREA –Intermediate staging area where source data is copied and processed

D ATA Q UALITY S ERVICES C OMPONENTS Data Quality Client –Standalone application Designed for both data stewards and DQS Administrators Perform knowledge management, data quality projects and administration in one user interface Allows for domain management, matching policy creation, data cleansing, matching, profiling, monitoring and server administration. Can be installed on a remote computer

D ATA Q UALITY S ERVICES C OMPONENTS DQS Cleansing Component in SQL Server Integration Services –Performs data cleansing as a part of an SSIS package –Alternative to running a cleansing project within the Data Quality Services Client application Data Quality Processes within Master Data Services –Perform de-duplication on source data and master data within the Microsoft SQL Server Data Services Add-in for Microsoft Excel.

DEMO

R ESOURCES DBI207: Using Knowledge to Cleanse Data with Data Quality Services Data Quality Services Blog Books Online for SQL Server - Data Quality Services Install Data Quality Services Used Master Data Services Configuration Manager, set up IIS Troubleshoot Installation and Configuration Issues (Master Data Services in SQL Server SQL Server 2012 Developer Training Kit Web Installer SQL Server 2012 Update for Developers Training Workshop SQL Server 2012 Update for Developers Training Kit SQL Server 2012 Update for Developers Training Kit Content MSDN – Data Quality Services MSDN Discussions – Data Quality Services Technet – Data Quality Services PASS MDM/DQS Virtual Chapter

Driving Innovation with SAP On-Premise and Beyond Monish Nagisetty Daniel Sepp 7 June :00-11:00 PDT m/?id= Delivering a Semantic Model for Ad-Hoc Reporting Steve Muise21 June :00-11:00 PDT m/?id= Exploring the New Hadoop Implementation in Azure Jacob Saunders10 July :00-11:00 PDT m/?id= Amerishore – an Innovative, Socially Conscious Approach to Offshoring Tracy Derr17 July :00-11:00 PDT m/?id= Data Integration Improvements within SQL Server 2012 Kola Bolarin24 July :00-11:00 PDT m/?id= EDI: The Reports of My Death Have Been Greatly Exaggerated Abhilash Shanmug9 August :00-11:00 PDT m/?id=

THANK YOU