DBI207 3 Data QualityIssueSample Data Problem Standard Are data elements consistently defined and understood ? Gender code = M, F, U in one system and.

Slides:



Advertisements
Similar presentations
SIM348. “ConfigMgr appeared in Gartner client buying decisions more frequently than any other product in the market in 2010.”
Advertisements

WSV405. IPv6 Ready Logo Program
1er Simposio Latinoamericano Data Quality Fundamentals Miguel Angel Granados Troncoso.
OSP303. demo Status Bar Notification.
SQL Server Data Quality Services A knowledge driven Data Quality Solution.
SIM Separate solution install paths can be taken, stand alone and SCOM integrated. Both require core AVIcode web apps and DB’s.
DBI331. Cube Measure Group Measure Partition Cube Dimension Dimension Attribute Relationship Hierarchy Level Cube Attribute Cube Hierarchy Measure.
DBI209 demo Deploy & Future Proof Monitor & Manage Share & Collaborate Mash-up & Analyze Connect & ProvisionFind & Access.
OSP206. Experience Office as it was meant to be… without the complexity of setting up servers.
OSP203. Customer’s Need …is a continuous process of managing the life of an application through governance, development and maintenance.
Managing Master Data with MDS and Microsoft Excel
SIM346. General information about the software application.
DEV207. SSDT Database Services Database Services Analysis Services Reporting Services Integration Services.
DEV314. Entity Data Model demo Entity Data Model.
OSP202. Business Need Business Creates Application DeploySupport The SharePoint Application Lifecycle Business Self-Service.
DEV202 Before I get started... …is too expensive. …is too complex. …requires a server.
WSV314. MAP 5.5 Internet ExplorerWindows 7 Software Usage Tracking Heterogeneous Server & Database Inventory Windows Server 2008 R2 Hyper-V SQL Server.
WCL309. Demo.
SIM329. Certificate Enrollment Without CEP/CES Certificate Authority Active Directory Client Workstations LDAP RPC/DCOM.
OSP312 Beauty is, it’s entirely up to you.
OSP317. Built on SharePoint Leverage one or more out of the box or custom features. These features can typically live on there own Like any other.
November 10 th, 2011 DQS BOOTCAMP D AVID F AIBISH, S ENIOR P ROGRAM M ANAGER SQL S ERVER D ATA Q UALITY S ERVICES Microsoft SQL Server 2012.
EXL319. *Baseline for 80,000 user pool with 8 FEs and 1 BE Lync Server 2010 Capacity Calculator released.
WCL318. MAP 5.5 Internet ExplorerWindows 7 Software Usage Tracking Heterogeneous Server & Database Inventory Windows Server 2008 R2 Hyper-V SQL Server.
WPH203 Content Choice Discoverability demo.
SIM314 Introduction Transport Layer Summary Network Layer.
SIM335 Demo 6 7 NetApp Confidential - Internal Use Only.
demo.
SQL S ERVER D ATA Q UALITY S ERVICES Marc Jellinek Principal Consultant – Neudesic
DBI329. video.
DBI326. PhraseGoal “Data Mining”Inform actionable decisions “Machine Learning”Determine best performing algorithm.
OSP Addressing Critical Business Challenges 2. Increasing Productivity 3. Modern Organizational Reality 4. Connecting Data and People Business.
DPR302.
DENALI SSIS AND DATA QUALITY ENHANCEMENTS Dr Greg Low Principal Mentor and CEO SolidQ Australia SESSION CODE: DAT307 (c) 2011 Microsoft. All rights reserved.
OSP210 Talk, Show, Q&A -75 min.- Agenda Introducing.
2.
DEV331. class Tweet : TimelineItem {…} class DirectMessage : TimelineItem {…} class Notification : TimelineItem {…} … TimelineItem[] items = new.
WCL304.
DPR306. Process and tools Individuals and interactions over Following a plan Responding to change over Source: Comprehensive.
DPR305. Controller Model View Client Business Objects Server Business Objects Data.
OSP402 Required Slide Track PMs will supply the content for this slide, which will be inserted during the final scrub.
DEV211. The simplest way to create business applications for the desktop and the cloud.
Atlanta User Group Introduction to: Data Quality & Master Data Management.
DBI325. Monitoring Analytics Support will extend to Analysis Services in the Denali release.
SIM401. A. Datum Account Forest Trey Research Resource Forest Federation Trust Microsoft (Users) E-Company Store (Resource) Contoso(Users)Contoso(Users)Fabrikam(Resource)Fabrikam(Resource)
DPR301 demo Executable Requirements.
OSP318. ProfileSynchronizationServiceInstanceProfileSynchronizationServiceInstance Profile Service Instance Instance.
VIR326. Dell Compellent always puts the right data in the right place at the right time at the right cost. That’s Fluid Data.
DEV351.
DEV332. Required Slide Speakers, please list the Breakout Sessions, Interactive Discussions, Labs, Demo Stations and Certification Exam that.
Introducing Data Quality Services and its role in an Enterprise Information Management (EIM) Process James Beresford Group Manager, Avanade DBI217.
DEV321. demo Rule: Any slide about UX must be charcoal gray or black.
DEV203. Coded workflows Declarative workflows Web part hook-up Professional developerBusiness Analyst/Process Designer List definitions Event receivers.
Learn more: Download SCM: Join the TechNet Wiki community:
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Welcome November 2012 Einführung in.
OSP-302. DescriptionUri All lists on a site.../_vti_bin/ListData.svc All Items in a named list.../_vti_bin/ListData.svc/MyList 2nd Item in the list.../_vti_bin/ListData.svc/MyList(2)
OSP319. Many Office integration options Excel & Excel Services REST InfoPath & Forms Services Access & Access Services Visio & SharePoint Designer.
DEV348. demo Valid HTML5 Syntax demo.
EXL Lync ‘out-of-the-box’ Add a little SDK magic…
WSV303. I live here... DC DNS DHCP WDS Clients DC DNS WDS/DHCP DC/DNS.
DEV354. Describe your data Create screens for common tasks Author business logic Customize screen layouts Define custom queries Create custom Silverlight.
SIM End users Web servers Application servers Data servers ? How do I know I have a problem? How do I isolate the problem? How do I diagnose.
WCL301. demo Basic Custom XML-file.
November 10 th, 2011 C LEANSING D ATA IN SSIS D AVID F AIBISH, S ENIOR P ROGRAM M ANAGER SQL S ERVER D ATA Q UALITY S ERVICES Microsoft SQL Server 2012.

DBI407. Oracle 10g CDF SSAS Cube Builder NAS SSAS Query Servers HW NLB Partition 1 Partition 2 Partition N Partition 1 Partition.
DEV353. Required Slide Speakers, please list the Breakout Sessions, Interactive Discussions, Labs, Demo Stations and Certification.
COS307. demo Required Slide Track PMs will supply the content for this slide, which will be inserted during the final scrub. Website:
Matt Masson Senior Program Manager Microsoft Corporation
Data Quality in the BI Life Cycle
Presentation transcript:

DBI207

3

Data QualityIssueSample Data Problem Standard Are data elements consistently defined and understood ? Gender code = M, F, U in one system and Gender code = 0, 1, 2 in another system Complete Is all necessary data present ?20% of customers’ last name is blank, 50% of zip-codes are Accurate Does the data accurately represent reality or a verifiable source? A Supplier is listed as ‘Active’ but went out of business six years ago Valid Do data values fall within acceptable ranges? Salary values should be between 60, ,000 Unique Data appears several timesBoth John Ryan and Jack Ryan appear in the system – are they the same person?

Cleansing MatchingProfiling Monitoring Monitoring Tracking and monitoring the state of Quality activities and Quality of Data Cleansing Amend, remove or enrich data that is incorrect or incomplete. This includes correction, standardization and enrichment. Profiling Analysis of the data source to provide insight into the quality of the data and help to identify data quality issues. Matching Identifying, linking or merging related entries within or across sets of data.

Data Quality Services (DQS) is a Knowledge-Driven data quality solution, enabling IT Pros and data stewards to easily improve the quality of their data

7 Based on a Data Quality Knowledge Base (DQKB) Knowledge-Driven Data Domains capture the semantics of your data Knowledge Discovery Acquires additional knowledge the more you use it Semantics Support use of user-generated knowledge and IP by 3 rd party reference data providers Open and Extendible Compelling user experience designed for increased productivity Easy to use

Build Use DQ Projects Knowledge Management Match & De-dupe Correct & standardize Knowledge Manage Discover / Explore Data / Connect Enterprise Data Reference Data Reference Data Cloud Services Integrated Profiling Notifications Progress Status Knowledge Base

Creating and managing the Data Quality Knowledge Bases Discover knowledge from your org’s data samples Exploration and integration with 3 rd party reference data Creating and managing the Data Quality Knowledge Bases Discover knowledge from your org’s data samples Exploration and integration with 3 rd party reference data Knowledge Management & Reference Data Correction, de-duplication and standardization of the data Cleansing & Matching Tools to monitor and control data quality processes Administration

demo

Domains Represent the data type Domains Represent the data type Values Rules & Relations 3 rd party Reference Data Knowledge Base Composite Domains Matching Policy Domains

Matching Reference Data DQ Clients DQS UI DQ Server DQ Projects StoreCommon Knowledge StoreKnowledge Base Store DQ Engine 3 rd Party MS DQ Domains Store MS DQ Domains Store Reference Data Services Reference Data Sets DQ Active Projects MS Data Domains Local Data Domains Published KBs Knowledge Discovery Data Profiling & Exploration Cleansing Knowledge Discovery and Management Interactive DQ Projects Data Exploration Future Clients – Excel, SharePoint… Azure Market Place Categorized Reference Data Categorized Reference Data Services Reference Data API (Browse, Get, Update…) Reference Data API (Browse, Get, Update…) RD Services API (Browse, Set, Validate…) RD Services API (Browse, Set, Validate…)

Easily cleanse and enrich data with Reference Data Services from DataMarket Open integration with external 3 rd party reference data providers Website that contains DQS knowledge available for downloading DataMarket 3 rd Party Reference Data Providers DQS Data Store Create domains from your own data sources Organization Data A set of data domains that come out of the box with DQS Out of the Box Knowledge

demo

Microsoft Confidential—Preliminary Information Subject to Change Reference Data Definition Values/Rules New Records Corrections & Suggestions Correct Records Invalid Records SSIS Data Flow Source + Mapping Data correction Component SSIS Package Destination Reference Data Services DQS Server

demo

Rich Knowledge Base Continuous improvement and knowledge acquisition Build once, reuse for multiple DQ improvements Focus on productivity and user experience Designed for business users Out-of-the-box knowledge Focus on cloud-based Reference Data User-generated knowledge Integration with SSIS Knowledge-driven Easy To Use Open & Extendible

Sessions On-Demand & CommunityMicrosoft Certification & Training Resources Resources for IT ProfessionalsResources for Developers Connect. Share. Discuss.

Scan the Tag to evaluate this session now on myTechEd Mobile