Achieving better Operations and Analytics

Slides:



Advertisements
Similar presentations
Bob Hoffman Technical Account Manager Eastern Area Boston User Group Getting Data Ready for WebFOCUS November 10, 2011.
Advertisements

Navigator Management Partners LLC Business Analysis Professional Development Day – Sep 2014 How to understand and deliver requirements to your Business.
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
The Database Environment
Copyright 2007, Information Builders. Slide 1 The Relevance of Data Governance in Higher Education Tim Beckett Higher Education Solutions November 9, 2011.
Basic guidelines for the creation of a DW Create corporate sponsors and plan thoroughly Determine a scalable architectural framework for the DW Identify.
Implementing MDM for BI & Data Integration by Kabir Makhija.
Jeremy Kashel BI 200 End to End Master Data Management With SQL Server Master Data Services (MDS)
Managing Data Resources
Supply Chain Management
Recording / Financing Fixed Asset Acquisition Human Resources Purchasing Revenue Traditional files approach: separate systems (Legacy Systems) Expenditure.
Universe Design Concepts Business Intelligence Copyright © SUPINFO. All rights reserved.
Agenda 02/21/2013 Discuss exercise Answer questions in task #1 Put up your sample databases for tasks #2 and #3 Define ETL in more depth by the activities.
Copyright 2009, Information Builders. Slide 1 iWay Enterprise Information Management (EIM) Data Quality and Master Data Management Kam Wong Solutions Architect.
® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer.
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
Lucius McInnis Technical Account Manager Eastern Area New York User Forum Getting Data Ready for WebFOCUS August 10, 2011.
Lucius McInnis, Systems Engineer – Client Services Group Kam Wong, Solutions Architect – iWay Software March 22, 2012 Getting Data Ready for WebFOCUS 1.
IWay Solutions - EIM Vincent Deeney – Solutions Architect 6/25/2009.
1 INTRODUCTION TO DATABASE MANAGEMENT SYSTEM L E C T U R E
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Clients (and the interface level) Application Server (and the application level) Database Server (and the Database level)
Emerging Technologies Work Group Master Data Management (MDM) in the Public Sector Don Hoag Manager.
Pierre-Louis Usselmann, Ben Watt SOGETI Switzerland Master Data Services.
© 2007 by Prentice Hall 1 Introduction to databases.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
Building Marketing Databases. In-House or Outside Bureau? Outside Bureau: Outside agency that specializes in designing and developing customized databases.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Workflow for ACT! and QuickBooks Graeme Leo Xact Software.
Master Data Management & Microsoft Master Data Services Presented By: Jeff Prom Data Architect MCTS - Business Intelligence (2008), Admin (2008), Developer.
University of Southern California Enterprise Wide Information Systems Customer Order Management Instructor: Richard W. Vawter.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
1 iWay DQC and iDP Kam Wong Solutions Architect Exploring Techniques of Data Quality and Profiling April 20, 2012 What Is Data Profiling? What Are Some.
BUILDING THE INFORMATION INFRASTRUCTURE. The Challenge  Information understanding through increased context and consistency of definition.  Information.
MBA/1092/10 MBA/1093/10 MBA/1095/10 MBA/1114/10 MBA/1115/10.
Mastering Master Data Services Presented By: Jeff Prom BI Data Architect Bridgepoint Education MCTS - Business Intelligence, Admin, Developer.
Enterprise Processes and Systems MIS 2000 Instructor: Bob Travica Updated 2016 Class 16.
Bartek Doruch, Managing Partner, Kamil Karbowiak, Managing Partner, Using Power BI in a Corporate.
Will Ingleby Solutions for Accounting
Introduction To DBMS.
Information Systems By Kundang K Juman, Ir. MMSI
Enterprise Processes and Systems
Databases.
MIS2502: Data Analytics Advanced Analytics - Introduction
Foundations of Information Systems in Business
Overview of MDM Site Hub
Implementing MDM for BI & Data Integration by Kabir Makhija
Lexmark MDM.
Trinity Health Presenters:
Introduction to Transaction Processing
Manajemen Data (2) PTI Pertemuan 6.
Data Student to Data Master
Achieving Operational Excellence and Customer Intimacy:Enterprise Applications Chapter 9 (10E)
Swagatika Sarangi (Jazz), MDM Expert
Accounting System Design
Sales Order Process.
Chapter 1 Database Systems
Lexmark MDM.
06 | Managing Enterprise Data
Collaborative Business Solutions
Supporting End-User Access
Accounting System Design
Data Quality in the BI Life Cycle
Metadata The metadata contains
Chapter 1 Database Systems
The Social Life of Information
Data Warehousing Concepts
Information Systems in Organizations 2
COIT 20253: Business Intelligence Using Big Data
Presentation transcript:

Achieving better Operations and Analytics through Master Data Management James Cotton Sr. Solution Architect European Headquarters

What is Master Data? Master data is data that is shared by multiple computer systems. The Information Difference Master data is information that is key to the operation of a business…persistent, non-transactional data that defines a business entity for which there is, or should be, an agreed-upon view across the organisation. Wikipedia Master data is the consistent and uniform set of identifiers and extended attributes that describes the core entities of the enterprise including customers, prospects, citizens, suppliers, sites, hierarchies and chart of accounts. Gartner Master data is often one of the key assets of a company. Microsoft

What is Master Data Management? Master data management is a technology-enabled discipline in which business and IT work together to ensure the uniformity, accuracy, stewardship, semantic consistency and accountability of the enterprise’s official shared master data assets.  Gartner Master Data Management comprises a set of processes, governance, policies, standards and tools that consistently defines and manages the master data. Wikipedia The creation of: The Golden Record Single Version of the Truth

Types of data in an organisation Unstructured Found in e-mail, white papers, magazine articles, corporate intranet portals, product specifications, marketing collateral, and PDF files Transactional Related to sales, deliveries, invoices, trouble tickets, claims, and other monetary and non-monetary interactions Metadata Data about other data and includes: report definitions, column descriptions in a database, log files, connections, and configuration files Hierarchical Stores the relationships between other data such as company organisational structures or product lines. Master Critical nouns of a business and fall generally into the groupings: people, places and things, The What, Why, and How of Master Data Management – Microsoft November 2006

Understanding Master Data Think of nouns and verbs Bob Smith buys a widget (SKU #A1234) and ships it to his home address The master data elements are the nouns and are people, things, and places The transactional data elements are verbs that describe what happens to those people, places, and things. Bob Smith widget (SKU #A1234) home address CRM Marketing ERP WMS Financial

Deciding what Master Data should be Managed Generally speaking, master data should meet the following requirements: Cardinality Volatility Lifetime Value Reuse

Master Data Management Name: Bob Smith Tel: 01323 456842 DOB: 23/10/71 Gender: M Name: Bob Smith Tel: 01323 456842 DOB: Gender: M Name: B Smith Tel: 01323 456842 DOB: 23/10/71 Gender: M Name: Bob Smith Tel: 01283 56982 DOB: 23/10/71 Gender: Name: Bob Smith Tel: 01323-456842 DOB: Gender: Male Name: B Smith Tel: (0)1323456842 DOB: 23-Oct-71 Gender: M Name: Smith, Bob Tel: (01283)56982 DOB: 23/10/1971 Gender: CRM Marketing ERP WMS Financial

The Current Landscape of MDM Systems Aberdeen Group – April 2012

Operational vs. Analytical Master Data Management Operational data is the lifeblood of an organisation Operational MDM centres on assuring ‘single view’ of master data in the core systems used by business users Sales, service, order management, manufacturing, purchasing, billing, accounts receivable, accounts payable, payroll, etc. Rely heavily on integration technologies to keep systems in sync

Operational vs. Analytical Master Data Management Analytical data is used to support a company's decision making Analytical MDM centres on assuring ‘single view’ of master data in the downstream data warehouse used most often to supply the data for a business intelligence (BI) solution for historical and predictive analysis Any data cleansing done inside an Analytical MDM solution is invisible to the transactional applications

Master Data Management - Value Across the Enterprise Operational Analytical Single Version of Truth = Better System synchronisation Consistency in transactional data Party/product data across all systems System integration/migration Cost reduction within the business process Data aggregation & analysis Marketing segmentation & analysis Risk management Financial reporting Cost reduction and time savings in analysis Maximum business value comes from managing both operational and analytical master data

Data Quality Improvement Concept Data Governance Share Communicate Analyse Propagate Data Distribution Build Match Merge Data Mastering Improve Standardise Enrich Data Quality Know Explore Profile Data Analysis Get Connect Orchestrate Data Integration Manage Control

Data Quality Improvement Concept Data Governance Communicate Analyse Propagate Data Distribution Build Match Merge Data Mastering Improve Standardise Enrich Data Quality Know Explore Profile Data Analysis Share Manage Control Get Connect Orchestrate Data Integration

Data Governance It embodies: Data quality Data management People It embodies: Data quality Data management Data policies Business process management Risk management It is about putting people in charge of fixing and preventing issues with data so that the enterprise can become more efficient.* It’s about using technology when necessary in many forms to help aid the process.* When companies desire, or are required, to gain control of their data, they empower their people, set up processes and get help from technology to do it.* *Sarsfield, Steve (2009). "The Data Governance Imperative", IT Governance. Process Technology

Data Quality Improvement Concept Data Governance Communicate Analyse Propagate Data Distribution Build Match Merge Data Mastering Improve Standardise Enrich Data Quality Know Explore Profile Data Analysis Share Manage Control Get Connect Orchestrate Data Integration

Data Integration - Batch

Data Integration – Real Time

Data Quality Improvement Concept Data Governance Communicate Analyse Propagate Data Distribution Build Match Merge Data Mastering Improve Standardise Enrich Data Quality Know Explore Profile Data Analysis Share Manage Control Get Connect Orchestrate Data Integration

Profiling Profiling – Technical (Pre-built) Basic Analysis Minimums Maximums Averages Counts Etc. Patterns / Masking Extremes Quantities Frequency Analysis Foreign Key Analysis Profiling – All Charting Grouping / Aggregate Drilldown / Interactive Displays

Monitoring Advanced Profiling ‘Custom’ analysis of data Defined by user and relevant to data context Multiple fields can be considered, e.g. Address lines Complex computation may be required Output is Binary (true/false) – Data Quality Indicators

Data Governance - Monitoring Portal DQ plan Profiling/DQIs Reports

Data Quality Improvement Concept Data Governance Communicate Analyse Propagate Data Distribution Build Match Merge Data Mastering Improve Standardise Enrich Data Quality Know Explore Profile Data Analysis Share Manage Control Get Connect Orchestrate Data Integration

Data Quality - Facts and Stats The amount of data you have doubles every 12 to 18 months Thomas Redman – Data-Driven The average amount of inaccurate data in an organisation increased by 30% last year. Experian Data Quality Survey 50% of Data Warehouse projects will fail or receive limited acceptance because of NOT proactively addressing data quality issues Thomas Redman – Data-Driven 75% of 250 CFOs Surveyed said “data quality significantly impedes performance.” Gartner Survey

Master Data Management Name: Bob Smith Tel: 01323 456842 DOB: Gender: M Name: B Smith Tel: 01323 456842 DOB: 23/10/71 Gender: M Name: Bob Smith Tel: 01283 56982 DOB: 23/10/71 Gender: Name: Bob Smith Tel: 01323-456842 DOB: Gender: Male Name: B Smith Tel: (0)1323456842 DOB: 23-Oct-71 Gender: M Name: Smith, Bob Tel: (01283)56982 DOB: 23/10/1971 Gender: CRM Marketing ERP WMS Financial

Cleansing Parsing Validation of Data Quality Enrichment Data parsed into components (pattern based) E.G. Jim Smith -> Jim + Smith Validation of Data Quality Validation against rules Validation against reference tables Enrichment Adding data Standardisation Transformation into standard format (Jim Smith -> James Smith) Standard and nonstandard abbreviations (Str. -> Street) Language-specific replacements Parsing Validation Enrichment Large number of domain oriented algorithms - examples: Name Address Credit Card number Bank account number Extension by custom validation steps Using complex function and rules including Levensthein distance SoundEx Industry standard functions Standardisation 25

Scoring Cleansing Parsing Validation Enrichment Standardisation 26

Data Before and After Cleansing Name ANNE PHILLIPS CHRISTINE HALL JOHN SMITH IAN SCOTT Gender F N   Male Date of Birth 14/11/1987 10/12/1940 10/01/1971 28.Oct.1956 Telephone 01569 274873 01491 24778 01598 867305 7801551340 Email phillips87@live.co.uk christine.hall@gmail.com JS@hotmail.com ian@@dfgmail.-.com Address Line 1 6 BOOTON COURT 56C HORNCHURCH ROAD 22 RINGMORE STREET 56 WOULD LANE Address Line 2 Address Line 3 Address Line 4 KIDDERMINSTER PLYMUTH ISLEWORTH Address Line 5 PORCESTERSHIRE DEVON LONDON MIDDLESEX Postcode DY102YZ PL5 2TF SE233DE TW7-5ED Score 210 300 600 Explanation ADDRESS_VALID GENDER_TAKEN_FROM_NAME ADDRESS_CORRECTED_MINOR EMAIL_INV DATE_STANDARDIZED GENDER_STANDARDIZED TELEPHONE_STANDARDIZED ADDRESS_CORRECTED_MAJOR out_first_name Anne Christine John Ian out_last_name Phillips Hall Smith Scott out_gender M out_birthdate 28/10/1956 out_telephone 07801 551340 out_email js@hotmail.com out_address_line_1 22 RINGMORE RISE 56 WOOD LANE out_address_line_2 out_address_line_3 out_address_line_4 out_post_town PLYMOUTH out_postcode DY10 2YZ SE23 3DE TW7 5ED Name IAN SCOTT out_first_name Ian out_last_name Scott Gender Male out_gender M Date of Birth 28.Oct.1956 out_birthdate 28/10/1956 Telephone 7801551340 out_telephone 07801 551340 Email ian@@dfgmail.-.com out_email Address Line 1 56 WOULD LANE out_address_line_1 56 WOOD LANE Address Line 2 out_address_line_2 Address Line 3 out_address_line_3 Address Line 4 ISLEWORTH out_address_line_4 Address Line 5 MIDDLESEX out_post_town Postcode TW7-5ED out_postcode TW7 5ED Score 600 Explanation EMAIL_INV DATE_STANDARDIZED GENDER_STANDARDIZED TELEPHONE_STANDARDIZED ADDRESS_CORRECTED_MAJOR Name JOHN SMITH out_first_name John   out_last_name Smith Gender out_gender M Date of Birth 10/01/1971 out_birthdate Telephone 01598 867305 out_telephone Email JS@hotmail.com out_email Address Line 1 22 RINGMORE STREET out_address_line_1 22 RINGMORE RISE Address Line 2 out_address_line_2 Address Line 3 out_address_line_3 Address Line 4 out_address_line_4 Address Line 5 LONDON out_post_town Postcode SE233DE out_postcode SE23 3DE Score 300 Explanation GENDER_TAKEN_FROM_NAME ADDRESS_CORRECTED_MINOR

Data Governance – Issue Resolution Is the score lower than the threshold? Yes No

Data Governance - Issue Management Portal DQ plan Profiling/DQIs Reports Issue data Issue Database Issue List Workflow Exception Mgt

Data Quality Improvement Concept Data Governance Communicate Analyse Propagate Data Distribution Build Match Merge Data Mastering Improve Standardise Enrich Data Quality Know Explore Profile Data Analysis Share Manage Control Get Connect Orchestrate Data Integration

Master Data Management Name: Bob Smith Tel: 01323 456842 DOB: 23/10/71 Gender: M Name: Bob Smith Tel: 01323 456842 DOB: Gender: M Name: B Smith Tel: 01323 456842 DOB: 23/10/71 Gender: M Name: Bob Smith Tel: 01283 56982 DOB: 23/10/71 Gender: Name: Bob Smith Tel: 01323-456842 DOB: Gender: Male Name: B Smith Tel: (0)1323456842 DOB: 23-Oct-71 Gender: M Name: Smith, Bob Tel: (01283)56982 DOB: 23/10/1971 Gender: CRM Marketing ERP WMS Financial

Matching Goal: Identify groups of records that in reality represent a single client or entity. Match & Merge This may not be so simple : Data comes from different sources Must handle data that is missing, wrong or conflicting There’s no single ‘correct’ solution

How many people are here? Cleansed data First Last G SIN Birth Date Address John Smith M 1978-12-16 22 Ringmore Street, London, SE23 3DE 095242434 74 Arnold Street, Boldon Colliery, Bolton, NE35 9BD 1978-11-16 095252433 3 Catalina Avenue, Pembroke Dock, SA72 6YB Smiht Jane Watson F 420347213 1982-01-01 J.

Match Cleansed data First Last G SIN Birth Date Address John Smith M 1978-12-16 22 Ringmore Street, London, SE23 3DE 095242434 74 Arnold Street, Boldon Colliery, Bolton, NE35 9BD 1978-11-16 095252433 3 Catalina Avenue, Pembroke Dock, SA72 6YB Smiht Jane Watson F 420347213 1982-01-01 J.

Merging Creating the Golden Record Can cherry pick the best fields or even the best record Using rules to determine the best field/record For example: The one from the ‘reference system’ The newest one The one of highest quality Aggregation functions SQL-like: count, sum, minimum, maximum, average Modus, concatenate Match & Merge

Match Cleansed data First Last G SIN Birth Date Address John Smith M 1978-12-16 22 Ringmore Street, London, SE23 3DE 095242434 74 Arnold Street, Boldon Colliery, Bolton, NE35 9BD 1978-11-16 095252433 3 Catalina Avenue, Pembroke Dock, SA72 6YB Smiht Jane Watson F 420347213 1982-01-01 J.

Merge Cleansed data First Last G SIN Birth Date Address Golden record John Smith M 1978-12-16 22 Ringmore Street, London, SE23 3DE 095242434 74 Arnold Street, Boldon Colliery, Bolton, NE35 9BD John Smith M 095242434 1978-12-16 22 Ringmore Street, London, SE23 3DE 74 Arnold Street, Boldon Colliery, Bolton, NE35 9BD Golden record First Last G SIN Birth Date Address The most frequent address The newest permanent address

Data Quality Improvement Concept Data Governance Communicate Analyse Propagate Data Distribution Build Match Merge Data Mastering Improve Standardise Enrich Data Quality Know Explore Profile Data Analysis Share Manage Control Get Connect Orchestrate Data Integration

Master Data Management Architectures Consolidated Master is Single Version of Truth Data Quality at Master Updates occur at Sources Updates propagated to Master Coexistence Master is Single Version of Truth Data Quality is ongoing Updates occur at Sources or Master Updates propagated to other Sources Registry Multiple Versions of Truth Data Quality is ongoing Updates occur at Sources Keys and Metadata in Registry Updates optionally propagated to other Sources Centralised Master is Single Version of Truth Data Quality at Master Updates occur at Master Updates propagated to Sources

Data Distribution

Intelligence

Summary It’s a process and it’s iterative Enable the process via technology Start small with an eye on the long-term Understand that requirements will change over time Know that Information Builders can help you

Data Quality Challenge

Thank you!