1 iWay DQC and iDP Kam Wong Solutions Architect Exploring Techniques of Data Quality and Profiling April 20, 2012 What Is Data Profiling? What Are Some.

Slides:



Advertisements
Similar presentations
Module 12: Enabling Business Activity Monitoring.
Advertisements

EiS – Education iT Services “Our passion in EiS is to make a real difference in education and ultimately children’s lives by providing innovative solutions.
Darrell W. Gunter EVP / CMO Collexis Holdings, Inc. March 23, 2010 Spring Conference CONTENT: Uncovering the Value and Benefits of Semantic Technology.
Bob Hoffman Technical Account Manager Eastern Area Boston User Group Getting Data Ready for WebFOCUS November 10, 2011.
© Copyright IBM Corporation 2014 Getting started with Rational Engineering Lifecycle Manager queries Andy Lapping – Technical sales and solutions Joanne.
Copyright 2009, Information Builders. Slide 1 Data Profiling Kam Wong Solutions Architect Information Builders May 5, 2010.
EXPERT SYSTEMS apply rules to solve a problem. –The system uses IF statements and user answers to questions in order to reason just like a human does.
Mapping Nominal Values to Numbers for Effective Visualization Presented by Matthew O. Ward Geraldine Rosario, Elke Rundensteiner, David Brown, Matthew.
Copyright 2007, Information Builders. Slide 1 The Relevance of Data Governance in Higher Education Tim Beckett Higher Education Solutions November 9, 2011.
4.2.3 Data Quality, Composite Indicators and Aggregation 1 DATA QUALITY, COMPOSITE INDICATORS AND AGGREGATION UPA Package 4, Module 2.
Microsoft Excel Working with Excel Lists, Subtotals and Pivot Tables.
Data Quality Class 3. Goals Dimensions of Data Quality Enterprise Reference Data Data Parsing.
Chapter Physical Database Design Methodology Software & Hardware Mapping Logical Design to DBMS Physical Implementation Security Implementation Monitoring.
Introduction to Structured Query Language (SQL)
Introduction to Structured Query Language (SQL)
© 2013 IBM Corporation Information Management Discovering the Value of IBM InfoSphere Information Analyzer IBM Software Group 1Discovering the Value of.
A Guide to SQL, Seventh Edition. Objectives Retrieve data from a database using SQL commands Use compound conditions Use computed columns Use the SQL.
Solve for y when x = 1, 2, 3 and 4. 1.) y = x ) y = 5x 4 3.) y = 3x Solve for y when x is -2, -1, 0, 1. Patterns and Functions Day 2.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
Copyright 2009, Information Builders. Slide 1 iWay Enterprise Information Management (EIM) Data Quality and Master Data Management Kam Wong Solutions Architect.
Log-linear Models For 2-dimensional tables. Two-Factor ANOVA (Mean rot of potatoes) Bacteria Type Temp123 1=Cool 2=Warm.
Data Governance Data & Metadata Standards Antonio Amorin © 2011.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Lucius McInnis Technical Account Manager Eastern Area New York User Forum Getting Data Ready for WebFOCUS August 10, 2011.
Creating the Foundation for Enterprise Information Management.
Lucius McInnis, Systems Engineer – Client Services Group Kam Wong, Solutions Architect – iWay Software March 22, 2012 Getting Data Ready for WebFOCUS 1.
Chapter 3 Single-Table Queries
CHAPTER 1 Basic Statistics Statistics in Engineering
Data Access Patterns Some of the problems with data access from OO programs: 1.Data source and OO program use different data modelling concepts 2.Decoupling.
AP STATISTICS Section 4.2 Relationships between Categorical Variables.
Exploring Microsoft Access Chapter 4 Relational Databases, External Data, Charts, and the Switchboard.
RELATIONSHIPS Generally there are two main database types: flat-file and relational.
The introduction to SPSS Ⅱ.Tables and Graphs for one variable ---Descriptive Statistics & Graphs.
IE 423 – Design of Decision Support Systems Database development – Relationships and Queries.
Database Design. The process of developing database structures from user requirements for data a structured methodology Structured Methodology - a number.
Project 6 Using The Analysis ToolPak To Analyze Sales Transactions Jason C. H. Chen, Ph.D. Professor of Management Information Systems School of Business.
Executive Invitation – Oracle Data Finder Service Oracle Corporation.
ISV Innovation Presented by ISV Innovation Presented by Business Intelligence Fundamentals: Data Cleansing Ola Ekdahl IT Mentors 9/12/08.
DATABASE TRANSACTION. Transaction It is a logical unit of work that must succeed or fail in its entirety. A transaction is an atomic operation which may.
The Universal Database Design The Future of Data For Database Architects.
Creating Trends, Histograms, Profiles, and Statistics using PQView Express.
…optimise your IT investments Data Discovery Understanding data relationships Philip Howard Research Director – Bloor Research.
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
Detecting Group Differences: Mining Contrast Sets Author: Stephen D. Bay Advisor: Dr. Hsu Graduate: Yan-Cheng Lin.
Database Design. The process of developing database structures from user requirements for data a structured methodology Structured Methodology - a number.
Demonstrate knowledge of measures and displays used to compare data sets.
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 7 (Part II) INTRODUCTION TO STRUCTURED QUERY LANGUAGE (SQL) Instructor.
CANE 2007 Spring Meeting Visualizing Predictive Modeling Results Chuck Boucek (312)
Introduction to the ABAP System. Slide 2 The Data Browser Allows us to look at the underlying table contents Use transaction code SE16.
Data Profiling 13 th Meeting Course Name: Business Intelligence Year: 2009.
A Guide to SQL, Eighth Edition Chapter Four Single-Table Queries.
DataJewel 1 : Tightly Integrating Visualization with Temporal Data Mining Mihael Ankerst, David H. Jones, Anne Kao, Changzhou Wang 1 US patent pending.
1 Chapter 3 Single Table Queries. 2 Simple Queries Query - a question represented in a way that the DBMS can understand Basic format SELECT-FROM Optional.
7 1 Database Systems: Design, Implementation, & Management, 7 th Edition, Rob & Coronel 7.6 Advanced Select Queries SQL provides useful functions that.
LM 5 Introduction to SQL MISM 4135 Instructor: Dr. Lei Li.
BUILDING THE INFORMATION INFRASTRUCTURE. The Challenge  Information understanding through increased context and consistency of definition.  Information.
Highline Class, Busn 218 Excel 2016: Spreadsheet Construction Charts 1.
IT 5433 LM3 Relational Data Model. Learning Objectives: List the 5 properties of relations List the properties of a candidate key, primary key and foreign.
Relational Databases Today we will look at: Different ways of searching a database Creating queries Aggregate Queries More complex queries involving different.
1 Chapter 13: Class Diagram Chapter 19 in Applying UML and Patterns Book.
Database Design.
Introduction to Computational Thinking
Point of View, Multiple Perspectives
Point of View, Multiple Perspectives
Point of View, Multiple Perspectives
SQL – Entire Select.
X y y = x2 - 3x Solutions of y = x2 - 3x y x –1 5 –2 –3 6 y = x2-3x.
Point of View, Multiple Perspectives
Analysis of Absolute Value Functions Date:______________________
Presentation transcript:

1 iWay DQC and iDP Kam Wong Solutions Architect Exploring Techniques of Data Quality and Profiling April 20, 2012 What Is Data Profiling? What Are Some of Data Profiling Techniques? How To Monitor Your Data?

What Is Data Profiling? Data profiling is about knowing your data It discovers relationship between data elements, whether they are in the same data source or across multiple, heterogeneous data sources. It performs statistical analysis against individual columns (as in relational database) discovering such things as the number of null values, patterns, whether the data matches the expected data type and so on. 2

What Are Some Of Data Profiling Techniques? 3  Profiling – Technical – Basic Analysis Minimums Maximums Averages Counts Etc. – Patterns / Masking – Domain – Extremes – Quantities – Frequency Analysis – Foreign Key Analysis – Charting – Grouping / Aggregate – Drilldown / Interactive Displays

How To Monitor Your Data? 4  View Profiles  Compare data quality over time – trend analysis  Monitor data quality index based on business rules

iWay Data Quality Management Life-Cycle 5

6 Demonstration

Thank-You 7