TOOLS FOR DATA GOVERNANCE PASSIONATE BY DATA AND THE ACCURACY OF THE RESULTS.

Slides:



Advertisements
Similar presentations
Chapter 1: The Database Environment
Advertisements

Chapter 10: Designing Databases
Data Modeling and Database Design Chapter 1: Database Systems: Architecture and Components.
Copyright 2008 Tieto Corporation Database merge. Copyright 2008 Tieto Corporation Table of contents Please, do not remove this slide if you want to use.
TOOLS FOR DATA GOVERNANCE PASSIONATE BY DATA AND THE PRECISION OF THE RESULTS.
The Database Environment
Prentice Hall, Database Systems Week 1 Introduction By Zekrullah Popal.
Chapter 3 Database Management
Introduction to Introduction to Database Systems Rose-Hulman Institute of Technology Curt Clifton.
1 Introduction The Database Environment. 2 Web Links Google General Database Search Database News Access Forums Google Database Books O’Reilly Books Oracle.
Data Management I DBMS Relational Systems. Overview u Introduction u DBMS –components –types u Relational Model –characteristics –implementation u Physical.
© 2007 by Prentice Hall 1 Chapter 1: The Database Environment Modern Database Management 8 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
TOURISM INFORMATION SYSTEM AAR Technologies and Software Solution Pvt. Ltd A Flexible Business Intelligence Tool.
Chapter 1: The Database Environment
Database Administration Chapter 16. Need for Databases  Data is used by different people, in different departments, for different reasons  Interpretation.
TOOLS FOR DATA GOVERNANCE PASSIONATE BY DATA AND THE PRECISION OF THE RESULTS.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
9 Feb 2004Mikko Mäkinen & Saija Ylönen Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Geneva, 9-11 February 2004, Topic (ii): Metadata.
CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS Modern Database Management 11 th Edition Jeffrey A. Hoffer, V. Ramesh, Heikki Topi © 2013 Pearson.
Chapter 5 Lecture 2. Principles of Information Systems2 Objectives Understand Data definition language (DDL) and data dictionary Learn about popular DBMSs.
Computers & Employment By Andrew Attard and Stephen Calleja.
FP OntoGrid: Paving the way for Knowledgeable Grid Services and Systems WP8: Use case 1: Quality Analysis for Satellite Missions.
Chapter 1: The Database Environment and Development Process
CS370 Spring 2007 CS 370 Database Systems Lecture 2 Overview of Database Systems.
Avalanche Internet Data Management System. Presentation plan 1. The problem to be solved 2. Description of the software needed 3. The solution 4. Avalanche.
Database Technical Session By: Prof. Adarsh Patel.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
Integrating Security Design Into The Software Development Process For E-Commerce Systems By: M.T. Chan, L.F. Kwok (City University of Hong Kong)
Help Desk System How to Deploy them? Author: Stephen Grabowski.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
Organizing Data and Information AD660 – Databases, Security, and Web Technologies Marcus Goncalves Spring 2013.
Chapter 7: Database Systems Succeeding with Technology: Second Edition.
© 2007 by Prentice Hall 1 Introduction to databases.
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Chapter 1: The Database Environment Modern Database Management 9 th Edition Jeffrey A. Hoffer,
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Chapter 3: Computer Software. Stored Program Concept v The concept of preparing a precise list of exactly what the computer is to do (this list is called.
Explaining the statistical data warehouse (S-DWH)
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
Collaborative Development of a Web Knowledgebase for a Liaison Program: “Back to the Future” Holly Harden, MLIS, Liaison Librarian Kathleen B. Oliver,
Technology In Action Chapter 11 1 Databases and… Databases and their uses Database components Types of databases Database management systems Relational.
Lecture # 3 & 4 Chapter # 2 Database System Concepts and Architecture Muhammad Emran Database Systems 1.
Chapter 1 Chapter 1: The Database Environment Modern Database Management 8 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden © 2007 by Prentice.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
CHAPTER 3 DATABASES AND DATA WAREHOUSES. 2 OPENING CASE STUDY Chrysler Spins a Competitive Advantage with Supply Chain Management Software Chapter 2 –
DataBase Management System What is DBMS Purpose of DBMS Data Abstraction Data Definition Language Data Manipulation Language Data Models Data Keys Relationships.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
FILES AND DATABASES. A FILE is a collection of records with similar characteristics, e.g: A Sales Ledger Stock Records A Price List Customer Records Files.
Data resource management
Database Administration
1 Database Systems Instructor: Nasir Minhas Assistant Professor UIIT PMAS-AAUR
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Object storage and object interoperability
TOOLS FOR DATA GOVERNANCE PASSIONATE BY DATA AND THE PRECISION OF THE RESULTS.
REPORTING SOFTWARE for BUILDING & INDUSTRIAL AUTOMATION.
1 Object-Oriented Analysis and Design with the Unified Process Figure 13-1 Implementation discipline activities.
ESS-net DWH ESSnet on microdata linking and data warehousing in statistical production.
Copyright © 2016 Pearson Education, Inc. CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS Modern Database Management 12 th Edition Jeff Hoffer,
V7 Foundation Series Vignette Education Services.
The Database Approach Muhammad Nasir
Data Resource Management Data Concepts Database Management Types of Databases Chapter 5 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies,
Internet Made Easy! Make sure all your information is always up to date and instantly available to all your clients.
Databases and Information Management
Introduction to Software Testing
CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Databases and Information Management
Chapter 1: The Database Environment
The Database Environment
Metadata The metadata contains
Presentation transcript:

TOOLS FOR DATA GOVERNANCE PASSIONATE BY DATA AND THE ACCURACY OF THE RESULTS

DOMAINDOMAIN REVER DBMS Data Access Processings Presentation Programs management (web server, transactional, jcl, …) Data DBMS Data Access Processings Presentation Programs management REVER Data are at the heart of the I.S. and are the elements allowing BUSINESS CONTINUITY Data are at the heart of the I.S. and are the elements allowing BUSINESS CONTINUITY

SOLUTIONS EVOLVE-EASY DEV-EASY DATA QUALITY S.E.A.L. DOC-EASY SHARED MASTERY SHARED MASTERY ÉVOLUTIONS WITHOUT RISK MEASURES CORRECTIONS DB-MAIN KNOWLEDGE MODELLING EXTRACTIONS ANONYMIZATIONS ACCESS LAYER DEVELOPMENT ACCELERATOR

ARCHITECTURE

CHARACTERISTICSCHARACTERISTICS  INDUSTRIAL  AUTOMATED  CONTROLS  Integrated in the proccesses  Applications independant REVER SOLUTIONS  ADAPTABLE  GÉNÉRIC  Methods  Tools  FLEXIBLE  PROGRAMMABLE  SERVICES  SUBCONTRACTING  SIDE BY SIDE  Training  Support  Follow-up

SOLUTIONS

1 OUT OF 2 COMPANY declares 1 OUT OF 4 COMPANY déclares « WE DO NOT KNOW WHICH ARE THE REAL USES OF OUR DATA » « THE DATA AT THE DISPOSAL OF DEVELOPERS WERE USED FOR OTHER PURPOSES » WHAT STUDIES TELL US* *Ponemon Institute

THE MANAGEMENT OF THE DATA USES 60 % OF THE TESTING TIME THE MANAGEMENT OF THE DATA USES 60 % OF THE TESTING TIME THE DATA RELATED TO A CONCEPT (customers, suppliers, products) ARE SPREAD IN VARIOUS TECHNICAL DATABASES THE DATA RELATED TO A CONCEPT (customers, suppliers, products) ARE SPREAD IN VARIOUS TECHNICAL DATABASES STUDIES (follow)

DEFINITIONS DATA Generic name covering the notions of:  « category» (name) designed by « column »  « value» (Smith) designed by « content » DATA Generic name covering the notions of:  « category» (name) designed by « column »  « value» (Smith) designed by « content » TABLE Collection of grouped columns to represent a concept TABLE Collection of grouped columns to represent a concept LINK All type of relations between columns There are numerous "types of link: dependency, referential, redundancy, … LINK All type of relations between columns There are numerous "types of link: dependency, referential, redundancy, …

REFERENTIAL LINK The column establishing a link between the content of 2 tables REFERENTIAL LINK The column establishing a link between the content of 2 tables REDONDANCY LINK The column which takes the content of another column at time T REDONDANCY LINK The column which takes the content of another column at time T ORDERS réf. Cli. Nbr Order NbrAmountDelivery ad ORD €PHOENIXCLI 001 ORD €NYCLI 001 DEFINITIONS ORDERS réf. Cli. Nbr Order NbrAmountDelivery ad ORD €PHOENIXCLI 001 ORD €NYCLI 001

DATABASE Technical « container » grouping a collection of tables DATABASE Technical « container » grouping a collection of tables COPY Reproduction of an "original" content for processing purposes COPY Reproduction of an "original" content for processing purposes PROCESS Term which denotes either manual processes, or automated processes, or any combination of manual and automated processes PROCESS Term which denotes either manual processes, or automated processes, or any combination of manual and automated processes DEFINITIONS

« clients » « payments» DOSSIER: tables collection linked directly or indirectly with a main table DOSSIER: tables collection linked directly or indirectly with a main table DEFINITIONS

CLIENTS BASE ORDERS BASE PAYMENTS BASE The notion of dossier is independent from the "technical" implementation and is mostly "transverse" in databases DEFINITIONS

S.E.A.L. : Select, Extract, Anonymize & Load

THE NEEDS

S.E.A.L

PRODUCTION DATABASES S.E.A.L. DATABASE. PRODUCTS ORDERS PAYMENTS CLIENTS « TECHNICAL » DESCRIPTION OF THE TABLES AND COLUMNS « TECHNICAL » DESCRIPTION OF THE TABLES AND COLUMNS Project manager DATABASES SELECTION DATABASES SELECTION ADDITION  REFERENTIAL LINKS  REDONDANCY LINKS ADDITION  REFERENTIAL LINKS  REDONDANCY LINKS « FUNCTIONAL » DESCRIPTION « FUNCTIONAL » DESCRIPTION THE DATA YOU HAVE

REDONDANCY LINKS REFERAL LINKS PRODUCTS ORDERS PAYMENTS CLIENTS THE DATA YOU HAVE

SELECT THE « NECESSARY AND SUFFICIENT » DATA FOR THE FORECASTED PROCESSINGS SELECT THE « NECESSARY AND SUFFICIENT » DATA FOR THE FORECASTED PROCESSINGS DEFINE THE DOSSIERS SELECT THE CONTENTS S.E.A.L. database « FUNCTIONAL » DESCRIPTION « FUNCTIONAL » DESCRIPTION TABLES LIST Ordered in THE ORDER OF THE PROCESSINGS TABLES LIST Ordered in THE ORDER OF THE PROCESSINGS COMBINATION OF THE SELECTION CRITERIA THE DATA « YOU WANT »

PROJECT M CAMPAIGN i CAMPAIGN j THE DATA « YOU WANT »

CAMPAIGN j SELECT the CONTENTS « Name clients = SMITH » SELECT the CONTENTS « Name clients = SMITH » THE DATA THAT « YOU WANT »

COPIES CONTROLS COLUMN NOT TO BE USED FOR SELECTING CONTENTS COLUMN YOU MAY NOT COPY LIMIT TO THE NUMBER OF DOSIERS TO BE COPIED e,g, minimum 100 dossiers LIMIT TO THE NUMBER OF DOSIERS TO BE COPIED e,g, minimum 100 dossiers THE PROTECTIONS

RULES MASKING LIST CALCULATION Specific functions ANONYMIZATION

COLUMNS RÉGLES PROJECTS/ CAMPAIGNS PROJECTS/ CAMPAIGNS Client name Client name Rule A (masking) Rule A (masking) PROJ M/ CAMP i PROJ M/ CAMP i PROJ M/ CAMP j PROJ M/ CAMP j Client name Client name Rule B (list) Rule B (list) Birth date Birth date Rule C (calculated) Rule C (calculated) PROJ M/ CAMP i PROJ M/ CAMP i PROJ M/ CAMP j PROJ M/ CAMP j Birth date Birth date Rule D (calculated) Rule D (calculated) ANONYMIZATION

EXTRACTION ENGINE ALLOWS THE EXTRACTION OF THE DOSSIERS EXTRACTION ENGINE ALLOWS THE EXTRACTION OF THE DOSSIERS GENERATION ENGINE ADD LINES AND 3POPULATE 3THE COLUMNS GENERATION ENGINE ADD LINES AND 3POPULATE 3THE COLUMNS ANONYMIZATION ENGINE ANONYMIZE THE CONTENTS ANONYMIZATION ENGINE ANONYMIZE THE CONTENTS STORAGE ENGINE GIVES THE RESULTING DOSSIERS STORAGE ENGINE GIVES THE RESULTING DOSSIERS REPORT ENGINE PRODUCES THE REPORTS AND STATISTICS REPORT ENGINE PRODUCES THE REPORTS AND STATISTICS THE ENGINES

THE ANONYMIZATION ENGINE

DATABASE FROM THE SOFTWARE PACKAGE (ERP,CRM,….) DATA TO BE PROCESSED ANONYMIZED DATA TO BE PROCESSED REAL CONTENTS FICTIVE CONTENTS CORRESPONDENCE REAL CONTENTS FICTIVE CONTENTS CORRESPONDENCE ANONYMIZATIONS EXPORT PROCESSED DATA PROCESSED ANONYMIZED DATA RE- IDENTIFICATION PROCESSING IMPORT PROCESSING EXAMPLE: SOFTWARE PACKAGES

YOU WANT TO COPY INTEGRATE ONE DOSSIER SEVERAL DOSSIERS SEVERAL DOSSIERS ALL THE CONTENTS from one or more databases S.E.A.L. FUNCTIONS In your applications or packages S.E.A.L. FUNCTIONS In your applications or packages THE S.E.A.L. PRODUCTS

COPY PART OF THE DATA COPY A COMPLETE DATABASE AN INDIVIDUAL DOSSIER SEVERAL DOSSIERS ONLY THE TABLES NEEDED FOR PROCESING BAN TO COPY CERTAIN COLUMNS OBLIGATION TO COPY A MINIMUM Nbr OF DOSSIERS CONTENTS ANONYMIZATION S.E.A.L. The products PROTECTING all COPIES of your DATA S.E.A.L. The products PROTECTING all COPIES of your DATA PROTECTIONS SUMMARY

FUNCTIONAL APPROACH INTUITIVE AND FRIENDLY INTERFACE RULES AND ANONYMIZATION DESCRIPTIONS RE-USE RULES AND ANONYMIZATION DESCRIPTIONS RE-USE PARTIAL COPIES DÉFINITIONS AND OPERATIONS STORED IN A SPECIALIZED DATABASE QUICK INSTALLATION AND CONFIGURATION MONO DATABASE MULTI DATABASES MONO DATABASE MULTI DATABASES RÉDUCTION OF THE TECHNICAL RESSOURCES MAINTAIN COHERENCE SIMPLE INCREASE OF PRODUCTIVITY COSTS DECREASE FUNCTIONALITIES TECHNICAL ADDED VALUE S.E.A.L. MAIN ADVANTAGES

THE MECHANISMS USED IN S.E.A.L. ARE INDEPENDENT FROM THE DATA "SEMANTICS" S.E.A.L. IS DIRECTLY USABLE BY EVERY TYPES OF"BUSINESSES" « AFFORDABLE » PRICING « AFFORDABLE » PRICING S.E.AL. ADVANTAGES (more)

THANK YOU FOR YOUR TIME