BUILDING THE INFORMATION INFRASTRUCTURE. The Challenge  Information understanding through increased context and consistency of definition.  Information.

Slides:



Advertisements
Similar presentations
EIM Framework EIM Vision & Strategy EIM Governance EIM Core Processes
Advertisements

Copyright 2007, Information Builders. Slide 1 The Relevance of Data Governance in Higher Education Tim Beckett Higher Education Solutions November 9, 2011.
Basic guidelines for the creation of a DW Create corporate sponsors and plan thoroughly Determine a scalable architectural framework for the DW Identify.
Oncor’s EIM Program.
OASIS Reference Model for Service Oriented Architecture 1.0
Managing Data Resources
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Clarity on the performance of IT Metricus at a Glance Metricus Metricus has been acknowledged for breaking new ground on IT performance management and.
Business Intelligence: Essential of Business
® IBM Software Group © IBM Corporation IBM Information Server Metadata Management.
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
Demonstrating IT Relevance to Business Aligning IT and Business Goals with On Demand Automation Solutions Robert LeBlanc General Manager Tivoli Software.
Enterprise Architecture
Chapter © 2012 Pearson Education, Inc. Publishing as Prentice Hall.
BUSINESS INTELLIGENCE/DATA INTEGRATION/ETL/INTEGRATION AN INTRODUCTION Presented by: Gautam Sinha.
Presented to: By: Date: Federal Aviation Administration Enterprise Information Management SOA Brown Bag #2 Sam Ceccola – SOA Architect November 17, 2010.
Getting Smarter with Information An Information Agenda Approach
What is Business Analysis Planning & Monitoring?
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Developing Enterprise Architecture
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
The Microsoft Office 2007 Enterprise Project Management Solution:
Integrated Capability Maturity Model (CMMI)
® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Overview of the Database Development Process
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Get More Value from Your Reference Data—Make it Meaningful with TopBraid RDM Bob DuCharme Data Governance and Information Quality Conference June 9.
Chapter 1: Business Intelligence and its Impacts
Understanding Data Warehousing
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Database Design - Lecture 1
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
Data Profiling
- 1 - Roadmap to Re-aligning the Customer Master with Oracle's TCA Northern California OAUG March 7, 2005.
Human Resource Management Lecture 27 MGT 350. Last Lecture What is change. why do we require change. You have to be comfortable with the change before.
The Challenge of IT-Business Alignment
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Emerging Technologies Work Group Master Data Management (MDM) in the Public Sector Don Hoag Manager.
What is a Business Analyst? A Business Analyst is someone who works as a liaison among stakeholders in order to elicit, analyze, communicate and validate.
Delivering business value through Context Driven Content Management Karsten Fogh Ho-Lanng, CTO.
Using Taxonomies Effectively in the Organization KMWorld 2000 Mike Crandall Microsoft Information Services
A Strategic Business Imperative Cypress Management Group Corporation Victor Brown Managing Partner 10/19/20151Managing Master Data © 2009 CMGC.
ISV Innovation Presented by ISV Innovation Presented by Business Intelligence Fundamentals: Data Cleansing Ola Ekdahl IT Mentors 9/12/08.
Data Warehousing Data Mining Privacy. Reading Bhavani Thuraisingham, Murat Kantarcioglu, and Srinivasan Iyer Extended RBAC-design and implementation.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
+ Chapter 9: Management of Business Intelligence © Sabherwal & Becerra-Fernandez.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
EPA Enterprise Data Architecture Metadata Framework Assessment Kevin J. Kirby, Enterprise Data Architect EPA Enterprise Architecture Team
© 2007 IBM Corporation IBM Information Management Accelerate information on demand with dynamic warehousing April 2007.
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS Instructor Ms. Arwa Binsaleh.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
7 Strategies for Extracting, Transforming, and Loading.
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
0 ©2015 U.S. Education Delivery Institute While there is no prescribed format for a good delivery plan, it should answer 10 questions What a good delivery.
Information Integration 15 th Meeting Course Name: Business Intelligence Year: 2009.
Data Profiling 13 th Meeting Course Name: Business Intelligence Year: 2009.
Chapter © 2012 Pearson Education, Inc. Publishing as Prentice Hall.
Leadership Guide for Strategic Information Management Leadership Guide for Strategic Information Management for State DOTs NCHRP Project Information.
SAP NetWeaver Business Intelligence SAP Netweaver Business Warehouse (SAP NetWeaver BW) the name of the Business Intelligence,
Introduction to Data Warehousing
Accounting System Design
Data Quality By Suparna Kansakar.
Accounting System Design
Introduction to Systems Analysis and Design Stefano Moshi Memorial University College System Analysis & Design BIT
Metadata The metadata contains
David Gilmore & Richard Blevins Senior Consultants April 17th, 2012
Presentation transcript:

BUILDING THE INFORMATION INFRASTRUCTURE

The Challenge  Information understanding through increased context and consistency of definition.  Information trust by managing the quality of the data and improving its quality over time.  Information reach to all required data when it is needed, with a full range of delivery strategies.

Data Models  An essential component of any data warehouse infrastructure—and the key to breaking down information silos—is a data model that specifies how data is structured and how it is accessed for analysis and reporting. The model provides technical and business data definitions that become the blueprint for a successful data warehouse.

Data Models  The term data model is used in two related senses. It can be a description of the objects represented by a computer system together with their properties and relationships; these are typically "real world" objects such as products, suppliers, customers, and orders. In the second sense, database model it means a collection of concepts and rules used in defining data models: for example the relational model, network model etc.

Data Models

 Data models are often used as an aid to communication between the business people defining the requirements for a computer system and the technical people defining the design in response to those requirements.  The main aim of data models is to support the development of Information systems by providing the definition and format of data.

Data Models  The primary obstacle in achieving a holistic view of the organization centers is the absence of an enterprise data model.

Data Profiling Data profiling is defined as a specific technique or process, usually automated, to statistically measure and infer data content and relationships based on the actual underlying data, as well as to validate the data against technical and business rules.

Data Profiling  The insight gained by data profiling can be used to determine how difficult it will be to use existing data for other purposes. It can also be used to determine whether or not metadata accurately describes the actual values in the source data. The data profiling process cannot identify inaccurate data; it can only identify business rules violations and anomalies.

Data Profiling  Data profiling utilizes different kinds of descriptive statistics such as minimum, maximum, mean, mode, standard deviation, and frequency as well as other aggregates such as count and sum. Additional metadata information obtained during data profiling could be data type, length, discrete values, uniqueness, occurrence of null values, typical string patterns etc.

Data Profiling  The benefits of data profiling is to improve data quality, shorten the implementation cycle of major projects, and improve understanding of data for the users. Discovering business knowledge embedded in data itself is one of the significant benefits derived from data profiling.

Data Profiling  Find out whether existing data can easily be used for other purposes.  Improve the ability to search the data by tagging it with keywords, descriptions, or assigning it to a category.  Assess the risk involved in integrating for new applications.  Assess whether metadata accurately describes the actual values in the source database  Have an enterprise view of all data.  Does the data really conform to the semantic and structural definition defined?  Do any integrity constraint violations exist?

Traceability and Auditability  It is the identification of the ultimate origin of the information, its path from one database to another, and the type of operations between them. It strongly influences the trust users have in the information.  Data lineage analysis helps identify where information comes from and determine whether it can be trusted or needs to be further validated, cleansed, or transformed.

Database Inventory Finally, like any corporate asset, data needs to be inventoried. ■ Which data is managed by which system? ■ What relationships exist between data in different systems? ■ How do I map data together for new uses? ■ What should my data look like to allow me to use it to optimize my business? ■ How does the business use the data? ■ What meaning does it have to the business? ■ Which systems are best sources for specific pieces of information? ■ What data quality issues do I have?

Database Inventory  It requires the identification of repositories and databases that contain relevant information. Once these data sources have been found, business analysts can identify the relevant elements in them and, at a high level, can identify how they map to the new integrated structures of the business analytic solution.

Data Cleansing Data cleansing includes the standardization of data (e.g., to establish addresses with the same format), the enrichment of information (e.g., through adding missing postal codes in addresses), and the removal of duplicates. 1. The data is focused at a specific task. 2. The data is good enough to meet that need. 3. The context of data creation is different from data use.

Integrating and Transforming Information We need data transformation and data cleansing capabilities to extract information from information sources; transform, aggregate, and cleanse it; and load it into a warehouse for consumption by business applications. In some cases, it is neither possible (e.g., due to business or federal policies) nor feasible to integrate the information by copying it into a common repository. The only integration option is virtual and on-demand information integration. In this scenario, only when it is requested is the information pulled from existing sources, integrated, and then returned, without being stored along the way.

Metadata Management and Deployment  An information integration approach needs to be built on a unified metadata infrastructure that enables shared understanding between the different user roles involved in a data integration project, including business, operational, and technical domains.

Enterprise Data Warehousing and Appliances  Today’s data warehouses need to be tied into the operational systems in a bi-directional fashion, so that analysis from the warehouse feeds the day-to- day operations of the business.  This is sometimes called operational BI and is described by IBM as an element of dynamic data warehousing.

Master Data Management  Master data is core business information that is generally spread across multiple applications and locations, such as customer data and/or product data. Due to that fact that it is usually managed by multiple lines of business applications, it is usually not managed on an enterprise basis.

Master Data Management  While creating an integrated view of master data in the data warehouse is one way to get a single version of the truth, it does little for the operational applications.

Master Data Management  Master Data Management is a solution that provides an industry-established model, a repository for storing data, and a large library of SOA web services to read, insert, update, and manage each of the fields of the master data.

Master Data Management Master data exists in the data warehouse as well as in the operational systems.

MDM Solution  Data warehousing and MDM complement each other in developing enterprise data that can be leveraged for analysis and reporting. An MDM solution gathers and manages the common information for core business entities, such as customers, products, and locations.  When combined with historical information in the data warehouse, master data can provide an in- depth view of a customer’s behavior and preferences.

Information Lifecycle Management  Information lifecycle management is essentially the ability to monitor, analyze, and manage the lifecycle of applications, systems, and processes in the data warehouse environment.  The ability to control and optimize this environment includes setting policies to optimize the use of system resources (CPU, memory, disk).

Information Lifecycle Management Effective lifecycle management of the warehouse must include: ■ Storage optimization: Minimizing storage and memory requirements while not impacting performance ■ Performance management: Monitoring and analyzing the environment in order to optimize performance ■ Data retention: Policy-based lifecycle management of data to reduce storage and retrieval costs ■ Workload management: Manage and control computing resources to support varying workload requests

INFORMATION AGENDA

Information Agenda  The Information Agenda provides a proven approach for helping companies turn their information into New Intelligence.

Information Agenda

Information Strategy  The information strategy establishes those principles that will guide the organization’s efforts to create and exploit information to support its goals, objectives, and initiatives.  The organization should identify its corporate objectives and initiatives for innovation and differentiation.  The information strategy should be revisited regularly to ensure it stays aligned with the organization’s business strategy.

Roadmaps  The information roadmaps identify the sequence of iterative projects and objectives that return tangible results.  Information Infrastructure: To manage information as a strategic asset over time, companies must commit to an enterprise-  level information infrastructure.

Information Governance  Once you have your objectives aligned, roadmaps defined, and your enterprise information infrastructure in place, management becomes the key to using the Information.  Agenda to sustain the value of information over time.