Data Warehousing, Access, Analysis, Mining, and Visualization

Slides:



Advertisements
Similar presentations
Supporting End-User Access
Advertisements

Decision Support Systems: An Overview
1 CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization.
Sistem (Pengantar) Penunjang Keputusan Kecerdasan Bisnis (KB) 1/20 KECERDASAN BISNIS (KB) Sifat dan sumber data Pengumpulan data, masalah dan kualitas.
4.1 Opening Vignette: Data Warehousing and DSS at Group Health Cooperative 2-3 million data records are processed monthly How to use for decision support?
Sistem (Pengantar) Penunjang Keputusan Perdagangan Elektronik 1/20 PERDAGANGAN ELEKTRONIK Tinjauan Mekanisme Aplikasi B2C Riset pasar, E-CRM dan periklanan.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
Chapter 3 Database Management
Data Sources Data Warehouse Analysis Results Data visualisation Analytical tools OLAP Data Mining Overview of Business Intelligence Data visualisation.
Database Access and Techniques, most notably Data Mining.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 3-1 Chapter 3 Decision Support Systems:
1 Data and Knowledge Management. 2 Data Management: A Critical Success Factor The difficulties and the process Data sources and collection Data quality.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization.
1 Chapter 4 Data Management: Warehousing, Access and Visualization MSS foundation New concepts Object-oriented databases Intelligent databases Data warehouse.
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
Dr.S.Sridhar,Ph.D., RACI(Paris),RZFM(Germany),RMR(USA),RIEEEProc.
Data Warehousing, Access, Analysis, Mining, and Visualization
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
1 CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization.
DECISION SUPPORT SYSTEM ARCHITECTURE: The data management component.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
Case 2: Emerson and Sanofi Data stewards seek data conformity
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
1 CHAPTER 4 Data Management. 2 Data Warehousing, Access, Analysis, Mining, and Visualization n MSS foundation n Many new concepts n Object-oriented databases.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Data Warehouse. Group 5 Kacie Johnson Summer Bird Washington Farver Jonathan Wright Mike Muchane.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Architecture of Decision Support System
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 3-1 Chapter 3 Decision Support Systems:
DATA RESOURCE MANAGEMENT
CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization 2 1.
1 CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization.
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
Advanced Database Concepts
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
1 CHAPTER 8 Enterprise Decision Support Systems. Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition, Copyright.
Primary Decision Support Technologies Management Support Systems (MSS)
Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 3: Decision Support Systems Concepts, Methodologies, and Technologies:
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
Chapter 3 Decision Support Systems: An Overview
Dr.S.Sridhar,Ph.D., RACI(Paris),RZFM(Germany),RMR(USA),RIEEEProc.
Data Resource Management
Intelligent Systems Development
Advanced Applied IT for Business 2
Decision Support System by Simulation Model (Ajarn Chat Chuchuen)
Chapter 13 The Data Warehouse
ENHANCING MANAGEMENT DECISION MAKING
Chapter 5 Data Management
Decision Support and Business Intelligence Systems (9th Ed
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Data Warehouse.
Chapter 4 Data Management: Warehousing, Access and Visualization
Chapter 16 Designing Distributed and Internet Systems
MANAGING DATA RESOURCES
Supporting End-User Access
Data Warehousing Data Model –Part 1
Decision Support Systems: An Overview
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Chapter 3 Database Management
Chapter 3 Decision Support Systems: An Overview
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Presentation transcript:

Data Warehousing, Access, Analysis, Mining, and Visualization CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization

Data Warehousing, Access, Analysis, Mining, and Visualization MSS foundation Many new concepts Object-oriented databases Intelligent databases Data warehouse Data mining Online analytical processing Multidimensionality Internet / Intranet / Web Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Data Warehousing, Access, Analysis, and Visualization What to do with all the data that organizations collect, store, and use? (Information overload!) Solution Data warehousing Data access Data mining Online analytical processing (OLAP) Data visualization Data sources Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

The Nature and Sources of Data Data: Raw Information: Data organized to convey meaning Knowledge: Data items organized and processed to convey understanding, experience, accumulated learning, and expertise Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Copyright 2001, Prentice Hall, Upper Saddle River, NJ DSS Data Items Documents Pictures Maps Sound Animation Video Can be hard or soft Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Copyright 2001, Prentice Hall, Upper Saddle River, NJ Data Sources Internal External Personal Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Data Collection, Problems, and Quality Problems (Table 4.1) Quality: determines usefulness of data Intrinsic data quality Accessibility data quality Representation data quality Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Data Quality Issues in Data Warehousing Uniformity Version Completeness check Conformity check Genealogy check (drill down) Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

The Internet and Commercial Database Services For external data The Internet: major supplier of external data Commercial Data Banks: sell access to specialized databases Can add external data to the MSS in a timely manner and at a reasonable cost Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

The Internet and Commercial Databases Servers Use Web Browsers to Access vital information by employees and customers Implement executive information systems Implement group support systems (GSS) Database management systems provide data in HTML, on Web servers directly Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Database Management Systems in DSS DBMS: Software program for entering (or adding) information into a database; updating, deleting, manipulating, storing, and retrieving information A DBMS + modeling language to develop DSS DBMS to handle LARGE amounts of information Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Database Organization and Structure Relational databases Hierarchical databases Network databases Object-oriented databases Multimedia-based databases Document-based databases Intelligent databases Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Copyright 2001, Prentice Hall, Upper Saddle River, NJ Data Warehousing Physical separation of operational and decision support environments Purpose: to establish a data repository making operational data accessible Transforms operational data to relational form Only data needed for decision support come from the TPS Data are transformed and integrated into a consistent structure Data warehousing (information warehousing): solves the data access problem End users perform ad hoc query, reporting analysis and visualization Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Data Warehousing Benefits Increase in knowledge worker productivity Supports all decision makers’ data requirements Provide ready access to critical data Insulates operation databases from ad hoc processing Provides high-level summary information Provides drill down capabilities Yields Improved business knowledge Competitive advantage Enhances customer service and satisfaction Facilitates decision making Help streamline business processes

Data Warehouse Architecture and Process Two-tier architecture Three-tier architecture Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Data Warehouse Components Large physical database Logical data warehouse Data mart Decision support systems (DSS) and executive information system (EIS) Can feed OLAP Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Data Marts

Copyright 2001, Prentice Hall, Upper Saddle River, NJ DW Suitability For organizations where Data are in different systems Information-based approach to management in use Large, diverse customer base Same data have different representations in different systems Highly technical, messy data formats Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Characteristics of Data Warehousing 1. Data organized by detailed subject with information relevant for decision support 2. Integrated data 3. Time-variant data 4. Non-volatile data Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

OLAP: Data Access and Mining, Querying, and Analysis Online analytical processing (OLAP) DSS and EIS computing done by end-users in online systems Versus online transaction processing (OLTP) Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Copyright 2001, Prentice Hall, Upper Saddle River, NJ OLAP Activities Generating queries Requesting ad hoc reports Conducting statistical and other analyses Developing multimedia applications Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Copyright 2001, Prentice Hall, Upper Saddle River, NJ OLAP uses the data warehouse and a set of tools, usually with multidimensional capabilities Query tools Spreadsheets Data mining tools Data visualization tools Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Copyright 2001, Prentice Hall, Upper Saddle River, NJ Using SQL for Querying SQL (Structured Query Language) Data language English-like, nonprocedural, very user friendly language Free format Example: SELECT Name, Salary FROM Employees WHERE Salary >2000 Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Copyright 2001, Prentice Hall, Upper Saddle River, NJ Data Mining for Knowledge discovery in databases Knowledge extraction Data archeology Data exploration Data pattern processing Data dredging Information harvesting Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Major Data Mining Characteristics and Objectives Data are often buried deep Client/server architecture Sophisticated new tools--including advanced visualization tools--help to remove the information “ore” End-user miner empowered by data drills and other power query tools with little or no programming skills Often involves finding unexpected results Tools are easily combined with spreadsheets, etc. Parallel processing for data mining Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Data Mining Application Areas Marketing Banking Retailing and sales Manufacturing and production Brokerage and securities trading Insurance Computer hardware and software Government and defense Airlines Health care Broadcasting Law enforcement

Intelligent Data Mining Use intelligent search to discover information within data warehouses that queries and reports cannot effectively reveal Find patterns in the data and infer rules from them Use patterns and rules to guide decision making and forecasting Five common types of information that can be yielded by data mining: 1) association, 2) sequences, 3) classifications, 4) clusters, and 5) forecasting

Main Tools Used in Intelligent Data Mining Case-based Reasoning Neural Computing Intelligent Agents Other Tools Decision trees Rule induction Data visualization Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Data Visualization and Multidimensionality Data Visualization Technologies Digital images Geographic information systems Graphical user interfaces Multidimensions Tables and graphs Virtual reality Presentations Animation Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Copyright 2001, Prentice Hall, Upper Saddle River, NJ Multidimensionality 3-D + Spreadsheets (OLAP has this) Data can be organized the way managers like to see them, rather than the way that the system analysts do Different presentations of the same data can be arranged easily and quickly Dimensions: products, salespeople, market segments, business units, geographical locations, distribution channels, country, or industry Measures: money, sales volume, head count, inventory profit, actual versus forecast Time: daily, weekly, monthly, quarterly, or yearly Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Multidimensionality Limitations Extra storage requirements Higher cost Extra system resource and time consumption More complex interfaces and maintenance Multidimensionality is especially popular in executive information and support systems Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Geographic Information Systems (GIS) A computer-based system for capturing, storing, checking, integrating, manipulating, and displaying data using digitized maps Spatially-oriented databases Useful in marketing, sales, voting estimation, planned product distribution Available via the Web Can use with GPS Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Copyright 2001, Prentice Hall, Upper Saddle River, NJ Virtual Reality An environment and/or technology that provides artificially generated sensory cues sufficient to engender in the user some willing suspension of disbelief Can share data and interact Can analyze data by creating a landscape Useful in marketing, prototyping aircraft designs VR over the Internet through VRML Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Business Intelligence on the Web Can capture and analyze data from Web Tools deployed on Web Decision Support Systems and Intelligent Systems, Efraim Turban and Jay E. Aronson, 6th edition Copyright 2001, Prentice Hall, Upper Saddle River, NJ

Summary Data for decision making come from internal and external sources The database management system is one of the major components of most management support systems Familiarity with the latest developments is critical Data contain a gold mine of information if they can dig it out Organizations are warehousing and mining data Multidimensional analysis tools and new enterprise-wide system architectures are useful OLAP tools are also useful

Summary (cont’d.) New data formats for multimedia DBMS Internet and intranets via Web browser interfaces for DBMS access Built-in artificial intelligence methods in DBMS