Database Access and Techniques, most notably Data Mining.

Slides:



Advertisements
Similar presentations
Database Management Character, file, field, record, database??? 1.
Advertisements

Supporting End-User Access
1 CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization.
Sistem (Pengantar) Penunjang Keputusan Kecerdasan Bisnis (KB) 1/20 KECERDASAN BISNIS (KB) Sifat dan sumber data Pengumpulan data, masalah dan kualitas.
Database – Part 3 Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Mr. Sakthi Angappamudali.
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
Chapter 3 Database Management
MP3 / MD740 Strategy & Information Systems Oct. 13, 2004 Databases & the Data Asset, Types of Information Systems, Artificial Intelligence.
Introduction to Data Warehousing. From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis,
Database – Part 2b Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Sakthi Angappamudali at Standard Insurance; BI.
Chapter 13 The Data Warehouse
1 Data and Knowledge Management. 2 Data Management: A Critical Success Factor The difficulties and the process Data sources and collection Data quality.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization.
Lecture-8/ T. Nouf Almujally
Business Intelligence Instructor: Bajuna Salehe Web:
Chapter 13 – Data Warehousing. Databases  Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age  Information,
M ODULE 5 Metadata, Tools, and Data Warehousing Section 4 Data Warehouse Administration 1 ITEC 450.
Chapter 4 Data, Text, and Web Mining
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
1 Chapter 4 Data Management: Warehousing, Access and Visualization MSS foundation New concepts Object-oriented databases Intelligent databases Data warehouse.
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
MPIII Database Technologies Relational Concepts Data Warehouses & Marts Queries, OLAP, Data Mining.
MD703 Final Class: Database Internal / External Integration Outsourcing.
Data Warehouse & Data Mining
Database Systems – Data Warehousing
MD240 - MIS Oct. 4, 2005 Databases & the Data Asset Harrah’s & Allstate Cases.
Data Warehousing, Access, Analysis, Mining, and Visualization
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
1 CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization.
CIS 9002 Kannan Mohan Department of CIS Zicklin School of Business, Baruch College.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
DECISION SUPPORT SYSTEM ARCHITECTURE: The data management component.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
Datawarehouse Objectives
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
BUS1MIS Management Information Systems Semester 1, 2012 Week 6 Lecture 1.
Data Warehousing.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
CISB594 – Business Intelligence
1 CHAPTER 4 Data Management. 2 Data Warehousing, Access, Analysis, Mining, and Visualization n MSS foundation n Many new concepts n Object-oriented databases.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
Data Warehouse. Group 5 Kacie Johnson Summer Bird Washington Farver Jonathan Wright Mike Muchane.
CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization 2 1.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
MIS2502: Data Analytics Advanced Analytics - Introduction.
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
Advanced Database Concepts
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
Business intelligence systems. Data warehousing. An orderly and accessible repositery of known facts and related data used as a basis for making better.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
Chapter 2 Data, Text, and Web Mining. Data Mining Concepts and Applications  Data mining (DM) A process that uses statistical, mathematical, artificial.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
MIS2502: Data Analytics Advanced Analytics - Introduction
Chapter 13 The Data Warehouse
Data Warehousing, Access, Analysis, Mining, and Visualization
Chapter 13 – Data Warehousing
المحاضرة 4 : مستودعات البيانات (Data warehouse)
Data Warehouse and OLAP
Data Warehousing and Data Mining
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Introduction of Week 9 Return assignment 5-2
Data Warehouse and OLAP
Presentation transcript:

Database Access and Techniques, most notably Data Mining.

Tools: Computers and IT. VB, VBA, Excel, InterDev, Etc. Humans: Decision Making Process Algorithms: Math/Flow Chart stuff that helps the tools help the humans make decisions. DSS Data: Facts pertinent to the decision at hand.

Warehouses & Marts Data Warehouse –a database designed to support decision-making in an organization. It is batch-updated and structured for fast online queries and exploration. Data warehouses may aggregate enormous amounts of data from many different operational systems. Data Mart –a database focused on addressing the concerns of a specific problem or business unit (e.g. Marketing, Engineering). Size doesn’t define data marts, but they tend to be smaller than data warehouses.

Data Warehouses & Data Marts TPS & other operational systems Data Warehouse Data Mart (Marketing) Data Mart (Engineering) 3rd party data = query, OLAP, mining, etc. = operational clients

Data Warehousing Physical separation of operational and decision support environments Purpose: to establish a data repository making operational data accessible Transforms operational data to relational form Only data needed for decision support come from the TPS Data are transformed and integrated into a consistent structure Data warehousing (or information warehousing): a solution to the data access problem End users perform ad hoc query, reporting analysis and visualization

Data Warehousing Benefits Increase in knowledge worker productivity Supports all decision makers’ data requirements Provide ready access to critical data Insulates operation databases from ad hoc processing Provides high-level summary information Provides drill down capabilities Yields –Improved business knowledge –Competitive advantage –Enhances customer service and satisfaction –Facilitates decision making –Help streamline business processes

DW Suitability For organizations where Data are in different systems Information-based approach to management in use Large, diverse customer base Same data have different representations in different systems Highly technical, messy data formats

Transform Data from TPS to Warehouse Consolidate data –e.g. from multiple TPS around the country/world “Scrub” the data –keep definitions consistent (e.g. translate part numbers/product names if they differ per country) Calculate fields (decrease processor load) Summarize fields (decrease processor load) De-normalize data (ease of use)

OLAP: Data Access and Mining, Querying and Analysis Online Analytical processing (OLAP) –DSS and EIS computing done by end-users in online systems –Versus online transaction processing (OLTP) OLAP Activities –Generating queries –Requesting ad hoc reports –Conducting statistical analyses –Building multimedia applications

OLAP uses the data warehouse and a set of tools, usually with multidimensional capabilities Query tools Spreadsheets Data mining tools Data visualization tools

Using SQL for Querying SQL (Structured Query Language) Data language English-like, nonprocedural, very user friendly language Free format Example: SELECTName, Salary FROMEmployees WHERESalary >2000

Query Tools & OLAP Query Tools –user-lead discovery. Can return individual records or summaries. Requests are formulated in advance (e.g. “show me all delinquent accounts in the northeast region during Q1”). OLAP - Online Analytical Processing –user-lead discovery. Data is explored via “drill down” into the data by selecting variables to summarize on. Results are usually reported in a cross-tab report or graph (e.g. “show me a tabular breakdown of sales by business unit, product type, and year”).

OLAP Online Analytical Processing. (example of cross-tab results presented below) 1. business unit 2. product type 3. year

Data Mining automated information discovery process, uncovers important patterns in existing data –can use neural networks or other approaches. Requires ‘clean’, reliable, consistent data. Historical data must reflect the current environment. e.g. “What are the characteristics that identify when we are likely to lose a customer?”

Data Mining For Knowledge discovery in databases Knowledge extraction Data archeology Data exploration Data pattern processing Data dredging Information harvesting

Data Mining Examples Market Segmentation - e.g. Dayton Hudson Direct Marketing - e.g. Chase Market basket analysis - e.g. Wal-Mart Customer Churn - e.g. Fleet Bank Fraud Detection - e.g. Bank of America Cost Reduction Prospecting - e.g. Merk Medco.

Major Data Mining Characteristics and Objectives Data are often buried deep Client/server architecture Sophisticated new tools--including advanced visualization tools--help to remove the information “ore” Massaging and synchronizing data Usefulness of “soft” data End-user minor is empowered by “data drills” and other power query tools with little or no programming skills Often involves finding unexpected results Tools are easily combined with spreadsheets etc. Parallel processing for data mining

Data Mining Application Areas Marketing Banking Retailing and sales Manufacturing and production Brokerage and securities trading Insurance Computer hardware and software Government and defense Airlines Health care Broadcasting Law Enforcement … almost everywhere!!!

Stupid Data-Miner Tricks Ad-Hoc Theories –when an oddity jumps out of the data, it’s tempting to develop a theory for it. Sometimes findings are just statistical flukes. Using Too Many Variables –the more factors considered, the more likely a relationship will be found - valid or not. Not Taking No for an Answer –it’s OK to stop looking if you can’t find anything. There are no silver bullets. Limited or incorrect interpretation