Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.

Slides:



Advertisements
Similar presentations
An Introduction to Data Mining
Advertisements

Chapter 1 Business Driven Technology
Supporting End-User Access
Introduction to Databases
Unit 7: Store and Retrieve it Database Management Systems (DBMS)
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
Data warehouse example
1 DATA MINING: DEFINITIONS AND DECISION TREE EXAMPLES Emily Thomas Director of Planning and Institutional Research.
Managing Data Resources
Chapter 9 Competitive Advantage with Information Systems for Decision Making © 2008 Pearson Prentice Hall, Experiencing MIS, David Kroenke.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Business Intelligence Tracy Hartley Zack Johnson Marissa McGee Tracy Hartley Zack Johnson Marissa McGee.
Week 9 Data Mining System (Knowledge Data Discovery)
Data Mining By Archana Ketkar.
Knowledge Discovery Centre: CityU-SAS Partnership 1 Speakers: Prof Y V Hui, CityU Dr H P Lo, CityU Dr Sammy Yuen, CityU Dr K W Cheng, SAS Institute Mr.
1 Data and Knowledge Management. 2 Data Management: A Critical Success Factor The difficulties and the process Data sources and collection Data quality.
DataMining By Guan Hang Su CS157A section 2 fall 2005.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
Enterprise systems infrastructure and architecture DT211 4
3-1 Chapter Three. 3-2 Secondary Data vs. Primary Data Secondary Data: Data that have been gathered previously. Primary Data: New data gathered to help.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Dr. Awad Khalil Computer Science Department AUC
Data Mining By Jason Baltazar, Phil Cademas, Jillian Latham, Rachel Peeler & Kamila Singh.
Chapter 5: Data Mining for Business Intelligence
MAKING THE BUSINESS BETTER Presented By Mohammed Dwikat DATA MINING Presented to Faculty of IT MIS Department An Najah National University.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Data Mining Techniques As Tools for Analysis of Customer Behavior
DATA MINING Team #1 Kristen Durst Mark Gillespie Banan Mandura University of DaytonMBA APR 09.
Data Mining Chun-Hung Chou
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
Succeeding with Technology Database Systems Basic Data Management Concepts Organizing Data in a Database Database Management Systems Using Database Systems.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Banking on Analytics Dr A S Ramasastri Director, IDRBT.
Data MINING Data mining is the process of extracting previously unknown, valid and actionable information from large data and then using the information.
Chapter 3 The Impact of Databases. What is a database? Flat file – Access is slow – Most older legacy systems Relational – Files are linked by a duplicate.
Data Warehousing Data Mining Privacy. Reading Bhavani Thuraisingham, Murat Kantarcioglu, and Srinivasan Iyer Extended RBAC-design and implementation.
1 Introduction to Neural Networks And Their Applications.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
The CRISP Data Mining Process. August 28, 2004Data Mining2 The Data Mining Process Business understanding Data evaluation Data preparation Modeling Evaluation.
Guest Lecture Introduction to Data Mining Dr. Bhavani Thuraisingham September 17, 2010.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Data Mining In contrast to the traditional (reactive) DSS tools, the data mining premise is proactive. Data mining tools automatically search the data.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
1 STAT 5814 Statistical Data Mining. 2 Use of SAS Data Mining.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
MIS2502: Data Analytics Advanced Analytics - Introduction.
DATA MINING PREPARED BY RAJNIKANT MODI REFERENCE:DOUG ALEXANDER.
CHAPTER 8 DATA MINING BASICS.
Knowledge Discovery and Data Mining 19 th Meeting Course Name: Business Intelligence Year: 2009.
ERP and Related Technologies
Data Warehousing Data Mining Privacy. Reading FarkasCSCE Spring
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.
Introduction to Computing Lecture # 13
MIS2502: Data Analytics Advanced Analytics - Introduction
Data and Applications Security Introduction to Data Mining
Social Media Data Mining
Chapter 12: Automated data collection methods
DATA MINING.
Supporting End-User Access
Data Warehousing Data Mining Privacy
Presentation transcript:

Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento

Introduction What is Data Mining? Data Mining is the process of collecting large amounts of raw data and transforming that data into useful information. Data Warehousing? A Data Warehouse is a computerized collection of mined data.

Objectives Explore the business applications of data mining & warehousing Explore the business applications of data mining & warehousing Explain the advantages & disadvantages Explain the advantages & disadvantages Uncover software used in data mining. Uncover software used in data mining. Find what data mining is used for. Find what data mining is used for. Discover current trends, regulation, and future uses of the technology. Discover current trends, regulation, and future uses of the technology.

What is Data Mining? Data mining is the practice of searching through large amounts of computerized data to find useful patterns or trends (American Heritage Dictionary, 2008).

Data Mining Applications  Banking  Detect Fraudulent Activity  Insurance  Risk Assessment  Medicine/Healthcare  Enhance Research  Retail  Track consumer buying trends

Data Mining Software Applications - SPSS/ SPSS Clementine is a data mining toolkit with a wide variety of mining methods to choose from. This package is designed to be used by the normal person with the desire to do their own data mining projects -Previously known as yale, RapidMiner is a data mining suite which makes a wide range of techniques available.. Its easy to use interface makes it accessible for general use, while its flexibility and extensibility make it suitable for academic use and adds a number of useful visualization methods. -‘SAS Enterprise Miner,’ a tool provided in the package suite, has an easy to use data-flow GUI. A wide variety of data mining techniques are supported including: Decision Trees; Neural Networks; Market Basket Analysis; Clustering; Web Log Analysis; Regression; and Rule Induction.

Cross-Industry Standard Process for Data Mining - Understanding the business - Understanding the data - Data preparation - Modeling - Evaluation - Deployment CRISP-DM

Data Warehousing Advantages Access to information Access to information Data Inconsistency Data Inconsistency Decrease Computing Cost Decrease Computing Cost Productivity Increase Productivity Increase Increase company profits Increase company profits

Data Warehousing Disadvantages Data must be cleaned, loaded, and extracted Data must be cleaned, loaded, and extracted 80% of the overall process 80% of the overall process User Variability User Variability Proper Training Proper Training Difficult to Maintain Difficult to Maintain Incongruence among systems Incongruence among systems

Data Mining Advantages Improves Customer Satisfaction/service Improves Customer Satisfaction/service Saves Time and Money Saves Time and Money Increases Sales Effectiveness Increases Sales Effectiveness Increases profitability Increases profitability

Data Mining Disadvantages Require skilled technical users to interpret and analyze data from warehouse Require skilled technical users to interpret and analyze data from warehouse Validity of the patterns Validity of the patterns Related to real world circumstances Related to real world circumstances Unable to Identify Casual Relationships Unable to Identify Casual Relationships Reserved for the few instead of the many Reserved for the few instead of the many

Current Issues Data Quality Duplicated records Duplicated records Lack of Data Standards Lack of Data Standards Human Error Human ErrorInoperability Lack of communications among existing systems Lack of communications among existing systems Mission Creep

Trends & Current Issues 4 Major Trends Data – growing amount collected to be sifted Hardware – growing performance & storage Scientific Computing – theory, experiment, simulation Business – Meet higher standard in order to foresee risks, opportunities, and benefits for the company Growing quickly due to renewal with new methodology frequently discovered Applications of uses & methodology to medical, marketing, operations, & others The government is closely reviewing the uses of data mining, due to the possibilities both good and bad Counterterrorism data mining has been done, but in some instances has been deemed a violation of privacy

Future Research Possibilities Government’s uses for data mining. Government’s uses for data mining. National Security National Security Terrorism Detection Terrorism Detection Identity theft through data mining. Identity theft through data mining.

Conclusion/Analysis Data mining is the extraction of information that can predict future trends & behaviors Data mining is the extraction of information that can predict future trends & behaviors Requires a large amount of data to be collected, and then stored in data warehouse Requires a large amount of data to be collected, and then stored in data warehouse Possible violation of privacy in some circumstances Possible violation of privacy in some circumstances Government is getting involved with regulation, despite the counterterrorism program being a possible violation Government is getting involved with regulation, despite the counterterrorism program being a possible violation

Questions Questions Questions