Data Mining.

Slides:



Advertisements
Similar presentations
Overview of Data Mining & The Knowledge Discovery Process Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Advertisements

modified by Marius Bulacu
1 Introduction and Review CS 636 – Adv. Data Mining.
Data Mining: Concepts and Techniques
Data Mining Knowledge Discovery in Databases Data 31.
Dr. Tahar Kechadi Dr. Joe Carthy
Data Mining By Archana Ketkar.
Data Mining – Intro.
Data mining By Aung Oo.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
CIT 858: Data Mining and Data Warehousing Course Instructor: Bajuna Salehe Web:
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
CS-470: Data Mining Fall Organizational Details Class Meeting: 4:00-6:45pm, Tuesday, Room SCIT215 Instructor: Dr. Igor Aizenberg Office: Science.
Data Mining Chapter 26.
10 Data Mining. What is Data Mining? “Data Mining is the process of selecting, exploring and modeling large amounts of data to uncover previously unknown.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Chapter 1. Introduction Motivation: Why data mining?
Data Mining Techniques As Tools for Analysis of Customer Behavior
Business Intelligence
Data Mining: Introduction. Why Data Mining? l The Explosive Growth of Data: from terabytes to petabytes –Data collection and data availability  Automated.
Data Mining: Concepts and Techniques
Data Mining Techniques As Tools for Analysis of Customer Behavior Lecture 2:
Data Warehousing/Mining 1 Data Warehousing/Mining Comp 150 DW Chapter 1. Introduction Instructor: Dan Hebert.
Chapter 1 Introduction to Data Mining
DATA MINING 1. 2 Data Mining Extracting or “mining” knowledge from large amounts of data Data mining is the process of autonomously retrieving useful.
Introduction Pertemuan 01 Matakuliah: M0614 / Data Mining & OLAP Tahun : Feb
2015年10月18日星期日 2015年10月18日星期日 2015年10月18日星期日 Introduction to Data Mining 1 Chapter 1 Introduction to Data Mining Chen. Chun-Hsien Department of Information.
October 18, 2015 Data Mining: Concepts and Techniques 1 DATA MINING Motivation: Why data mining? What is data mining? Data Mining: On what kind of data?
2015年10月22日星期四 2015年10月22日星期四 2015年10月22日星期四 Introduction to Data Mining 1 Chapter 1 Introduction to Data Mining Chen. Chun-Hsien Department of Information.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
Introduction to Data-Mining Marko Grobelnik Institut Jozef Stefan.
Data Mining: Concepts and Techniques. Overview 1.Introduction 2.Data Preprocessing 3.Data Warehouse and OLAP Technology: An Introduction 4.Advanced Data.
Han: Introduction to KDD 1 Introduction to Knowledge Discovery and Data Mining ©Jiawei Han and Micheline Kamber Intelligent Database Systems Research Lab.
1 Knowledge Discovery from DataBases (KDD) A.K.A. Data Mining & by other names as well Carlo Zaniolo UCLA CS Dept.
January 8, 2016Data Mining: Concepts and Techniques1 Data Mining: Trends and Applications.
January 17, 2016Data Mining: Concepts and Techniques 1 What Is Data Mining? Data mining (knowledge discovery from data) Extraction of interesting ( non-trivial,
Conclusions. Why Data Mining? -- Potential Applications Database analysis and decision support – Market analysis and management target marketing, customer.
Academic Year 2014 Spring Academic Year 2014 Spring.
February 13, 2016 Data Mining: Concepts and Techniques 1 1 Data Mining: Concepts and Techniques These slides have been adapted from Han, J., Kamber, M.,
Business Intelligence Introduction & Overview. 2 of 25 Examples: Telecommunications Huge amount of data is collected daily: –Transactional data (about.
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
Data Warehousing/Mining 1. 2 Chapter 1. Introduction v Motivation: Why data mining? v What is data mining? v Data Mining: On what kind of data? v Data.
CENG 514. Data mining (knowledge discovery from data) – Extraction of interesting ( non-trivial, implicit, previously unknown and potentially useful)
2016年6月12日星期日 2016年6月12日星期日 2016年6月12日星期日 Introduction to Data Mining 1 Chapter 1 Introduction to Data Mining Chen. Chun-Hsien Department of Information.
Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.
Lecture-2 Bscshelp.com.  Why Data Mining and What Kinds of Data Can Be Mined?  Potential Applications 2.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
July 7, 2016 Data Mining: Concepts and Techniques 1 1.
Data Mining.
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Data Mining – Intro.
DATA MINING BY: PRADEEP AGRAWAL MBA (SEC – A) ALLIANCE UNIVERSITY – SCHOOL OF BUSINESS.
Data warehouse & Data Mining: Concepts and Techniques
Introduction C.Eng 714 Spring 2010.
Introduction to Data Mining
Data Mining: Concepts and Techniques
Data Warehousing and Data Mining
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Data Mining Concepts and Techniques
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Presentation transcript:

Data Mining

What is Data Mining/KDD Data mining (knowledge discovery from data) Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or knowledge from huge amount of data

What is Data Mining By definition is the process of extracting previously unknown data from large databases and using it to make orgnisational decisions. Is concerned with the discovery of hidden knowledge. Usually works on large volumes of data Is useful in making critical organisationnal decisions, particularly those of strategic nature

Data Mining Data Mining referred using a number of names: Data Fishing, Data Dredging (1960…): Used by statisticians (as bad name) Knowledge Discovery in Databases (1989…): Used by AI, Machine Learning Community Business Intelligence (1990…): Business management term Also data archaeology, information harvesting, information discovery, knowledge extraction, data/pattern analysis, etc.

Data Mining: On What Kinds Of Data? Relational database Data warehouse Transactional database Advanced database and information repository Object-relational database Spatial and temporal data Time-series data Stream data Multimedia database Text databases & WWW

Data Mining Functionalities Concept description Generalize, summarize, and contrast data characteristics, e.g., dry vs. wet regions Association (correlation and causality) Nappies & Beer Classification and Prediction Construct models that describe and distinguish classes or concepts for future prediction Predict some unknown or missing numerical values

Data Mining Functionalities Cluster analysis Class label is unknown: Group data to form new classes, e.g., cluster houses to find distribution patterns Outlier analysis Outlier: a data object that does not comply with the general behavior of the data Noise or exception? No! useful in fraud detection and rare event analysis Other pattern-directed or statistical analyses

Data Mining is Multidisciplinary Statistics Pattern Recognition Neurocomputing Machine Learning AI Data Mining Databases KDD

Why we Need Data Mining Data explosion problem Automated data collection tools and mature database technology lead to huge amounts of data accumulated We are drowning in data, but starving for knowledge! Solution: Data warehousing and data mining Data warehousing and on-line analytical processing Mining interesting knowledge (rules, regularities, patterns, constraints) from data in large databases

Potential Applications Data analysis and decision support Market analysis and management Risk analysis and management Fraud detection and detection of unusual patterns Other applications Text mining (email, documents) and Web mining Stream data mining DNA and bio-data analysis

Stages of KDD Evaluation & Presentation Data Mining Knowledge Evaluation & Presentation Data Mining Selection & Transformation Data Warehouse Cleaning & Integration Databases

Issues and Challenges of Data Mining Data mining methodology Mining different kinds of knowledge from diverse data types, e.g., bio, stream, Web Performance: efficiency, effectiveness, and scalability Pattern evaluation: the interestingness problem Incorporation of background knowledge Handling noise and incomplete data Parallel, distributed and incremental mining methods Integration of the discovered knowledge with existing one: knowledge fusion

Issues and Challenges of Data Mining User interaction Data mining query languages and ad-hoc mining Expression and visualization of resultant knowledge Interactive mining of knowledge at multiple levels of abstraction Applications and social impacts Domain-specific data mining & invisible data mining Protection of data security, integrity, and privacy

Market Analysis And Management Where does the data come from? Credit card transactions, loyalty cards, discount coupons, customer complaint calls, etc Target marketing Find clusters of “model” customers who share the same characteristics Determine customer purchasing patterns over time Cross-market analysis Associations/co-relations between product sales, & prediction based on such association

Market Analysis And Management (cont…) Customer profiling What types of customers buy what products (clustering or classification) Customer requirement analysis Identifying the best products for different customers Predict what factors will attract new customers Provision of summary information Multidimensional summary reports Statistical summary information (data central tendency and variation)

Corporate Analysis & Risk Management Finance planning and asset evaluation Cash flow analysis and prediction Contingent claim analysis to evaluate assets Cross-sectional and time series analysis (financial-ratio, trend analysis, etc.) Resource planning Summarize and compare the resources and spending Competition Monitor competitors and market directions Group customers into classes and a class-based pricing procedure Set pricing strategy in a highly competitive market

Fraud Detection & Mining Unusual Patterns Applications: Health care, retail, credit card service, telecommunications Auto insurance: ring of collisions Money laundering: suspicious monetary transactions Medical insurance Professional patients, ring of doctors, and ring of references Unnecessary or correlated screening tests Telecommunications: phone-call fraud Phone call model: destination of the call, duration, time of day or week. Analyze patterns that deviate from an expected norm Retail industry Analysts estimate that 38% of retail shrink is due to dishonest employees Anti-terrorism Approaches: Clustering, model construction, outlier analysis, etc.