Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.

Slides:



Advertisements
Similar presentations
Decision Tree Approach in Data Mining
Advertisements

Data Mining Sangeeta Devadiga CS 157B, Spring 2007.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Week 9 Data Mining System (Knowledge Data Discovery)
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Data Mining By Archana Ketkar.
Data Mining Adrian Tuhtan CS157A Section1.
Data Mining – Intro.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
DataMining By Guan Hang Su CS157A section 2 fall 2005.
Neural Networks in Data Mining “An Overview”
Data Mining: A Closer Look
Chapter 5 Data mining : A Closer Look.
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
Business Intelligence
Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
Dr. Awad Khalil Computer Science Department AUC
Data Mining Techniques
1 Data Mining DT211 4 Refer to Connolly and Begg 4ed.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
DATA MINING Team #1 Kristen Durst Mark Gillespie Banan Mandura University of DaytonMBA APR 09.
1 An Introduction to Data Mining Hosein Rostani Alireza Zohdi Report 1 for “advance data base” course Supervisor: Dr. Masoud Rahgozar December 2007.
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Outline What Neural Networks are and why they are desirable Historical background Applications Strengths neural networks and advantages Status N.N and.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
DATA MINING 1. 2 Data Mining Extracting or “mining” knowledge from large amounts of data Data mining is the process of autonomously retrieving useful.
Fox MIS Spring 2011 Data Mining Week 9 Introduction to Data Mining.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
1 STAT 5814 Statistical Data Mining. 2 Use of SAS Data Mining.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
MIS2502: Data Analytics Advanced Analytics - Introduction.
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
DATA MINING PREPARED BY RAJNIKANT MODI REFERENCE:DOUG ALEXANDER.
Data Mining and Decision Support
Data Mining. Overview the extraction of hidden predictive information from large databases Data mining tools predict future trends and behaviors, allowing.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
Prepared by Fayes Salma.  Introduction: Financial Tasks  Data Mining process  Methods in Financial Data mining o Neural Network o Decision Tree  Trading.
Data Mining.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining – Intro.
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
Data Mining 101 with Scikit-Learn
Introduction C.Eng 714 Spring 2010.
Adrian Tuhtan CS157A Section1
Sangeeta Devadiga CS 157B, Spring 2007
Data Warehousing and Data Mining
C.U.SHAH COLLEGE OF ENG. & TECH.
Data Science introduction.
Data Mining: Introduction
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Presentation transcript:

Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu

 Definition  General Concept  Foundations  Evolution  Applications  Challenges  Algorithms  Classical  Next Generations Introduction to Data Mining

What is Data Mining?  Data mining is the process for the non-trivial extraction of implicit, previously unknown and potentially useful information from data stored in repositories using pattern recognition technologies as well as statistical and mathematical methods. Introduction to Data Mining

Foundations  Massive data collection  Powerful multiprocessor computers  Data mining algorithms

Introduction to Data Mining Evolution

Introduction to Data Mining Applications  Industry  Retails  Health maintenance group  Telecommunications  Credit card  Web mining  Sports and entertainment solutions

Introduction to Data Mining Challenges  Ability to handle different types of data  Graceful degeneration of data mining algorithms  Valuable data mining results  Representation of data mining requests and results  Mining at different abstraction levels  Mining information from different sources of data  Protection of privacy and data security

Introduction to Data Mining Hierarchy of Choices and Decisions  Business goal  Collecting, cleaning and preparing data  Prediction  Model type and algorithms

Introduction to Data Mining Data Description  Descriptions of data characteristics in elementary and aggregated form  Summarization  Visualization

Introduction to Data Mining Predictive Data Mining  Predictive modeling is a term used to describe the process of mathematically or mentally representing a phenomenon or occurrence with a series of equations or relationships.

Introduction to Data Mining Prediction: Classification  Classification predicts class membership  Pre-classify (using classification algorithms)  Test to determine the quality of the model  Predict (using effective classifier)

Introduction to Data Mining Prediction: Regression  Regression takes a numerical dataset and develops a mathematical formula that fits the data.  When you're ready to use the results to predict future behavior, you simply take your new data, plug it into the developed formula and you get a prediction!

Introduction to Data Mining Algorithms  Classical Techniques  Statistics  Neighborhoods  Clustering  Next Generations  Decision Tree  Neural Network  Rule Induction

Introduction to Data Mining Statistics  Classical Statistics:  Related to the collection and description of data  Believes: there exists an underlying pattern of data distribution  Objective: find the best guess  Data Mining:  Employs statistical methods  Needs to analyze huge amounts of data  Beyond traditional statistics

Introduction to Data Mining Neighborhoods  Basic idea:  For a new problem, look for the similar problems (neighborhoods) that have been solved  Key point: find the neighborhood  Calculate the distance: how far is good to be considered as a neighbor?  Which class the new problem belong to?  Large computational load:  New calculation for each new case

Introduction to Data Mining Clustering  Elements grouped together according to different characteristics  Every cluster share same values (homogenous)  Problem: Control the number of cluster  Hierarchical clustering: flexibility  Non-hierarchical clustering: given by user  Used most frequently for:  Consolidating data into a high-level of view  Group records into likely behaviors

Introduction to Data Mining Decision Tree  A way of representing a series of rules that lead to a class or value  Structure:  Decision node, branches, leaves  Example: A loan officer wants to determine the credit of applicants

Introduction to Data Mining Decision Tree (continued)  Help to induce the tree and its rules to make predictions

Introduction to Data Mining Neural Networks  Efficiently modeling large and complex problems with hundreds of predictor variables  Structure:  Input layer, hidden layer, output layer  Activation function between nodes  Requires training and testing of relations

Introduction to Data Mining Neural Networks (continued)  Example:

Introduction to Data Mining Rule Induction  A method to derive a set of rules to classify cases  For example, rule induction can be used to discover patterns relating decisions (e.g., credit card application)  Rules may not cover all possible situations

Introduction to Data Mining