데이터 마이닝 기술(Ⅰ) 데이터 마이닝(data mining)의 정의 대량의 실제 데이터로부터 이전에 잘 알려지지는 않았지만

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

Overview of Data Mining & The Knowledge Discovery Process Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
1. Abstract 2 Introduction Related Work Conclusion References.
SAK 5609 DATA MINING Prof. Madya Dr. Md. Nasir bin Sulaiman
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Data Mining By Archana Ketkar.
Data Mining Adrian Tuhtan CS157A Section1.
Data Mining – Intro.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Data Mining: A Closer Look
Data Mining: A Closer Look Chapter Data Mining Strategies 2.
Chapter 5 Data mining : A Closer Look.
Enterprise systems infrastructure and architecture DT211 4
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
10 Data Mining. What is Data Mining? “Data Mining is the process of selecting, exploring and modeling large amounts of data to uncover previously unknown.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
Business Intelligence, Data Mining and Data Analytics/Predictive Analytics By: Asela Thomason IS 495 Summer 2015.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining Chun-Hung Chou
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Forecast Anything! The Seven Data Mining Models Andy Cheung ISV Developer Evangelist Microsoft Hong Kong.
Data Mining Techniques As Tools for Analysis of Customer Behavior Lecture 2:
Chapter 13 Genetic Algorithms. 2 Data Mining Techniques So Far… Chapter 5 – Statistics Chapter 6 – Decision Trees Chapter 7 – Neural Networks Chapter.
Overview of Data Mining Methods Data mining techniques What techniques do, examples, advantages & disadvantages.
Data Clustering 1 – An introduction
Data Mining By Dave Maung.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
1 STAT 5814 Statistical Data Mining. 2 Use of SAS Data Mining.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
Overview of Methods Data mining techniques What techniques do, examples, advantages & disadvantages.
Introduction to Data-Mining Marko Grobelnik Institut Jozef Stefan.
Business Intelligence - 2 BUS 782. Topics Data warehousing Data Mining.
Chapter 14 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
Academic Year 2014 Spring Academic Year 2014 Spring.
Data Mining Copyright KEYSOFT Solutions.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Miloš Kotlar 2012/115 Single Layer Perceptron Linear Classifier.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Show Me Potential Customers Data Mining Approach Leila Etaati.
Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Data Mining is the process of analyzing data and summarizing it into useful information Data Mining is usually used for extremely large sets of data It.
Data Mining: Confluence of Multiple Disciplines Data Mining Database Systems Statistics Other Disciplines Algorithm Machine Learning Visualization.
Data Mining – Intro.
DATA MINING © Prentice Hall.
RESEARCH APPROACH.
MIS 451 Building Business Intelligence Systems
Data Mining 101 with Scikit-Learn
Data Mining Techniques So Far…
Adrian Tuhtan CS157A Section1
Sangeeta Devadiga CS 157B, Spring 2007
כריית נתונים.
Supporting End-User Access
Understanding Customer Behaviors with Information Technologies
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining: Concepts and Techniques
Welcome! Knowledge Discovery and Data Mining
Presentation transcript:

데이터 마이닝 기술(Ⅰ) 데이터 마이닝(data mining)의 정의 대량의 실제 데이터로부터 이전에 잘 알려지지는 않았지만 묵시적이고 잠재적으로 유용한 정보를 추출하는 작업 cf) KDD(Knowledge Discovery in Database) 데이터베이스로부터 지식을 추출하는 전 과정

데이터 마이닝 기술(Ⅱ) 전문가 시스템 기계학습 KDD Data Minig 데이터 베이스 통계학 가시화

데이터 마이닝 기술(IV) 데이터 마이닝 기법 연관규칙(association rule) K-최단 인접(K-nearest neighbor) 의사결정트리(decision tree) 신경망(neural network) 유전자 알고리즘(genetic algorithm) 통계적 기법(statistical technique)

데이터 마이닝 기술(V) 데이터 마이닝 주요 작업(primaty tasks) 분류화(classification) 군집화(clustering) 특성화(characterization, summerization) 경향분석(trend analysis) 연관규칙 탐사(association) – Monket Basket Analysis 패턴분석(pattern analysis) Estimation Prediction Text Mining Web Mining Web Contents Web Log

데이터 마이닝 기술(VII) 응용 분야 Marketing & retail Banking Finance Insurance Medicine & health (Genetics) Quality control Transportation Geo – Spaetial Applications

DM Tasks Classifications (1/2) ○▱△ large ○○○ ○▱Ⅹ medium △△△ △○Ⅹ small ⅩⅩ objects predefined classes

DM Tasks Classification (2/2) (ex) news [ International ] [ domestic ] [ sports ] [ culture ] : credit application [ high ] [ medium ] [ low ] water sample data [ 일급수 ] [ 이급수 ] [ 꾸정물 ] (alg) Decision Trees, Memory Based Reasoning

DM Tasks Estimation (1/2) Attr 1 Attr 2 : (Continuous) Value cf. classification maps to discrete categories

DM Tasks Estimation (2/2) (ex) 나이, 성별, 혈압, … 잔여수명 나이, 성별, 직업, … 연 수입 나이, 성별, 직업, … 연 수입 지역, 수량, 인구, … 오염농도 (alg) neural net (*) estimating future value is called Prediction.

DM Tasks [○△] Association (1/2) Market Basket Analysis Determine which things go together [○○△☓] [○□△] [☆□] [○☆☓△] ⋮ [○△]

DM Tasks Association (2/2) Water Sample [NO2, C2H5OH,] Eg) Shopping list Cross - Selling (super market, (shelf, catalog, home shopping, CF, ⋯) E-shopping,etc.) Water Sample [NO2, C2H5OH,] Alg) Association rules

DM Tasks Clustering (1/4) Cf. classification - predefined category clustering - find new category & explain the category G1 G2 G3 G4 Heterogeneous population Homogeneous subgroups (cluster)

DM Tasks Clustering (2/4) eg) symptoms → disease customer info → selective sales 토양 data (수질) Note : Clustering is dependent to the features used. card 예 : number, color, suite…

DM Tasks Clustering (3/4) Clustering is useful for Exception finding Calling card fraud detection Credit card fraud etc. exceptions

DM Tasks Clustering (4/4) alg) K-means → K clusters Note) directed vs non-directed KDD