Data Mining GyuHyeon Choi. ‘80s  When the term began to be used  Within the research community.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

Prof. Carolina Ruiz Department of Computer Science Worcester Polytechnic Institute INTRODUCTION TO KNOWLEDGE DISCOVERY IN DATABASES AND DATA MINING.
Logistics Network Configuration
Principal Component Analysis
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Week 9 Data Mining System (Knowledge Data Discovery)
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
1 Data Mining Techniques Instructor: Ruoming Jin Fall 2006.
Chapter 14 The Second Component: The Database.
Data mining By Aung Oo.
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Business Intelligence
WHT/ HPCC Systems Flavio Villanustre VP, Products and Infrastructure HPCC Systems Risk Solutions.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,
1 Data Mining Books: 1.Data Mining, 1996 Pieter Adriaans and Dolf Zantinge Addison-Wesley 2.Discovering Data Mining, 1997 From Concept to Implementation.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Machine Learning Lecture 1. Course Information Text book “Introduction to Machine Learning” by Ethem Alpaydin, MIT Press. Reference book “Data Mining.
Data Mining By Dave Maung.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Last Words DM 1. Mining Data Steams / Incremental Data Mining / Mining sensor data (e.g. modify a decision tree assuming that new examples arrive continuously,
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Wednesday, March 29, 2000.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
27-18 września Data Mining dr Iwona Schab. 2 Semester timetable ORGANIZATIONAL ISSUES, INDTRODUCTION TO DATA MINING 1 Sources of data in business,
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization 2 1.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Academic Year 2014 Spring Academic Year 2014 Spring.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
CENG 770. Data mining (knowledge discovery from data) – Extraction of interesting ( non-trivial, implicit, previously unknown and potentially useful)
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
Data Mining - Introduction Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
The KDD Process for Extracting Useful Knowledge from Volumes of Data Fayyad, Piatetsky-Shapiro, and Smyth Ian Kim SWHIG Seminar.
Term Project Proposal By J. H. Wang Apr. 7, 2017.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
What Is Cluster Analysis?
MIS2502: Data Analytics Advanced Analytics - Introduction
INTELLIGENT SYSTEMS BUSINESS MOTIVATION BUSINESS INTELLIGENCE
Eick: Introduction Machine Learning
School of Computer Science & Engineering
Introduction to Data Mining
MIS 451 Building Business Intelligence Systems
Introduction C.Eng 714 Spring 2010.
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
An Enhanced Support Vector Machine Model for Intrusion Detection
What is Pattern Recognition?
MIS5101: Data Analytics Advanced Analytics - Introduction
Advanced Embodiment Design 26 March 2015
Data Warehousing and Data Mining
Principal Component Analysis
Supporting End-User Access
CSE 635 Multimedia Information Retrieval
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Christoph F. Eick: A Gentle Introduction to Machine Learning
Welcome! Knowledge Discovery and Data Mining
CSCE 4143 Section 001: Data Mining Spring 2019.
Promising “Newer” Technologies to Cope with the
PolyAnalyst™ text mining tool Allstate Insurance example
Presentation transcript:

Data Mining GyuHyeon Choi

‘80s  When the term began to be used  Within the research community

‘80s  Definition  A set of mechanisms and techniques  To extract hidden information from data  SQL style is not data mining

‘90s  Definition  Sub-process within KDD  (Knowledge Discovery in Databases)  Different with data preparation, analysis, and visualization  Other parts of KDD

‘90s  Became popular significantly  ACM SIGKDD annual conference, 1995  European PKDD conference, 1995  Pacific/Asia PAKDD conference, 1997

‘90s  Contribution of technological advances  Processing power  Data storage capability

‘90s  Processing of large volumes of data  Even using desktop machines  Commercial enterprises started to maintain data  To support commercial activities  But not to mine

‘90s  Large super market chains  Introduction of customer loyalty cards  To record customer purchases  Started mining purchasing patterns

Present  Mining non-standard data  Text mining  Image mining  Graph mining

Present  Collective term  Different with data preparation, analysis, and visualization  Even called as “big data”

Present  Domain of AI and KE  Artificial Intelligence  Knowledge Engineering

Present  Application  Rather than a technology

Before We Go  Data mining techniques  Pattern extraction  Clustering  Classification

Before We Go  Examples of classification

Problem  Curse of dimensionality  Data in high-dimensional spaces

Problem  Earlier classification (maybe)

Problem  Current classification

Example  Clustering

Example  Anomaly detection

Example  Recommender system

Problem (again)  Current classification

Topic  Complexity  The more complex society  The more complex data mining

Dimension Reduction  Use most significant dimensions  Cannot satisfy people’s demand  Waste of storage

Principal Component Analysis  Orthogonal transformation  Computationally expensive  Still doubtful to satisfy people’s demand

Next Solution?  Currently no solutions  Or no problem

Future Solution  No algorithmic solution

Future Solution  Advanced hardware  Super-supercomputer

Future Problem  Human being as data?  Dimensions of the human Being

Future Problem  Can people be satisfied?  More and more sophisticated demand  More and more dimensions of data

Frans Coenen, The Knowledge Engineering Review, Vol. 26:1, 25-29, Cambridge University Press, 2011 Curse of dimensionality, Wikipedia Andrew Tarantola, The Quantum D-Wave 2 Is 3,600 Times Faster than a Super Computer, GIZMODO, April 2015 Peter Rüst, Dimensions of the Human Being and of Divine Action, Volume 57, Number 3, September 2005