Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.

Slides:



Advertisements
Similar presentations
Office of SA to CNS GeoIntelligence Introduction Data Mining vs Image Mining Image Mining - Issues and Challenges CBIR Image Mining Process Ontology.
Advertisements

Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Active subgroup mining for descriptive induction tasks Dragan Gamberger Rudjer Bošković Instute, Zagreb Zdenko Sonicki University of Zagreb.
Civil and Environmental Engineering Carnegie Mellon University Sensors & Knowledge Discovery (a.k.a. Data Mining) H. Scott Matthews April 14, 2003.
Week 9 Data Mining System (Knowledge Data Discovery)
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Lecture 2 Themes in this session Knowledge discovery in databases
Data Mining By Archana Ketkar.
Lecture 4 TIES445 Data mining Nov-Dec 2007 Sami Äyrämö.
Data Mining – Intro.
Data mining By Aung Oo.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Business Intelligence
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
Geographic Data Mining Marc van Kreveld Seminar for GIVE Block 1, 2003/2004.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
CSCI 347 / CS 4206: Data Mining Module 01: Introduction Topic 03: Stages in Data Mining.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Mining Techniques
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Business Intelligence, Data Mining and Data Analytics/Predictive Analytics By: Asela Thomason IS 495 Summer 2015.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining Chun-Hung Chou
Tang: Introduction to Data Mining (with modification by Ch. Eick) I: Introduction to Data Mining A.Short Preview 1.Initial Definition of Data Mining 2.Motivation.
3 Objects (Views Synonyms Sequences) 4 PL/SQL blocks 5 Procedures Triggers 6 Enhanced SQL programming 7 SQL &.NET applications 8 OEM DB structure 9 DB.
2 Outline of the presentation Objectives, Prerequisite and Content Brief Introduction to Lectures Discussion and Conclusion Objectives, Prerequisite and.
1 SHIM 413 Database Applications for Healthcare Fall 2006 Slides by H. T. Bao.
Chapter 1 Introduction to Data Mining
Knowledge Discovery and Data Mining Evgueni Smirnov.
Data Mining Process A manifestation of best practices A systematic way to conduct DM projects Different groups has different versions Most common standard.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Knowledge Discovery and Data Mining Evgueni Smirnov.
DATA MINING 1. 2 Data Mining Extracting or “mining” knowledge from large amounts of data Data mining is the process of autonomously retrieving useful.
Data Mining By Dave Maung.
Principles of Data Mining. Introduction: Topics 1. Introduction to Data Mining 2. Nature of Data Sets 3. Types of Structure Models and Patterns 4. Data.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
1 Improving quality of graduate students by data mining Asst. Prof. Kitsana Waiyamai, Ph.D. Dept. of Computer Engineering Faculty of Engineering, Kasetsart.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Wednesday, March 29, 2000.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Friday, 14 November 2003 William.
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
Data Mining and Decision Support
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
KNOWLEDGE DISCOVERY & DATA MINING Abhishek M. Mehta ROLL NO:24.
The KDD Process for Extracting Useful Knowledge from Volumes of Data Fayyad, Piatetsky-Shapiro, and Smyth Ian Kim SWHIG Seminar.
Data Mining – Intro.
DATA MINING © Prentice Hall.
Introduction to Data Mining
Introduction C.Eng 714 Spring 2010.
Data Mining: Concepts and Techniques Course Outline
Sangeeta Devadiga CS 157B, Spring 2007
CSE591: Data Mining by H. Liu
Data Warehousing and Data Mining
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Data Warehousing Data Mining Privacy
Data Mining: Concepts and Techniques
CSE591: Data Mining by H. Liu
Presentation transcript:

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge Discovery : An Overview Course: CIS 864 Class: 6 Presenter: MANMOHAN K. UTTARWAR On: Monday, January 29, 2001

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Contents Reasoning Advances KDD:Definition Data Mining & KDD The KDD Process Stages Of KDD Primary Tasks of Data Mining Components Of Data Mining Algorithm Popular Data Mining methods Application Issues Of Data Mining Guidelines for Selecting Potential KDD Applications Research & Application Challenges for KDD

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence An Overview Explosive growth of Business,Govt. & Scientific databases. Ability to Interpret & Digest Data Inappropriate Tools & Techniques for Database Analysis

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Advances in Storage Technology Wal-Mart : 20 Million Trans/Day Health care : Multi Gigabyte Mobil Oil : 100 TB of Oil Exploration Data NASA: EOS –generates 50GB /hr Remotely Sensed Image Data

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence KDD: Definition KDD: Definition The non-trivial process of identifying valid, novel, potentially useful, and ultimately understandable patterns in data non-trivial process Multiple process valid Justified patterns/models novel Previously unknown useful Can be used understandable by human and machine

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Data Mining & KDD Data Mining -Step in KDD process -Consists of particular Data Mining algorithms Under specified computational efficiency limitations produces specific enumeration of patterns KDD process is the process of using Data Mining methods (Algorithms) to extract deemed knowledge according to the specifications of measures and threshold using the database along with any required preprocessing, sub sampling & transformations of that database.

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence The KDD Processes Preliminaries Developing an understanding of the application Domain Creating a Target Data Set Choosing the Data Mining Tasks Preprocessing Data cleaning and Pre-processing Data Reduction and Projection Choosing the Data Mining Algorithms Data Mining Application Interpretation of Mining Patterns Consolidating discovered Knowledge

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Stages of KDD

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Tasks in KDD Process

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Classification Deviation and change detection Summarization Regression finding the description of several predefined classes and classify a data item into one of them. maps a data item to a real-valued prediction variable. identifying a finite set of categories or clusters to describe the data. finding a compact description for a subset of data finding a model which describes significant dependencies between variables. discovering the most significant changes in the data Primary Tasks of Data Mining Primary Tasks of Data Mining Clustering Dependency Modeling

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Primary Tasks of Data Mining Classification Regression Clustering Dependency modeling Change & Deviation Detection Summarization

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Primary Tasks of Data Mining

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Components of Data Mining Algorithms Model Representation - L describing discoverable patterns Model Evaluation - meets criteria of KDD process Search Methods - Parameter Search - Model Search

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Popular Data Mining Methods Decision Trees & Rules Non-Linear Regression & Classification Methods Example-Based Methods Probabilistic Graphical Dependency Model Relational Learning Model

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Non-Linear & Nearest Neighbor Classifier

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence SEMMA Process (Simple DM Algo.)

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Application Issues of KDD Database Marketing Analysis and Selection of Stocks Scientific applications such as -- Astronomy -- Molecular Biology -- Global Climate Change Modeling Fraud Detection & Prevention

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Guidelines for Selecting a Potential KDD Application Practical Criteria -- Potential for significant impact of an application -- No Good alternatives exists -- Organizational support -- Potential for privacy / legal issues Technical Criteria -- Availability of Significant Data -- Relevance of attributes -- Low noise levels -- Confidence intervals -- Prior Knowledge

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Challenges for KDD Larger Database High Dimensionality Over-fitting Assessing Statistical Significance Changing Data & Knowledge Missing & Noisy Data Complex Relationships between Fields Understandability of Patterns User Interaction & Prior Knowledge Integration with other Systems

Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Conclusions KDD – a desirable end product Many approaches exists - have advantages & problems Barrier in obtaining Quality Data