Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter 13 Knowledge Discovery Systems: Systems That Create Knowledge.

Slides:



Advertisements
Similar presentations
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
Advertisements

Introduction to Data Mining with XLMiner
Mining the Data Ira M. Schoenberger, FACHCA Senior Administrator 2011 AHCA/NCAL Quality Symposium Friday February 18, 2011.
Week 9 Data Mining System (Knowledge Data Discovery)
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
SLIDE 1IS 257 – Fall 2008 Data Mining and the Weka Toolkit University of California, Berkeley School of Information IS 257: Database Management.
Building Knowledge-Driven DSS and Mining Data
Data Mining – Intro.
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Data Warehousing by Industry Chapter 4 e-Data. Retail Data warehousing’s early adopters Capturing data from their POS systems  POS = point-of-sale Industry.
Dr. Awad Khalil Computer Science Department AUC
Chapter 5: Data Mining for Business Intelligence
Overview DM for Business Intelligence.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Business Intelligence, Data Mining and Data Analytics/Predictive Analytics By: Asela Thomason IS 495 Summer 2015.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining Chun-Hung Chou
Chapter 3: Marketing Intelligence Copyright © 2010 Pearson Education Canada1.
Chapter 7 DATA, TEXT, AND WEB MINING Pages , 311, Sections 7.3, 7.5, 7.6.
The CRISP-DM Process Model
Knowledge Management Assessment of an Organization
Copyright © 2008 by Nelson, a division of Thomson Canada Limited SECONDARY DATA RESEARCH IN A DIGITAL AGE Chapter 6 Part 2 Designing Research Studies.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Lecture 9: Knowledge Discovery Systems Md. Mahbubul Alam, PhD Associate Professor Dept. of AEIS Sher-e-Bangla Agricultural University.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Data Mining By : Tung, Sze Ming ( Leo ) CS 157B. Definition A class of database application that analyze data in a database using tools which look for.
Data MINING Data mining is the process of extracting previously unknown, valid and actionable information from large data and then using the information.
Fox MIS Spring 2011 Data Mining Week 9 Introduction to Data Mining.
Data Mining By Dave Maung.
Technology In Action Chapter 11 1 Databases and… Databases and their uses Database components Types of databases Database management systems Relational.
The CRISP Data Mining Process. August 28, 2004Data Mining2 The Data Mining Process Business understanding Data evaluation Data preparation Modeling Evaluation.
Guest Lecture Introduction to Data Mining Dr. Bhavani Thuraisingham September 17, 2010.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Part II Tools for Knowledge Discovery Ch 5. Knowledge Discovery in Databases Ch 6. The Data Warehouse Ch 7. Formal Evaluation Technique.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
An Investigation of Commercial Data Mining Presented by Emily Davis Supervisor: John Ebden.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
1-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Week 11 Knowledge Discovery Systems: Systems That Create Knowledge.
Data Mining Copyright KEYSOFT Solutions.
Customer Relationship Management (CRM) Chapter 4 Customer Portfolio Analysis Learning Objectives Why customer portfolio analysis is necessary for CRM implementation.
Knowledge Discovery and Data Mining 19 th Meeting Course Name: Business Intelligence Year: 2009.
Monday, February 22,  The term analytics is often used interchangeably with:  Data science  Data mining  Knowledge discovery  Extracting useful.
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
Chapter 2 Data, Text, and Web Mining. Data Mining Concepts and Applications  Data mining (DM) A process that uses statistical, mathematical, artificial.
Copyright  2007 McGraw-Hill Pty Ltd PPTs t/a Marketing Research 2e by Lukas, Hair, Bush and Ortinau Slides prepared by Judy Rex 19-1 Chapter Nineteen.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
Data Mining: Confluence of Multiple Disciplines Data Mining Database Systems Statistics Other Disciplines Algorithm Machine Learning Visualization.
Data mining in web applications
Quantitative Methods for Business Studies
KNOWLEDGE MANAGEMENT (KM) Session # 38
Data Based Decision Making
DATA MINING © Prentice Hall.
Web Mining Ref:
Data and Applications Security Introduction to Data Mining
Knowledge Discovery Systems: Systems That Create Knowledge
Knowledge Discovery Systems: Systems That Create Knowledge
Sangeeta Devadiga CS 157B, Spring 2007
Week 11 Knowledge Discovery Systems & Data Mining :
PROJECTS SUMMARY PRESNETED BY HARISH KUMAR JANUARY 10,2018.
KNOWLEDGE MANAGEMENT (KM) Session # 37
Presentation transcript:

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter 13 Knowledge Discovery Systems: Systems That Create Knowledge

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter Objectives  To explain how knowledge is discovered  To describe knowledge discovery systems, including design considerations, and how they rely on mechanisms and technologies  To explain data mining (DM) technologies  To discuss the role of DM in customer relationship management

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Knowledge Synthesis through Socialization To discover tacit knowledge Socialization enables the discovery of tacit knowledge through joint activities  between masters and apprentices  between researchers at an academic conference

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Knowledge Discovery from Data – Data Mining Another name for Knowledge Discovery in Databases is data mining (DM). Data mining systems have made a significant contribution in scientific fields for years. The recent proliferation of e-commerce applications, providing reams of hard data ready for analysis, presents us with an excellent opportunity to make profitable use of data mining.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Data Mining Techniques Applications Marketing – Predictive DM techniques, like artificial neural networks (ANN), have been used for target marketing including market segmentation. Direct marketing – customers are likely to respond to new products based on their previous consumer behavior. Retail – DM methods have likewise been used for sales forecasting. Market basket analysis – uncover which products are likely to be purchased together.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Data Mining Techniques Applications Banking – Trading and financial forecasting are used to determine derivative securities pricing, futures price forecasting, and stock performance. Insurance – DM techniques have been used for segmenting customer groups to determine premium pricing and predict claim frequencies. Telecommunications – Predictive DM techniques have been used to attempt to reduce churn, and to predict when customers will attrition to a competitor. Operations management – Neural network techniques have been used for planning and scheduling, project management, and quality control.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Designing the Knowledge Discovery System – CRISP DM 1.Business Understanding – To obtain the highest benefit from data mining, there must be a clear statement of the business objectives. 2.Data Understanding – Knowing the data well can permit the designer to tailor the algorithm or tools used for data mining to his/her specific problem. 3.Data Preparation – Data selection, variable construction and transformation, integration, and formatting 4.Model building and validation – Building an accurate model is a trial and error process. The process often requires the data mining specialist to iteratively try several options, until the best model emerges. 5.Evaluation and interpretation – Once the model is determined, the validation dataset is fed through the model. 6.Deployment – Involves implementing the ‘live’ model within an organization to aid the decision making process.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall CRISP-DM Data Mining Process Methodology

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall 1. Business Understanding process a.Determine Business objectives – To obtain the highest benefit from data mining, there must be a clear statement of the business objectives. b.Situation Assessment – T he majority of the people in a marketing campaign who receive a target mail, do not purchase the product. c.Determine Data Mining Goal – Identifying the most likely prospective buyers from the sample, and targeting the direct mail to those customers, could save the organization significant costs. d.Produce Project Plan – This step also includes the specification of a project plan for the DM study.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall 2. Data Understanding process a.Data collection – Defines the data sources for the study, including the use of external public data, and proprietary databases. b.Data description – Describes the contents of each file or table. Some of the important items in this report are: number of fields (columns) and percent of records missing. c.Data quality and verification – Define if any data can be eliminated because of irrelevance or lack of quality. d.Exploratory Analysis of the Data – Use to develop a hypothesis of the problem to be studied, and to identify the fields that are likely to be the best predictors.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall 3. Data Preparation process a.Selection – Requires the selection of the predictor variables and the sample set. b.Construction and transformation of variables – Often, new variables must be constructed to build effective models. c.Data integration – The dataset for the data mining study may reside on multiple databases, which would need to be consolidated into one database. d.Formatting – Involves the reordering and reformatting of the data fields, as required by the DM model.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall 4. Model building and Validation process a.Generate Test Design – Building an accurate model is a trial and error process. The data mining specialist iteratively try several options, until the best model emerges. b.Build Model – Different algorithms could be tried with the same dataset. Results are compared to see which model yields the best results. c.Model Evaluation – In constructing a model, a subset of the data is usually set-aside for validation purposes. The validation data set is used to calculate the accuracy of predictive qualities of the model.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall 5. Evaluation and Interpretation process a.Evaluate Results – Once the model is determined, the predicted results are compared with the actual results in the validation dataset. b.Review Process – Verify the accuracy of the process. c.Determine Next Steps – List of possible actions decision.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall 6. Deployment process a.Plan Deployment – This step involves implementing the ‘live’ model within an organization to aid the decision making process.. b.Produce Final Report – Write a final report. c.Plan Monitoring and Maintenance – Monitor how well the model predicts the outcomes, and the benefits that this brings to the organization. d.Review Project – Experience, and documentation.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall The Iterative Nature of the KDD process

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Data Mining Techniques 1.Predictive Techniques  Classification: Data mining techniques in this category serve to classify the discrete outcome variable.  Prediction or Estimation: DM techniques in this category predict a continuous outcome (as opposed to classification techniques that predict discrete outcomes). 2.Descriptive Techniques  Affinity or association: Data mining techniques in this category serve to find items closely associated in the data set.  Clustering: DM techniques in this category aim to create clusters of input objects, rather than an outcome variable.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Web Data Mining - Types 1.Web structure mining – Examines how the Web documents are structured, and attempts to discover the model underlying the link structures of the Web.  Intra-page structure mining evaluates the arrangement of the various HTML or XML tags within a page  Inter-page structure refers to hyper-links connecting one page to another. 2.Web usage mining (Clickstream Analysis) – Involves the identification of patterns in user navigation through Web pages in a domain.  Processing, Pattern analysis, and Pattern discovery 3.Web content mining – Used to discover what a Web page is about and how to uncover new knowledge from it.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Data Mining and Customer Relationship Management CRM is the mechanisms and technologies used to manage the interactions between a company and its customers. The data mining prediction model is used to calculate a score: a numeric value assigned to each record in the database to indicate the probability that the customer represented by that record will behave in a specific manner.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Barriers to the use of DM Two of the most significant barriers that prevented the earlier deployment of knowledge discovery in the business relate to:  Lack of data to support the analysis  Limited computing power to perform the mathematical calculations required by the DM algorithms.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Case Study An application of Rule Induction to real estate appraisal systems  In this case, we seek specific knowledge that we know can be found in the data in databases, but which can be difficult to extract.  Procedure to create the decision tree:  Data preparation and preprocessing  Tree construction  House pruning  Paired leaf analysis

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Case Study An application of Rule Induction to real estate appraisal systems AttributeInduction ResultsExpert EstimateDifference Living Area$15 - $31$15 - $ % Bedrooms$ $5212$ $ % Bathrooms$ $5718$ $ % Garage$ $4522$ $ % Pool$ $11697$ $ % Fireplace$ $4180$ $ % Year Built % % % Summary of Induction Results

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Case Study An application of Rule Induction to real estate appraisal systems Partial Decision Tree Results for Real Estate Appraisal

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Case Study An application of Web Content mining to Expertise Locator Systems  NASA Expert Seeker Web Miner demo  A KM system that locates experts based on published documents requires: Automatic method for identifying employee names. A method to associate employee names with skill keywords embedded in those documents.

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Conclusions In this Chapter we: Described knowledge discovery systems, including design considerations, and how they rely on mechanisms and technologies Learned how knowledge is discovered:  Through through socialization with other knowledgeable persons  Trough DM by finding interesting patterns in observations, typically embodied in explicit data Explained data mining (DM) technologies Discussed the role of DM in customer relationship management

Becerra-Fernandez, et al. -- Knowledge Management 1/e -- © 2004 Prentice Hall Chapter 13 Knowledge Discovery Systems: Systems That Create Knowledge