Enterprise systems infrastructure and architecture DT211 4

Slides:



Advertisements
Similar presentations
IT 433 Data Warehousing and Data Mining
Advertisements

Decision Tree Approach in Data Mining
1 Chapter 34 Data Mining Transparencies © Pearson Education Limited 1995, 2005.
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
1. Abstract 2 Introduction Related Work Conclusion References.
Data Mining: A Closer Look Chapter Data Mining Strategies.
Basic Data Mining Techniques Chapter Decision Trees.
Data Mining Knowledge Discovery in Databases Data 31.
Data Mining.
Basic Data Mining Techniques
Data Mining By Archana Ketkar.
Data Mining Adrian Tuhtan CS157A Section1.
Data Mining: A Closer Look Chapter Data Mining Strategies (p35) Moh!
Data Mining – Intro.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
DataMining By Guan Hang Su CS157A section 2 fall 2005.
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Data Mining: A Closer Look
Data Mining: A Closer Look Chapter Data Mining Strategies 2.
Chapter 5 Data mining : A Closer Look.
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
Chapter 35 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Chapter 7 Decision Tree.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Basic Data Mining Techniques
Dr. Awad Khalil Computer Science Department AUC
Chapter 5: Data Mining for Business Intelligence
Data Mining Techniques
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
1 Data Mining DT211 4 Refer to Connolly and Begg 4ed.
Business Intelligence, Data Mining and Data Analytics/Predictive Analytics By: Asela Thomason IS 495 Summer 2015.
3 Objects (Views Synonyms Sequences) 4 PL/SQL blocks 5 Procedures Triggers 6 Enhanced SQL programming 7 SQL &.NET applications 8 OEM DB structure 9 DB.
Chapter 13 Genetic Algorithms. 2 Data Mining Techniques So Far… Chapter 5 – Statistics Chapter 6 – Decision Trees Chapter 7 – Neural Networks Chapter.
Data Mining and Application Part 1: Data Mining Fundamentals Part 2: Tools for Knowledge Discovery Part 3: Advanced Data Mining Techniques Part 4: Intelligent.
Inductive learning Simplest form: learn a function from examples
Introduction To Data Mining. What Is Data Mining? A toolA tool Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful)
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
DATA MINING 1. 2 Data Mining Extracting or “mining” knowledge from large amounts of data Data mining is the process of autonomously retrieving useful.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 2 Data Mining: A Closer Look Jason C. H. Chen, Ph.D. Professor of MIS School of Business Administration.
Data MINING Data mining is the process of extracting previously unknown, valid and actionable information from large data and then using the information.
 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Data Mining In contrast to the traditional (reactive) DSS tools, the data mining premise is proactive. Data mining tools automatically search the data.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Chapter 14 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
MIS2502: Data Analytics Advanced Analytics - Introduction.
DATA MINING PREPARED BY RAJNIKANT MODI REFERENCE:DOUG ALEXANDER.
CHAPTER 8 DATA MINING BASICS.
Academic Year 2014 Spring Academic Year 2014 Spring.
Data Mining. Overview the extraction of hidden predictive information from large databases Data mining tools predict future trends and behaviors, allowing.
Data Mining Copyright KEYSOFT Solutions.
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
Basic Data Mining Techniques Chapter 3-A. 3.1 Decision Trees.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 3 Basic Data Mining Techniques Jason C. H. Chen, Ph.D. Professor of MIS School of Business.
Chapter 2 Data, Text, and Web Mining. Data Mining Concepts and Applications  Data mining (DM) A process that uses statistical, mathematical, artificial.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
Data Mining is the process of analyzing data and summarizing it into useful information Data Mining is usually used for extremely large sets of data It.
Data Mining Transparencies
MIS2502: Data Analytics Advanced Analytics - Introduction
Data Mining 101 with Scikit-Learn
Adrian Tuhtan CS157A Section1
Supporting End-User Access
Data Mining The process of extracting valid, previously unknown, comprehensible, and actionable information from large databases and using it to make.
Kenneth C. Laudon & Jane P. Laudon
Presentation transcript:

Enterprise systems infrastructure and architecture DT211 4 Data Mining Enterprise systems infrastructure and architecture DT211 4

Note for next year Data mining in Laudon and laudon and Kopec paper gives a few more good ideas

Data Mining The process of extracting valid, previously unknown, comprehensible, and actionable information from large databases and using it to make crucial business decisions. Involves the analysis of data and the use of software techniques for finding hidden and unexpected patterns and relationships in sets of data.

Data Mining Data mining tools uses ,e.g. AI techniques, to help: predict future trends: , Segment datasets “Product” association allowing businesses to make proactive, knowledge-driven decisions.

Data mining: A.I. techniques. The most commonly used techniques A.I. techniques in data mining are: Decision trees: Tree-shaped structures that represent sets of decisions. These decisions generate rules for the classification of a dataset. Nearest neighbour method: A technique that classifies each record in a dataset based on a combination of the classes of the k record(s) most similar to it in a historical dataset. Sometimes called the k-nearest neighbour technique; a clustering technique Rule induction: The extraction of useful if-then rules from data based on statistical significance. Artificial neural networks: Predictive models that learn through training and resemble biological neural networks in structure.

How Data Mining Works For example, say that you are the director of marketing for a insurance company and you'd like to acquire some new customers You could just randomly go out and mail coupons to the general population. However you would not achieve the required result. Alternatively As the marketing director you have access to a lot of information about all of your customers: their age, sex, income range and credit card insurance.

How Data Mining Works Customers Prospects General information (e.g. demographic data) Known Proprietary information (e.g. customer transactions) Target The goal in prospecting is to make some decisions about the information in the lower right hand quadrant based on the model that we build going from Customer General Information to Customer Proprietary Information.

An Algorithm for Building Decision Trees Consider the following using decision trees. The following is decision tree algorithm: 1. Let T be the set of training instances. 2. Choose an attribute that best differentiates the instances in T. 3. Create a tree node whose value is the chosen attribute. -Create child links from this node where each link represents a unique value for the chosen attribute. -Use the child link values to further subdivide the instances into subclasses. 4. For each subclass created in step 3: -If the instances in the subclass satisfy predefined criteria or if the set of remaining attribute choices for this path is null, specify the classification for new instances following this decision path. -If the subclass does not satisfy the criteria and there is at least one attribute to further subdivide the path of the tree, let T be the current set of subclass instances and return to step 2.

Customers who earn between 50 K to 60 K have a life insurance policy. How Data Mining Works For instance, a simple model for a Insurance company might be: Customers who earn between 50 K to 60 K have a life insurance policy. This model could then be applied to the general population to target those for the life insurance promotion. The tree can be more complex e.g. See figure opposite

Data Mining Operations Data mining operations include: Predictive modelling: decision trees, regression analysis… Database segmentation: clustering techniques Link analysis: decision trees, association rules

Predictive Modeling Applications of predictive modelling include direct marketing and use techniques like decision trees. uses observations to form a model of the important characteristics of some phenomenon: e.g. those traits associated with those who will buy property Simple decision tree example

Database Segmentation Aim is to partition a database into an unknown number of segments, or clusters, of similar records. Uses clustering techniques in order to group data Applications of database segmentation include credit card fraud….

Database Segmentation using a Scatterplot

Link Analysis Aims to establish links between records, or sets of records, in a database; one such example would be association discovery…. Applications include product affinity analysis. Finds items that imply the presence of other items in the same event.

Link Analysis - Associations Discovery Affinities between items are represented by association discovery. e.g. ‘When a customer rents property for more than 2 years and is more than 25 years old, in 40% of cases, the customer will buy a property. This association happens in 35% of all customers who rent properties’.

Examples of Applications of Data Mining Retail / Marketing Predicting response to mailing campaigns Market basket analysis Banking: Detecting patterns of fraudulent credit card use. Insurance Claims analysis Medicine Identifying successful medical therapies for different illnesses

Data mining in conclusion Two critical factors for success with data mining are: a large, well-integrated data warehouse and a well-defined understanding of the business process within which data mining is to be applied (e.g. customer prospecting, retention, campaign management etc.).

Sample types questions Discuss, using suitable examples how data mining can contribute to companies making a proactive knowledge driven decisions which could help with formulation of a companies strategy.