D ATA M INING A N O VERVIEW BY : J OSEPH C ASABONA Data Warehouse-->

Slides:



Advertisements
Similar presentations
Chapter 1 Business Driven Technology
Advertisements

1 Chapter 34 Data Mining Transparencies © Pearson Education Limited 1995, 2005.
Shipi Kankane Prashanth Nakirekommula.  Applying analytics and risk- management capabilities to health insurance through LexisNexis data platforms. 
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Week 9 Data Mining System (Knowledge Data Discovery)
Data Mining Jessica Jackson Kimberli Klein Kevin Wood.
Data Mining By Archana Ketkar.
Data Mining Adrian Tuhtan CS157A Section1.
Data Mining Concepts 1.1 COT5230 Data Mining Week 1 Data Mining Concepts M O N A S H A U S T R A L I A ’ S I N T E R N A T I O N A L U N I V E R S I T.
Data Mining – Intro.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
Data Mining: A Closer Look
Business Intelligence
Chapter 35 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
CIT 858: Data Mining and Data Warehousing Course Instructor: Bajuna Salehe Web:
Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.
Data Mining CS 157B Section 2 Keng Teng Lao. Overview Definition of Data Mining Application of Data Mining.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
Enterprise systems infrastructure and architecture DT211 4
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Discussion of: A Taxonomy to Guide Research on the Application of Data Mining to Fraud Detection in Financial Statement Analysis Severin Grabski Department.
6/22/2006 DATA MINING I. Definition & Business-Related Examples Mohammad Monakes Fouad Alibrahim.
Dr. Awad Khalil Computer Science Department AUC
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
1 Data Mining DT211 4 Refer to Connolly and Begg 4ed.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Techniques As Tools for Analysis of Customer Behavior
© Negnevitsky, Pearson Education, Introduction, or what is data mining? Introduction, or what is data mining? Data warehouse and query tools Data.
1 Data Mining Books: 1.Data Mining, 1996 Pieter Adriaans and Dolf Zantinge Addison-Wesley 2.Discovering Data Mining, 1997 From Concept to Implementation.
Business Intelligence Solutions for the Insurance Industry DAT – 13 Data Warehousing Rasool Ahmed.
Data mining: some basic ideas Francisco Moreno Excerpts from Fundamentals of DB Systems, Elmasri & Navathe and other sources.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Introduction, or what is data mining? Introduction, or what is data mining? Data warehouse and query tools Data warehouse and query tools Decision trees.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Sigur Ecommerce Pvt. Ltd.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
DATA MINING By Cecilia Parng CS 157B.
Foundations of Business Intelligence: Databases and Information Management.
Chapter 14 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
Advanced Database Concepts
DATA MINING IN THE CORPORATE WORLD BY RYANN A. WARD.
Miloš Kotlar 2012/115 Single Layer Perceptron Linear Classifier.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Data Mining Transparencies
Data Mining.
DATA MINING © Prentice Hall.
Introduction to Data Mining
MIS 451 Building Business Intelligence Systems
Adrian Tuhtan CS157A Section1
Sangeeta Devadiga CS 157B, Spring 2007
Data Analysis.
Data Warehousing Data Mining Privacy
Data Mining The process of extracting valid, previously unknown, comprehensible, and actionable information from large databases and using it to make.
Welcome! Knowledge Discovery and Data Mining
CSE591: Data Mining by H. Liu
Business Intelligence
Presentation transcript:

D ATA M INING A N O VERVIEW BY : J OSEPH C ASABONA Data Warehouse-->

O VERVIEW What is Data Mining? Introduction to KDD Type of Data found using Data Mining The 4 Goals of Data Mining Case Study: MetLife

W HAT IS D ATA M INING ? Definition: The mining or discovery of new information in terms of patterns or rules from vast amounts of data Adds more functionality than a DBMS Creates relationships within the data One step in the KDD Process

KDD Stands for "Knowledge Discovery in Databases" Six step process that helps us organize and extract new data from already existing data The six steps are: data selection, cleansing, enrichment, transformation, mining, and report generation.

KDD CONT. Selection and cleaning grab and validate the data to make sure it's good, complete, and proper. Enrichment will add more to the data from other sources. Transformation then limits the data in some way

D ATA M INING Result is new information the user would not know just by standard querying. Can be in the form of: o Association Rules o Sequential Patterns o Classification Trees

T HE F OUR G OALS OF D ATA M INING Prediction: Using current data to make prediction on future activities Identification: "Data patterns can be used to identify the existence of an item, an event, or an activity"

T HE F OUR G OALS CONT. Classification: Breaking the data down into categories based on certain attributes. Optimization: Using the mined data to make optimizations on resources, such as time, money, etc.

D ATA M INING E XAMPLES Most have been consumer bases Applicable in most industries Next: Case Study on MetLife

C ASE S TUDY : M ET L IFE Company Profile MetLife, Inc. is a leading provider of insurance and other financial services to millions of individual and institutional customers throughout the United States. Established in 1863, Metlife now has offices all over the US and the world, and offers ten different types of insurances and financial services.

C ASE S TUDY : M ET L IFE Industry: Insurance and Financial Services How they use Data Mining: Fraud Detection

C ASE S TUDY : M ET L IFE Project first started in 2001 MetLife set out to build $50 Million relational database This project would consolidate data from 30 business world wide.

C ASE S TUDY : M ET L IFE Around same time, it was reported that $30 Million of insurance money went to fraudulent claims. MetLife teamed up with Computer Sciences Corporation (CSC) to o License their data mining tool (called Fraud Investigator), o "an early fraud detection system"

C ASE S TUDY : M ET L IFE By 2003, MetLife's data mining operation was in full swing. They were able to detect fraud in a fraction of the time it would take in man hours One example is detecting rate evasion

C ASE S TUDY : M ET L IFE Rate evasion is lying about where you live to pay lower premiums. Metlife used data mining to detect rate evasion by matching ZIP codes with phone numbers to see if the cities matched. In 2.5 hours, Metlife found 107 fraudulent claims, all linked to a rate-evasion ring in NY and Massachusetts.

Q UESTIONS /C OMMENTS ?