Presentation is loading. Please wait.

Presentation is loading. Please wait.

D ATA M INING A N O VERVIEW BY : J OSEPH C ASABONA Data Warehouse-->

Similar presentations


Presentation on theme: "D ATA M INING A N O VERVIEW BY : J OSEPH C ASABONA Data Warehouse-->"— Presentation transcript:

1 D ATA M INING A N O VERVIEW BY : J OSEPH C ASABONA Data Warehouse-->

2 O VERVIEW What is Data Mining? Introduction to KDD Type of Data found using Data Mining The 4 Goals of Data Mining Case Study: MetLife

3 W HAT IS D ATA M INING ? Definition: The mining or discovery of new information in terms of patterns or rules from vast amounts of data Adds more functionality than a DBMS Creates relationships within the data One step in the KDD Process

4 KDD Stands for "Knowledge Discovery in Databases" Six step process that helps us organize and extract new data from already existing data The six steps are: data selection, cleansing, enrichment, transformation, mining, and report generation.

5 KDD CONT. Selection and cleaning grab and validate the data to make sure it's good, complete, and proper. Enrichment will add more to the data from other sources. Transformation then limits the data in some way

6 D ATA M INING Result is new information the user would not know just by standard querying. Can be in the form of: o Association Rules o Sequential Patterns o Classification Trees

7 T HE F OUR G OALS OF D ATA M INING Prediction: Using current data to make prediction on future activities Identification: "Data patterns can be used to identify the existence of an item, an event, or an activity"

8 T HE F OUR G OALS CONT. Classification: Breaking the data down into categories based on certain attributes. Optimization: Using the mined data to make optimizations on resources, such as time, money, etc.

9 D ATA M INING E XAMPLES Most have been consumer bases Applicable in most industries Next: Case Study on MetLife

10 C ASE S TUDY : M ET L IFE Company Profile MetLife, Inc. is a leading provider of insurance and other financial services to millions of individual and institutional customers throughout the United States. Established in 1863, Metlife now has offices all over the US and the world, and offers ten different types of insurances and financial services.

11 C ASE S TUDY : M ET L IFE Industry: Insurance and Financial Services How they use Data Mining: Fraud Detection

12 C ASE S TUDY : M ET L IFE Project first started in 2001 MetLife set out to build $50 Million relational database This project would consolidate data from 30 business world wide.

13 C ASE S TUDY : M ET L IFE Around same time, it was reported that $30 Million of insurance money went to fraudulent claims. MetLife teamed up with Computer Sciences Corporation (CSC) to o License their data mining tool (called Fraud Investigator), o Develop @First, "an early fraud detection system"

14 C ASE S TUDY : M ET L IFE By 2003, MetLife's data mining operation was in full swing. They were able to detect fraud in a fraction of the time it would take in man hours One example is detecting rate evasion

15 C ASE S TUDY : M ET L IFE Rate evasion is lying about where you live to pay lower premiums. Metlife used data mining to detect rate evasion by matching ZIP codes with phone numbers to see if the cities matched. In 2.5 hours, Metlife found 107 fraudulent claims, all linked to a rate-evasion ring in NY and Massachusetts.

16 Q UESTIONS /C OMMENTS ?


Download ppt "D ATA M INING A N O VERVIEW BY : J OSEPH C ASABONA Data Warehouse-->"

Similar presentations


Ads by Google