Dental Insurance Fraud Detection With Predictive Modeling SOA Spring Health Meeting May 29, 2008 Jonathan Polon FSA www.claimanalytics.com.

Slides:



Advertisements
Similar presentations
Improving Disability Claims Management with Predictive Modeling May 15, 2008 Claim Analytics Inc. Barry Senensky FSA FCIA MAAA Jonathan Polon FSA
Advertisements

Endodontics and the Dental Student [Insert presenter name, degree] [Insert practice name] [Insert date]
Evaluating Inforce Blocks Of Disability Business With Predictive Modeling SOA Spring Health Meeting May 28, 2008 Jonathan Polon FSA
© 2014 Fair Isaac Corporation. Confidential. This presentation is provided for the recipient only and cannot be reproduced or shared without Fair Isaac.
Disability Pricing and Dental Fraud Detection: Supervised and Unsupervised Learning October 11, 2007 Jonathan Polon FSA Claim Analytics Inc.
Predictive Modeling for Disability Pricing May 13, 2009 Claim Analytics Inc. Barry Senensky FSA FCIA MAAA Jonathan Polon FSA
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
BA 555 Practical Business Analysis
Multi-Scale Analysis for Network Traffic Prediction and Anomaly Detection Ling Huang Joint work with Anthony Joseph and Nina Taft January, 2005.
Data Mining: A Closer Look Chapter Data Mining Strategies (p35) Moh!
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Part I: Classification and Bayesian Learning
Predictive Modeling and Dental Fraud Detection November 22, 2007 Jonathan Polon FSA Barry Senensky FSA FCIA MAAA Claim Analytics Inc.
CORRELATIO NAL RESEARCH METHOD. The researcher wanted to determine if there is a significant relationship between the nursing personnel characteristics.
Data Mining: A Closer Look
Data Mining: A Closer Look Chapter Data Mining Strategies 2.
Chapter 5 Data mining : A Closer Look.
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Dr. Awad Khalil Computer Science Department AUC
Knowledge Discovery & Data Mining process of extracting previously unknown, valid, and actionable (understandable) information from large databases Data.
Data Mining Techniques
Overview DM for Business Intelligence.
CS490D: Introduction to Data Mining Prof. Chris Clifton April 14, 2004 Fraud and Misuse Detection.
1 Data Mining DT211 4 Refer to Connolly and Begg 4ed.
© 2010 IBM Corporation © 2011 IBM Corporation September 6, 2012 NCDHHS FAMS Overview for Behavioral Health Managed Care Organizations.
1 Reading Report 9 Yin Chen 29 Mar 2004 Reference: Multivariate Resource Performance Forecasting in the Network Weather Service, Martin Swany and Rich.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Data Mining and Application Part 1: Data Mining Fundamentals Part 2: Tools for Knowledge Discovery Part 3: Advanced Data Mining Techniques Part 4: Intelligent.
Anomaly detection with Bayesian networks Website: John Sandiford.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Correlation Analysis. A measure of association between two or more numerical variables. For examples height & weight relationship price and demand relationship.
1 Statistical Techniques Chapter Linear Regression Analysis Simple Linear Regression.
Guest Lecture Introduction to Data Mining Dr. Bhavani Thuraisingham September 17, 2010.
Data Mining – Best Practices Part #2 Richard Derrig, PhD, Opal Consulting LLC CAS Spring Meeting June 16-18, 2008.
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Part I Data Mining Fundamentals Chapter 1 Data Mining: A First View Jason C. H. Chen, Ph.D. Professor.
1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 12 — Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Dimension Reduction in Workers Compensation CAS predictive Modeling Seminar Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc.
Predictive Modeling Spring 2005 CAMAR meeting Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Principal Component Analysis
Clustering Algorithms Minimize distance But to Centers of Groups.
Monday, February 22,  The term analytics is often used interchangeably with:  Data science  Data mining  Knowledge discovery  Extracting useful.
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
Machine Learning Supervised Learning Classification and Regression K-Nearest Neighbor Classification Fisher’s Criteria & Linear Discriminant Analysis Perceptron:
CSE 4705 Artificial Intelligence
Unsupervised Learning
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Introduction to Data Mining
Validating the PRIDIT method for determining hospital quality with outcomes data Robert Lieberthal, PhD, Dominique Comer, PharmD, Katherine O’Connell,
Dimension Reduction in Workers Compensation
Outlier Processing via L1-Principal Subspaces
Data and Applications Security Introduction to Data Mining
Principal Component Analysis (PCA)
Understanding Research Results: Description and Correlation
Dr. Morgan C. Wang Department of Statistics
Descriptive Statistics vs. Factor Analysis
What is Regression Analysis?
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Azure Machine Learning
Unsupervised Learning
Presentation transcript:

Dental Insurance Fraud Detection With Predictive Modeling SOA Spring Health Meeting May 29, 2008 Jonathan Polon FSA

Intro to Predictive Modeling Supervised vs Unsupervised Learning Modeling Procedures Dental Insurance Fraud Detection Examples of Atypical Dentists Agenda

Introduction to Predictive Modeling

A Part of Everyday Life Have you used a predictive model today? Mail sorting Credit card processing Credit scores Weather forecasting Grocery shopping

What is Predictive Modeling Harnesses power of modern computers to find hidden patterns in data Used extensively in industry Many possible uses in insurance: Claim management Pricing Reserving Fraud detection

Predictive Modeling Tools Some common techniques Generalized linear models Neural networks Genetic algorithms Random forests Stochastic gradient boosted trees Support vector machines

SupervisedVs Unsupervised Learning

Supervised Learning In the training dataset, there is a known outcome associated with each record Learning objective: build a model that can accurately estimate the outcomes from the predictor variables for each record Eg, Sports Betting: predict winners based on observations from past games Commonly referred to as predictive modeling

Unsupervised Learning In the training dataset, there is not an outcome associated with any record Learning objective: self-organization or clustering; finding structure in the data. Eg, Grocery Stores: define types of shoppers and their preferences

ModelingDentalProcedures

Why Model Procedures? There’s more to diagnosis than category within categories, severity varies similarities can exist between procedures of different categories how do we extract more information?

Scoring Procedures Create a series of metrics for each procedure some relative values from 0 – 10 Some absolute values for example: -unbundling- specialized -major problem- emergency allows every procedure to be compared to every other procedure

Scoring Procedures - Example Caries, trauma, pain control Pulpotomy, molars, emergency Root canals, 3 canals, calcified CategoryRestorativeEndodontics Specialized008 Major work5810 EmergencyNoYesNo

Dental Fraud Detection Using Unsupervised Learning

Research Project 200,000 dental claim records from 2004 General practitioners in Ontario Applied modern pattern detection techniques to identify dentists with atypical claims histories Research paper: Dental Insurance Claims: Identification of Atypical Claims Activity

Traditional Tools Rule-based Strong at identifying claims that match known types of fraudulent activity Limited to identifying what is known Typically analyze at the level of a single claim, in isolation

Pattern Detection Tools Analyze millions of claims to reveal patterns –Technology learns what is normal and what is atypical – not limited by what we already know Categorize each claim, reveals dentists with a high percentage of atypical claims –Immediately highlights new questionable behaviors –Finds new large dollar schemes –Also identifies frequent repetition of small dollar abuses

Methodology – 3 Perspectives 1. By Claim e.g. Joe Green’s semi-annual check-up 2. By Tooth e.g. all work done in 2004 on Joe’s bicuspid by Dr. Brown 3. By General Work e.g. all general work (exams, fluoride, radiographs, scaling and polishing) done on Joe in 2004 by Dr. Brown

Methodology – 2 Techniques 1. Principal components analysis 2. Clustering

PCACluster Claim by claim Tooth by tooth General work Methodology – Summary

Principal Components Analysis Allows reduction of multi-dimensional data to lower dimensions while maximizing amount of information preserved Reducing to 2 dimensions allows data points to be plotted on a chart Powerful approach for identifying outliers

PCA: Simple Example Lots of variance between points around both axes

PCA: Simple Example Create new axes, X’ and Y’: rotate original axes 45 o

PCA: Simple Example In the new axes, very little variance around Y’ Y’ contains little “information”

PCA: Simple Example Can set all Y’ values to 0 – ie, ignore Y’ axis Result: reduce to 1 dimension with little loss of info ♦ Original ● Revised

PCA: Identifying Atypical Dentists

1.Begin at the individual transaction level 2.Determine the average transaction for each dentist 3.Graph quickly isolates dentists that are outliers

Dentist Average 90 th percentile PCA: Identifying Atypical Dentists

PCA: Summary 1.Visualization allows for easy and intuitive identification of dentists that are atypical 2.Manual investigation required to understand why a dentist or claim is atypical

Clustering Techniques Categorization tools Organize claims into several groups of similar claims Applicable for profiling dentists by looking at the percentage of claims in each cluster We apply two different clustering techniques: K-Means and Expectation-Maximization

Clustering Example

Example: Clusters found – by Claim 1.Major dental problems 2.Moderate dental problems 3.Age > 28, minor work performed 4.Work includes unbundled procedures 5.Minor work and expensive technologies 6.Age < 28, minor work performed

Clusters: Identifying Atypical Dentists 1. Begin at the individual claim level 2. Calculate the proportion of claims in each cluster – in total, and for each dentist 3. Isolate dentists with large deviations from average

Clusters: Identifying Atypical Dentists

Clustering Techniques: Summary 1.Clustering provides an easy to understand method to profile dentists 2.Unlike PCA, clustering tells why a given dentist is considered atypical 3.Effectively identifies atypical activity at the dentist level

Pilot Project Results PCA identified 68 dentists that are atypical Clusters identified 182 dentists that are atypical 36 dentists are identified as atypical by both PCA and Clusters In total, 214 of 1,644 dentists are identified as atypical (13%) Billings by atypical dentists were $2.5 MM out of $16.1 MM billed by all dentists (15%)

ExamplesOf Atypical Dentists

Analysis: Dentist A is far beyond norms in clusters 6 and 8; both indicate high charges in ‘general dentistry’ What we discovered: Each of Dentist A’s patients is being billed for at least 30 minutes of polishing and 45 minutes of scaling Dentist A

Dentist B Analysis : Dentist B is far beyond norms in cluster 7 (age > 19, major work) What we discovered: Frequent extractions and anesthesia  Dentist B looks like an oral surgeon, but is a general practitioner

Dentist C Analysis: Very high proportion in Cluster 1, suggesting many high-ticket visits What we discovered: First, Dentist C performs an inordinate amount of scaling (average of 1 hour p.p.). Second, Dentist C has emergency examinations with atypically high frequency

Dentist M Analysis: Dentist M has a disproportionate amount of work beyond the 90 th percentile of work by all dentists on all teeth What we discovered: Dentist M appears to utilize lab work very heavily

More Atypical Dentists Frequent use of panoramic x-rays Frequent use of nitrous oxide – including with procedures rarely associated with anesthesia Large number of extractions, often using multiple types of sedation Very high proportion of claims for crowns and endodontic work Individual instruction on oral hygiene provided with very atypical frequency

Dental Fraud Summary Highly effective in identifying dentists with claim portfolios significantly different from the norm Enables experts to quickly identify and focus on those dentists with atypical claims activity Pattern detection:

Questions?