1 Data Mining in Pharmaceutical Marketing and Sales Analysis Pavel Brusilovskiy, PhD Merck.

Slides:



Advertisements
Similar presentations
Statistics for Improving the Efficiency of Public Administration Daniel Peña Universidad Carlos III Madrid, Spain NTTS 2009 Brussels.
Advertisements

Data Mining vs. Statistics
Brief introduction on Logistic Regression
Statistical Tests Karen H. Hagglund, M.S.
Data Analysis Statistics. Inferential statistics.
Lecture 3: Chi-Sqaure, correlation and your dissertation proposal Non-parametric data: the Chi-Square test Statistical correlation and regression: parametric.
Statistical Methods Chichang Jou Tamkang University.
Data Mining.
Data Analysis Statistics. Inferential statistics.
Data Mining – Intro.
Causal Models, Learning Algorithms and their Application to Performance Modeling Jan Lemeire Parallel Systems lab November 15 th 2006.
Today Concepts underlying inferential statistics
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Chapter 14 Inferential Data Analysis
Richard M. Jacobs, OSA, Ph.D.
Statistics Idiots Guide! Dr. Hamda Qotba, B.Med.Sc, M.D, ABCM.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Spreadsheet Modeling & Decision Analysis A Practical Introduction to Management Science 5 th edition Cliff T. Ragsdale.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Leedy and Ormrod Ch. 11 Gray Ch. 14
Data Mining Techniques
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 24 Statistical Inference: Conclusion.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Research Terminology for The Social Sciences.  Data is a collection of observations  Observations have associated attributes  These attributes are.
Simple Linear Regression
Chapter 12 Multiple Regression and Model Building.
Simple Covariation Focus is still on ‘Understanding the Variability” With Group Difference approaches, issue has been: Can group membership (based on ‘levels.
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
Chapter 6 Regression Algorithms in Data Mining
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Multivariate Data Analysis CHAPTER seventeen.
1 Multivariate Analysis (Source: W.G Zikmund, B.J Babin, J.C Carr and M. Griffin, Business Research Methods, 8th Edition, U.S, South-Western Cengage Learning,
Analyzing and Interpreting Quantitative Data
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Introduction Osborn. Daubert is a benchmark!!!: Daubert (1993)- Judges are the “gatekeepers” of scientific evidence. Must determine if the science is.
Regression Models Fit data Time-series data: Forecast Other data: Predict.
Kano Model & Multivariate Statistics Dr. Surej P John.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
EDCI 696 Dr. D. Brown Presented by: Kim Bassa. Targeted Topics Analysis of dependent variables and different types of data Selecting the appropriate statistic.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Chapter 6: Analyzing and Interpreting Quantitative Data
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 12 Multiple.
Chapter 10 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Analyzing Statistical Inferences July 30, Inferential Statistics? When? When you infer from a sample to a population Generalize sample results to.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Impact of Sales Force Structure Change on Products Performance Pilot Study Business Intelligence Solutions June,
Classification Ensemble Methods 1
Data Mining and Decision Support
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Chapter Seventeen Copyright © 2004 John Wiley & Sons, Inc. Multivariate Data Analysis.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Appendix I A Refresher on some Statistical Terms and Tests.
CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.
Data Mining – Intro.
KAIR 2013 Nov 7, 2013 A Data Driven Analytic Strategy for Increasing Yield and Retention at Western Kentucky University Matt Bogard Office of Institutional.
Understanding Statistical Inferences
Welcome! Knowledge Discovery and Data Mining
Chapter 12 Analyzing Semistructured Decision Support Systems
Presentation transcript:

1 Data Mining in Pharmaceutical Marketing and Sales Analysis Pavel Brusilovskiy, PhD Merck

2 Contents What is Data Mining? Data Mining vs. Statistics: what is the difference? Why Data Mining is important tool in pharmaceutical marketing research and sales analysis? Case Study

3 What is the Data Mining? “The magic phrase to put in every funding proposal you write to NSF, DARPA, NASA, etc” “Data Mining is a process of torturing the data until they confess” “The magic phrase you use to sell your…..  - database software  - statistical analysis software  - parallel computing hardware  - consulting services”

4 Data Mining  is a cutting edge technology to analyze diverse, multidisciplinary and multidimensional complex data  is defined as the non-trivial iterative process of extracting implicit, previously unknown and potentially useful information from your data Data mining could identify relationships in your multidimensional and heterogeneous data that cannot be identified in any other way Successful application of state-of-the-art data mining technology to marketing, sales, and outcomes research problems (not to mention drug discovery) is indicative of analytic maturity and the success of a pharmaceutical company

5 Data Mining and Related Fields Visualization Machine Learning Statistics Database Data Mining Is Data Mining extension of Statistics?

6 Statistics vs. Data Mining: Concepts FeatureStatisticsData Mining Type of ProblemWell structuredUnstructured / Semi-structured Inference RoleExplicit inference plays great role in any analysis No explicit inference Objective of the Analysis and Data Collection First – objective formulation, and then - data collection Data rarely collected for objective of the analysis/modeling Size of data setData set is small and hopefully homogeneous Data set is large and data set is heterogeneous Paradigm/ApproachTheory-based (deductive)Synergy of theory-based and heuristic- based approaches (inductive) Signal-to-Noise RatioSTNR > 30 < STNR <= 3 Type of AnalysisConfirmativeExplorative Number of variablesSmallLarge

7 Statistics vs. Data Mining: Regression Modeling FeatureStatisticsData Mining Number of inputsSmallLarge Type of inputsInterval scaled and categorical with small number of categories (percentage of categorical variables is small) Any mixture of interval scaled, categorical, and text variables MulticollinearityWide range of degree of multicollinearity with intolerance to multicollinearity Severe multicollinearity is always there, tolerance to multicollinearity Distributional assumptions, homoscedasticity, outliers, missing values Intolerance to distrubitional assumption violation, homoscedasticity, Outliers/leverage points, missing values Tolerance to distributional assumption violation, outliers/leverage points, and missing values Type of modelLinear / Non-linear / Parametric / Non- Parametric in low dimensional X-space (intolerance to uncharacterizable non- linearities) Non-linear and non-parametric in high dimensional X-space with tolerance to uncharacterizable non-linearities

8 What is an unstructured problem? Well-structured Business ProblemUnstructured Business Problem Definition Can be described with a high degree of completeness Cannot be described with a high degree of completeness Can be solved with a high degree of certaintyCannot be resolved with a high degree of certainty Experts usually agree on the best method and best solution Experts often disagree about the best method and best solution Can be easily and uniquely translated into quantitative counterpart Cannot be easily and uniquely translated into quantitative counterpart Goal Find the best solutionFind reasonable solution Complexity Ranges from very simple to complexRanges from complex to very complex Example Project: Sample size calculation Key business question: what is the physician sample size (PCP vs. OBGYN) to detect five script difference of Product A. sales? How to translate this business question into quantitative counterpart? No data No variables Project: Customer Feedback Study Key business question: Is there any relationship between customer perception and performance? How to translate this business question into quantitative counterpart? Data: 800 observations/interactions and 400 variables/attributes Variables are differently scaled

9 What are differences between Data Mining and Statistics? Statistical analysis is designed to deal with well structured problems:  Results are software and researcher independent  Inference reflects statistical hypothesis testing Data mining is designed to deal with unstructured problems  Results are software and researcher dependent  Inference reflects computational properties of data mining algorithm at hand

10 Is Data Mining extension of Statistics? Data Mining and Statistics: mutual fertilization with convergence Statistical Data Mining (Graduate course, George Mason University) Statistical Data Mining and Knowledge Discovery (Hardcover) by Hamparsum Bozdogan (Editor)Hamparsum Bozdogan  An overview of Bayesian and frequentist issues that arise in multivariate statistical modeling involving data mining Data Mining with Stepwise Regression (Dean Foster, Wharton School)  use interactions to capture non-linearities  use Bonferroni adjustment to pick variables to include  use the sandwich estimator to get robust standard errors

11 When data mining technology is appropriate? Data mining technology is appropriate if:  The business problem is unstructured  Accurate prediction is more important than the explanation  The data include the mixture of interval, nominal, ordinal, count, and text variables, and the role and the number of non-numeric variables are essential  Among those variables there are a lot of irrelevant and redundant attributes  The relationship among variables could be non-linear with uncharacterizable nonlinearities  The data are highly heterogeneous with a large percentage of outliers, leverage points, and missing values  The sample size is relatively large Important marketing, sales, and outcomes research studies have the majority of these features

12 Accurate prediction is more important than the explanation

13 Case Study: Effectiveness Evaluation of Vaccine Sales Force New lunched vaccine (10 months in marketplace) Sales force structure: two sales team  Team_1  Team_2 Some locations are visited only by Team_1 (1-up promotion), others – by both teams (2-up promotion) Promotion is on doctors/HCP level, but sales is on location level Business question: What is the effectiveness of 2-up promotion?

14 Dependent Variables and Study Design Dependent variables (criteria to judge):  Total dosage purchased (shipped or ordered?)  Total dosage purchased per promotion dollar  Probability of making a purchase  Time to the first purchase  Frequency of purchase, etc. Study Design:  Test (2-up promotion locations) – Control (1-up promotion locations) Consider Vaccine sales as two criteria problem:  Estimate outcome for Test and Control groups, taking into account difference in sales difference in promotion cost Use pre-period data to match Test and Control groups and post- period data to compare sales and promotion cost

15 Independent Variables 47 input variables  Demographics of a location Geography, specialty, potentials, potential decile, average age, reimbursement, believes, etc.  Promotion activities Number of direct interactions with HCP by Team_1 Number of direct interactions with HCP by Team_2 Percentage of direct interaction with decision maker by Team_1 Percentage of direct interaction with decision maker by Team_2 Number of phone interactions with HCP by Team_1 Number of phone interactions with HCP by Team_2, etc.

16 Variables Distribution Private Dosage Potential Public dosage Potential Number of Sales Calls Vaccine Dosage Sales

17 Methodology Form Test - Control groups, using only pre-period data and propensity score methodology with greedy one-to- one matching technique on propensity score Develop models for the post-period data for total vaccine sales, controlling for  “location demographics” variables  promotion variables in pre-period  sales in pre-period Estimate the difference in sales for Test and Control groups, taking into account promotion cost

18 Propensity Score Propensity score  is the predicted probability of receiving the treatment (probability of belonging to a test group)  is a function of several differently scaled covariates Propensity_Score = f (location demographics variables, promotion variables, sales variables), 0 < f < 1 where f is a non-parametric non-linear multivariate function A sample matched on propensity score will be similar across all covariates used to calculate propensity score

19 Propensity score with SAS EM Neural Net was the best modeling paradigm

20 Finding and Business Implication There is no algorithmically significant difference in sales between Test and Control groups, but promotion cost for Control group is two times lower than for Test group. In other words, 2-up structure does not produce desired/expected outcome for Vaccine performance Team / Group Mean of Sales Call Test5.29 Control2.76

21 Phone call / Sales call response curve for Vaccine sales, constructed by TreeNet Number of Purchases Number of Purchases Number of Purchases has a strong diminishing returns effect when the number of sale calls becomes greater than thirty five and the number of Phone calls becomes greater than 2 Number of Phone callsNumber of Sales calls

22 Phone call and Sales call response surface for Vaccine sales,, constructed by TreeNet Sales call is the most effective when a location gets two Phone calls Number of Purchase has a strong diminishing returns effect when the number of Sales calls becomes greater than thirty five and the number of Phone calls becomes greater than 2 Number of Phone calls Number of Sales calls

23 Reference David J. Hand, Data Mining: Statistics and More? The American Statistician, May 1998, Vol. 52 No. 2 Friedman, J.H Data Mining and Statistics. What’s connection? Proceedings of the 29 th Symposium on the Interface: Computing Science and Statistics, May 1997, Houston, Texas Padhraic Smyth (2000), An Introduction to Data Mining, Elumetric.com Inc Doug Wielenga (2007), Identifying and Overcoming Common Data Mining Mistakes, SAS Global Forum Paper