C REDIT R ISK M ODELS C ROSS - V ALIDATION – I S T HERE A NY A DDED V ALUE ? Croatian Quants Day Zagreb, June 6, 2014 Vili Krainz The.

Slides:



Advertisements
Similar presentations
Conceptualization, Operationalization, and Measurement
Advertisements

Inferential Statistics
New liquidity measurements – LCR&NSFR Zagreb,
Significance Testing.  A statistical method that uses sample data to evaluate a hypothesis about a population  1. State a hypothesis  2. Use the hypothesis.
Statistical Analysis and Data Interpretation What is significant for the athlete, the statistician and team doctor? important Will Hopkins
Bivariate Analyses.
Chapter 8 Logistic Regression 1. Introduction Logistic regression extends the ideas of linear regression to the situation where the dependent variable,
Implementation In Tree Stat 6601 November 24, 2004 Bin Hu, Philip Wong, Yu Ye.
Statistics: An Introduction Alan Monroe: Chapter 6.
Chapter 9 Audit Sampling: An Application to Substantive Tests of Account Balances McGraw-Hill/Irwin ©2008 The McGraw-Hill Companies, All Rights Reserved.
Social Research Methods
Chapter 7 Correlational Research Gay, Mills, and Airasian
Chapter 14 Inferential Data Analysis
Decision Tree Models in Data Mining
Hypothesis Testing. Outline The Null Hypothesis The Null Hypothesis Type I and Type II Error Type I and Type II Error Using Statistics to test the Null.
Lecture 16 Correlation and Coefficient of Correlation
Understanding Research Results
EVALUATION David Kauchak CS 451 – Fall Admin Assignment 3 - change constructor to take zero parameters - instead, in the train method, call getFeatureIndices()
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
CHAPTER 4 Research in Psychology: Methods & Design
Modelling Credit Risk Croatian Quants Day Vančo Balen
Agenda Background Model for purchase probability (often reffered to as conversion rate) Model for renewwal probability What is the link between price change,
The Logic of Statistical Analysis Lesson 2 Population APopulation B Sample 1Sample 2 OR.
Experimental Research Methods in Language Learning Chapter 11 Correlational Analysis.
Chapter 7 Neural Networks in Data Mining Automatic Model Building (Machine Learning) Artificial Intelligence.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Final review - statistics Spring 03 Also, see final review - research design.
Multivariate Data Analysis Chapter 5 – Discrimination Analysis and Logistic Regression.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Zoran Bohaček Croatian Quants Day 22. II Business reasons to analyse a credit registry database.
Difference Between Means Test (“t” statistic) Analysis of Variance (“F” statistic)
Selecting a Sample. Sampling Select participants for study Select participants for study Must represent a larger group Must represent a larger group Picked.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 12 Making Sense of Advanced Statistical.
CROSS-VALIDATION AND MODEL SELECTION Many Slides are from: Dr. Thomas Jensen -Expedia.com and Prof. Olga Veksler - CS Learning and Computer Vision.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Chapter 2 Doing Sociological Research Key Terms. scientific method Involves several steps in research process, including observation, hypothesis testing,
Hypothesis Testing. Why do we need it? – simply, we are looking for something – a statistical measure - that will allow us to conclude there is truly.
Going from data to analysis Dr. Nancy Mayo. Getting it right Research is about getting the right answer, not just an answer An answer is easy The right.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Exam 1 Review GOVT 120. Review: Levels of Analysis Theory: Concept 1 is related to Concept 2 Hypothesis: Variable 1 (IV) is related to Variable 2 (DV)
Master’s Essay in Epidemiology I P9419 Methods Luisa N. Borrell, DDS, PhD October 25, 2004.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Tuesday, April 8 n Inferential statistics – Part 2 n Hypothesis testing n Statistical significance n continued….
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
Difference Between Means Test (“t” statistic) Analysis of Variance (F statistic)
Lecturer’s desk Physics- atmospheric Sciences (PAS) - Room 201 s c r e e n Row A Row B Row C Row D Row E Row F Row G Row H Row A
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Classification Tree Interaction Detection. Use of decision trees Segmentation Stratification Prediction Data reduction and variable screening Interaction.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
Methods of Presenting and Interpreting Information Class 9.
Causality, Null Hypothesis Testing, and Bivariate Analysis
Piecewise Logistic Regression: An application in credit scoring
Chapter 9 Audit Sampling: An Application to Substantive Tests of Account Balances McGraw-Hill/Irwin ©2008 The McGraw-Hill Companies, All Rights Reserved.
Hypothesis Testing.
Chi-Square X2.
Applied Statistical Analysis
Making Sense of Advanced Statistical Procedures in Research Articles
An Introduction to Correlational Research
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

C REDIT R ISK M ODELS C ROSS - V ALIDATION – I S T HERE A NY A DDED V ALUE ? Croatian Quants Day Zagreb, June 6, 2014 Vili Krainz The views expressed during this presentation are solely those of the author

I NTRODUCTION Credit risk – The risk that one party to a financial contract will not perform the obligation partially or entirely (default) Example – Bank loans The need to assess the level of credit risk – credit risk rating models (credit scorecards) Problem – to determine the functional relationship between obligor or loan characteristics X 1, X 2,..., X n (risk drivers) and binary event of default (0/1), in a form of latent variable of probability of default (PD) 2

S CORECARD D EVELOPMENT P ROCESS Potential risk drivers – retail application example Sociodemographic characteristics: Age, marital status, residential status... Economic characteristics: Level of education, profession, years of work experience... Financial characteristics: Monthly income, monthly income averages... Stability characteristics: Time on current address, current job... Loan characteristics: Installment amount, approved limit amount, loan maturity... 3

S CORECARD D EVELOPMENT P ROCESS Univariate analysis – analysis of each individual characteristic Fine classing – division of numeric variables into a number (e.g. 20) of subgroups, analysis of general trend Coarse classing – grouping into (2-5) larger classes to optimize predictiveness, with certain conditions (logical, monotonic trend, robust enough...) 4 AgeBad rate <303.47% [30, 55]2.86% >551.73%

S CORECARD D EVELOPMENT P ROCESS Multivariate analysis Correlation between characteristics Logit model – most widely used Logistic regression (with selection process) 5

S CORECARD M ODEL P REDICTIVENESS The goal of a scorecard model is to discriminate between the good and the bad applications Predictivity is most commonly measured by Gini index (a.k.a Accuracy Ratio, Somers’ D) 6

S CORECARD M ODEL C ROSS -V ALIDATION At model development start, the whole data sample is split randomly (70/30, 75/25, 80/20...) The bigger sample is used for model development, while the smaller sample is used for cross-validation Model’s predictive power (Gini index) is measured on the independent, validation sample Done to avoid overfitting The predictive power shouldn’t be much lower on the validation sample than it is on the development – that’s when the validation is considered successful 7

W HAT IF VALIDATION FAILS ? Is it possible if everything is done „by the book”? Does that mean that: Something was done wrong in model development process? The sample is not suitable for modeling at all? The process needs to be repeated? 8

M ONTE C ARLO S IMULATIONS Real (masked) publicly available retail application data (Thomas, L., Edelman, D. and Crook, J., Credit Scoring and Its Applications. Philadelphia: SIAM.) 1000 simulations of model development process in R Each time stratified random sampling (75/25) was done (on several characteristics, including the target variable – default indicator) Fine classing for the numeric variables Coarse classing all the variables using the code that simulates modeler’s decisions Stepwise logistic regression using AIC Measuring Gini index on development and validation sample Pre-selection of characteristics for the business logic and correlation One reference model was built on whole data sample 9

R ESULTS In 12.5% of cases we get a difference bigger than 0.1 Pearson’s chi-square test – all characteristics of all 1000 samples representative at 5% significance level 10

R ESULTS Idea: Compare the scores from each simulation model to reference model (on the whole sample) and relate to differences in Gini If there is a strong connection – we strive to get a model similar to the reference model Wilcoxon paired (signed rank) test H 0 : median difference between the pairs is zero H 1 : median difference is not zero. Basically, the alternative hypothesis states that one model results in systematically different (higher or lower) scores than the other 11

R ESULTS 12 Correlation: 0.68

R ESULTS 13

F ROM T HE S IMULATIONS... Regardless of a modeling job done right, validation can fail by chance We like to have Gini index on the development sample “similar” to the one on the validation sample – we tend to get the model that is more similar to the reference model – why not develop on the whole sample in the first place? Regardless of validation results and difference in Gini, predictive power on the whole data sample does not vary too much 14

I NSTEAD O F A C ONCLUSION... Does this method of cross-validation bring any added value? It may be more important to check whether all the modeling steps have been performed carefully and properly, and that best practices are used, in order to avoid overfitting Can any cross-validation method can offer real assurance or does the only real test come with future data? 15

T HANK Y OU !