Predicting Mortgage Pre-payment Risk. Introduction Definition Borrower pays off the loan before the contracted term loan length. Lender loses future part.

Slides:



Advertisements
Similar presentations
An Improved Categorization of Classifiers Sensitivity on Sample Selection Bias Wei Fan Ian Davidson Bianca Zadrozny Philip S. Yu.
Advertisements

Two-sample tests. Binary or categorical outcomes (proportions) Outcome Variable Are the observations correlated?Alternative to the chi- square test if.
Huffman code and ID3 Prof. Sin-Min Lee Department of Computer Science.
Financing PKS Consultancy. Overdraft Loan – (Finance for Lending Scheme) Invoice Finance Factoring Invoice Discounting Payroll Finance Single Invoice.
Bab /44 Bab 4 Classification: Basic Concepts, Decision Trees & Model Evaluation Part 1 Classification With Decision tree.
Classification: Definition Given a collection of records (training set ) –Each record contains a set of attributes, one of the attributes is the class.
Decision Tree.
Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Introduction to Data Mining by Tan, Steinbach,
Week 3. Logistic Regression Overview and applications Additional issues Select Inputs Optimize complexity Transforming Inputs.
Classification Techniques: Decision Tree Learning
Introduction to Data Mining with XLMiner
Lecture Notes for Chapter 4 Introduction to Data Mining
Classification: Decision Trees, and Naïve Bayes etc. March 17, 2010 Adapted from Chapters 4 and 5 of the book Introduction to Data Mining by Tan, Steinbach,
1. Abstract 2 Introduction Related Work Conclusion References.
Chapter 8 Logistic Regression 1. Introduction Logistic regression extends the ideas of linear regression to the situation where the dependent variable,
Decision Tree Rong Jin. Determine Milage Per Gallon.
Tree-based methods, neutral networks
Lecture 5 (Classification with Decision Trees)
Discriminative Naïve Bayesian Classifiers Kaizhu Huang Supervisors: Prof. Irwin King, Prof. Michael R. Lyu Markers: Prof. Lai Wan Chan, Prof. Kin Hong.
Introduction to Home Mortgages. Outline Process to purchase a home Process to purchase a home –Pre-approval Programs Programs What do Lenders look for?
Data Mining: A Closer Look
Chapter 5 Data mining : A Closer Look.
Part 10: Qualitative Data 10-1/21 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Decision Tree Models in Data Mining
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Application of SAS®! Enterprise Miner™ in Credit Risk Analytics
Spreadsheet Modeling & Decision Analysis
Bayesian Networks 4 th, December 2009 Presented by Kwak, Nam-ju The slides are based on, 2nd ed., written by Ian H. Witten & Eibe Frank. Images and Materials.
DATA MINING : CLASSIFICATION. Classification : Definition  Classification is a supervised learning.  Uses training sets which has correct answers (class.
by B. Zadrozny and C. Elkan
1 Data Mining Lecture 3: Decision Trees. 2 Classification: Definition l Given a collection of records (training set ) –Each record contains a set of attributes,
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Decision Trees Jyh-Shing Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University.
Neural Networks Automatic Model Building (Machine Learning) Artificial Intelligence.
Jia-Jyun Dong, Yu-Hsiang Tung, Chien-Chih Chen, Jyh-Jong Liao, Yii-Wen Pan 學 生 :陳奕愷 Logistic regression model for predicting the failure probability of.
Business Intelligence and Decision Modeling Week 9 Customer Profiling Decision Trees (Part 2) CHAID CRT.
Classification Techniques: Bayesian Classification
Jennifer Lewis Priestley Presentation of “Assessment of Evaluation Methods for Prediction and Classification of Consumer Risk in the Credit Industry” co-authored.
Fast Kernel-Density-Based Classification and Clustering Using P-Trees Anne Denton Major Advisor: William Perrizo.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Logistic Regression. Linear Regression Purchases vs. Income.
USE RECIPE INGREDIENTS TO PREDICT THE CATEGORY OF CUISINE Group 7 – MEI, Yan & HUANG, Chenyu.
Text Categorization With Support Vector Machines: Learning With Many Relevant Features By Thornsten Joachims Presented By Meghneel Gore.
Regress-itation Feb. 5, Outline Linear regression – Regression: predicting a continuous value Logistic regression – Classification: predicting a.
Lecture Notes for Chapter 4 Introduction to Data Mining
Logistic Regression Saed Sayad 1www.ismartsoft.com.
Binary logistic regression. Characteristic Regression model for target categorized variable explanatory variables – continuous and categorical Estimate.
Competition II: Springleaf Sha Li (Team leader) Xiaoyan Chong, Minglu Ma, Yue Wang CAMCOS Fall 2015 San Jose State University.
1 Classification: predicts categorical class labels (discrete or nominal) classifies data (constructs a model) based on the training set and the values.
BAYESIAN LEARNING. 2 Bayesian Classifiers Bayesian classifiers are statistical classifiers, and are based on Bayes theorem They can calculate the probability.
BY International School of Engineering {We Are Applied Engineering} Disclaimer: Some of the Images and content have been taken from multiple online sources.
Classification Tree Interaction Detection. Use of decision trees Segmentation Stratification Prediction Data reduction and variable screening Interaction.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Usman Roshan Dept. of Computer Science NJIT
EDUCAUSE Annual Conference
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
DECISION TREES An internal node represents a test on an attribute.
Privacy-Preserving Data Mining
Fast Kernel-Density-Based Classification and Clustering Using P-Trees
Regression Techniques
Classification Techniques: Bayesian Classification
Dr. Morgan C. Wang Department of Statistics
Data Mining for Business Analytics
Machine Learning Interpretability
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Somi Jacob and Christian Bach
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Analysis for Predicting the Selling Price of Apartments Pratik Nikte
Predicting Loan Defaults
Presentation transcript:

Predicting Mortgage Pre-payment Risk

Introduction Definition Borrower pays off the loan before the contracted term loan length. Lender loses future part of the income stream associated with the loan. Inspiration & Previous work Oded Netzer’s talk on text-mining techniques for business applications – loan default prediction.

Breakdown of the Problem Target To pin-point among borrower profiles, who all likely to prepay the mortgage. Data UCI Machine Learning Repository : Credit Approval Dataset – 690 observations; 15 attributes. German Credit Dataset – 1000 observations, 20 attributes. Give Me Some Credit - Kaggle credit-scoring competition - very large.

Prospective Solutions Techniques for categorization of data Decision Tree - recursively separates observations in branches to construct a tree to improve prediction accuracy, with use of measurements like information gain ratio, Gini index etc. Naïve Bayes Classifier - calculates a set of probabilities by counting the frequency and combinations of values in the dataset. Logistic Regression - measures the relationship between the one dependent binary variable and one or more independent variables. by estimating probabilities using a logistic function. Appropriate when the dependent variable is binary.

Evaluation of Results Measure of success How clear the proposed categorization scheme is proposed. How applicable the scheme is in enabling lenders in managing prepayment risk by providing a useful structure for early mitigation targeting.