Data Mining: Neural Network Applications by Louise Francis CAS Annual Meeting, Nov 11, 2002 Francis Analytics and Actuarial Data Mining, Inc.

Slides:



Advertisements
Similar presentations
Inference for Regression
Advertisements

1 Regression Models & Loss Reserve Variability Prakash Narayan Ph.D., ACAS 2001 Casualty Loss Reserve Seminar.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Model assessment and cross-validation - overview
BY EVAN FRISCIA AND PARTH THAKKAR Introduction to Technical Analysis.
An Introduction to Stochastic Reserve Analysis Gerald Kirschner, FCAS, MAAA Deloitte Consulting Casualty Loss Reserve Seminar September 2004.
Econometric Details -- the market model Assume that asset returns are jointly multivariate normal and independently and identically distributed through.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Simple Linear Regression
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 13 Forecasting.
Lecture 20 Simple linear regression (18.6, 18.9)
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
1 Simple Linear Regression and Correlation Chapter 17.
Lecture 17 Interaction Plots Simple Linear Regression (Chapter ) Homework 4 due Friday. JMP instructions for question are actually for.
Correlation and Regression Analysis
Classification and Prediction: Regression Analysis
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
Introduction to Linear Regression and Correlation Analysis
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
Correlation.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Chapter 6 & 7 Linear Regression & Correlation
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
Introduction to Generalized Linear Models Prepared by Louise Francis Francis Analytics and Actuarial Data Mining, Inc. October 3, 2004.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 19 Linear Patterns.
1 Combining GLM and data mining techniques Greg Taylor Taylor Fry Consulting Actuaries University of Melbourne University of New South Wales Casualty Actuarial.
Regression. Types of Linear Regression Model Ordinary Least Square Model (OLS) –Minimize the residuals about the regression linear –Most commonly used.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
Chapter 16 Data Analysis: Testing for Associations.
4-1 Operations Management Forecasting Chapter 4 - Part 2.
GoldSim Technology Group LLC, 2006 Slide 1 Sensitivity and Uncertainty Analysis and Optimization in GoldSim.
Dimension Reduction in Workers Compensation CAS predictive Modeling Seminar Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc.
Ch 5-1 © 2004 Pearson Education, Inc. Pearson Prentice Hall, Pearson Education, Upper Saddle River, NJ Ostwald and McLaren / Cost Analysis and Estimating.
Predictive Modeling Spring 2005 CAMAR meeting Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc
Correlation – Recap Correlation provides an estimate of how well change in ‘ x ’ causes change in ‘ y ’. The relationship has a magnitude (the r value)
Chapter1: Introduction Chapter2: Overview of Supervised Learning
Neural Networks Demystified by Louise Francis Francis Analytics and Actuarial Data Mining, Inc.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Dealing with continuous variables and geographical information in non life insurance ratemaking Maxime Clijsters.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Data Mining: Neural Network Applications by Louise Francis CAS Convention, Nov 13, 2001 Francis Analytics and Actuarial Data Mining, Inc.
LESSON 5 - STATISTICS & RESEARCH STATISTICS – USE OF MATH TO ORGANIZE, SUMMARIZE, AND INTERPRET DATA.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
Bump Hunting The objective PRIM algorithm Beam search References: Feelders, A.J. (2002). Rule induction by bump hunting. In J. Meij (Ed.), Dealing with.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Data Mining: Concepts and Techniques1 Prediction Prediction vs. classification Classification predicts categorical class label Prediction predicts continuous-valued.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
1 Simple Linear Regression Chapter Introduction In Chapters 17 to 19 we examine the relationship between interval variables via a mathematical.
KAIR 2013 Nov 7, 2013 A Data Driven Analytic Strategy for Increasing Yield and Retention at Western Kentucky University Matt Bogard Office of Institutional.
Data Mining CAS 2004 Ratemaking Seminar Philadelphia, Pa.
Statistics 200 Lecture #5 Tuesday, September 6, 2016
Chapter 3: Describing Relationships
Dimension Reduction in Workers Compensation
Linear Regression and Correlation Analysis
Elementary Statistics
Chapter 12 Curve Fitting : Fitting a Straight Line Gab-Byung Chae
Introduction to Predictive Modeling
Chapter 3 Statistical Concepts.
Simple Linear Regression
Presentation transcript:

Data Mining: Neural Network Applications by Louise Francis CAS Annual Meeting, Nov 11, 2002 Francis Analytics and Actuarial Data Mining, Inc.

Objectives of Presentation Introduce insurance professionals to neural networks Show that neural networks are a lot like some conventional statistics Indicate where use of neural networks might be helpful Show practical examples of using neural networks Show how to interpret neural network models

A Common Actuarial Application: Loss Development

An Example of a Nonlinear Function

Conventional Statistics: Regression One of the most common methods of fitting a function is linear regression Models a relationship between two variables by fitting a straight line through points Minimizes a squared deviation between an observed and fitted value

Neural Networks Also minimizes squared deviation between fitted and actual values Can be viewed as a non-parametric, non-linear regression

The Feedforward Neural Network

The Activation Function The sigmoid logistic function

The Logistic Function

Simple Example: One Hidden Node

Function if Network has One Hidden Node

Development Example: Incremental Payments Used for Fitting

Two Methods for Fitting Development Curve Neural Networks Simpler model using only development age for prediction More complex model using development age and accident year GLM model Example uses Poisson regression Like OLS regression, but does not require normality Fits some nonlinear relationships See England and Verrall, PCAS 2001

The Chain Ladder Model

Common Approach: The Deterministic Chain Ladder

GLM Model A Stochastic Chain Ladder Model

Hidden Nodes for Paid Chain Ladder Example

NN Chain Ladder Model with 3 Nodes

Universal Function Approximator The feedforward neural network with one hidden layer is a universal function approximator Theoretically, with a sufficient number of nodes in the hidden layer, any continuous nonlinear function can be approximated

Neural Network Curve with Dev Age and Accident Year

GLM Poisson Regression Curve

How Many Hidden Nodes for Neural Network? Too few nodes: Don’t fit the curve very well Too many nodes: Over parameterization May fit noise as well as pattern

How Do We Determine the Number of Hidden Nodes? Use methods that assess goodness of fit Hold out part of the sample Resampling Bootstrapping Jacknifing Algebraic formula Uses gradient and Hessian matrices

Hold Out Part of Sample Fit model on 1/2 to 2/3 of data Test fit of model on remaining data Need a large sample

Cross-Validation Hold out 1/n (say 1/10) of data Fit model to remaining data Test on portion of sample held out Do this n (say 10) times and average the results Used for moderate sample sizes Jacknifing similar to cross-validation

Bootstrapping Create many samples by drawing samples, with replacement, from the original data Fit the model to each of the samples Measure overall goodness of fit and create distribution of results Used for small and moderate sample sizes

Jackknife of 95% CI for 2 and 5 Nodes

Another Complexity of Data: Interactions

Technical Predictors of Stock Price A Complex Multivariate Example

Stock Prediction: Which Indicator is Best? Moving Averages Measures of Volatility Seasonal Indicators The January effect Oscillators

The Data S&P 500 Index since 1930 Open High Low Close

Moving Averages A very commonly used technical indicator 1 week MA of returns 2 week MA of returns 1 month MA of returns These are trend following indicators A more complicated time series smoother based on running medians called T4253H

Volatility Measures Finance literature suggests volatility of market changes over time More turbulent market -> higher volatility Measures Standard deviation of returns Range of returns Moving averages of above

Seasonal Effects

Oscillators May indicate that market is overbought or oversold May indicate that a trend is nearing completion Some oscillators Moving average differences Stochastic

Stochastic and Relative Strength Index Stochastic based on observation that as prices increase closing prices tend to be closer to upper end of range %K = (C – L5)/(H5 – L5) C is closing prince, L5 is 5 day low, H5 is 5 day high %D = 3 day moving average of %K RS =(Average x day’s up closes)/(Avg of x day’s down Closes) RSI = 100 – 100/(1+RS)

Measuring Variable Importance Look at weights to hidden layer Compute sensitivities: a measure of how much the predicted value’s error increases when the variables are excluded from the model one at a time

Neural Network Result Variable Importance 1. Smoothed return 2. %K (from stochastic) 3. Smoothed %K 4. 2 Week %D 5. 1 Week range of returns 6. Smoothed standard deviation 7. Month R 2 was.13 or 13% of variance explained

Understanding Relationships Between Predictor and Target Variables: A Tree Example

What are the Relationships between the Variables?

Visualization Method for Understanding Neural Network Functions Method was published by Plate et al. in Neural Computation, 2000 Based on Generalized Additive Models Detailed description by Francis in “Neural Networks Demystified”, 2001 Hastie, Tibshirini and Friedman present a similar method

Visualization Method is essentially a way to approximate the derivatives of the neural network model

Neural Network Result for Smoothed Return

Neural Network Result for Oscillator

Neural Network Result for Smoothed Return and Oscillator

Neural Network Result for Standard Deviation

Conclusions Neural Networks are a lot like conventional statistics They address some problems of conventional statistics: nonlinear relationships, correlated variables and interactions Despite black block aspect, we now can interpret them Find further information, including many references, at Neural Networks for Statistical Modeling, M. Smith Reference for Kohonen Networks: Visual Explorations in Finance, Guido Debok and Teuvo Kohone (Eds), Springer, 1998