Practical Model Selection and Multi-model Inference using R Presented by: Eric Stolen and Dan Hunt.

Slides:



Advertisements
Similar presentations
Correlation and regression
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Practical Model Selection and Multi-model Inference using R Modified from on a presentation by : Eric Stolen and Dan Hunt.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Class 17: Tuesday, Nov. 9 Another example of interpreting multiple regression coefficients Steps in multiple regression analysis and example analysis Omitted.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
Linear Regression with One Regression
Chapter Topics Types of Regression Models
Introduction to statistical estimation methods Finse Alpine Research Center, September 2010.
Statistics 303 Chapter 10 Least Squares Regression Analysis.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Pertemua 19 Regresi Linier
Correlation and Regression Analysis
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Logistic Regression with “Grouped” Data Lobster Survival by Size in a Tethering Experiment Source: E.B. Wilkinson, J.H. Grabowski, G.D. Sherwood, P.O.
Logistic regression for binary response variables.
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Practical Model Selection and Multi-model Inference using R Presented by: Eric Stolen and Dan Hunt.
Regression and Correlation Methods Judy Zhong Ph.D.
Inference for regression - Simple linear regression
1 G Lect 11W Logistic Regression Review Maximum Likelihood Estimates Probit Regression and Example Model Fit G Multiple Regression Week 11.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
BRIEF REVIEW OF STATISTICAL CONCEPTS AND METHODS.
Lecture 3: Inference in Simple Linear Regression BMTRY 701 Biostatistical Methods II.
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
BRIEF REVIEW OF STATISTICAL CONCEPTS AND METHODS.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Logistic Regression. Linear Regression Purchases vs. Income.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Linear Models Alan Lee Sample presentation for STATS 760.
Chapter 10: Determining How Costs Behave 1 Horngren 13e.
Logistic regression (when you have a binary response variable)
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Model Selection Information-Theoretic Approach UF 2015 (25 minutes) Outline: Why use model selection Why use model selection AIC AIC AIC weights and model.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
Quiz 3. Model selection Overview Objectives determine the “choice” of model Modeling for forecasting Likelihood ratio test Akaike Information Criterion.
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Chapter 15 Multiple Regression Model Building
AP Statistics Chapter 14 Section 1.
Chapter 3: Describing Relationships
Ch3: Model Building through Regression
Wildlife Population Analysis What are those βs anyway?
Regression Analysis Week 4.
STAT120C: Final Review.
Chapter 3: Describing Relationships
Least-Squares Regression
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Regression Part II.
Chapter 3: Describing Relationships
Presentation transcript:

Practical Model Selection and Multi-model Inference using R Presented by: Eric Stolen and Dan Hunt

Indigo snake home range radio telemetry study of eastern indigo snake home range sizes in central FL Response variable was ln(home range) (95% fixed-kernel estimate) sex, landcover type, and length of time (weeks) General linear model Interested in effect sizes and predictions

Indigo snake home range Science questions: –“Is there evidence for a difference in home range size between habitats?” –“Is there evidence for a difference in home range size between sexes?” –“Is there an effect of length of time tracked? –“Is the effect of habitats type different between the sexes (is there an interaction term)?” –“Does the data support estimating an effect of habitats type with 3 levels or 2 levels?”

Models – Indigo Snake example SEX land cover type 3 levels (lc1) land cover type 2 levels(lc2) weeks SEX + lc1 SEX + lc2 SEX + weeks lc1 + weeks lc2 + weeks SEX + lc1 + weeks SEX + lc2 + weeks SEX + lc2 + SEX * lc1 SEX + lc2 + SEX * lc2 SEX + lc2 + weeks + SEX * lc1 SEX + lc2 + weeks + SEX * lc2

Models – Indigo Snake example SEX land cover type 3 levels (lc1) land cover type 2 levels(lc2) weeks SEX + lc1 SEX + lc2 SEX + weeks lc1 + weeks lc2 + weeks SEX + lc1 + weeks SEX + lc2 + weeks SEX + lc1 + SEX * lc1 SEX + lc2 + SEX * lc2 SEX + lc1 + weeks + SEX * lc1 SEX + lc2 + weeks + SEX * lc2

Models – Indigo Snake example SEX land cover type 3 levels (lc1) land cover type 2 levels(lc2) weeks SEX + lc1 SEX + lc2 SEX + weeks lc1 + weeks lc2 + weeks SEX + lc1 + weeks SEX + lc2 + weeks SEX + lc1 + SEX * lc1 SEX + lc2 + SEX * lc2 SEX + lc1 + weeks + SEX * lc1 SEX + lc2 + weeks + SEX * lc2

Models – Indigo Snake example SEX land cover type 2 levels(lc2) weeks SEX + lc2 SEX + weeks lc2 + weeks SEX + lc2 + weeks SEX + lc2 + SEX * lc2 SEX + lc2 + weeks + SEX * lc2

Models – Indigo Snake example SEX land cover type 2 levels(lc2) weeks SEX + lc2 SEX + weeks lc2 + weeks SEX + lc2 + weeks SEX + lc2 + SEX * lc2 SEX + lc2 + weeks + SEX * lc2

Models – Indigo Snake example SEX land cover type 2 levels(lc2) weeks SEX + lc2 SEX + weeks lc2 + weeks SEX + lc2 + weeks SEX + lc2 + SEX * lc2 SEX + lc2 + weeks + SEX * lc2

I-T mechanics AICc i = -2*log e (Likelihood of model i given the data) + 2*K (n/(n-K-1)) or = AIC + 2*K*(K+1)/(n-K-1) (where K = the number of parameters estimated and n = the sample size)

I-T mechanics AICc min = AICc for the model with the lowest AICc value  i = AICc i – AICc min

Model Probability (also Bayesian posterior model probabilities) evidence ratio of model i to model j = w i / w j I-T mechanics

Least Squares Regression AIC = n log e (   ) + 2*K (n/(n-K-1)) Where    RSS / n (explain offset for constant part)

I-T mechanics Counting Parameters: K = number of parameters estimated Least Square Regression K = number of parameters + 2 (for intercept & 

I-T mechanics Counting Parameters: K = number of parameters estimated Logistic Regression K = number of parameters + 1 (for intercept 

I-T mechanics Counting Parameters: Non-identifiable parameters

Comparing Models

Evidence Ratio = 3.03

Comparing Models Evidence Ratio =4.28 ( ) / ( )

Mathematical details What types of models can be compared within a single I-T analysis? –Data must be fixed (including response) –Must be able to calculate maximum likelihood –(ways to deal with quasi-likelihood) –Models do not need to be nested –In some cases AIC is additive

Model Fitting Preliminaries Understanding the data/variables Avoid data dredging! safe data screening practices Detect outliers, scale issues, collinearity Tools in R

–Generalized linear models lm glm –Packages Design Package –FE Harrell Regression Modeling Strategies with Applications to Linear Models, Logistic Regression, and Survival Analysis. Springer. CAR package –Fox, J An R and S-plus Companion to Applied Regression. Sage Publications.

Tools in R –Model formula Ex) –Output summary(model4) model4$aic Model4$coefficients model4 <- glm(help~age2 + sex + mom_dad + suburb + brdeapp + matepp + density + I(density^2), family=binomial,data=choices)

Tools in R Fitting the model set – –R program does the work Trouble-shooting Export results

Model Checking –Global model must fit –Models used for inference must meet assumptions, –Look for numerical problems Tools in R

Interpretation of models for inference Case 1: One or a few models best models Examining model parameters and predictions –Effects –Prediction graphing results –nomograms –Presenting Results Anderson, D. R., W. A. Link, D. H. Johnson, and K. P. Burnham Suggestions for presenting the results of data analysis. Journal of Wildlife Management 65:

Model selection uncertainty Model-average prediction Model-average parameter estimates Multi-model Inference

Model Averaging Predictions

Model-averaged prediction Model Averaging Predictions

Prediction from model i Model Averaging Predictions

Weight model i Model Averaging Predictions

Model-averaged parameter estimate Model Averaging Parameters

Unconditional Variance Estimator

Snake Example