Mann-Whitney U = Wilcoxon rank sum is the non-parametric test equivalent to t-test ssd<-read.table("dimorphism.txt",header=T) attach(ssd) names(ssd) [1]

Slides:



Advertisements
Similar presentations
Analysis of variance and statistical inference.
Advertisements

Brief introduction on Logistic Regression
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Chapter 13 Multiple Regression
From last time….. Basic Biostats Topics Summary Statistics –mean, median, mode –standard deviation, standard error Confidence Intervals Hypothesis Tests.
Lecture 23: Tues., Dec. 2 Today: Thursday:
Statistics for Managers Using Microsoft® Excel 5th Edition
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 12 Multiple Regression
Chapter 4 Multiple Regression.
Final Review Session.
Nemours Biomedical Research Statistics April 2, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Linear Regression and Correlation Analysis
Biol 500: basic statistics
Lecture 11 Multivariate Regression A Case Study. Other topics: Multicollinearity  Assuming that all the regression assumptions hold how good are our.
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Topic 3: Regression.
Handling Categorical Data. Learning Outcomes At the end of this session and with additional reading you will be able to: – Understand when and how to.
Multiple Linear Regression
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Today Concepts underlying inferential statistics
Intro to Parametric Statistics, Assumptions & Degrees of Freedom Some terms we will need Normal Distributions Degrees of freedom Z-values of individual.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Simple Linear Regression
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
Simple Linear Regression Models
Analysis of Covariance Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Lecture 12 Model Building BMTRY 701 Biostatistical Methods II.
Choosing and using statistics to test ecological hypotheses
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
Copyright © 2012 Pearson Education. Chapter 23 Nonparametric Methods.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Discussion of time series and panel models
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Simple Linear Regression (OLS). Types of Correlation Positive correlationNegative correlationNo correlation.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 14 th February 2013.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Nonparametric Statistics
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Remember the equation of a line: Basic Linear Regression As scientists, we find it an irresistible temptation to put a straight line though something that.
Matrix form of Linear Regression The F distribution ANOVA approach to Linear Regression ANOVA approach to t-test (One way ANOVA with two levels)
Stats Methods at IC Lecture 3: Regression.
Nonparametric Statistics
Chapter 15 Multiple Regression Model Building
Nonparametric Statistics
Presentation transcript:

Mann-Whitney U = Wilcoxon rank sum is the non-parametric test equivalent to t-test ssd<-read.table("dimorphism.txt",header=T) attach(ssd) names(ssd) [1] "binomial" "sex" "svl" Is there a difference in body size (SVL) between males and females of true lizards (Lacertidae)? male<-svl[sex=="male"] female<-svl[sex=="female"] wilcox.test(male,female,paired=FALSE) Wilcoxon rank sum test with continuity correction data: male and female W = 15396, p-value = alternative hypothesis: true location shift is not equal to 0 Non-parametric tests in R

ssd<-read.table("dimorphism.txt",header=T) attach(ssd) names(ssd) [1] "binomial" "sex" "svl" male<-svl[sex=="male"] female<-svl[sex=="female"] wilcox.test(male,female,paired=TRUE) Wilcoxon rank sum test with continuity correction data: male and female V = , p-value = 1.773e-06 alternative hypothesis: true location shift is not equal to 0 Wilcoxon two-sample (=Wilcoxon signed-rank) test non-parametric equivalent for paired t-test Is there a difference in body size (SVL) between males and females of true lizards (Lacertidae) when you compare between sexes of the same species? Non-parametric tests in R

names(ssd) [1] "binomial" "sex" "svl" wilcox.test(male,female,paired=TRUE) Wilcoxon rank sum test with continuity correction data: male and female V = , p-value = 1.773e-06 alternative hypothesis: true location shift is not equal to 0 Is there a difference in body size (SVL) between males and females of true lizards (Lacertidae) when you compare between sexes of the same species? Wilcoxon two-sample (=Wilcoxon signed-rank) test non-parametric equivalent for paired t-test Here we have a problem: we want the test to be according to the species name but we didn’t declared it any where Non-parametric tests in R

We will use recast: this will transform the data to a matrix (something you will usually try to avoid in other statistical programs library(reshape2) sex<-recast(ssd,binomial~sex,measure.var = "svl") names(sex) [1] "binomial" "female" "male" Notice that there is a change in names compared to the pervious slide Wilcoxon two-sample (=Wilcoxon signed-rank) test non-parametric equivalent for paired t-test Is there a difference in body size (SVL) between males and females of true lizards (Lacertidae) when you compare between sexes of the same species? Non-parametric tests in R

sex<-recast(ssd,binomial~sex,measure.var = "svl") names(sex) sex wilcox.test(sex$female,sex$male,paired=TRUE) [1] "binomial" "female" "male" Wilcoxon signed rank test with continuity correction data: sex$female and sex$male V = , p-value = 1.773e-06 Non-parametric tests in R Wilcoxon two-sample (=Wilcoxon signed-rank) test non-parametric equivalent for paired t-test We will use recast: this will transform the data to a matrix (something you will usually try to avoid in other statistical programs Is there a difference in body size (SVL) between males and females of true lizards (Lacertidae) when you compare between sexes of the same species?

sex<-recast(ssd,binomial~sex,measure.var = "svl") names(sex) sex wilcox.test(sex$female,sex$male,paired=TRUE) p.s. this is also a way to do paired t-test t.test(sex$female,sex$male,paired = T) [1] "binomial" "female" "male" Non-parametric tests in R Wilcoxon two-sample (=Wilcoxon signed-rank) test non-parametric equivalent for paired t-test We will use recast: this will transform the data to a matrix (something you will usually try to avoid in other statistical programs Is there a difference in body size (SVL) between males and females of true lizards (Lacertidae) when you compare between sexes of the same species?

Kruskal-Wallis the equivalent to one-way ANOVA Kruskal-Wallis rank sum test data: clutch by type Kruskal-Wallis chi-squared = , df = 2, p-value = The code for this test is very similar to ANOVA test but instead of aov you write kruskal.test kruskal.test(clutch~type) island<-read.csv("island_type_final2.csv",header=T) Attach(island) Non-parametric tests in R

Spearman test and Kendall’s-tau test are equivalent to correlation test cor.test(clutch, mass, method="spearman") #or cor.test(clutch, mass, method="kendall") The code here is a variation of the common correlation test, by adding the definition of a non-parametric test in the ‘method’ argument Instead of writing: cor.test(clutch,mass)we will write: Spearman's rank correlation rho data: clutch and mass S = , p-value < 2.2e-16; rho Kendall's rank correlation tau data: clutch and mass z = 9.747, p-value < 2.2e- 16; tau We will get respectively: Non-parametric tests in R

Generalized linear models (GLM) We will use GLM when our response variable is not continuous (counts, proportions, binary etc.) – or when the parametric test assumptions (normal distribution, equality of variance) are not met GLM is structured from three parts 1. linear predictor; 2. link function; 3. error distribution The first is the parameter value, the second refers to transformation (for example “identity” when there is no transformation and “log” for logarithmic transformation) and the third refers to the distribution of the residuals – for example gama, binomial and normal distribution A unique case where link=identity and error=normal the GLM is a linear model

Generalized linear models (GLM) Structured from three parts: 1. linear predictor; 2. link function; 3. error distribution modelX<-glm(clutch~log10(age)+asin(lat),family=Gamma) Log linkarcsin linkGamma errors The first is the parameter value, the second refers to transformation (for example “identity” when there is no transformation and “log” for logarithmic transformation) and the third refers to the distribution of the residuals – for example gama, binomial and normal distributions A unique case where link=identity and error=normal the GLM is a linear model

Non-linear models Sometimes it is clear that the relationship between the predictor and the response is not linear We can test models that know how to deal with this type of data structure. For example: add quadratic equation to the model Response = a(predictor) 2 +b(predictor)+c and we can test if the quadratic model is better than the linear using AIC or anova() More explanation in the model selection part of the presentation model2<-lm(y~x+I(x^2))

Non-linear models We can test breaking point models that have different linear equations for different values of the predictor Y = A 1.x + K 1 for x < breakpoint Y = A 2.x + K 2 for x > breakpoint Losos & Schluter Analysis of an evolutionary species-area relationship. Nature 408: Sometimes it is clear that the relationship between the predictor and the response is not linear

Multiple predictors Life is complicated, what can we do. Sometimes what is interest us is affected by more than one variable Smith, R. J Statistics of sexual size dimorphism. Journal of Human Evolution 36: The heartbeat of lizards, for example, is affected by their body size and environmental temperature, and also from the time and speed their were moving lately

Smith, R. J Statistics of sexual size dimorphism. Journal of Human Evolution 36: We can explain the variable that interests us (heartbeat) if we have data on the predicting variables The assumption is that when we put all three of them to the equation we see the effect of each one when the other two help constant Multiple predictors Life is complicated, what can we do. Sometimes what is interest us is affected by more than one variable The heartbeat of lizards, for example, is affected by their body size and environmental temperature, and also from the time and speed their were moving lately

Smith, R. J Statistics of sexual size dimorphism. Journal of Human Evolution 36: The assumption is correct when we don’t have high correlation between the predictors Multiple predictors We can explain the variable that interests us (heartbeat) if we have data on the predicting variables The assumption is that when we put all three of them to the equation we see the effect of each one when the other two help constant

Which test should we choose? If we have a few predictor variable (lets say 4) and they are all categorical the test for them will be ANOVA (lets say 4-way ANOVA) If we have a few predictor variables (lets say seven) and they are all continuous the test for them will be Multiple Regression

How do we write a test with a few explanatory variables? We use the ‘+’ between the predictors. model<-lm(Grade~days_studied+professor_age+prayer_number+reconstruction, data=marks) summary(model) lm(y~a+b+c) For example a test that tries to predict the grades of a course based on how much we studied, the age of the lecturer, how much we prayed and whether there are test from previous years

If we have a few predictor variables (at least 2) – part of them (at least one) are categorical and part of them (at least one) are continuous the test for them will be ANCOVA (analysis of co-variance) Which test should we choose?

How will it look graphically? ANOVA Regression ANCOVA For example I measured the length of three teeth of the common fox, males and females, through their distribution range pFMSDFSS Intercept sex tooth Error Two significant variables: a difference between the teeth and the sexes Upper carnassial Bottom carnassial Upper canine ניב שן שסע תחתונ ה שן שסע עליונה Vulpes vulpes

ANOVA Regression ANCOVA For example: the length of the teeth in the common fox as a function of latitude Pt Std. ErrorEstimate > Intercept > Latitude We can see evidence for Bergman’s rule But it is easy to notice that it’s a horrible model: the canines are smaller than both of the carnassial R-squared: 0.015, F = 32.83, 1 & 2161 DF; p < *Regression line is a model for the relationship between the predictor and the response How will it look graphically?

Graphical ANCOVA It is easy to understand it graphically: in the example there is a single variable on the X and a response variable with two levels – continuous and categorical (dashed and full lines) Null hypothesis a. response Continuous predictor response b. Continuous predictor Categorical significant, Cont. not significant d. response Continuous predictor Both significant Continuous predictor c. response And thanks to Daniel for the plots Categorical not Significant, Cont. significant

ANOVA Regression ANCOVA Example: Teeth length in common fox, as a function of latitude (X axis) and sex (color) and which tooth it is (shapes) pFMSSSDffactor > sex > tooth > Latitude Residuals All the variables are significant This models explains 96.3% of the variation We have here 6 regression lines How will it look graphically?

Reading ANCOVA results in R Response = intercept + a for level 1 of the 1 st categorical predictor variable or + b for level 2 of the 1 st + c for level 1 of the 2 nd categorical predictor or d for level 2… +k*(value of the continuous predictor variable) + error For example, if we go back to foxes pt Std. ErrorEstimatefactor > Intercept (tooth c) > tooth_m > tooth_p > sex_male > Latitude According to the model tooth P length of a male in Tel Aviv = *0.043 Example without interactions Tooth / sex Latitude

When the predictor is categorical, R compares all the factors to the intercept of the first factor in the alphabet Here the predictor is island type and “Continental” is the first in the alphabet So mean clutch size* on continental islands is 0.33 and on land bridge islands is = The difference between continental and Land bridge is significant t=2.309, p=0.021 Estimate Std. Errort valuePr(>|t|) (Intercept) <2e-16*** typeLand_bridge * typeOceanic * After logarithmic transformation, no one lays third of an egg island<-read.csv("island_type_final2.csv",header=T) levels(type) [1] "Continental" "Land_bridge" "Oceanic" model<-lm(clutch~type, data=island) summary(model) How to read lm results in R

When the predictor is continuous, R reports its slope with its SE and, t and p values for it Here the predictors that explain clutch size are island area and latitude So clutch size increases in (the units are log 10 eggs) with the increase in each latitudinal degree and decreases in units with the increase in island area (but the decrease is not significant t=0.098, p=0.92) The effect of latitude is significant (t = 3.259, p= ) island<-read.csv("island_type_final2.csv",header=T) model3<-lm(clutch~area+lat,data=island) summary(model3) Estimate Std. Errort valuePr(>|t|) (Intercept) E-05*** area lat **

How to read lm results in R In log 10 mean clutch size of a lizard on New Caledonia (latitude 21, log island area 4.27 sq km, we will ignore it for a second that area was not significant) will be: Intercept+slope*area+slope*latitude (slope) *4.27 (area) (slope) *21 latitude = Estimate Std. Errort valuePr(>|t|) (Intercept) E-05*** area lat ** Or 2.38 eggs (10 in the power of 0.377) When the predictor is continuous, R reports its slope with its SE and, t and p values for it

How to read lm results in R In ANCOVA we have both categorical and continuous predictors, R reports intercept for the first and slopes for the second, with their SE, t and p values accordingly model4<-lm(brood~mass+lat+type,data=island) summary(model4) EstimateStd. Errort valuePr(>|t|) (Intercept) E-13*** mass E-09*** lat E-06*** typeLand_bridge * typeOceanic Here we predict the number of broods in a year with mass, latitude and 3 categories of island types (continental, Land bridge and Oceanic)

How to read lm results in R ANCOVA: categorical and continuous predictors Residual standard error: on 72 degrees of freedom (242 observations deleted due to missingness) Multiple R-squared: 0.478, Adjusted R-squared: F-statistic: on 4 and 72 DF, p-value: 1.25e-09 We can see here the R 2 values, df, F value etc. of the model Notice that R ignored empty (=NA) cells model4<-lm(brood~mass+lat+type,data=island) summary(model4) EstimateStd. Errort valuePr(>|t|) (Intercept) E-13*** mass E-09*** lat E-06*** typeLand_bridge * typeOceanic

How to read lm results in R model4<-lm(brood~mass+lat+type,data=island) summary(model4) Because alphabetically continental<Land_bridge<Oceanic our intercept is for the first category: species on continental/ So the number of yearly broods of species on continental islands is significantly larger than of species on Land bridge islands and not significantly larger than of species on oceanic islands (notice: the difference is negative) In addition number of yearly broods decreases with the mass and with the increase in latitude (negative slope: higher brood frequency to small lizards on tropical islands EstimateStd. Errort valuePr(>|t|) (Intercept) E-13*** mass E-09*** lat E-06*** typeLand_bridge * typeOceanic

In the categorical variables we have a problem: R calculates only the difference between each category and the first category in the alphabetic order. Here a comparison between land bridge islands and oceanic islands to continental islands. But R doesn’t report the difference between land bridge islands and oceanic islands. Moreover, it doesn’t give use SE and the difference from zero for both of these categories, just the difference from continental islands and the SE for this test (not the SE of the category it self) model4<-lm(brood~mass+lat+type,data=island) summary(model4) EstimateStd. Errort valuePr(>|t|) (Intercept) E-13*** mass E-09*** lat E-06*** typeLand_bridge * typeOceanic relevel

We can outsmart R, we can tell it what will be the first category that he will be comparing the rest to using the function relevel : Or relevel (2) model4<-lm(brood~mass+lat+type,data=island) summary(model4) model4a<-lm(brood~mass+lat+relevel(type, " Land_bridge " ),data=island) summary(model4a) model4b<-lm(brood~mass+lat+relevel(type, " Oceanic " ),data=island) summary(model4b)

relevel (3) Notice that the general model parameters stayed the same Residual standard error: on 72 degrees of freedom (242 observations deleted due to missingness) Multiple R-squared: 0.478, Adjusted R-squared: F-statistic: on 4 and 72 DF, p-value: 1.25e-09 model4a<-lm(brood~mass+lat+relevel(type, " Land_bridge " ),data=island) summary(model4a) Estimate Std. ErrorT valuetPr(>|t|) (Intercept) E-08*** mass E-09*** lat E-06*** relevel(type,Land_bridge)Continental * relevel(type,Land_bridge)Oceanic We can outsmart R, we can tell it what will be the first category that he will be comparing the rest to using the function relevel :

PredictorResponsetestIn R CategoricalSuccess/failureBinomial**binom.test CategoricalCountsChi-square/Gchisq.test CategoricalcontinuousANOVA*aov continuous Regression/correlationlm continuousCategorical/countsChi-square/ANOVAlm Categorical, multiple predictors continuousMulti-way ANOVAaov continuous, multiple predictors continuousMultiple regressionlm Both categorical & continuous predictors continuousANCOVAlm If the assumptions of the parametric models (equality of variance, normal distribution of the residuals) are met: *t-test if there are only 2 categories ** or logistic regression: Choosing the right model

Interactions It is easy to understand it graphically: in the example there is a categorical variable with two levels (dashed and full lines) and a continuous variable (on the X axis) Null hypothesis a. response Continuous predictor response c. categorical significant, continuous not e. response Continuous predictor b. response Continuous predictor Both significant, no interaction d. response Continuous predictor Both significant with interaction f. response Continuous predictor Continuous significant, categorical not, there is interaction Continuous significant, categorical not, there is no interaction

Interactions in R We use ‘+’ between the predictor variables. For interaction we will use ‘:’. If the predictor variable has interaction and main effect we will use ‘*’. For example the model that tries to predict grade of a course according to how much we studied, age of the lecturer, how much we prayed and if there are test from previous years Here we asked for two interactions: between prays and past year tests, and between lecturer age and prayer lm(y~a*b) lm(y~a+b+c+a:b) model<- lm(Grade~days_studied+professor_age+prayer_number*reconstruction_exist +professor_age:prayer_number, data=grades)

Important: The basic assumption of multi predictor tests is that there is no correlation between the predictors High correlation between two predictors is called multi-co- linearity and can be expressed by the tolerance (1-R2) or by Variance Inflation Factors (VIF=1/tolerance) If there is a strong multi-co-linearity then the model is not stable and parameter estimation might be incorrect Don’t add to your model predictors with high correlation among them

Always, Always, Always the more predictors we’ll add the more variance will be explained The ratio is monotonous – and trivial: in the worst case the parameter estimate of additional variables will be zero (for example number of species = *latitude + 0*number of mandates of religious parties in the same area But the parameter estimate will never be exactly zero – it will just be very small – lets say a species is added for every 5120 mandates added to Shas, or a species is subtracted for every 974 mandates that are added to the Bait Hayehudi Our R 2 raises from 0.45 to – is it worth it? Model selection

With every statistical question we can explain 100% of the variance by have the same number of variable as the sample size Example? What is your height? Model selection But, what the predictive ability of this model gives us for the next datum?

The more predictor variables we add more of the variance will be explained Our goal as scientists is to explain the maximal number of phenomena with the minimal number of predictor variables Have you heard about Occam's razor? If we have many predictors we very much like to know if it’s worth to complicate our life for them Model selection

We can test which predictors in the model are significant Model selection We will start with a very complicated model and we’ll remove each time the predictor (or the interaction) that has the highest p-value – until all p-values are lower than 0.05 (or any other threshold). The final model will be MAM = minimum adequate model Example: we are trying to explain the clutch size of different lizard species (response variable: clutch size) with data for body mass, their environmental temperature, elevation, and the number of broods Backwards (stepwise) elimination 1. Using the p value

Estimatesetp (Intercept) * mass ** temp elevation broods * We can just test which variables in the model are significant Model selection Lets start with the most complicated model: clu<-read.table(“eggs.txt”,header=T) model1<-lm(clutch~mass+temp+elevation+broods, data=clu) Example: we are trying to explain the clutch size of different lizard species (response variable: clutch size) with data for body mass, their environmental temperature, elevation, and the number of broods

Estimatesetp (Intercept) *** mass ** elevation broods * Model selection We will remove the temperature and run the model again: model2<-lm(clutch~mass+elevation+broods) We can just test which variables in the model are significant Example: we are trying to explain the clutch size of different lizard species (response variable: clutch size) with data for body mass, their environmental temperature, elevation, and the number of broods

Estimatesetp (Intercept) *** mass *** broods * Model selection Clutch size is affected by body mass and the number of yearly broods, and that’s it model3<-lm(clutch~mass+broods) All the variables are significant STOP! model3 = MAM We can just test which variables in the model are significant We will remove the elevation and run the model again:

forward addition Model selection We can start with the simplest model, and add a new variables each time, and leave it in the model if its p- value is lower than 0.05 (or any other threshold) model1a<-lm(clutch~mass) model2a<- lm(clutch~mass+broods) model3a<- lm(clutch~mass+broods+elevation) When we get to a model with non-significant predictors (model3a in our example) we will stop and choose the previous model (model2a in our example) as MAM We can just test which variables in the model are significant

forward addition Model selection Notice: not all the possible variation among the predictors (and their interactions) are tested in forward addition and backward elimination, it is possible that the best combination was not tested On the other hand the number of models increases in the power of the number of predictors we use in the model, so its not practical to try all the possible combinations unless you have a strong some computer and some time We can just test which variables in the model are significant We can start with the simplest model, and add a new variables each time, and leave it in the model if its p- value is lower than 0.05 (or any other threshold)

Akaike Information Criterion Comparing two models based on two parameters: how “good” is the model (the accuracy in which the reality in it is described) compared to its complexity (how many parameters we estimated) AIC = 2k-2ln(L) K is the number of parameters and L is the maximum likelihood of the model (without getting into details in this case it expresses the residual sum of squares – the smaller it is the better the model. We can also write [AIC = 2k+n[ln(RSS)) Model selection Alternative way for model selection Hirotsugu Akaike The lower the AIC the better the model

Akaike Information Criterion AIC = 2k-2ln(L) Model selection Hirotsugu Akaike AIC rewards descriptive accuracy via the maximum likelihood (High L), and penalizes lack of parsimony according to the number of free parameters (high K) Notice that the model support will be stronger with the decrease in the AIC In R model comparison based on AIC is very simple AIC(model1,model2,model3)

Lets go back to the lizards Model selection model1<-lm(clutch~mass+temp+elevation+broods) model2<-lm(clutch~mass+elevation+broods) model3<-lm(clutch~mass+broods) AIC(model1,model2,model3) dfAIC model model model Here we can see that model 3 is the best (it has the lowest AIC score) Example: we are trying to explain the clutch size of different lizard species (response variable: clutch size) with data for body mass, their environmental temperature, elevation, and the number of broods

AIC doesn’t allow us to test how good is one model just to compare between two models that are based on the same data The AIC score is meaningless by itself : we can’t compare AIC score of two models that ask different questions or based on different data Akaike Information Criterion Moreover, rule of thumb is that you can’t decide which model is better if their AIC difference is lower than 2 AIC(model1,model2,model3)

Akaike Information Criterion dfAIC∆AIC model model model We will arrange the models based on their AIC scores from the lowest (the best) to the highest and calculate for each the difference between each AIC score and the lowest AIC score – to get the ΔAIC of each model. We can’t say that model 3 is better than model 2 because the difference between their AIC score is lower than 2 AIC(model1,model2,model3) Rule of thumb is that you can’t decide which model is better if their AIC difference is lower than 2

Akaike Information Criterion Notice: rule of thumb does not apply to the model with the best model nested within it AIC(model1,model2,model3) AIC = 2k-2ln(L) This is because if we add a parameter to the best model it will never increase the AIC score in more than 2 In nested models we can say that a simpler model is as good as a more complicated model with ΔAIC of 2 or lower – but not that a more complicated model is as good as nested model with ΔAIC of 2 or lower Arnold Uninformative parameters and model selection using Akaike's information criterion. Journal of Wildlife Management, 74: Rule of thumb is that you can’t decide which model is better if their AIC difference is lower than 2

Model selection: AIC and other animals There is no reason, and it is wrong, to calculate the AIC score for a single test (this is similar to saying that a basketball team sinked in a specific game 79 points – it has no value if we don’t know how many point the opponent team had) The model with the lowest AIC can defiantly have variables with p-values higher than 0.05 (AIC is relatively permissive for the parameters it allows) AIC and p-values come from different statistical philosophies and you shouldn’t mix them* *but see Johnson Revised standards for statistical evidence. PNAS 110: , who suggest the philosophies can be reconciled – and that p values <<0.05 should be used See also a lively debate about p values and AIC in Ecology (95 #3, 2014: )

1.AICc BIC: -2*ln L + k*ln(n) Correction to AIC for models with small sample size (חבילת R 2. AIC weights “Akaike weights are used in model averaging. They represent the relative likelihood of a model. To calculate them, for each model first calculate the relative likelihood of the model, which is just exp(-0.5 * ∆AIC score for that model). The Akaike weight for a model is this value divided by the sum of these values across all models.” † † Aho et al Model selection for ecologists: the worldviews of AIC and BIC. Ecology, 95: †† Wagenmakers & Farrell Baysian Information Criterion, BIC Less permissive 1 than AIC to large number of parameters †† AICc = AIC+(2k*[k+1]/[n-k-1]) 1. Notice : k is multiplied by sample size! In the AIC sample size is not incorporated Model selection: variation on the AIC theme

Remember for each research: "No statistical procedure can substitute for serious thinking about alternative evolutionary scenarios and their credibility" Westoby, Leishman & Lord On misinterpreting 'phylogenetic correction. J. of Ecology 83:

YNET : מספר הרקטות שנורו לישראל, כמה יורטו ע"י כיפת ברזל (וכמה לא יורטו): לפני ואחרי הכניסה הקרקעית

תרגיל בית לשימוש בשעת אזעקות: מדלו ב-R האם הכניסה הקרקעית הורידה את ירי הרקטות לעבר ישראל? האם היא הורידה את הירי המדוייק (=רקטות ש"כיפת ברזל" טרחה ליירט) או את הפרופורציה שלו מכל הירי? האם הזמן עד כה פעל לטובתנו?