Some dates check out “outline version 3.0.pdf” Return reviews to reviewees (use track changes and “comments) by 20 Sep – send also to Lee Hsiang Revised.

Slides:



Advertisements
Similar presentations
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Advertisements

Day 6 Model Selection and Multimodel Inference
458 Model Uncertainty and Model Selection Fish 458, Lecture 13.
Introduction to statistical estimation methods Finse Alpine Research Center, September 2010.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
EEM332 Design of Experiments En. Mohd Nazri Mahmud
Chapter 2 Simple Comparative Experiments
BCOR 1020 Business Statistics
Lecture 4 Model Selection and Multimodel Inference
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
The Method of Likelihood Hal Whitehead BIOL4062/5062.
On Model Validation Techniques Alex Karagrigoriou University of Cyprus "Quality - Theory and Practice”, ORT Braude College of Engineering, Karmiel, May.
Chapter 8 Introduction to Hypothesis Testing
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
Essential Statistics Chapter 131 Introduction to Inference.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Lecture 4 Model Selection and Multimodel Inference.
4 Hypothesis & Testing. CHAPTER OUTLINE 4-1 STATISTICAL INFERENCE 4-2 POINT ESTIMATION 4-3 HYPOTHESIS TESTING Statistical Hypotheses Testing.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
CHAPTER 9 Testing a Claim
answers to me Assignment*Which commandments did Sepkoski (1984) break, do you think his inferences hold (if so, to what extent)? R assignment(s)
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Statistical Methods. 2 Concepts and Notations Sample unit – the basic landscape unit at which we wish to establish the presence/absence of the species.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Statistics (cont.) Psych 231: Research Methods in Psychology.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Inferential Statistics Psych 231: Research Methods in Psychology.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
Model Comparison. Assessing alternative models We don’t ask “Is the model right or wrong?” We ask “Do the data support a model more than a competing model?”
Lecture 4 Model Selection and Multimodel Inference
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 9 Testing a Claim
CHAPTER 10 Comparing Two Populations or Groups
Chapter 2 Simple Comparative Experiments
Simulation-Based Approach for Comparing Two Means
CHAPTER 9 Testing a Claim
CHAPTER 10 Comparing Two Populations or Groups
Analysis based on normal distributions
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
CHAPTER 9 Testing a Claim
Model Comparison.
Discrete Event Simulation - 4
Essential Statistics Introduction to Inference
CHAPTER 9 Testing a Claim
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 9 Testing a Claim
Psych 231: Research Methods in Psychology
Lecture 4 Model Selection and Multimodel Inference
CHAPTER 10 Comparing Two Populations or Groups
Psych 231: Research Methods in Psychology
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Psych 231: Research Methods in Psychology
CHAPTER 9 Testing a Claim
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 9 Testing a Claim
Lecture 4 Model Selection and Multimodel Inference
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 9 Testing a Claim
Wildlife Population Analysis
Presentation transcript:

Some dates check out “outline version 3.0.pdf” Return reviews to reviewees (use track changes and “comments) by 20 Sep – send also to Lee Hsiang Revised version to Lee Hsiang by 27 Sep Writing assignment (Sepkoski and 10 commandments: ASAP) All R assignments in one annotated R file by 20 Sep noon; results (including plots), description of data you downloaded and interpretation in a separate pdf file.

All models are wrong but some are useful. -Box 1976 J. Am. Stat. Assoc. 71:

Introduction to likelihood, AIC and model selection

4 Models Model – an idea (formalized or not) about how the world (or a part of it) works. – may be conceptual, verbal or mathematical. – competing hypotheses can be represented as different models. – mathematical models can be developed to project the consequences of hypotheses Parameter – a true characteristic of the “population” of interest Estimator – an equation or process used to produce a parameter estimate from observed data

5 Illustrating likelihood (Steve Wang’s e.g.) Weather (unknown) Total ColdHot Clothing (Observed) Jacket Tshirts Total1.0 Rows: statement about an unknown and unobserved state Likelihood (weather|clothing) Columns: state of observed data Probability(clothing|weather)

Illustration of MLE (Steve Wang’s example from 2010) Sampled 297 gastropod shells from mid Palaeocene Alabama. 138 had drill holes. Of all gastropods that are in that locality (including those not sampled or not preserved), what proportion drilled? Let this = p Let n = sampled gastropods, let x = drilled of those sampled

Illustration of MLE (Steve Wang’s example from 2010) Sampled 297 gastropod shells from mid Palaeocene Alabama. 138 had drill holes. Of all gastropods that are in that locality (including those not sampled or not preserved), what proportion drilled? Let this = p Let n = sampled gastropods, let x = drilled of those sampled L(p)=P(x|p) The likelihood of the true parameter is the probability of the observed data (x) given the parameter

Binomial distribution A gastropod sampled can be of two states: drilled or not drilled. Hence this data can be modeled using a binomial distribution Repeat n trials under the same conditions (= picking up gastropods) Each trial has two outcomes, drilled or NOT drilled The probability of being drilled is p and NOT drilled is 1-p Not A A

Binomial distribution

n = 297 (total gastropods) X = 138 (drilled)

Find MLE

Foote (2003). "Origination and extinction through the Phanerozoic: A new approach." Journal of Geology 111(2): Hunt (2007). "The relative importance of directional change, random walks, and stasis in the evolution of fossil lineages." Proceedings of the National Academy of Sciences of the United States of America 104(47): Wagner (2000). "Likelihood tests of hypothesized durations: determining and accommodating biasing factors." Paleobiology 26(3): Liow, Fortelius et al. (2008). "Higher origination and extinction rates in larger mammals." Proceedings of the National Academy of Sciences of the United States of America 105(16): And many many more Examples of Paleo Maximum Likelihood uses

Simple Likelihood Ratio example What can we do with a Likelihood? Compare two models using likelihood – by using a likelihood ratio, limited use, like hypothesis testing, but sometimes useful 779 ancestor-descendent mammal pairs in NAm (Cenozoic) 442 of descendants are larger than their ancestors (56.7%) Based in these data, is there a general size increase in mammals in NAm over time or not?

Likelihood Ratio Test (LRT)

#class exercise mammal body size example from Wang 2010 a=(choose(779, 442))*(0.5^442)*((1-0.5)^( )) b=(choose(779, 442))*(0.567^442)*(( )^( )) lambda=a/b -2*log(lambda)

H testing, L ratios, Model comparison Hypothesis testing compares 2 models, one null and one alternative (L ratio test as well). Sometimes the null makes sense, but often it does not, hence the only thing we are finding out with a significance test is whether we have enough data to show that null is false. Why is previous example ok?

Valid inference Fischer 1922: valid inference= 1.model specification 2.estimation of model parameters 3.estimation of precision In much of science, neither the model parameters nor the model is known! Hence problem with step missing model formulation and selection! We need to be able to formulate models and select among them, but how?

TRUTH Model 1 Model 2 Model 3 Model 4 KL dist 1 KL dist 2 KL dist 4 KL dist 3 Kullback-Leibler Distance Illustration (basis for AIC)

In practice Make a set of candidate models: think long and hard about “realistic” and non trivial-models (~4-20) Write them down and identify (or make) a global model A global model is one where all variables thought to be important are included If global model fits data adequately, the selected model that is more parsimonious will also fit the data (this is an empirical result) Note: if a really good model is not in the candidate set, we’re screwed* Compare the models (model comparison) There are many information criteria “out there” There we argue for the AIC (Akaike Information Criteria)

Akaike Information Criteria

K Distance between estimate and truth Spread of estimates

26 Akaike Information Criteria

TRUTH Model 1 Model 2 Model 3 Model 4 KL dist 1 KL dist 2 KL dist 4 KL dist 3 Kullback-Leibler Distance Illustration (basis for AIC)

Model 1 Model 2 Model 3 Model 4 ΔAIC

29  AIC = AIC –min(AIC)  AIC measures the distance between the AIC for the model being considered and the AIC of the model with the lowest AIC in the candidate set of models Information lost when given model is used instead of the top model Absolute AIC values have little meaning, they reflect sample size mostly A rule of thumb is –  AICc  2 substantial support and should be used for making inferences. –  AICc of about 4 to 7 have considerably less support –  AICc > 10 have essentially no support ΔAIC

30 AIC model weights

31 Adjusted AICs

R demo

33 References Readings: – Gary White’s lecture notes on Model selection – Johnson and Omland TREE 2004 (an easy to read over view on model selection written for biologists, but should be understandable to paleobiologists) Burnham and Anderson 2002 Model Selection and Multimodel Inference: A practical Information-Theoretic Approach – This is “the” book on model selection. The material for this lecture came largely out of Chapters 1-3

Continued R assignment Using the same data you downloaded, tabulate the number of occurrences per unit time (of your choice) and plot that (either for the entire taxonomic group, or split up into genera or species, depending on how much data you happen to have, using an R script Fit models of occurrences ~ time as you see fit after plotting out the data. We showed some non linear models in class, but you might like some linear models as well, check out Write a summary of your observations This is more a simple exercise in “semi-canned” model fitting just so you feel comfortable with models and comparing them using AIC. All the maximum likelihood is done for you in R, but hopefully you understand the principle from the short lecture. There are many model fitting and optimization “machines” in R. In preparation for next lecture, download: (only runs in windows)

weblinks by-maximum-likelihood/ by-maximum-likelihood/ And watch another TED talk ws_how_stats_fool_juries.html