Information criteria What function fits best? The more free parameters a model has the higher will be R 2. The more parsimonious a model is the lesser.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Analysis of variance and statistical inference.
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Kin 304 Regression Linear Regression Least Sum of Squares
Chapter 12 Simple Linear Regression
Outline input analysis input analyzer of ARENA parameter estimation
Day 6 Model Selection and Multimodel Inference
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Linear regression models
The General Linear Model. The Simple Linear Model Linear Regression.
Data Handling & Analysis Polynomials and model fit Andrew Jackson
458 Model Uncertainty and Model Selection Fish 458, Lecture 13.
1 Chapter 3 Multiple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Least Square Regression
The Islamic University of Gaza Faculty of Engineering Civil Engineering Department Numerical Analysis ECIV 3306 Chapter 17 Least Square Regression.
Estimation of parameters. Maximum likelihood What has happened was most likely.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
2. Point and interval estimation Introduction Properties of estimators Finite sample size Asymptotic properties Construction methods Method of moments.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
1 MULTI VARIATE VARIABLE n-th OBJECT m-th VARIABLE.
The Method of Likelihood Hal Whitehead BIOL4062/5062.
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
Random Sampling, Point Estimation and Maximum Likelihood.
Physics 270 – Experimental Physics. Standard Deviation of the Mean (Standard Error) When we report the average value of n measurements, the uncertainty.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Lecture 4 Model Selection and Multimodel Inference.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 11: Bayesian learning continued Geoffrey Hinton.
Analysis of Residuals Data = Fit + Residual. Residual means left over Vertical distance of Y i from the regression hyper-plane An error of “prediction”
Logistic Regression Database Marketing Instructor: N. Kumar.
Regression Analysis Week 8 DIAGNOSTIC AND REMEDIAL MEASURES Residuals The main purpose examining residuals Diagnostic for Residuals Test involving residuals.
Count based PVA Incorporating Density Dependence, Demographic Stochasticity, Correlated Environments, Catastrophes and Bonanzas.
Measures of Fit David A. Kenny January 25, Background Introduction to Measures of Fit.
Bangladesh Short term Discharge Forecasting time series forecasting Tom Hopson A project supported by USAID.
Simple Linear Regression (OLS). Types of Correlation Positive correlationNegative correlationNo correlation.
INTRODUCTION TO Machine Learning 3rd Edition
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
1 Standard error Estimated standard error,s,. 2 Example 1 While measuring the thermal conductivity of Armco iron, using a temperature of 100F and a power.
Chap 6 Further Inference in the Multiple Regression Model
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Specification: Choosing the Independent.
ES 07 These slides can be found at optimized for Windows)
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
The Instrumental Variables Estimator The instrumental variables (IV) estimator is an alternative to Ordinary Least Squares (OLS) which generates consistent.
Model Selection Information-Theoretic Approach UF 2015 (25 minutes) Outline: Why use model selection Why use model selection AIC AIC AIC weights and model.
4-1 MGMG 522 : Session #4 Choosing the Independent Variables and a Functional Form (Ch. 6 & 7)
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
The SweSAT Vocabulary (word): understanding of words and concepts. Data Sufficiency (ds): numerical reasoning ability. Reading Comprehension (read): Swedish.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Remember the equation of a line: Basic Linear Regression As scientists, we find it an irresistible temptation to put a straight line though something that.
Model Comparison. Assessing alternative models We don’t ask “Is the model right or wrong?” We ask “Do the data support a model more than a competing model?”
Linear model. a type of regression analyses statistical method – both the response variable (Y) and the explanatory variable (X) are continuous variables.
Stat 223 Introduction to the Theory of Statistics
Lecture 4 Model Selection and Multimodel Inference
STATISTICAL INFERENCE
Probability Theory and Parameter Estimation I
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Kin 304 Regression Linear Regression Least Sum of Squares
LINEAR AND NONLINEAR MODELS
Statistics in MSmcDESPOT
BPK 304W Regression Linear Regression Least Sum of Squares
Model Comparison.
Discrete Event Simulation - 4
Linear regression Fitting a straight line to observations.
Cross-validation for the selection of statistical models
Stock Prediction with ARIMA
Simple Linear Regression
Chapter 24 Comparing Two Means.
Determining Which Method to use
Presentation transcript:

Information criteria What function fits best? The more free parameters a model has the higher will be R 2. The more parsimonious a model is the lesser is the bias towards type I errors. We have to find a compromis between goodness of fit and bias! Model parameters few many Bias Explained variance The optimal number of model parameters

The Akaike criterion of model choice k: number of model parameters +1 L: maximum likelihood estimate of the model If the parameter errors are normal and independent we get n: number data points RSS: residual sums of squares If we fit using  2 : If we fit using R 2 : At small sample size we should use the following correction The preferred model is the one with the lowest AIC.

We get the surprising result that the seemingly worst fitting model appears to be the preferred one. A single outlier makes the difference. The single high residual makes the exponential fitting worse

Significant difference in model fit Approximately  AIC is statisticaly significant in favor of the model with thesmaller AIC at the 5% error benchmark if |  AIC| > 2. The last model is not significantly (5% level) different from the second model. AIC model selection serves to find the best descriptor of observed structure. It is a hypothesis generating method. It does not test for significance Model selection using significance levels is a hypothesis testing method. Significance levels and AIC must not be used together.

Literature