Regression of NFL Scores on Vegas Line – 2007 Regular Season.

Slides:



Advertisements
Similar presentations
Assumptions underlying regression analysis
Advertisements

Simple Linear Regression Analysis
Lecture 17: Tues., March 16 Inference for simple linear regression (Ch ) R2 statistic (Ch ) Association is not causation (Ch ) Next.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Forecasting Using the Simple Linear Regression Model and Correlation
Chapter 27 Inferences for Regression This is just for one sample We want to talk about the relation between waist size and %body fat for the complete population.
Inference for Regression
Regression Inferential Methods
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Regression Analysis Simple Regression. y = mx + b y = a + bx.
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Regression Analysis Notes. What is a simple linear relation? When one variable is associated with another variable in such a way that two numbers completely.
Inference for Regression 1Section 13.3, Page 284.
Chapter 12 Simple Linear Regression
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r Assumptions.
Chapter 10 Simple Regression.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Stat 112 – Notes 3 Homework 1 is due at the beginning of class next Thursday.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter Topics Types of Regression Models
Simple Linear Regression Analysis
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Slide Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
Pertemua 19 Regresi Linier
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Correlation and Regression Analysis
Simple Linear Regression Analysis
Simple Linear Regression NFL Point Spreads – 2007.
Checking Regression Model Assumptions NBA 2013/14 Player Heights and Weights.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Inference for regression - Simple linear regression
The Examination of Residuals. The residuals are defined as the n differences : where is an observation and is the corresponding fitted value obtained.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
12.1 WS Solutions. (b) The y-intercept says that if there no time spent at the table, we would predict the average number of calories consumed to be
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.
ANOVA Assumptions 1.Normality (sampling distribution of the mean) 2.Homogeneity of Variance 3.Independence of Observations - reason for random assignment.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Diagnostics – Part II Using statistical tests to check to see if the assumptions we made about the model are realistic.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Chapter 13 Simple Linear Regression
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
AP Statistics Chapter 14 Section 1.
Statistics for Managers using Microsoft Excel 3rd Edition
Checking Regression Model Assumptions
CHAPTER 29: Multiple Regression*
Checking Regression Model Assumptions
Unit 3 – Linear regression
PENGOLAHAN DAN PENYAJIAN
(Approximately) Bivariate Normal Data and Inference Based on Hotelling’s T2 WNBA Regular Season Home Point Spread and Over/Under Differentials
Diagnostics and Remedial Measures
Linear Regression and Correlation
3.2. SIMPLE LINEAR REGRESSION
Inference for Regression
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Regression of NFL Scores on Vegas Line – 2007 Regular Season

Problem Description Odds makers Place a Point Spread (differential) and a Over/Under (total) on all National Football League games Combining these two quantities, we can obtain a prediction for the final score of the game Let P A and P H be the odds makers Predicted scores for the Away and Home teams, respectively Spread [wrt Home Team] (PS)= P A – P H (Negative spreads for Home teams mean they are favored (giving points) Over/Under (OU) = P A + P H P A = (OU+PS)/2 P H = (OU-PS)/2

Data/Model Description Point Spreads, Over/Under, and Actual Scores obtained for all n=256 NFL games from 2007 season Predicted Scores obtained for each team in each game Regression is fit for each teams actual score (n=512 team games) as a function of predicted score, and home team indicator Residuals checked to see if errors are independent within games for the two teams Tests conducted to determine: If Home Team effect is sufficiently accounted for by odds makers If Odds makers are unbiased in their point predictions If relation between actual and predicted scores is linear

Week 1 Data Note for the first game: Spread = P A – P H = -6.5 (IND was favored to beat NO by 6.5 Points) Over/Under = P A + P H = 49.5 (Predicted Total Score was 49.5 points) P A = ( (-6.5))/2 = 21.5 P H = ( (-6.5))/2 = 28

Regression Model

Regression Results

Test of No Home Effect and Unbiasedness

Results of Test of No Home Effects and Unbiasedness No evidence to Conclude that E(Y) P

Fit of Simple Regression of Actual on Predicted Score Note, we clearly do not reject H 0 that the intercept is 0 and slope is 1, but will use this model to obtain Confidence Intervals for Mean Score and Prediction Intervals for Individual Game Scores at various levels of predicted scores

Confidence Intervals and Prediction Intervals

Residual Analysis Are the residuals consistent with the model assumptions: Normally Distributed Histogram, Normal Probability Plot, Wilks-Shapiro Test Linear relation between Actual and Predicted Scores Plot of Residuals versus Fitted, Lack-of-Fit F-test Constant Error Variance Plot of Residuals versus Fitted, Regress |resid| vs fitted Independent (e.g. Within Games and Within Teams Over Time) Correlation between Home/Away within games Non-Independent errors within Teams (Random Team effects) Autocorrelation among errors over time within teams

Normal Distribution of Residuals Correlation between residuals and their corresponding normal scores =.9952

Linearity of Regression No evidence to reject the hypothesis of a linear relation between Actual and Predicted scores

Equal (Homogeneous) Variance - I No overwhelming evidence of unequal variance based on graph

Equal (Homogeneous) Variance - II No evidence to reject the null hypothesis of equal variance among errors

Equal (Homogeneous) Variance There is some evidence of unequal variance, but keep in mind the sample size is huge. See plot for how weak the association is

Independence Between Home/Away Residuals Within Games No Evidence of associations between residuals within games

Testing For Random Team Effects - I No overwhelming evidence of team random effects

Testing for Random Team Effects - II No evidence of random Team Effects

Durbin-Watson Test Within Teams over Weeks Teams 2 and 9 have small DW values (positive autocorrelation). Team 22 displays negative autocorrelation (value above 4-d L ). Most teams show no autocorrelation