The Best Investment in this Economy (Safer than the S&P) Ana Burcroff Kathleen Fregeau Brett Koons Alistair Meadows March 3 rd 2009 – Team 7.

Slides:



Advertisements
Similar presentations
Regression of NFL Scores on Vegas Line – 2007 Regular Season.
Advertisements

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Inference for Regression
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
IB Math Studies – Topic 6 Statistics.
1 Multiple Regression Interpretation. 2 Correlation, Causation Think about a light switch and the light that is on the electrical circuit. If you and.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
PSY 307 – Statistics for the Behavioral Sciences
Multiple Regression Involves the use of more than one independent variable. Multivariate analysis involves more than one dependent variable - OMS 633 Adding.
1 Pertemuan 13 Uji Koefisien Korelasi dan Regresi Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Matching level of measurement to statistical procedures
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter Topics Types of Regression Models
Correlations and T-tests
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Business Statistics: Communicating with Numbers By Sanjiv Jaggia.
BCOR 1020 Business Statistics Lecture 24 – April 17, 2008.
Business Statistics - QBM117 Statistical inference for regression.
Chapter 7 Forecasting with Simple Regression
Chapter 9: Introduction to the t statistic
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression NFL Point Spreads – 2007.
Eric Huggins, Ph.D. Fort Lewis College Durango, CO.
Lecture 5 Correlation and Regression
Correlation and Linear Regression
Copyright © 2011 Pearson Education, Inc. Multiple Regression Chapter 23.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
AM Recitation 2/10/11.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
1 MADE WHAT IF SOME OLS ASSUMPTIONS ARE NOT FULFILED?
Confidence Intervals for the Regression Slope 12.1b Target Goal: I can perform a significance test about the slope β of a population (true) regression.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 22 Regression Diagnostics.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Data Analysis (continued). Analyzing the Results of Research Investigations Two basic ways of describing the results Two basic ways of describing the.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Part IV Significantly Different: Using Inferential Statistics
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Introduction to Statistics Introduction to Statistics Correlation Chapter 15 Apr 29-May 4, 2010 Classes #28-29.
Byron Gangnes Econ 427 lecture 3 slides. Byron Gangnes A scatterplot.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
NCSU WOLFPACK Men’s Basketball. Home & Away Games.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Data Analysis.
Copyright © 2011 Pearson Education, Inc. Regression Diagnostics Chapter 22.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Lecturer’s desk Physics- atmospheric Sciences (PAS) - Room 201 s c r e e n Row A Row B Row C Row D Row E Row F Row G Row H Row A
PRESENTATION TO MIS480/580 GABE HAZLEWOOD JOSH HOTTENSTEIN SCOTTIE WANG JAMES CHEN MAY 5, 2008 Betting in Super Bowl match ups.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Outline Sampling Measurement Descriptive Statistics:
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
The statistics behind the game
CHAPTER 3 Describing Relationships
Statistics for Managers using Microsoft Excel 3rd Edition
NCAA Basketball Tournament: Predicting Performance
Chapter 12: Regression Diagnostics
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
The statistics behind the game
When You See (This), You Think (That)
Presentation transcript:

The Best Investment in this Economy (Safer than the S&P) Ana Burcroff Kathleen Fregeau Brett Koons Alistair Meadows March 3 rd 2009 – Team 7

Determine the best predictors of the over-under and the spread in college basketball games and use that knowledge to make millions. Over-Under The Spread The Goal

Over/Under Analysis“Against the Spread” Analysis Games Analyzed A regression model was developed to predict the total points scored by the home and away teams. Two Regression models were developed to predict the point spread between the home and away teams. Source: Complete if required Rules of the Game We looked solely at NCAA Men’s Basketball: Only intra-divisional games within the following 6 conferences were considered

What is the spread?What is the over/under? The bookmaker specifies that one team will beat another by a certain amount of points  This amount is known as the spread and is usually quoted in half points to avoid ties, known as a push  Gamblers then wage on whether the actual margin of victory will be higher or lower than the spread  Example:  On Feb. 21, UNC was favored over Maryland by 12.5  If you bet on UNC, they would have to win by 13 points or more in order for you to win the bet  If UNC wins by anything less than 12.5 points, gamblers who bet on UNC lose their money although the team won. In addition to the spread, you can also bet the over/under  The bookmaker predicts that the combined score of the two teams will be a certain number  Bettors then wager on whether the actual score total will be higher or lower than that number We wanted to see if we could build a model that gave us “lock” picks – games where the model predicts that the spread is too high or low or the over/under is too high or low, so that we could bet accordingly. The basics for everyone who is not a degenerate gambler. The Spread and Over/Under

We considered away and home team offensive and defensive statistics along with how frequently the away team covers the spread and the over. Home Team StatisticsAway Team Statistics Offensive Statistics  Field Goal Attempts  Field Goal Percentage  3 Point Attempts  3 Point Percentage  Free Throw Attempts  Free Throw Percentage  Defensive Statistics  Average Points Allowed Offensive Statistics  Field Goal Attempts  Field Goal Percentage  3 Point Attempts  3 Point Percentage  Free Throw Attempts  Free Throw Percentage  Defensive Statistics  Average Points Allowed The Independent Variables Away Home

Bets are based on 3 models: one predicts over/under, two predict against the spread The models are based on historical data from different time periods Model 2 Against the Spread Model 3 Against the Spread Model 1 Over/Under The Models For all models: P value for F Test = 0 P value for all coefficients < 5% Adjusted R 2 Model 1: 41.9% Model 2: 55.4% Model 3: 49.9%

Model 2 - ATS Model 1 – Over/Under  Vegas O/U  Away Average Def.  Away Over %  Away ATS%  Away Win%  Away Field Goal Attempts  Away Free Throw %  Home 3PT Attempts  Home Free Throw %  Vegas Over/Under  Home Line (ATS)  Away ATS %  Away Win %  Away Field Goal %  Away 3PT %  Home Average Offense  Home Field Goal Attempts  Home Field Goal %  Home 3PT %  Home Free Throw Attempts  Home Avg. Opp. Power Rating The Independent Variables Model 3 - ATS  Home Line (ATS)  Away Win %  Away Field Goal %  Away 3PT %  Away Free Throw Attempts  Home Average Offense  Home Average Def.  Home ATS %  Home Field Goal Attempts  Home Field Goal %  Home 3PT %  Home Free Throw Attempts  Home Avg. Opp. Power Rating

How to Bet O/U Model - If predicted y hat indicates the game as a super lock or extra super lock – then bet. Against the Spread (Version 1) – If predicted y hat indicates the game as a lock, super lock, or extra super lock – then bet. Against the Spread (Version 2) - If predicted y hat indicates the game as a lock, super lock, or extra super lock – then bet.

Model 2 (ATS) Good BetBetter BetLockSuper Lock Extra Super Lock O/U +/- 0pts+/- 5pts+/- 10pts+/- 15pts+/- 20pts Win Losses Total Bet Amounts $200 Juice 10% Amount Invested $21,780 $11,660 $7,040 $880 $220 Account $60 $1,360 $3,460 $800 $200 Return 0.3% 11.7% 49.1% 90.9%

Models’ Fit with Data Model’s Fit with Data Last recorded actual data was 2/28

Model’s Fit with Data

Vegas O/U vs. Model’s Predicted Total Scoring Vegas O/U Model’s Predicted Total Scoring The Model’s predicted total scoring is positively correlated with the Vegas O/U. This correlation is stronger among all games than among solely the Super Locks and Extra Super Locks. All GamesSuper Locks and Extra Super Locks

Model’s Predicted Total Scoring Actual Total Scoring Vegas’ Predicted Total Scoring The Model predicts 21.8% of actual total scoring, while Vegas’ O/U explains 31.96% of total scoring when looking at all games analyzed. Model vs. Vegas’ Prediction Accuracy

Residual Analysis (Model Prediction vs. Actual Result) All Residuals vs. Model’s Prediction Residuals of Super Locks and Extra Super Locks vs. Model’s Prediction Residual vs. Prediction Scatterplot Residual Values Histogram Residuals have a relatively normal distribution with a mean of 5.6. The residuals of Super Locks and Extra Super Locks are not normally distributed and have a mean of -10.

Heteroscedasticity patterns imply that the model’s error generally increases as the expected total scoring increases. However, the model is better at making betting predictions when it underestimates the O/U for high scoring games and overestimates O/U for low scoring games. Model’s Predicted Scoring Difference between Predicted and Actual Super Locks and Extra Super Locks Positive Heteroscedasticity Negative Heteroscedasticity All GamesSuper Locks and Extra Super Locks Heteroscedasticity and Prediction Accuracy

Difference between Vegas’ O/U and Actual Difference between Predicted and Actual All GamesSuper Locks and Extra Super Locks The correlation (or lack of correlation) between the difference in the Model’s prediction and the actual outcome vs. the difference between Vegas’ O/U and the actual outcome implies that the model makes more accurate predictions when it is positively correlated (but not directly correlated) with Vegas’ O/U prediction. No significant correlation Significant positive correlation Correlation between Model’s Residual and Vegas’ Residual

Correlation Analysis A correlation matrix with all 26 variables generated only 16 coefficients greater than 0.5. Of those, the average correlation was , with a median of , and a max of (the relationship between the Home teams average offensive points, Home team field goal attempts).

Durban-Watson Calculation O/U Model’s D-stat Numerator26,808.5 Denominator16,908.2 D-stat1.6 Are the observations independent of one another – does the date of the game affect the volatility of the model (i.e. more uncertainty as March Madness approaches) Null Hypothesis: Correlation = 0 Alternative Hypothesis: Correlation >0 N = 95 X variables = 9 dL = dU = 1.85 Our d-stat is between the lower and upper critical values, so we do not reject the Null (but we don’t accept it either) and conclude that there is either no serial autocorrelation or weak serial autocorrelation in our data. Durbin-Watson Analysis

It’s the safest investment around!! We beat the S&P! We beat the professional handicapper! We’re making $$$!! Model Returns = 26.2%!! S & P 500 Returns = % year-to-date – that is scary! Dow closed below 7,000 for the first time in 11 years Model accuracy = 75.7%!! Handicappers pick bets Even good handicappers are right just over 50% of the time 52.38% is the magical number to be profitable So far the “company” has invested $11, bets Profit of $2,910 Invested Capital of only $3,000 ROIC = 97%!! Models’ Success-to-date

MAKE YOUR LIFE EASIER AND FILL YOUR POCKETS FULL OF MONEY!!!! FORGET ABOUT JOBS AND GAMBLE! Incredible Business Decision! Incredible Personal Decision! Money Won is Twice as Sweet as Money Earned!!!!

Questions