A Statistical Analysis of Basketball Comebacks

Slides:



Advertisements
Similar presentations
T HE ‘N ORMAL ’ D ISTRIBUTION. O BJECTIVES Review the Normal Distribution Properties of the Standard Normal Distribution Review the Central Limit Theorem.
Advertisements

11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Home Cooking Home field advantage in the four major professional sports By Pat Maher And Mike Danziger.
Diploma in Statistics Introduction to Regression Lecture 4.11 Introduction to Regression Lecture 4.2 Indicator variables for estimating seasonal effects.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
1 Detection and Analysis of Impulse Point Sequences on Correlated Disturbance Phone G. Filaretov, A. Avshalumov Moscow Power Engineering Institute, Moscow.
Introduction Offensive strategies of the National Football League have seemingly shifted towards a “West Coast” style offense, relying more heavily on.
Psychology 202b Advanced Psychological Statistics, II February 15, 2011.
Ranking individuals by group comparison New exponentiel model Two methods for calculations  Regularized least square  Maximum likelihood.
Computers and Sports Fan Perspective By Robert Driver & Robert Shoemake.
Part 18: Regression Modeling 18-1/44 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
MEASURING AND PREDICTING UW BADGERS’S PERFORMANCE BY QUARTERBACK AND RUNNING BACK STATS By: Tyler Chu ECE 539 Fall 2013.
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
Test Topics 1)Notation and symbols 2)Determining if CLT applies. 3)Using CLT to find mean/mean proportion and standard error of sampling distribution 4)Finding.
Regression and Correlation Jake Blanchard Fall 2010.
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
Correlation and Prediction Error The amount of prediction error is associated with the strength of the correlation between X and Y.
1 Forecasting Formulas Symbols n Total number of periods, or number of data points. A Actual demand for the period (  Y). F Forecast demand for the period.
Illinois Sports Illinois Sports. Sports Sports  Basketball  Football  Volleyball  Baseball.
Linear Regression and the By: Nancy Thach & Alexis Ammons July 11, 2011.
By Josh Tabor and Christine Franklin Probability & Statistics w/Applications Honors, Course
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
8-5 Estimating mean differences. Comparing two populations Are two populations different? Really? Just a little? What if I wanted to compare the mean.
Economic and team performance data have different effects on different elements of ticket demand Unemployment is a strong predictor of attendance but not.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/20/12 Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory.
Least Squares Estimate Additional Notes 1. Introduction The quality of an estimate can be judged using the expected value and the covariance matrix of.
PRESENTATION TO MIS480/580 GABE HAZLEWOOD JOSH HOTTENSTEIN SCOTTIE WANG JAMES CHEN MAY 5, 2008 Betting in Super Bowl match ups.
Probability Head to Head; Football. Barcelona.. How likely are Barcelona to score more than 2 goals in the next match? We will base this experiment on.
Chapter 3 DATA PROCESS & ANALYSIS OF STATISTICS Dr. BALAMURUGAN MUTHURAMAN
Sport’s impact on spectators. Does sport have a more significant impact on spectators than on athletes?
Lectures' Notes STAT –324 Probability Probability and Statistics for Engineers First Semester 1431/1432 and 5735 Teacher: Dr. Abdel-Hamid El-Zaid Department.
The Probit Model Alexander Spermann University of Freiburg SS 2008.
Copyright © 2009 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
High school football field John Elway has always loved playing sports. In high school Elway played for the Granada Hill Team.
ICPR2004 (24 July, 2004, Cambridge) 1 Probabilistic image processing based on the Q-Ising model by means of the mean- field method and loopy belief propagation.
CS Statistical Machine learning Lecture 7 Yuan (Alan) Qi Purdue CS Sept Acknowledgement: Sargur Srihari’s slides.
LONG CORNER KICKS IN THE ENGLISH PREMIER LEAGUE: DELIVERIES INTO THE GOAL AREA AND CRITICAL AREA INTRODUCTION Goals within soccer can be scored from open.
Forecasting the Outcomes of Football Bowl Games with the AP Poll Presenters: Caroline E. Meyer and P. Trent Teague Lincolnton High School.
The Probit Model Alexander Spermann University of Freiburg SoSe 2009
Alison Tiangson – Western Vance High School
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
American sports Prepared by the teacher of English Vovkotrub Larysa Valeriivna Katerinopol secondary school №2 Katerinopil district rada Cherkassy region.
Practice. Practice Practice Practice Practice r = X = 20 X2 = 120 Y = 19 Y2 = 123 XY = 72 N = 4 (4) 72.
Regression Chapter 6 I Introduction to Regression
Linear Regression and Correlation Analysis
Statistical Data Analysis
The Research Experience for Teachers Program
Statistics with Stiles
Statistics in Applied Science and Technology
Objectives The student will be able to: use Sigma Notation
Prediction of NBA games based on Machine Learning Methods
Lectures' Notes STAT – 324  Probability and Statistics for Engineers First Semester 1431/1432 and 5735 Teacher: Dr. Abdel-Hamid El-Zaid Department.
خشنه اتره اهورهه مزدا شيوۀ ارائه مقاله 17/10/1388.
Analysis of MLS Season Data Using Poisson Regression with R
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
HUMANITIES/ARTS GREEN TITLE
SCIENCE/SOCIAL SCIENCE GREEN TITLE
Calculating Probabilities for Any Normal Variable
Psychological Momentum in Hockey: Does fighting affect outcome?
SCIENCE/SOCIAL SCIENCE YELLOW TITLE
Statistical Data Analysis
Dr. Uhave Kale Department of Physical Education
Do Revenues Effect Success Among Professional Sports Teams?
3.2. SIMPLE LINEAR REGRESSION
Lectures' Notes STAT – 324  Probability and Statistics for Engineers First Semester 1431/ and 5735 Teacher: Dr. Abdel-Hamid El-Zaid Department.
SCIENCE/SOCIAL SCIENCE GREEN TITLE
Table 2. Regression statistics for independent and dependent variables
Linear Regression and Correlation
Presentation transcript:

A Statistical Analysis of Basketball Comebacks Presenters: Jessica Jenkins and Adam Benoit Watauga High School Lincolnton High School

Inspiration What was the likelihood of the http://www.nba.com/hawks/sites/hawks/files/imagecache/image_gallery_default/photos/HWK_Tmac_Moment1.jpg What was the likelihood of the "Greatest" NBA Comeback of All Time?

Abstract Likelihood of an NBA comeback based on the time remaining and point differential Data mining strategies Visualization Modeling the empirical data with a function

Introduction NBA regular season games from 2002 – 2013 Two factors: time and point differential Modeled with exponential function

Literary Reviews Factors: possession, home-team advantage, current ranking Bill James’ formula Seconds = (lead – 3 ± 0.5)2 +0.5 if the leading team has the ball -0.5 if the trailing team has the ball Research prior to 2000

Empirical Data

Empirical Data

Empirical Data 95% Confidence Interval The most frequent probability of a comeback is 0.1420; however, the 95% confidence interval is from 0.1126 to 0.1757.

Function z (x,y) = 0.5e -Ry x + C -178.3099y x + 457.8600 z represents likelihood of a comeback, x represents time remaining, and y represents point differential The coefficient (R) and the constant (C) were found using nonlinear regression in MatLab z = 0.1128 for the previous example

Function and Empirical Data

Comparison of Comeback Probabilities at the start of the 4th Quarter for Deficits up to 20 Points Point Differential Empirical Estimate z(x,y) 1 0.4235 0.4298 2 0.3975 0.3694 3 0.3581 0.3175 4 0.3108 0.2729 5 0.2531 0.2346 6 0.2722 0.2016 7 0.1859 0.1733 8 0.1522 0.1489 9 0.1210 0.1280 Comparison of Comeback Probabilities at the start of the 4th Quarter for Deficits up to 20 Points Point Differential Empirical Estimate z(x,y) 10 0.1119 0.1100 11 0.0882 0.0946 12 0.0382 0.0813 13 0.0485 0.0699 14 0.0260 0.0601 15 0.0126 0.0516 16 0.0383 0.0444 17 0.0221 0.0381 18 0.0000 0.0328 19 0.0189 0.0282 20 0.0066 0.0242

Function and Empirical Data Deviation

Conclusions Function reasonably models the data Better predictor at specific points in game

Back to the Inspiration

Further Research Home-team advantage Quantify game momentum Possession Different Function

Acknowledgments Co-author Dr. Mitch Parry Dr. Rahman Tashakkori Dr. Mary Beth Searcy National Science Foundation ASU Computer Science Department Watauga High School Lincolnton High School Fellow RET Members

References [1] H. S. Stern, “A Brownian Motion Model for the Progress of Sports Scores” J. Amer. Stat. Assoc., vol. 89, no. 427, pp. 1128-1134, Sep. 1994. [2] Paramjit S. Gill , “Late-Game Reversals in Professional Basketball, Football, and Hockey” The Amer. Stats.,vol. 54, no. 2. pp. 94-99. May. 2000. [3] B. James, (2008, March, 17). The Lead is Safe. [Online]. Available: http://www.slate.com/articles/sports/sports_nut/2008/03/the_lead_is_safe.3.html [4] (2013, May, 29). Root-mean-square deviation. [Online]. Available: http://en.wikipedia.org/wiki/Root_mean_square_deviation [5] (2013, July, 17). ESPN NBA. [Online]. Available: http://espn.go.com/nba/