Jeremy Sylvain & Michael Schuckers

Slides:



Advertisements
Similar presentations
Chapter 5 Multiple Linear Regression
Advertisements

Section 10-3 Regression.
NHL Shot Probability Maps Michael Schuckers St. Lawrence University Powerpoint slides based upon poster presentation at 2010 Joint.
Statistical Analysis Regression & Correlation Psyc 250 Winter, 2013.
Introduction Offensive strategies of the National Football League have seemingly shifted towards a “West Coast” style offense, relying more heavily on.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics, 9e Managerial Economics Thomas Maurice.
Statistics for Managers Using Microsoft® Excel 5th Edition
SLIDE 1IS 240 – Spring 2010 Logistic Regression The logistic function: The logistic function is useful because it can take as an input any.
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Simple Linear Regression NFL Point Spreads – 2007.
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Wednesday PM  Presentation of AM results  Multiple linear regression Simultaneous Simultaneous Stepwise Stepwise Hierarchical Hierarchical  Logistic.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
© 2004 Prentice-Hall, Inc.Chap 15-1 Basic Business Statistics (9 th Edition) Chapter 15 Multiple Regression Model Building.
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
(a.k.a: The statistical bare minimum I should take along from STAT 101)
Beyond +/-: A Rating System to Compare NHL Players Dennis F. Lock Michael E. Schuckers St. Lawrence University.
STAT E100 Section Week 3 - Regression. Review  Descriptive Statistics versus Hypothesis Testing  Outliers  Sample vs. Population  Residual Plots.
Model Selection1. 1. Regress Y on each k potential X variables. 2. Determine the best single variable model. 3. Regress Y on the best variable and each.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
Managerial Economics Demand Estimation & Forecasting.
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 4.
Week 5: Logistic regression analysis Overview Questions from last week What is logistic regression analysis? The mathematical model Interpreting the β.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Functional Data Approach to Longitudinal Modeling in the National Hockey League Matthew J. Valente, David P. MacKinnon, and Hye Won Suk Arizona State University.
Making the NHL Playoffs Dustin Schneider Lawrence Mulcahy.
Introduction to Curve Fitting: The Line of Best Fit OBJ: To find the line of best fit for a set of data and find the correlation coefficient for a set.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
1 Chapter 4: Introduction to Predictive Modeling: Regressions 4.1 Introduction 4.2 Selecting Regression Inputs 4.3 Optimizing Regression Complexity 4.4.
The Impact of Power Plays in NHL Hockey, or: No Dogs Play Hockey Jordan Pedersen, Tom Geiger, and Waleed Khoury.
Logistic Regression Saed Sayad 1www.ismartsoft.com.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Controlling for Context S. Burtch © Traditional Points Plus/Minus Faceoffs Real Time Stats (hits, blocked shots, takeaways/giveaways) Ice Time.
1 Response Surface A Response surface model is a special type of multiple regression model with: Explanatory variables Interaction variables Squared variables.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Predicting the NHL Playoffs Daniel Boucher Tarek Bos-Jabbar.
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
Chapter 15 Multiple Regression Model Building
The New York Rangers Come in the locker room and learn all about The New York Rangers. Presented by: Logan Byers Project: 12 12/17/2010.
Simple Linear Regression
Copyright © 2009 Pearson Education, Inc.
Basic Estimation Techniques
Conclusions and areas for further analysis
Multilevel Modeling in Hockey Analytics: Untangling Individual and Team Performance In Even-Strength, Power Play, and Short Handed Situations Sophie Jablansky.
R. E. Wyllys Copyright 2003 by R. E. Wyllys Last revised 2003 Jan 15
Regression Techniques
QM222 Class 18 Omitted Variable Bias
“The Art of Forecasting”
Predicting NHL Player Contracts
School of Computing Science
A Model Based Approach to Injuries
Warm-Up Math Social Studies P.E. Women Men 2 10
NBA Draft Prediction BIT 5534 May 2nd 2018
Basic Estimation Techniques
Analysis of MLS Season Data Using Poisson Regression with R
Linear Regression.
Model Trees for Identifying Exceptional Players in the NHL Draft
Rodney J. Paul – Syracuse University
Prediction of new observations
Chi-Square Goodness-of-Fit Tests
NHL Statistics Jonathan maratta.
Objectives Vocabulary
Loading up the Blue Line Eric Cantor
Sample Presentation – Mr. Linden
Linear Regression and Correlation
Presentation transcript:

Jeremy Sylvain & Michael Schuckers The Probability and Severity of Man Games Lost Due to Injury in an NHL regular Season Jeremy Sylvain & Michael Schuckers

Statistical models for a player’s Goal Statistical models for a player’s Injury probability Injury severity

Introduction: ● Teams that get injured struggle to make playoffs ● The amount of time missed due to injures affects decisions made by coaches and managers as to who to call up from farm teams ● Variables chosen were seen to potentially increase the probability and severity of injury

Background: ● In 2016-17, the correlation between percent cap hit of injured players per game and points per game is -0.31 (nhlinjuryviz.blogspot.com) ● In 2016-17 3 most injured teams VAN, WPG, and BUF, all failed to make the playoffs* ● The 3 least injured teams WSH, CGY, and STL all made the playoffs and WSH and STL both moved on to the second round* *mangameslost.com

Data: ● Data From mangameslost.com and hockey-reference.com ● Data collected from 2009-2016 to allow us to collect stable estimates Variable Definition INJ Games a player lost due to injury GP Games played by a player TOI-GM Average time on ice played per game in all situations HPG Hits a player gives divided by games played BPG Shots a player blocks divided by games played AGE Age of player as of January 1st, of each season in years AGE2 Age of a player squared in years

● Bobby Ryan 2013-2014 Example Variable INJ GP TOI.GM HPG BPG AGE AGE2 Stat 12.00 70.00 14.10 1.41 0.43 26.81 718.97

Percent of player with INJ>10 Average Games Missed for Data: ● Data all situations (ES, PP, PK) aggregated Season 2009-10 2010-11 2011-12 2012-13 2013-14 2014-15 2015-16 Number of Players 862 982 985 902 957 979 963 Percent of player with INJ>10 18.7% 21.9% 21.5% 13.6% 22.2% 22.1% 21.4% Average Games Missed for Injured Players 5.86 7.05 7.25 4.01 6.83 6.67 6.65

Data: ● Broke data down by both forwards and defensemen as different factors might effect the models differently

Analysis: ● HPG and BPG are collision factors ● Time on ice per game (TOI.GM) is included to account for exposure ● To account for susceptibility due to aging we include player’s age (AGE) and a quadratic term (AGE2)

Models

Injury Probability Model: ● Used logistic regression ● INJPROB=1, if INJ>0 ● Predicts the probability of injury ● Model is run for each season and position (FW or D) Probability Model (Logistic Regression): INJPROB~HPG+BPG+TOI.GM+AGE+ AGE2

Injury Severity model: ● To model the severity or length of an NHL injury we used a log linear regression model ● Model is based on a Poisson distribution ● Model is run for each position (FW or D) Severity Model (Poisson Regression): INJ~HPG+BPG+TOI.GM+AGE+AGE2

Fit models by position, by season, and for all seasons 2009- 2016 Results Fit models by position, by season, and for all seasons 2009- 2016 Report all season results by position Full results on poster

Coefficients and Their Significance Results: Injury Probability Model for Forwards ● TOI.GM significant in each season ● HPG(2), AGE(1), and AGE2(1) were significant some seasons ● TOI.GM, AGE, and AGE2, significant in the all seasons model Coefficients and Their Significance Forwards TOI.GM HPG BPG AGE  AGE2 All Seasons **0.1489 0.0767 0.0477 **0.2358 *-0.0028 NOTE: “*” level of significance, “**” (p<0.01) and “*” (p<0.05)

Coefficients and Their Significance for the Injury Probability Model Results: Injury Probability Model for Defense ● TOI-GM significant in each season except for 2012-2013 ● HPG(2) AGE(1) and AGE2(1) were significant in some seasons ● TOI.GM, HPG, AGE, AGE2 were significant in the all seasons model Coefficients and Their Significance for the Injury Probability Model Forwards TOI.GM HPG BPG AGE  AGE2 All Seasons **0.1363 **0.2094 0.0647 *0.2627 -0.0030 NOTE: “*” level of significance, “**” (p<0.01) and “*” (p<0.05)

Coefficients and Their Significance Results: Injury Severity Model for Forwards ● AGE and AGE2 were significant each season ●TOI.GM(6), HPG(6), and BPG(5) were significant in some seasons ● All variables were significant in the all seasons model Coefficients and Their Significance Forwards TOI.GM HPG BPG AGE  AGE2 All Seasons **0.0466 **0.0325 **-0.1300 **0.2894 **-0.0039 NOTE: “*” level of significance, “**” (p<0.01) and “*” (p<0.05)

Coefficients and Their Significance Results: Injury Severity Model for Defense ● AGE and AGE2 were significant each season ● TOI.GM(5), HPG(6), BPG(4), were significant in some seasons ● TOI.GM, HPG, AGE, and, AGE2 were significant in the all season models Coefficients and Their Significance Forwards TOI.GM HPG BPG AGE  AGE2 All Seasons **0.0190 **0.0613 0.0071 **0.3180 **-0.0042 NOTE: “*” level of significance, “**” (p<0.01) and “*” (p<0.05)

Application ● 2016-2017 Ottawa Senators

Apply model to each Ottawa Senator Simulation Fit all season model Apply model to each Ottawa Senator Predict injury probability, and injury severity Store total man games lost Repeat steps 3 and 4 many times (n=1000)

Model Simulation: ● Clarke MacArthur Removed

Discussion: ● Above models are groundwork for study of NHL injuries ● TOI.GM was the most significant factor in predicting probability of an injury ● AGE and AGE2 were the most important factors in predicting the severity of an injury

Issues: ● Players who are blocking shots are good at blocking shots ● Hitters are braced for the hit, potentially add frequency of being hit ● Special teams more or less dangerous? ● Modeling long term injuries

Extensions ● Mixture distributions ● Special teams ● Are players injury prone or unlucky?

Thank you j.sylvain01@gmail.com