Controlling for Context S. Burtch © 2014. Traditional Points Plus/Minus Faceoffs Real Time Stats (hits, blocked shots, takeaways/giveaways) Ice Time.

Slides:



Advertisements
Similar presentations
1.What is Pearson’s coefficient of correlation? 2.What proportion of the variation in SAT scores is explained by variation in class sizes? 3.What is the.
Advertisements

Hockey. Point awarded to a player or players for helping set up a goal; usually the last two offensive players to handle the puck prior to a goal being.
Chapter 8 Linear regression
Chapter 8 Linear regression
Hockey Terminology & Strategy Basics for Parents.
Chapter 8 Linear Regression © 2010 Pearson Education 1.
Improving Puck Possession Time & it’s impact on metrics Presented by: Josh Brandwene, Penn State Women’s Hockey.
Introduction Offensive strategies of the National Football League have seemingly shifted towards a “West Coast” style offense, relying more heavily on.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
MULTIPLE REGRESSION. OVERVIEW What Makes it Multiple? What Makes it Multiple? Additional Assumptions Additional Assumptions Methods of Entering Variables.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Describing the Relation Between Two Variables
Chapter 10 Simple Regression.
The Best Investment in this Economy (Safer than the S&P) Ana Burcroff Kathleen Fregeau Brett Koons Alistair Meadows March 3 rd 2009 – Team 7.
TRIVIA GAME. Question 1 When does a referee use the line change procedure?
Choose your own adventure: Hockey edition! By Joseph Reynolds.
Copyright © 2011 Pearson Education, Inc. Multiple Regression Chapter 23.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 25 Categorical Explanatory Variables.
September In Chapter 14: 14.1 Data 14.2 Scatterplots 14.3 Correlation 14.4 Regression.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Least-Squares Regression Section 3.3. Why Create a Model? There are two reasons to create a mathematical model for a set of bivariate data. To predict.
Linear Regression and the By: Nancy Thach & Alexis Ammons July 11, 2011.
By: Jim Ogle Charles Henkel Kyle Marcangelo.  We are trying to find what team would win in a hypothetical game based on their stats for the
Chapter 16 Data Analysis: Testing for Associations.
Section 3.2C. The regression line can be found using the calculator Put the data in L1 and L2. Press Stat – Calc - #8 (or 4) - enter To get the correlation.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Unit 4 Lesson 6 Demonstrating Mastery M.8.SP.2 To demonstrate mastery of the objectives in this lesson you must be able to:  Know that straight lines.
Expanding Plus- Minus for Visual and Statistical Analysis of NBA Box-Score Data IEEE VIS 2013 Workshop on Sports Data Visualization Rob Sisneros, Mark.
Skill, Coaching, or Randomness: What Goes Into Open-Play Performance in the Offensive, Defensive, and Neutral Zones Josh Smolow
Residual Plots Unit #8 - Statistics.
LEAST-SQUARES REGRESSION 3.2 Role of s and r 2 in Regression.
…. a linear regression coefficient indicates the impact of each independent variable on the outcome in the context of (or “adjusting for”) all other variables.
Empowering analytics through data capture Cary Moretti, CTO | HockeyTech Ottawa Hockey Analytics, Carleton University, Ottawa the world’s premier provider.
What Is The Most Important Statistic In The NHL? Does a higher Corsi%, Save %, or Shot % increase your chances of winning a Stanley Cup?
Chapter 8 Relationships Among Variables. Outline What correlational research investigates Understanding the nature of correlation What the coefficient.
The Next Frontier: How x/y tracking data is a game-changer for NHL analytics Presented by:Marc Appleby President, Solution Architect PowerScout Sports.
Predicting Free Agent Salaries with Traditional and Advanced Metrics Matt puckplusplus.com hockey-graphs.com.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Lopez – MA 276 MA 276: Sports and statistics Lecture 2: Statistics in baseball 0.
Predicting the NHL Playoffs Daniel Boucher Tarek Bos-Jabbar.
CHAPTER 3 Describing Relationships
Jeremy Sylvain & Michael Schuckers
CHAPTER 3 Describing Relationships
Multilevel Modeling in Hockey Analytics: Untangling Individual and Team Performance In Even-Strength, Power Play, and Short Handed Situations Sophie Jablansky.
Passing Networks in Hockey RIT Analytics Conference 2015 S Burtch
Section 3.2: Least Squares Regression
Goalies Are Voodoo. 1.
The Use and Effectiveness of the Dump and Change in the NHL
NHL Player Development Modelling
School of Computing Science
A Model Based Approach to Injuries
Day 13 Agenda: DG minutes.
residual = observed y – predicted y residual = y - ŷ
Stupid Goalie Stuff Nick Mercadante
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Game Changing Hockey Intelligence
. 17/01/2019 Team: Physical rating: Hard Date:
Maximizing NHL Player Usage Using a Linear Optimization Model
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Why is it so hard to evaluate individual defensive play?
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Algebra Review The equation of a straight line y = mx + b
CHAPTER 3 Describing Relationships
Presentation transcript:

Controlling for Context S. Burtch © 2014

Traditional Points Plus/Minus Faceoffs Real Time Stats (hits, blocked shots, takeaways/giveaways) Ice Time

Rate Statistics Possession Metrics (On-Ice) Shots For / Against Shot Attempts For / Against(aka Corsi) Unblocked Shot Attempts For / Against (aka Fenwick) Shot Attempt Differentials Shot Attempt Percentages

Not all minutes/situations are created equal – we are aware of Contextual impacts resulting from usage. We’re working within a dynamic system – teasing out impacts of individuals will never be a simplistic process.

Zone Starts Time on Ice Quality of Teammates Quality of Competition (more on this later) Face Off Wins/Losses Aging Score Effects Time Effects (in game) Shot Type / Location

Eyeballing all the various components REL Measures (On Ice - Off Ice differential) - Desjardins WOWY Charts – Tango (via Johnson) Usage Charts – Vollman Aging Trajectories / DELTA – Tango Heavy Lifter Index - Poplichak Score Adjusted Metrics – Tulsky Zone Start Adjusted Metrics – Tulsky Hextally – Thomas / Ventura

THoR – Schuckers Expected Goals – Parkatti/Pfeffer Delta SOT – Awad Goals Above Baseline – Thomas / Ventura delta Corsi – Burtch

Need to assess defenders better Defenders don’t seem to have a massive amount of impact on SV% or SH% Defenders DO seem to repeatably impact on Shot Attempt Differentials Initially patterned after HLI (Poplichak) – but specific to D men Unhappy with index values / basis for comparison / weighting Shifted to a regression model to predict outcomes Multivariate Linear Regression Model to predict Expected Corsi

Corsi For and Corsi Against are very weakly correlated – Offense does NOT equate to Defense at the team or individual skater level. This means we should regress CF and CA separately Position is Relevant (F and D impact CF/CA differently) QoT effects are very significant QoC effects are marginal to nonexistent This aligns with the work done by Tulsky and Johnson TOI is a significant correlate to results (better players tend to play more) Zone Starts are significant Faceoffs matter – but less than people think

Yearly Team Effects are very significant This is likely an indication of team quality / systems / coaching Age effects are hard to detect but appear to be present CONTEXT MATTERS VERY MUCH – It Explains Over 64% of observed results for Corsi.

dCorsi represents the residual (differential) between a player’s Observed Corsi and their Expected Corsi resulting from the discussed regression. This is an improvement on Corsi REL because it is determined directly from contextual factors while the player is ON the ice (the OFF Ice results aren’t weighted equivalently to the other factors).

1 SD of dCorsi for NHL skaters is ± , population mean µ = dCorsi is repeatable at a higher level than individual skater SH% or goaltender SV%. Year to Year 5v5 SV% for goalies with 500+ mins has an autocorrelation coefficient of r = , which translates to an r 2 of – the prior year explains 0.4% of the following year’s result. The r 2 year over year for Expected Corsi exceeds 43%, and for dCorsi is approximately 15%. dCorsi accounts for yearly team effects so players are not impacted negatively/positively by transitioning from one team to another year to year. Impacts of Coaching Effects Can Plausibly Be Observed Year to Year.

Tableau Visualizations have been created for tracking individual skaters and to make team comparisons. =yes yes THANKS!