Maximizing NHL Player Usage Using a Linear Optimization Model

Slides:



Advertisements
Similar presentations
MA-250 Probability and Statistics
Advertisements

Chapter 17 Overview of Multivariate Analysis Methods
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: What it Is and How it Works Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r.
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r Assumptions.
A Bayesian hierarchical modeling approach to reconstructing past climates David Hirst Norwegian Computing Center.
© Copyright 2000, Julia Hartman 1 An Interactive Tutorial for SPSS 10.0 for Windows © by Julia Hartman Multiple Linear Regression Next.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
AP Statistics Chapter 8: Linear Regression
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
C HAPTER 2 S CATTER PLOTS, C ORRELATION, L INEAR R EGRESSION, I NFERENCES FOR R EGRESSION By: Tasha Carr, Lyndsay Gentile, Darya Rosikhina, Stacey Zarko.
Lecture 14 Multiple Regression Model
Then click the box for Normal probability plot. In the box labeled Standardized Residual Plots, first click the checkbox for Histogram, Multiple Linear.
Simple Linear Regression. Deterministic Relationship If the value of y (dependent) is completely determined by the value of x (Independent variable) (Like.
Slide 8- 1 Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
Non-Linear Regression. How would we be able to find the line of best fit for data that is supposed to be curved? This is non-linear regression! In most.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
Linear Discriminant Analysis (LDA). Goal To classify observations into 2 or more groups based on k discriminant functions (Dependent variable Y is categorical.
 Find the Least Squares Regression Line and interpret its slope, y-intercept, and the coefficients of correlation and determination  Justify the regression.
Review #2.
Unit 4 Lesson 6 Demonstrating Mastery M.8.SP.2 To demonstrate mastery of the objectives in this lesson you must be able to:  Know that straight lines.
SWBAT: Calculate and interpret the residual plot for a line of regression Do Now: Do heavier cars really use more gasoline? In the following data set,
Frap Time! Linear Regression. You will have a question to try to answer and I’ll give you 15 minutes. Then we will stop, look at some commentary then.
Math 4030 – 11b Method of Least Squares. Model: Dependent (response) Variable Independent (control) Variable Random Error Objectives: Find (estimated)
Curve Fitting Pertemuan 10 Matakuliah: S0262-Analisis Numerik Tahun: 2010.
1.What is Pearson’s coefficient of correlation? 2.What proportion of the variation in SAT scores is explained by variation in class sizes? 3.What is the.
© 2001 Prentice-Hall, Inc.Chap 13-1 BA 201 Lecture 18 Introduction to Simple Linear Regression (Data)Data.
LEAST-SQUARES REGRESSION 3.2 Role of s and r 2 in Regression.
Logistic Regression Saed Sayad 1www.ismartsoft.com.
Differential Equations Linear Equations with Variable Coefficients.
Bellwork (Why do we want scattered residual plots?): 10/2/15 I feel like I didn’t explain this well, so this is residual plots redux! Copy down these two.
Chapter 8 Linear Regression. Fat Versus Protein: An Example 30 items on the Burger King menu:
Residuals. Why Do You Need to Look at the Residual Plot? Because a linear regression model is not always appropriate for the data Can I just look at the.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Correlation and Linear Regression Chapter 13 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
An Interactive Tutorial for SPSS 10.0 for Windows©
Warm-up Get a sheet of computer paper/construction paper from the front of the room, and create your very own paper airplane. Try to create planes with.
CHAPTER 3 Describing Relationships
Lecture 24 Multiple Regression Model And Residual Analysis
CHAPTER 3 Describing Relationships
Does adding a fuel additive help gasoline mileage in automobiles?
The following data represents the amount of Profit (in thousands of $) made by a trucking company dependent on gas prices. Gas $
Multiple Regression.
Understanding Standards Event Higher Statistics Award
12 Inferential Analysis.
(Residuals and
Week 5 Lecture 2 Chapter 8. Regression Wisdom.
Statistical Methods For Engineers
Polynomial Regression
Counting Loops.
CHAPTER 3 Describing Relationships
BA 275 Quantitative Business Methods
Residuals and Residual Plots
12 Inferential Analysis.
Linear Programming Example: Maximize x + y x and y are called
Adequacy of Linear Regression Models
Adequacy of Linear Regression Models
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Adequacy of Linear Regression Models
Adequacy of Linear Regression Models
CHAPTER 3 Describing Relationships
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Disorders of the Eye Human Body Systems
CHAPTER 3 Describing Relationships
Presentation transcript:

Maximizing NHL Player Usage Using a Linear Optimization Model Dawson Sprigings

What is wrong with a linear model? You can use any metric you would like - Corsi/xG/WAR

Is a linear model the right choice? Theoretical Residual Analysis Residual = Observed - Predicted Example: Fake model to predict how many goals a player will score Model Predicts -> 5 Player Actually Scores -> 7 Residual -> 2

Theoretical

2. Residual Analysis

2. Residual Analysis - Cumulative Residual Plot

Solution: Non-Linear Model You can use any metric you would like - Corsi/xG/WAR

Finding a Time-On-Ice Cutoff You can use any metric you would like - Corsi/xG/WAR

Polynomial Regression

Basic Linear vs. Polynomial Regression

Linear vs. Polynomial - Cumulative Residual Plot

Should you play your best players together? Short Answer: NO Better Answer: It depends 7 Models All - Average 3 Bad Players 2 Bad Players 1 Bad Player 3 Good Players 2 Good Players 1 Good Player Good = CF%RelTM > 1 SD Bad = CF%RelTM < -1 SD

Good Player - Brad Marchand

Bad Player - Zac Rinaldo

Average Player - David Pastrnak

Diminishing Returns Model: Dependent Variable Actual Line CF% Independent Variable Estimated Line CF% Est. Line CF% = (P1 CF%RelTM + P2 CF%RelTM P3 CF%RelTM)/3 Model Beta Coefficient 3 Good Players 0.51 2 Good Players 0.87 1 Good Player 1.3 All Average 1.58 1 Bad Player 1.41 2 Bad Players 1 3 Bad Players 1.02

Linear Optimization Maximize under a certain set of constraints Gives us the “best” possible lines Can use “best” lines as a baseline to see how much value is being lost Unfair to compare actual lines to best Compare to “basic” model instead

Basic Model vs. New Model Uses straight line approach Output Sub-optimal lines Incorrect performance projections New Model Uses polynomial approach Uses 7 different models to account for line makeup “Best” lines Can apply to the lineup created by the basic model to give correct predictions

Basic Model vs. New Model - Effect Incorrect Projections Basic Model overestimated Minnesota Los Angeles Arizona Basic Model underestimated Florida Nashville Montreal Cost of Bad Lineups (Goals) Minnesota - 7.88 Los Angeles - 6.05 Carolina - 4.79 Boston - 3.25

Summary & Future Work Polynomial regression is more appropriate than standard linear model Try to spread talent throughout lineup to maximize impact Offense vs. Defense Improve the 7 models approach Apply different metrics xG / WAR

Thank You