Waging WAR: MLB Player Valuation Using Advanced Metrics

Slides:



Advertisements
Similar presentations
The “Sophomore Slump” Mike Kalis, Joe Hultzen, James Asimes.
Advertisements

Baseball Statistics By Krishna Hajari Faraz Hyder William Walker.
The Underpayment of Restricted Players in North American Sports Leagues DANIEL CIANCIOSO ARTICLE WRITTEN BY: ANTHONY KRAUTMANN, PETER VON ALLMEN AND DAVID.
Introduction Offensive strategies of the National Football League have seemingly shifted towards a “West Coast” style offense, relying more heavily on.
Chapter 6.  Just remember Karl “Carl” Pearson  Let’s run some correlations!  Analyze  Correlate ▪ Bivariate.
Crash Course in Correlation and Regression MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central.
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
2014 Milwaukee Brewers Franchise/Roster Evaluation By: Philip Sophinos, Professor: James Wible 2014 Milwaukee Brewers Franchise/Roster Evaluation By: Philip.
The Lognormal Distribution
CHAPTER 18 Models for Time Series and Forecasting
Non-continuous Relationships If the relationship between the dependent variable and an independent variable is non-continuous a slope dummy variable can.
Linear Trend Lines Y t = b 0 + b 1 X t Where Y t is the dependent variable being forecasted X t is the independent variable being used to explain Y. In.
STAT E100 Section Week 3 - Regression. Review  Descriptive Statistics versus Hypothesis Testing  Outliers  Sample vs. Population  Residual Plots.
Sabermetrics- Advanced Statistics in the MLB. More On Base Percentage (OBP) measures the most important thing a batter can do at the plate: not make.
1919 New York Yankees Official Logo William Brennan Sports Finance February 6, 2014.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Measures of variability: understanding the complexity of natural phenomena.
Race and the Evaluation of Signal Callers in the National Football League David J. Berri (California State University, Bakersfield) Rob Simmons (Lancaster.
Economic and team performance data have different effects on different elements of ticket demand Unemployment is a strong predictor of attendance but not.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
LILA BERRY MACKENZIE BLONDIN ANGELA SCHONDEK FACTORS CORRELATED WITH 2013 STARTING PITCHERS’ SALARIES.
Lesson 14 - R Chapter 14 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Class 13, October 15, 2015 Lessons 2.7 & 2.8.  By the end of this lesson, you should understand that: ◦ Each statistic—the mean, median, and mode—is.
Predicting Free Agent Salaries with Traditional and Advanced Metrics Matt puckplusplus.com hockey-graphs.com.
Was Babe Ruth Underpaid?. SWBAT O Define Consumer Price Index O Understand why it is important to use constant dollar figures to make comparisons O Understand.
Diversification, risk, return and the market portfolio.
Progress Report #2 By Adam Rothstein and Jesse Cox.
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
Heteroscedasticity Chapter 8
Waging WAR: MLB Player Valuation Using Advanced Metrics
Chapter 8 Linear Regression.
CHAPTER 3 Describing Relationships
Competitive Balance and Attendance in the MLB
3.2 Least-Squares Regression
CHAPTER 3 Describing Relationships
Inflation Part 3.
Earnings and Discrimination
Earnings and Discrimination
3.2 (part 1)
Lecture 1: Measures of Location
Section 9.2: Data Distributions and Outliers
Theories of Income Distribution
Theories of Income Distribution
Analysis of MLS Season Data Using Poisson Regression with R
The Consumer Price Index and Government Statistics
residual = observed y – predicted y residual = y - ŷ
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Undergraduated Econometrics
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
Warm Up The table below shows data on the number of live births per 1000 women (aged years) from 1965 to (Hint: enter the year as the years.
© 2007 Thomson South-Western
Heteroskedasticity.
An Introduction to Statistical Process Control
Product moment correlation
CHAPTER 2 Modeling Distributions of Data
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Naming the top three pitching assets in baseball
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Earnings and Discrimination
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Methods Results Conclusion Introduction Research Question References
Do Revenues Effect Success Among Professional Sports Teams?
Business Statistics - QBM117
CHAPTER 3 Describing Relationships
Exponential Smoothing
Presentation transcript:

Waging WAR: MLB Player Valuation Using Advanced Metrics By Alexander Knorr, Sebastian Kadamany and Nick Vedder

Who we are and what motivates us UCD Economic Master’s Students Curious about how goods are valued and priced (Microeconomics) Using the idea of market value to identify disparity between current prices and competitive pricing

Value in the setting of MLB players Identifying players who most outperform their current salaries We estimate what players should be earning to identify price disparity We calculate a straightforward index to identify who outperforms their contract the most in (relative terms)

Data and WAR Used Baseball Reference for all variables: WAR, age and salary Unfortunately was not available for Minor Leagues (we’re too poor to purchase that data!) Used recent career statistics (2010 – 2014) which has 145,109 player – season observations Collapsed and summarized Data Salary as of 2014 Median WAR statistic Age as of 2014

Wins Above Replacement (WAR): Why reinvent the wheel? WAR isolates the individual effect a player has on winning, holding team, season and even stadium factors constant Identifies relative performance to an “average” replacement player

Model Specification (1) Stratified players by position (Fielder or Pitcher) Transformed Salary (dependent variable) using logarithms to normalize and provide relative interpretations Removed players who were earning exactly the 2010 rookie minimum to reduce the skew of salary Log Transform for Fielders

Model Specification (2) Used Linear Regression for estimating “true” salary WAR will drive true value of a player Experience terms will control for player experience as that is not explicitly accounted for in WAR Age start will attempt to temper the power of experience in predicting salary. Controlling for players “past their prime”

Results of Salary Estimation Fielders Pitchers

Aside: Controlling for High Correlation Between Experience and Salary By using the squared experience term, we allowed for players to lose predicted salary as they “passed their prime”

Ranking Simply calculated the ratio of predicted player salary to actual salary as of 2014 Creates the Salary Arbitrage Index (SAI) Rank by SAI

Conclusions Ranking the top 100 players by traditional statistics or even WAR does not identify opportunity The very best players are typically already paid appropriately Management always wants to find efficiency “Biggest bang for the buck” SAI was directly developed to identify opportunity in player prospecting