Lopez – MA 276 MA 276: Sports and statistics Lecture 2: Statistics in baseball 0.

Slides:



Advertisements
Similar presentations
Leveraging Big Data to Develop Next Generation Demand Side Management Programs and Energy Regulations Daniel Young, Energy Solutions Mike McGaraghan, Energy.
Advertisements

Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Baseball Statistics By Krishna Hajari Faraz Hyder William Walker.
Optimization of Batting Order Frank R. Zheng. A Quick Introduction to Baseball  Two teams alternate batting and fielding.  Batting team tries to score.
PREDICTING MLB CAREER SALARIES Stephanie Aube Mike Tarpey Justin Teal.
WAR on statistics Channing Burgess, Brandon Tiong, Daniel Smith Team TATUM.
James Parsons Mathematics Numerical Analysis Applied Numerical Analysis Sabermetrics.
CSE 219 COMPUTER SCIENCE III PROJECT INTRODUCTION: A FANTASY BASEBALL DRAFT KIT.
Baseball is a bat and ball sport played between two teams of nine players each. The goal is to score runs by hitting a thrown ball with a bat and touching.
SCOREKEEPING CLINIC Presented by: Anita Arnold 2/19/2015.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Simple Linear Regression NFL Point Spreads – 2007.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Two Sample Project Example 5/6/2013 Ms. Browne made this up Saber metrics: TX Rangers vs. SF Giants.
Sales Management Sales Forecasting Topic 13. Sales Forecasting What is it? Why do it? Qualitative vs Quantitative Goal = Accuracy Commonly Done by Marketing.
Linear Trend Lines Y t = b 0 + b 1 X t Where Y t is the dependent variable being forecasted X t is the independent variable being used to explain Y. In.
Moneyball in the Classroom Using Baseball to Teach Statistics Josh Tabor Canyon del Oro High School
Linear Trend Lines = b 0 + b 1 X t Where is the dependent variable being forecasted X t is the independent variable being used to explain Y. In Linear.
STAT E100 Section Week 3 - Regression. Review  Descriptive Statistics versus Hypothesis Testing  Outliers  Sample vs. Population  Residual Plots.
Sabermetrics- Advanced Statistics in the MLB. More On Base Percentage (OBP) measures the most important thing a batter can do at the plate: not make.
Case 2: Assessing the Value of Alex Rodriguez Teresa Sonka Gail Bernstein.
Hitting One Out of the Park Presentation by: Richie Veihl Derek Monroe.
Using Relationships to Make Predictions Pythagorean Formula: Predicted Winning Percentage.
MONEYBALL THE ART OF WINNING AN UNFAIR GAME. As a Player:  Five skills player  Can’t miss star  Regarded as a failure As a GM:  Third lowest salaries.
UCLA Team Offense. Offensive Philosophy  Get on Base  Create Scoring Opportunities  Quality AB’s in Crunch Time Situations.
Aim: Review for Exam Tomorrow. Independent VS. Dependent Variable Response Variables (DV) measures an outcome of a study Explanatory Variables (IV) explains.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Reminder Remember that both mean and standard deviation are not resistant measures so you want to take that into account when calculating the correlation.
Unit 4 Lesson 6 Demonstrating Mastery M.8.SP.2 To demonstrate mastery of the objectives in this lesson you must be able to:  Know that straight lines.
12/23/20151 Competition Information Packet. 12/23/ Goals of the Challenge The 2on2 Challenge was designed to create a skills competition that enhances.
Chapter 8: Simple Linear Regression Yang Zhenlin.
 A GM (general manager) essentially runs a baseball team  They are responsible for all the decisions a team makes with its roster.
Lopez – MA 276 MA 276: Sports and statistics Lecture 1: What is sports and statistics? 0.
Inference for Regression
Comparative Advantage and Specialization Sports and Trade
Global predictors of regression fidelity A single number to characterize the overall quality of the surrogate. Equivalence measures –Coefficient of multiple.
Famous Baseball Players By: Emily. Jackie Robinson Facts *Played for the Brooklyn Dodgers *Played *Won 4 awards Rookie of the Year, Most Valuable.
Scorekeeping Guide BASIC BATTING STATISTICS At Bats (AB) = Plate appearances minus [ BB + HBP + SB + SF + CI ] Hits = 1B + 2B + 3B + HR Batting Avg. =
Simple Linear Regression Relationships Between Quantitative Variables.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Statistics in Baseball How do players perform under different counts?
At Bats Hits Runs Doubles Triples Home Runs RBI’s Walks Batting Average Strikeouts.
The statistics behind the game
By: Jonathan Judge GLASC 2017
Purpose of Sports? Baseball vs. Football
Power and Multiple Regression
The Baltimore Orioles, Relationship of Wins and Loses, Batting Average, Earned Run Average, and Errors Stalanic Anu, Matthew Beeman, Jonathon Chudoba,
Baseball Overview Group Name: oneM2M TP30 Source: ATIS
Chapter 11: Simple Linear Regression
Building Linear Models from Data
Purpose of Sports? Baseball vs. Football
Math in Science In science we use the metric system to make measurements in the lab The basic unit of the metric system include: Gram (mass) Liter (volume)
Displaying and Describing Categorical Data
The statistics behind the game
I271B Quantitative Methods
Purpose of Sports? Baseball vs. Football
Examining Relationships in Data
What are models and simulations?
The Math of Baseball Will Cranford 11/1/2018.
Purpose of Sports? Baseball vs. Football
Purpose of Sports? Baseball vs. Football
Adam Smith on Trade From An Inquiry in the Nature and Causes of the Wealth of Nations (1776) “It is a maxim of every prudent mast of a family, never to.
A-Rod: Signing the Best Player in Baseball
Homoscedasticity/ Heteroscedasticity In Brief
Adam Smith on Trade “It is a maxim of every prudent mast of a family, never to attempt to make at home what will cost him more to make than to buy.
Homoscedasticity/ Heteroscedasticity In Brief
Science Fair – Baseball
Exploring Numerical Data
Jump-Shot Or Drive? (Using Mixed Strategy Nash Equilibria to Predict Player Behavior) By Patrick Long.
Presentation transcript:

Lopez – MA 276 MA 276: Sports and statistics Lecture 2: Statistics in baseball 0

Lopez – MA 276 Goals i)Overview of sabermetrics: What to look for? ii)Example: Runs created iii)Additional topics after lecture -Bunting/pitchouts, Pitch framingBunting/pitchoutsPitch framing -Defensive independent pitchingDefensive independent pitching 1 Tools i)Bivariate tools: scatter plots, r, R-squared ii)In-sample versus out-of-sample comparisons

Lopez – MA 276 What is sabermetrics? ‘Search for objective knowledge about baseball’ -Bill James Ex: Which player on the Red Sox contributed most to his team’s offense? Ex: Which player is your favorite? Ex: Which player deserves the MVP award? 2

Lopez – MA 276 Questions we’ll want to answer 1 – Is the metric important to success? 2 – How well does the metric measure a player’s contribution? 3 – Is the metric repeatable? 3

Lopez – MA 276 Is the metric important to success? What’s “important?” What’s “success?” -Examples in baseball Stolen bases Batting average Home runsWalks RBIsSlugging percentage 4

Lopez – MA 276 How well does the metric measure a player’s contribution? Stolen bases Batting average Home runsWalks RBIsSlugging percentage Which are impacted by a player’s teammates? Which are impacted by a player’s ballpark? Which are impacted by a player’s coach? Which are impacted by a player’s era? 5

Lopez – MA 276 Is the metric repeatable? Stolen bases Batting average Home runsWalks RBIsSlugging percentage How to judge repeatable? Why is repeatability (?) important? How does sample size fit in? 6

Lopez – MA 276 Ex: Runs created 7 Why runs created?

Lopez – MA 276 Ex: Runs created 8 General assumptions & expectations Different valuations to different types of hits Hitters only control their performance -What is assumed here? Hitters do not control when they hit Hitters do not control importance of at-bat relative to game’s outcome

Lopez – MA 276 Ex: Runs created 9

Lopez – MA 276 Ex: Runs created 10 Benefits of runs created Team level accuracy: - Basic version can predict a team’s run total within a 5% margin of error Individual talent: - Reflects individual performance only Repeatability? - To be determined in Thursday’s lab.

Lopez – MA 276 Ex: Runs created 11 Weaknesses of runs created What if clutch exists? Ballpark dependencies Opponent dependencies

Lopez – MA 276 Ex: Runs created 12 What’s it look like?

Lopez – MA 276 Ex: Runs created 13

Lopez – MA 276 Ex: Runs created 14 How do we describe the association between runs created and actual runs?

Lopez – MA 276 Ex: Runs created 15 What about the association between team runs and other team variables? Note: What does the select command do?

Lopez – MA 276 Ex: Runs created 16

Lopez – MA 276 Ex: Runs created 17 What about runs created against more popular but advanced metrics?

Lopez – MA 276 Ex: Runs created 18

Lopez – MA 276 What we’ve shown 1 – Is runs created important to success? -Yes. Strong link to team runs 2 – How well does the metric measure a player’s contribution? -Pretty well. Other advanced formulas exist -Adjustments possible 3 – Is the metric repeatable? -Let’s find out 19

Lopez – MA 276 Ex: Runs created 3 – Is the metric repeatable? Explanatory power vs. Predictive power 20

Lopez – MA 276 Ex: Runs created 3 – Is the metric repeatable? 21

Lopez – MA 276 Ex: Runs created 3 – Is the metric repeatable? 22

Lopez – MA 276 Ex: Runs created Implications: Other tools for assessing error: MSE: MAE: 23

Lopez – MA 276 Additional topics 24 Bunting, pitchouts Pitch framing Defensive independent pitching 1 – Importance 2 – Player-specific contributions 3 – Repeatability