Baserunner Scoring Percentage (BSP): an analysis using play-by- play data William Knapp and Dr. Jason A. Osborne, Dept. of Mathematics and Dept. of Statistics,

Slides:



Advertisements
Similar presentations
Methodology- Framework Homeruns Batting Average Base on Balls Runs Batted In Strikeouts Errors Double Plays Fielding % Salaries Performance.
Advertisements

Bunting and Sacrificing in MLB Lauren McNulty and Drew Jack.
© McGraw-Hill Higher Education. All Rights Reserved. Chapter 2F Statistical Tools in Evaluation.
Correlational Research 1. Spare the rod and spoil the child 2. Idle hands are the devil’s workplace 3. The early bird catches the worm 4. You can’t teach.
LINEAR REGRESSION: What it Is and How it Works Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r.
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r.
Chapter 8 Logistic Regression 1. Introduction Logistic regression extends the ideas of linear regression to the situation where the dependent variable,
Who Wins The Ironman? Best Swimmer, Cyclist, or Runner A Project Demonstration by Dr. Moffit.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Statistics. Overview 1. Confidence interval for the mean 2. Comparing means of 2 sampled populations (or treatments): t-test 3. Determining the strength.
STAT 104: Section 2 4 Oct, 2007 TF: Daniel Moon. TF Daniel Moon G2 (A.M. in Statistics Dept) Office hour:
1 BA 275 Quantitative Business Methods Simple Linear Regression Introduction Case Study: Housing Prices Agenda.
Reliability and Linking of Assessments. Figure 1 Differences Between Percentages Proficient or Above on State Assessments and on NAEP: Grade 8 Mathematics,
Linear Regression MARE 250 Dr. Jason Turner.
MARE 250 Dr. Jason Turner Correlation & Linear Regression.
Introduction to Linear and Logistic Regression. Basic Ideas Linear Transformation Finding the Regression Line Minimize sum of the quadratic residuals.
CORRELATIONAL RESEARCH I Lawrence R. Gordon Psychology Research Methods I.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
Correlational Research Strategy. Recall 5 basic Research Strategies Experimental Nonexperimental Quasi-experimental Correlational Descriptive.
Biostat Didactic Seminar Series Analyzing Binary Outcomes: Analyzing Binary Outcomes: An Introduction to Logistic Regression Robert Boudreau, PhD Co-Director.
Graph the linear function What is the domain and the range of f?
4x + 2y = 18 a. (1,8) = 18 a. 4(1) + 2(8) = 18 b. (3,3)20 = 18 b. 4(3) + 2(3) = = = 18 15x + 5y = 5 a. (-2,7) NoYes b. (-1,4) a.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Biostatistics Unit 9 – Regression and Correlation.
C. A. Warm Up 1/28/15 SoccerBasketballTotal Boys1812 Girls1614 Total Students were asked which sport they would play if they had to choose. 1)Fill in the.
New Seats – Block 1. New Seats – Block 2 Warm-up with Scatterplot Notes 1) 2) 3) 4) 5)
Run the colour experiment where kids write red, green, yellow …first.
The Moneyball Effect: Teaching AP Statistics through Sports Paul Buckley Gonzaga College High School Washington, DC 1.
Biostat Didactic Seminar Series Correlation and Regression Part 2 Robert Boudreau, PhD Co-Director of Methodology Core PITT-Multidisciplinary Clinical.
Outline Class Intros Overview of Course & Series Example Research Projects Beginning R.
Scheduling the Optimal Baseball Line-up Stefanie Molin Christian Morales Sarah Daniels.
Part IV Significantly Different: Using Inferential Statistics
Scatterplot and trendline. Scatterplot Scatterplot explores the relationship between two quantitative variables. Example:
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Welcome to MM570 Psychological Statistics Unit 4 Seminar Dr. Srabasti Dutta.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
MARE 250 Dr. Jason Turner Linear Regression. Linear regression investigates and models the linear relationship between a response (Y) and predictor(s)
Section 4.2 Building Linear Models from Data. OBJECTIVE 1.
Logistic Regression. Linear Regression Purchases vs. Income.
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Logistic Regression (Classification Algorithm)
AP Statistics HW: p. 165 #42, 44, 45 Obj: to understand the meaning of r 2 and to use residual plots Do Now: On your calculator select: 2 ND ; 0; DIAGNOSTIC.
Introduction to Statistical Modeling Wei Zhu Department of Applied Mathematics & Statistics State University of New York at Stony Brook
1.What is Pearson’s coefficient of correlation? 2.What proportion of the variation in SAT scores is explained by variation in class sizes? 3.What is the.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Welcome to MM570 Psychological Statistics Unit 4 Seminar Dr. Bob Lockwood.
Method 3: Least squares regression. Another method for finding the equation of a straight line which is fitted to data is known as the method of least-squares.
Chapter 4 Minitab Recipe Cards. Correlation coefficients Enter the data from Example 4.1 in columns C1 and C2 of the worksheet.
Simple linear regression and correlation Regression analysis is the process of constructing a mathematical model or function that can be used to predict.
4-5 Introduction to Scatter Plots 13 April Agenda What is a Scatter Plot? Correlation of Scatter Plots Line of Fit.
Scorekeeping Guide BASIC BATTING STATISTICS At Bats (AB) = Plate appearances minus [ BB + HBP + SB + SF + CI ] Hits = 1B + 2B + 3B + HR Batting Avg. =
MIS2502: Data Analytics Introduction to Advanced Analytics and R.
Multiple Regression Scott Hudson January 24, 2011.
Chapter 8 Linear Regression.
Scorekeeping Guide CPLL-Spring 2014
LEAST – SQUARES REGRESSION
Linear Regression Special Topics.
Regression Chapter 6 I Introduction to Regression
Building Linear Models from Data
2-7 Curve Fitting with Linear Models Holt Algebra 2.
Multiple Linear Regression
Sihua Peng, PhD Shanghai Ocean University
Correlation and Covariance
What’s the plan? First, we are going to look at the correlation between two variables: studying for calculus and the final percentage grade a student gets.
Cases. Simple Regression Linear Multiple Regression.
Bonus Slide!!! Things to “Carefully Consider”
Scorekeeping Guide Lacamas Little League--Spring 2009
Presentation transcript:

Baserunner Scoring Percentage (BSP): an analysis using play-by- play data William Knapp and Dr. Jason A. Osborne, Dept. of Mathematics and Dept. of Statistics, N.C. State University

Introduction What is BSP? Which teams have the best BSP? Effect of sacrifice bunts and stolen bases BSP as a predictor for runs scored and wins Methodology Linear and logistic regression Correlation

Data Acquisition Web-scraping and parsing with R and Java R functions: readLines() and regexp() Java: sort into arrays and change certain variables to binary

Analysis: Sacrifice and SB effects OutsBuntAttemptSBattemptEst. ProbStd Error 0No NoYes YesNo Yes No NoYes YesNo Yes No NoYes YesNo-- 2Yes --

Analysis: MLR on Runs and Wins Variables# of Predictorsrsqrpgrsqwlpct obp bsp hr lob obp bsp obp hr obp lob bsp hr bsp lob lob hr obp bsp hr lob

BSP vs Winning Percentage Scatter of all 30 teams over all 16 years Individual teams to show trends Houston needs some work