Forecasting the Winner of a Tennis Match Franc Klaassen University of Amsterdam (NL) Jan R. Magnus Tilburg University (NL) TST Congress, London July 29,

Slides:



Advertisements
Similar presentations
Unit 5 Sports. I.Speak out the names of sports in each picture baseballboxingswimming joggingtennisfootballbasketball.
Advertisements

1 Chapter 17 Adverbial Clauses The Williams Sisters.
Assignment 2 Observations
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
Real life issues in the life of a fictional Subject Leader Game, set and match.
Jared Hockly - Western Springs College
Success – it’s not all about the hours ‘Purposeful Practice’
Kacper Gomoła 1e.  The Championships, The Wimbledon Championships or simply Wibledon, it have many names about world. Wimbledon is the oldest tennis.
© 2011 wheresjenny.com French Open. © 2011 wheresjenny.com French Open Rafael Nadal cruises into last 16 at French Open  Defending champion Rafael Nadal.
Empirical Analysis Doing and interpreting empirical work.
Disclaimer © 2015 John Straumann. All rights reserved. “Score It! Tennis” and other product names are or may be registered trademarks and/or trademarks.
Rendering of Realistic Grass (In Real Time) Sami Benzaid.
Game Playing CSC361 AI CSC361: Game Playing.
Brick Finding Ankara CM 2/4/2009 Dario Autiero. A large effort was put in the last months by a team of people in order to recover the events pending due.
Sampras v. Federer: Who is the best tennis player ever? Jeremy Polster & Justin Greenlee.
Stat 512 – Lecture 12 Two sample comparisons (Ch. 7) Experiments revisited.
Common Voting Rules as Maximum Likelihood Estimators Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University, Computer Science Department.
HELP Ministries – Final Presentation Team: Dave Hess, Ria Ghose, Bobby Lusk.
Forecasting a Tennis Match at the Australian Open
Disclaimer © 2015 John Straumann. All rights reserved. “Score It! Tennis” and other product names are or may be registered trademarks and/or trademarks.
Disclaimer © 2015 John Straumann. All rights reserved. “Score It! Tennis” and other product names are or may be registered trademarks and/or trademarks.
Business Intelligence Focus Groups June, Agenda Welcome Introductions Presentation on Business Intelligence Discussion Groups – Identifying Issues.
CS 6190 Finding a Research Topic. The Thesis Equation Topic + Advisor = Dissertation.
The t-test Inferences about Population Means when population SD is unknown.
A Principled Study of Design Tradeoffs for Autonomous Trading Agents Ioannis A. Vetsikas Bart Selman Cornell University.
1 Adversary Search Ref: Chapter 5. 2 Games & A.I. Easy to measure success Easy to represent states Small number of operators Comparison against humans.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Fundamentals of Game Design, 2 nd Edition by Ernest Adams Chapter 16: Sports Games.
Estimation and Hypothesis Testing. The Investment Decision What would you like to know? What will be the return on my investment? Not possible PDF for.
Game Trees: MiniMax strategy, Tree Evaluation, Pruning, Utility evaluation Adapted from slides of Yoonsuck Choe.
Talks & Statistics (wrapping up) Psych 231: Research Methods in Psychology.
The possible outcomes are 2 + 2, 2 + 3, 2 + 4, 3 + 2, 3 + 3, 3 + 4, 4 + 2, 4 + 3, The probability of an even sum is ____. The probability of an.
Infusion - Unit 6 InFusion
Minimax Trees: Utility Evaluation, Tree Evaluation, Pruning CPSC 315 – Programming Studio Spring 2008 Project 2, Lecture 2 Adapted from slides of Yoonsuck.
Process Flowsheet Generation & Design Through a Group Contribution Approach Lo ï c d ’ Anterroches CAPEC Friday Morning Seminar, Spring 2005.
Modeling Resource Sharing Dynamic of VoIP users Over a WLAN Using a Game-Theoretic Approach Presented by Jaebok Kim.
TENNIS.
Psychology 202a Advanced Psychological Statistics September 1, 2015.
1 CSC 221: Computer Programming I Spring 2010 interaction & design  modular design: roulette game  constants, static fields  % operator, string equals.
I.The Writing Process overview. The Process of writing is a process of thinking It is not simply a sequence of steps a sequence of steps.
VB Games: Preparing for Memory Brainstorm controls & events Parallel structures (again), Visibility, LoadPicture, User-defined procedures, Do While/Loop,busy.
Maths on the move Cookie Count Room 13 – Year 2 Class 2007.
Revising your Paper. What is Revision? Revision literally means to “see again.” When you revise, you look at something from a fresh, critical perspective.
Short Read Mapper Evan Zhen CS 124. Introduction Find a short sequence in a very long DNA sequence Motivation – It is easy to sequence everyone’s genome,
By: Alicia Garcia.  Qualities ◦ “being able to serve hard enough so that the other person cant serve it back. It was all about placement” ◦ Being quick.
1 Markov Decision Processes Infinite Horizon Problems Alan Fern * * Based in part on slides by Craig Boutilier and Daniel Weld.
Roger Federer VS Rafael Nadal
Sample: Probability “Fair Game” Project (borrowed from Intel®, then adjusted)
ETM U 1 Analysis of Variance (ANOVA) Suppose we want to compare more than two means? For example, suppose a manufacturer of paper used for grocery.
IDEA STUDENT EVALUATION REPORTS Insight Improvement Impact ® Using IDEA as a Tool for Reflection about Your Teaching Raritan Valley Community College January.
© 2013 Catalyst The Case for Incrementality How to Measure the Real ROI of Your Marketing Programs By Marc Solomon, Director of Analytics, Catalyst 6/3/2013.
1 Some Guidelines for Good Research Dr Leow Wee Kheng Dept. of Computer Science.
Your Logo Here Do You Know Your Odds? Presented by: Your Name Here.
De-anonymizing Genomic Databases Using Phenotypic Traits Humbert et al. Proceedings on Privacy Enhancing Technologies 2015 (2) :
IntroductionCB and MUResultsConclusions IASE Annual conference. Gijón May 9-10, 2008 Data Measuring Competitive Balance and Match Uncertainty in Professional.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
World Congress on National Accounts and Economic Progress, Washington, D.C., May 2008 Capital and the National Accounts Paul Schreyer, OECD.
Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.
© 2015 albert-learning.com French open FRENCH OPEN Founded1891; 124 years ago LocationParis,France.
MCMC Output & Metropolis-Hastings Algorithm Part I
Psychology 202a Advanced Psychological Statistics
Rules and regulations of tennis
Preface to the special issue on context-aware recommender systems
Lecture 12.
National Dong Hwa University
Level 3 1st4sport Certificate in Coaching Tennis
JMP 11 added new features and improvements to CCB and MSA.
Bell Ringer: Jan. 7, 2019 Which of the following is equivalent to (2x - 6y) + (8x - 5y)? F. 6x + y G. 10x - 11y H. 7x + 2y J. 8x - 5y K. 2x + 6y.
Using Microsoft Excel to model a tennis match
Forecasting a Tennis Match at the Australian Open
Presentation transcript:

Forecasting the Winner of a Tennis Match Franc Klaassen University of Amsterdam (NL) Jan R. Magnus Tilburg University (NL) TST Congress, London July 29, 2003

Overview Forecasting: one aspect of a larger tennis project Motivation for forecasting How to compute forecasts during a match? Forecasting in practice: graph of the 2003 Ladies’ Singles Wimbledon final Robustness of the graph Conclusion.

Tennis project Testing hypotheses (six papers): 7 th game is the most important game in a set: false Real champions win the big points: true. Service strategy (in progress): How to choose the strengths of 1 st and 2 nd services to maximize the probability of winning a point? Rule changes (one paper): How to reduce the service dominance? Presented at TST-1. Forecasting (two papers): Forecasting winner while match is in progress: TST-2.

Motivation for forecasting Forecasting the winner of a tennis match: Before a match Using odds from bookmakers Using statistical model, e.g., Boulier and Stekler (1999) Clarke and Dyte (2000) During a match Using statistical model  focus of our paper.

Why forecasting during match? TV spectators want information on: 1. Which player leads at this moment? 2. Who is most likely to win the match? 3. How did the match develop up to now (momentum, winning mood)?

TV spectators get info on Score: gives info on 1 (Leader): Yes 2 (Likely winner): Partially 4-6 for Agassi-Hewitt => Hewitt will probably win, 4-6 for Agassi-Henman => Agassi will still be the favorite 3 (Development up to now): Partially 5-5 can result after 4-4 (match in balance), but also after 5-0 (one player is in a winning mood)  Room for improvement regarding 2 and 3.

TV spectators also get info on Match/set stats (%1 st serve in,...): give info on 2 (Likely winner): Not much 3 (Development up to now): Partially Comparison of 2 nd set with 1 st set statistics gives some insight, but each statistic is too aggregate to give a clear picture. Note: summary stats provide detailed info on specific aspects of each player  useful, but beyond scope of our paper.  Still room for improvement regarding 2 and 3  Purpose of current paper.

Idea Present the probability that a player will win match; update it as match unfolds (real-time forecasting). Example: Agassi-Hewitt At start of match:Agassi wins with prob. 60% At 4-6:Agassi wins with prob. 30% At 4-6/0-3:Agassi wins with prob. 20%. Use graph to visualize the probs. of all points till now.

How to compute the forecasts during a match? Suppose: match between players A and B. Goal: Prob{A wins match} at each point up to now. This probability depends on 2 inputs (besides score): Prob{A wins match} at start of match Prob{A wins point on serve}+Prob{B wins point on serve}. Implementation using our computer program TENNISPROB: Choose the two inputs before the match and keep them constant Type in the score at each point  TENNISPROB gives Prob{A wins match} very quickly.

How to choose the two inputs? Prob{A wins match} at start of match We provide an estimate based on rankings (e.g., 80%), but one can easily improve/overrule that estimate if one has specific other info (injury problems, specific ability of surface,...) (e.g., 70%)  In the end there is one starting point of the graph (70%). Prob{A wins point on serve}+Prob{B wins point on serve} We provide an estimate based on rankings (e.g., 120%: both players win 60% of their points on service) No need for adjustment: the graph hardly depends on our choice  There is an estimate (120%).

Forecasting in practice: Serena-Venus Williams at Wimbledon 2003 Before the match starts, we choose inputs: Prob{Serena wins match}= 70% Prob{Serena wins point on serve}+ Prob{Venus wins point on serve}= 116%. Then the match starts and graph builds up Note:match has not yet been completed  graph does not use info on later points!

set 1

set 2

set 1 set 2set 3

set 1 set 2set 3

set 1 set 2set 3

Robustness of the graph Our choices for the two input probabilities may be not perfectly correct; is that a problem?  Does profile change a lot if one chooses: Starting probability:60% or 80% instead of 70%? Prob{Serena wins point on serve}+Prob{Venus wins point on serve}: 110% or 120%instead of 116%?

set 1 set 2set 3

set 1 set 2set 3

Conclusion We have introduced a robust method to forecast winner of match as match unfolds New: existing papers focus on forecasting at start of match, while we do it also for matches in progress Info on who will win match and on development of match till now Single line makes the information visible at a glance Graph can be generated instantly and for any match (not just at Wimbledon)  Graph is useful in addition to score & summary statistics. Potential application: present graph during change of ends  TV commentator can discuss match developments so far (turning points,..)

Future research So far: two input probs. are kept fixed during match: updating may improve graph, but value-added is unclear. Other aspects of tennis project: Service strategy Development of tennis over time: Has return indeed improved? In what respects has the women’s game changed? Differences between Wimbledon and other tournaments: Impact of surfaces: grass, clay, hard court  Need more data on grand slam/ATP/WTA tournaments.

Probability S. Williams wins match Point number set 1 set 2set 3