Introduction to Probability and Probabilistic Forecasting L i n k i n g S c i e n c e t o S o c i e t y Simon Mason International Research Institute for.

Slides:



Advertisements
Similar presentations
Dealing with Random Phenomena A random phenomenon is a situation in which we know what outcomes could happen, but we don’t know which particular outcome.
Advertisements

Probability Unit 3.
Introduction to Probability Experiments, Outcomes, Events and Sample Spaces What is probability? Basic Rules of Probability Probabilities of Compound Events.
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
Statistics and Quantitative Analysis U4320
Uncertainty in Engineering The presence of uncertainty in engineering is unavoidable. Incomplete or insufficient data Design must rely on predictions or.
Probability Simple Events
Chapter 4 Probability and Probability Distributions
Statistical Issues in Research Planning and Evaluation
Probability & Counting Rules Chapter 4 Created by Laura Ralston Revised by Brent Griffin.
Lecture 1, Part 2 Albert Gatt Corpora and statistical methods.
NIPRL Chapter 1. Probability Theory 1.1 Probabilities 1.2 Events 1.3 Combinations of Events 1.4 Conditional Probability 1.5 Probabilities of Event Intersections.
Details for Today: DATE:3 rd February 2005 BY:Mark Cresswell FOLLOWED BY:Assignment 2 briefing Evaluation of Model Performance 69EG3137 – Impacts & Models.
Predictability and Chaos EPS and Probability Forecasting.
Creating probability forecasts of binary events from ensemble predictions and prior information - A comparison of methods Cristina Primo Institute Pierre.
Business and Economics 7th Edition
Chap 4-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 4 Probability.
CEEN-2131 Business Statistics: A Decision-Making Approach CEEN-2130/31/32 Using Probability and Probability Distributions.
Applicable Mathematics “Probability”
PROBABILITY (6MTCOAE205) Chapter 2 Probability.
Chapter 4 Basic Probability
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 4-2 Basic Concepts of Probability.
Basic Concepts and Approaches
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
1 Bayesian methods for parameter estimation and data assimilation with crop models Part 2: Likelihood function and prior distribution David Makowski and.
Chapter 1 Basics of Probability.
Dr. Gary Blau, Sean HanMonday, Aug 13, 2007 Statistical Design of Experiments SECTION I Probability Theory Review.
Theory of Probability Statistics for Business and Economics.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Chapter 4 Correlation and Regression Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Using Probability and Discrete Probability Distributions
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 4-1 Chapter 4 Basic Probability Business Statistics: A First Course 5 th Edition.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Model validation Simon Mason Seasonal Forecasting Using the Climate Predictability Tool Bangkok, Thailand, 12 – 16 January 2015.
Chap 4-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 4 Using Probability and Probability.
Forecasting in CPT Simon Mason Seasonal Forecasting Using the Climate Predictability Tool Bangkok, Thailand, 12 – 16 January 2015.
Probability Course web page: vision.cis.udel.edu/cv March 19, 2003  Lecture 15.
Can we distinguish wet years from dry years? Simon Mason Seasonal Forecasting Using the Climate Predictability Tool Bangkok, Thailand,
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
Please turn off cell phones, pagers, etc. The lecture will begin shortly. There will be a very easy quiz at the end of today’s lecture.
Probability Probability is a numerical measurement of likelihood of an event. Probability is a numerical measurement of likelihood of an event. The probability.
EAS31116/B9036: Statistics in Earth & Atmospheric Sciences Lecture 1: Review of Probability Instructor: Prof. Johnny Luo
Probability. What is probability? Probability discusses the likelihood or chance of something happening. For instance, -- the probability of it raining.
Introduction Remember that probability is a number from 0 to 1 inclusive or a percent from 0% to 100% inclusive that indicates how likely an event is to.
Chapter 4 Probability 4-1 Review and Preview 4-2 Basic Concepts of Probability 4-3 Addition Rule 4-4 Multiplication Rule: Basics 4-5 Multiplication Rule:
Statistical NLP: Lecture 4 Mathematical Foundations I: Probability Theory (Ch2)
Copyright ©2004 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 4-1 Probability and Counting Rules CHAPTER 4.
1 Probability- Basic Concepts and Approaches Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 4-1 Chapter 4 Basic Probability Business Statistics: A First Course 5 th Edition.
Chap 4-1 Chapter 4 Using Probability and Probability Distributions.
PROBABILITY 1. Basic Terminology 2 Probability 3  Probability is the numerical measure of the likelihood that an event will occur  The probability.
AP Statistics From Randomness to Probability Chapter 14.
STAT 1301 Introduction to Probability. Statistics: The Science of Decision Making in the Face of Uncertainty l Uncertainty makes life challenging and.
Bayes’ Theorem Suppose we have estimated prior probabilities for events we are concerned with, and then obtain new information. We would like to a sound.
Introduction To Probability
Elementary Probability Theory
Chapter 4 Basic Probability.
Probability and Counting Rules
Statistics for 8th Edition Chapter 3 Probability
Applicable Mathematics “Probability”
From Randomness to Probability
From Randomness to Probability
Introduction Remember that probability is a number from 0 to 1 inclusive or a percent from 0% to 100% inclusive that indicates how likely an event is to.
Probabilistic forecasts
Statistical NLP: Lecture 4
Honors Statistics From Randomness to Probability
WARM – UP A two sample t-test analyzing if there was a significant difference between the cholesterol level of men on a NEW medication vs. the traditional.
Can we distinguish wet years from dry years?
Theoretical Probability
Presentation transcript:

Introduction to Probability and Probabilistic Forecasting L i n k i n g S c i e n c e t o S o c i e t y Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia University AMS Short Course on Probabilistic Forecasting San Diego, CA, January 9, 2005

Questions about the Future L i n k i n g S c i e n c e t o S o c i e t y Will it rain in San Diego next weekend? Will it snow in San Diego next weekend? Will the Red Sox win the 2005 World Championship? Will Dick Cheney die a pauper? Will this surfer live to 50? We make forecasts to answer questions about the future:

Probability L i n k i n g S c i e n c e t o S o c i e t y For most situations the future is uncertain. In cases where the answer to a question about the future is uncertain, we tend to use probabilities to express this uncertainty in the outcome.

Probability and Events L i n k i n g S c i e n c e t o S o c i e t y We refer to a specific outcome, or a specific combination of outcomes, as an event, and refer to the probability of this event. What do these terms mean? event: a predefined outcome that forms the subject of a forecast (an outcome of interest) – examples: - rain in San Diego this afternoon; - tornado touch down anywhere in Iowa tomorrow; - average LAX January temperature of below 10°C; - NIÑO3 anomaly of more than +2°C by October; - global warming of more than +1°C by 2050.

Probability and Events L i n k i n g S c i e n c e t o S o c i e t y Events can be: - elementary (“hot”) or; - compound (“hot and dry”; rain two days in a row). Events either occur or do not occur – there are only these two possible outcomes (but there may be some uncertainty as to whether the outcome has occurred). If the event does not occur, its complement occurs. Events need to be well-defined to avoid ambiguity. (What does “unfaithful” mean? What does “in San Diego” mean?)

Notation L i n k i n g S c i e n c e t o S o c i e t y An elementary event is often denoted by the letter E, and its complement by. To distinguish an elementary event from a second elementary event, subscripts may be used: first elementary event: E 1 second elementary event: E 2 A compound event occurs when the first AND the second elementary event occur (or, more generally, when all elementary events occur):

Probability and Events L i n k i n g S c i e n c e t o S o c i e t y Uncertainty often is expressed using expressions such as “it is likely”, “the chances are”, etc. Different degrees of uncertainty can be indicated: “possibly” indicates higher uncertainty than “probably”. Compare: - will it rain in San Diego next weekend? - will it snow in San Diego next weekend? probability: a quantitative measure of the uncertainty in the event.

Probability and Uncertainty L i n k i n g S c i e n c e t o S o c i e t y Probabilities are used where there is uncertainty. Apart from ambiguity, there are two sources of uncertainty: our understanding is limited; there is some inherent randomness in the outcome. We do not know for certain what will happen. We cannot know for certain what will happen.

Probability and Uncertainty L i n k i n g S c i e n c e t o S o c i e t y But how can we quantify uncertainty? When probability is 1, the event will definitely occur. It is impossible for the event not to happen. When probability is 0, the event will definitely not occur. It is impossible for the event to happen. When the probability is between 0 and 1, the event may or may not happen.

Probability L i n k i n g S c i e n c e t o S o c i e t y When probability is close to 1, the event is more likely to occur than not to occur. When probability is close to 0, the event is more likely not to occur than to occur. When the probability is 0.5, the event is just as likely to happen as not to happen.

Odds L i n k i n g S c i e n c e t o S o c i e t y When the probability of an event, E, is 0.75, the probability that the event will not happen, the complement of the event,, is: 1 – 0.75 = 0.25 When the probability is 0.75, the event is three times more likely to happen than not to happen:

Probability L i n k i n g S c i e n c e t o S o c i e t y But how do we determine how likely the event is compared to its complement? How do we obtain / calculate probabilities?

Interpretations of Probability: I L i n k i n g S c i e n c e t o S o c i e t y How do we obtain / calculate probabilities? What is the probability that it will rain in San Diego (at Lindberg Field) on January 31, 2005? How often has it rained on the same day in previous years (1927 – 2003)? Climatology.

Interpretations of Probability: I L i n k i n g S c i e n c e t o S o c i e t y What is the probability that it will rain in San Diego (at Lindberg Field) on January 31, 2005?

Probability as Relative Frequency L i n k i n g S c i e n c e t o S o c i e t y What is the probability that it will rain in San Diego on January 31, 2005? Look for similar / identical situations. Repeat the experiment many times – only “unimportant” things are allowed to change. Note that there may be sampling errors in the relative frequencies – uncertainty about the uncertainty! (The distribution of these sampling errors can be obtained using the binomial distribution).

Interpretations of Probability: II L i n k i n g S c i e n c e t o S o c i e t y How do we obtain / calculate probabilities? What is the probability that it will rain in San Diego (at Lindberg Field) tomorrow? How often has it rained on the same day in previous years with similar atmospheric conditions? Only unimportant things are allowed to change. This experiment has no precedent – today’s initial conditions are “important”, and they are unique.

Probability as Subjective Belief L i n k i n g S c i e n c e t o S o c i e t y The probability that it will rain in San Diego is best estimated by conditioning upon the current atmospheric state, a set of conditions that are unique. Make a forecast based on the physics of the atmosphere, and expert knowledge / experience. Produce an ensemble of forecasts based on sampling of known uncertainties in the physics of the atmosphere and / or in the initial conditions (Bright). The probability now represents the degree to which we believe that it will rain in San Diego tomorrow.

Interpretations of Probability L i n k i n g S c i e n c e t o S o c i e t y So two interpretations of probability are: relative frequency interpretation: how often the event has occurred in similar situations in the past; subjective interpretation: how confident we are the event will occur this time. But all probabilities could be defined as subjective because of he subjectivity in defining which situations are “similar”.

Probability as Relative Frequency L i n k i n g S c i e n c e t o S o c i e t y The 77-year climatology does not provide a good estimate of the probability that it will rain in San Diego tomorrow because there are some “important” differences between the 77 instances of January 10 and January 10, Sometimes we can improve upon climatological forecasts because of access to “important” information. But how do we know whether the information is important? Is the probability of the event different when these conditions are present compared to when they are not?

Conditional Probabilities L i n k i n g S c i e n c e t o S o c i e t y The relative frequency of rainfall on January 10 could be obtained by considering only those January 10s on which January 9 rainfall occurrence was the same as on January 9, If January 9 is wet: P(January 10 is wet)? If January 9 is dry: P(January 10 is wet)? This conditional probability is different from a compound event P(E 1  E 2 ), because we know that E 2 has (or has not) occurred already.

Conditional Probabilities L i n k i n g S c i e n c e t o S o c i e t y

Conditional Probabilities L i n k i n g S c i e n c e t o S o c i e t y Venn diagram showing compound event: For conditional probabilities, the outcome of Jan 9 is known already, and so the sample space is reduced:

L i n k i n g S c i e n c e t o S o c i e t y

Conditional Probabilities L i n k i n g S c i e n c e t o S o c i e t y What is the probability that it will rain in San Diego (at Lindberg Field) on January 10, 2005, given that it has (or has not) rained on January 9, 2005?

Conditional Probabilities L i n k i n g S c i e n c e t o S o c i e t y What is the probability that it will rain in San Diego (at Lindberg Field) on January 10, 2005, given that it has has rained on January 9, 2005?

Conditional Probabilities L i n k i n g S c i e n c e t o S o c i e t y What is the probability that it will rain in San Diego (at Lindberg Field) on January 10, 2005, given that it has not rained on January 9, 2005?

Updating Probabilities L i n k i n g S c i e n c e t o S o c i e t y Based on the occurrence of January 9 rainfall, the probability of rainfall on January 10 has been updated from 0.22 to What if we now obtain a model forecast that states it will rain tomorrow, E 3 ? All we know about the model is that it has given a correct forecast 90% of the time over the last few days. How can we update our probability for rain tomorrow?

Bayes’ Theorem L i n k i n g S c i e n c e t o S o c i e t y What is the probability that it will rain in San Diego (at Lindberg Field) on January 10, 2005, given that it has has rained on January 9, 2005 AND that the model forecasts rain? (To simplify, conditions on E 2 are dropped.)

L i n k i n g S c i e n c e t o S o c i e t y

Bayes’ Theorem L i n k i n g S c i e n c e t o S o c i e t y All terms on the right are unknown The priors, at least, are known …

Bayes’ Theorem L i n k i n g S c i e n c e t o S o c i e t y are likelihoods: they tell us how likely it is that rain was forecasted, assuming that it will / will not rain, respectively. Or: how often are rain days successfully forecasted / dry days unsuccessfully forecasted?

Bayes’ Theorem L i n k i n g S c i e n c e t o S o c i e t y We do not have exact values for the likelihoods on the right side of the equation, but if we assume that the model has no bias, given that it has been correct 90% of the time, we can infer that 90% of the rain days have been forecasted.

Bayes’ Theorem L i n k i n g S c i e n c e t o S o c i e t y

Bayes’ Theorem L i n k i n g S c i e n c e t o S o c i e t y Bayes’ theorem allows us to update probabilities (posterior probabilities): The prior probabilities are imply the the best estimate of the probabilities before considering the new information. The may already have been previously updated. The likelihoods indicate how likely the new information is, assuming a specific outcome. For example: how likely is it that the forecast would be for wet conditions assuming that it is going to be wet / dry. (I.e., the hit and false alarm rates of the ROC.)

Conditional Probabilities L i n k i n g S c i e n c e t o S o c i e t y A problem with conditional probabilities is that the sample space is reduced, and so the errors in estimating the relative frequencies increases. These errors increase as the number of conditions is increased, and it is easy to reach the extreme case of having no previous cases with only “unimportant” differences from which to calculate the relative frequencies. (number of possible states = 2 n ). In numerical weather prediction the infinite dimensions of the current atmospheric state are important, and so the current initial conditions are unique.

Conditional Probabilities L i n k i n g S c i e n c e t o S o c i e t y What if the probability of rainfall tomorrow depends on how much rainfall there is today rather than just its occurrence? In this case the outcome of the current event is not dependent upon another event measured on a binary scale. Jolliffe – some statistical models for calculating probabilities of events that are functions of continuous variables.

Conditional Probabilities L i n k i n g S c i e n c e t o S o c i e t y Similarly, many forecast verification procedures are based on conditional probabilities: reliability: given a forecast of 90% chance of rain, how often does rain occur? P(E|F=f) NB – notice that this involves a subjective interpretation of probability – we are verifying forecasts with similar levels of confidence, not forecasts with similar boundary / initial conditions.

Conditional Probabilities L i n k i n g S c i e n c e t o S o c i e t y resolution: can we expect a different outcome given a different forecast? Note reliability and resolution are often confused. Resolution: is P(E) conditional upon the forecast? Reliability: if F=f does P(E|F=f)=f?

L i n k i n g S c i e n c e t o S o c i e t y REL = (BC) 2 REL = (EC) 2 RES = (AC) 2 Note the y-axis gives the conditional probability of the event given the forecast.

L i n k i n g S c i e n c e t o S o c i e t y RELIABILITY: Are the forecast probabilities correct? Do the forecast probabilities reflect an appropriate level of confidence? RESOLUTION: Does the outcome depend on the forecast? Do different forecast probabilities imply actual differences in the probability of an event? Reliability and Resolution