Office 310 320.... Probability-Related Concepts How to Assign Probabilities to Experimental Outcomes Probability Rules Discrete Random Variables Continuous.

Slides:



Advertisements
Similar presentations
Presentation on Probability Distribution * Binomial * Chi-square
Advertisements

AP STATISTICS Simulation “Statistics means never having to say you're certain.”
ฟังก์ชั่นการแจกแจงความน่าจะเป็น แบบไม่ต่อเนื่อง Discrete Probability Distributions.
CHAPTER 13: Binomial Distributions
Probability Theory: Probability theory is the branch of mathematics concerned with analysis of random phenomena. (Encyclopedia Britannica) Probability.
Introduction to Probability and Statistics
Probability and Probability Distributions
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Parameters and Statistics Probabilities The Binomial Probability Test.
CHAPTER 6 Statistical Analysis of Experimental Data
Discrete Probability Distributions
Lecture Slides Elementary Statistics Twelfth Edition
Irwin/McGraw-Hill © The McGraw-Hill Companies, Inc., 2000 LIND MASON MARCHAL 1-1 Chapter Five Discrete Probability Distributions GOALS When you have completed.
Lecture 6: Descriptive Statistics: Probability, Distribution, Univariate Data.
Discrete Random Variables: The Binomial Distribution
CHAPTER 10: Introducing Probability
Chapter 6 Probability.
CHAPTER 6 Random Variables
Problem A newly married couple plans to have four children and would like to have three girls and a boy. What are the chances (probability) their desire.
Stat 1510: Introducing Probability. Agenda 2  The Idea of Probability  Probability Models  Probability Rules  Finite and Discrete Probability Models.
Chapter 5 Sampling Distributions
Chapter 6: Probability Distributions
Mean = 41.21Median = 42.5s = 7.59 x - 1s – x + 1s – Assignment #1.
Modeling and Simulation CS 313
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Rules of the game: Syllabus and time table are available at: Read Appendix.
Theory of Probability Statistics for Business and Economics.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
Quality Improvement PowerPoint presentation to accompany Besterfield, Quality Improvement, 9e PowerPoint presentation to accompany Besterfield, Quality.
Applied Business Forecasting and Regression Analysis Review lecture 2 Randomness and Probability.
Using Probability and Discrete Probability Distributions
Random Variables. A random variable X is a real valued function defined on the sample space, X : S  R. The set { s  S : X ( s )  [ a, b ] is an event}.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Biostatistics Class 3 Discrete Probability Distributions 2/8/2000.
Binomial Experiment A binomial experiment (also known as a Bernoulli trial) is a statistical experiment that has the following properties:
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Chapter 10 Introducing Probability BPS - 5th Ed. Chapter 101.
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 6 Random Variables 6.3 Binomial and Geometric.
Inference: Probabilities and Distributions Feb , 2012.
Probability Distributions, Discrete Random Variables
Probability and Distributions. Deterministic vs. Random Processes In deterministic processes, the outcome can be predicted exactly in advance Eg. Force.
Mathematical Notation The mathematical notation used most often in this course is the summation notation The Greek letter is used as a shorthand way.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Business Statistics,
Probability Distribution. Probability Distributions: Overview To understand probability distributions, it is important to understand variables and random.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
AP Statistics From Randomness to Probability Chapter 14.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate accuracy.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Probability Distributions Chapter 6.
Probability Distributions ( 확률분포 ) Chapter 5. 2 모든 가능한 ( 확률 ) 변수의 값에 대해 확률을 할당하는 체계 X 가 1, 2, …, 6 의 값을 가진다면 이 6 개 변수 값에 확률을 할당하는 함수 Definition.
Theoretical distributions: the other distributions.
Theoretical distributions: the Normal distribution.
Chapter Five The Binomial Probability Distribution and Related Topics
Binomial and Geometric Random Variables
Business Statistics Topic 4
Virtual University of Pakistan
Chapter 5 Sampling Distributions
Chapter 5 Sampling Distributions
Chapter 5 Sampling Distributions
Chapter 5 Sampling Distributions
CHAPTER 10: Introducing Probability
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 5 Sampling Distributions
Lecture Slides Elementary Statistics Twelfth Edition
Lecture 11: Binomial and Poisson Distributions
Introduction to Probability and Statistics
Lecture Slides Essentials of Statistics 5th Edition
Lecture Slides Essentials of Statistics 5th Edition
Chapter 5: Sampling Distributions
Chapter 11 Probability.
Presentation transcript:

Office

Probability-Related Concepts How to Assign Probabilities to Experimental Outcomes Probability Rules Discrete Random Variables Continuous Random Variables Probability Distribution & Functions Topics Covered

Concepts An event – Any phenomenon you can observe that can have more than one outcome (e.g., flipping a coin) An outcome – Any unique condition that can be the result of an event (e.g., flipping a coin: heads or tails), a.k.a simple event or sample points Sample space – The set of all possible outcomes associated with an event Probability is a measure of the likelihood of each possible outcome

Probability Distributions The usual application of probability distributions is to find a theoretical distribution Reflects a process that explains what we see in some observed sample of a geographic phenomenon Compare the form of the sampled information and theoretical distribution through a test of significance Geography: discrete random events in space and time (e.g. how often will a tornado occur?)

Discrete Probability Distributions Discrete probability distributions –The Uniform Distribution –The Binomial Distribution –The Poisson Distribution Each is appropriately applied in certain situations and to particular phenomena

The Uniform Distribution Source:

The Uniform Distribution Describes the situation where the probability of all outcomes is the same n outcomes  P(x i ) = 1/n e.g. flipping a coin: P(x heads ) = 1/2 = P(x tails ) P(xi)P(xi) heads xixi tails A uniform probability mass function

a<=x<=b otherwise x < a a<=x<=b x>b Source:

The Uniform Distribution A little simplistic and perhaps useless But actually well applied in two situations 1. The probability of each outcome is truly equal (e.g. the coin toss) 2. No prior knowledge of how a variable is distributed (i.e. complete uncertainty), the first distribution we should use is uniform (no assumptions about the distribution)

The Uniform Distribution However, truly uniformly distributed geographic phenomena are somewhat rare We often encounter the situation of not knowing how something is distributed until we sample it When we are resisting making assumptions we usually apply the uniform distribution as a sort of null hypothesis of distribution

The Uniform Distribution Example – Predict the direction of the prevailing wind with no prior knowledge of the weather system’s tendencies in the area We would have to begin with the idea that P(x North ) = 1/4 P(x East ) = 1/4 P(x South ) = 1/4 P(x West ) = 1/4 Until we had an opportunity to sample and find out some tendency in the wind pattern based on those observations P(xi)P(xi) N E SW 0.125

(Lillesand et al. 2004) (Binford 2005) Remote Sensing Supervised classification

The Binomial Distribution Provides information about the probability of the repetition of events when there are only two possible outcomes, –e.g. heads or tails, left or right, success or failure, rain or no rain … –Events with multiple outcomes may be simplified as events with two outcomes (e.g., forest or non-forest) Characterizing the probability of a proportion of the events having a certain outcome over a specified number of events

The Binomial Distribution A binomial distribution is produced by a set of Bernoulli trials (Jacques Bernoulli) The law of large numbers for independent trials – at the heart of probability theory Given enough observed events, the observed probability should approach the theoretical values drawn from probability distributions –e.g. enough coin tosses should approach the P = 0.5 value for each outcome (heads or tails)

How to test the law? A set of Bernoulli trials is the way to operationally test the law of large numbers using an event with two possible outcomes: (1) N independent trials of an experiment (i.e. an event like a coin toss) are performed (2) Every trial must have the same set of possible outcomes (e.g., heads and tails)

Bernoulli Trials (3) The probability of each outcome must be the same for all trials, i.e. P(x i ) must be the same each time for both x i values (4) The resulting random variable is determined by the number of successes in the trials (successes  one of the two outcomes) p = the probability of success in a trial q = (p –1) as the probability of failure in a trial p + q = 1

Bernoulli Trials Suppose on a series of successive days, we will record whether or not it rains in Chapel Hill We will denote the 2 outcomes using R when it rains and N when it does not rain nPossible Outcomes# of Rain DaysP(# of Rain Days) 1R 1p p N0 (1 - p) q 2RR2p 2 p 2 RN NR1 2[p*(1 – p)] 2pq NN0 (1 – p) 2 q 2 3RRR 3 p 3 p 3 RRN RNR NRR 2 3[p 2 *(1 – p)] 3p 2 q NNR NRN RNN 1 3[p*(1 – p) 2 ]3pq 2 NNN0 (1 – p) 3 q 3

Bernoulli Trials If we have a value for P(R) = p, we can substitute it into the above equations to get the probability of each outcome from a series of successive samples (e.g. p=0.2) nPossible Outcomes# of Rain DaysP(# of Rain Days) 1R 1 p=0.2 N0 q=0.8 2RR2 p 2 =0.04 RN NR1 2pq=0.32 NN0 q 2 =0.64 3RRR 3 p 3 =0.008 RRN RNR NRR 2 3p 2 q=0.096 NNR NRN RNN 1 3pq 2 =0.384 NNN0 q 3 =0.512

Bernoulli Trials 1 event,  = (p + q) 1 = p + q 2 events,  = (p + q) 2 = p 2 + 2pq + q 2 3 events,  = (p + q) 3 = p 3 + 3p 2 q + 3pq 2 + q 3 4 events,  = (p + q) 4 = p 4 + 4p 3 q + 6p 2 q 2 + 4pq 3 + q 4 Source: Earickson, RJ, and Harlin, JM Geographic Measurement and Quantitative Analysis. USA: Macmillan College Publishing Co., p A graphical representation: probability # of successes The sum of the probabilities can be expressed using the binomial expansion of (p + q) n, where n = # of events

The Binomial Distribution A general formula for calculating the probability of x successes (n trials & a probability p of success: where C(n,x) is the number of possible combinations of x successes and (n –x) failures: P(x) = C(n,x) * p x * (1 - p) n - x C(n,x) = n! x! * (n – x)!

The Binomial Distribution – Example e.g., the probability of 2 successes in 4 trials, given p=0.2 is: P(x) = 4! 2! * (4 – 2)! * (0.2) 2 *(1 – 0.2) P(x) = 24 2 * 2 * (0.2) 2 * (0.8) 2 P(x) = 6 * (0.04)*(0.64) =

The Binomial Distribution – Example Calculating the probabilities of all possible number of rain days out of four days (p = 0.2): The chance of having one or more days of rain out of four: P(1) + P(2) + P(3) + P(4) = xP(x)C(n,x)p x (1 – p) n – x 0P(0)1(0.2) 0 (0.8) 4 = P(1)4(0.2) 1 (0.8) 3 = P(2)6(0.2) 2 (0.8) 2 = P(3)4(0.2) 3 (0.8) 1 = P(4)1(0.2) 4 (0.8) 0 =0.0016

The Binomial Distribution – Example Naturally, we can plot the probability mass function produced by this binomial distribution: x i P(x i ) P(xi)P(xi) 1234 xixi 0

Source: The following is the plot of the binomial probability density function for four values of p and n = 100

Source:

Source:

Rare Discrete Random Events Some discrete random events in question happen rarely (if at all), and the time and place of these events are independent and random (e.g., tornados) The greatest probability is zero occurrences at a certain time or place, with a small chance of one occurrence, an even smaller chance of two occurrences, etc. heavily peaked and skewed: P(xi)P(xi) 1234 xixi 0

The Poisson Distribution In the 1830s, S.D. Poisson described a distribution with these characteristics Describing the number of events that will occur within a certain area or duration (e.g. # of meteorite impacts per state, # of tornados per year, # of hurricanes in NC) Poisson distribution’s characteristics: 1. It is used to count the number of occurrences of an event within a given unit of time, area, volume, etc. (therefore a discrete distribution)

The Poisson Distribution 2. The probability that an event will occur within a given unit must be the same for all units (i.e. the underlying process governing the phenomenon must be invariant) 3. The number of events occurring per unit must be independent of the number of events occurring in other units (no interactions) 4. The mean or expected number of events per unit (λ) is found by past experience (observations)

The Poisson Distribution Poisson formulated his distribution as follows: P(x) = e - * x x! To calculate a Poisson distribution, you must know λ where e = (base of the natural logarithm) λ = the mean or expected value x = 1, 2, …, n – 1, n # of occurrences x! = x * (x – 1) * (x – 2) * … * 2 * 1

The Poisson Distribution Poisson distribution P(x) = e - * x x! The shape of the distribution depends strongly upon the value of λ, because as λ increases, the distribution becomes less skewed, eventually approaching a normal-shaped distribution as it gets quite large  increases, the distribution becomes less skewed We can evaluate P(x) for any value of x, but large values of x will have very small values of P(x)

λ

The Poisson Distribution Poisson distribution P(x) = e - * x x! The shape of the distribution depends strongly upon the value of λ, because as λ increases, the distribution becomes less skewed, eventually approaching a normal-shaped distribution as l gets quite large increases, the distribution becomes less skewed We can evaluate P(x) for any value of x, but large values of x will have very small values of P(x)

The Poisson Distribution Poisson distribution P(x) = e - * x x! The Poisson distribution can be defined as the limiting case of the binomial distribution: constant

The Poisson Distribution The Poisson distribution is sometimes known as the Law of Small Numbers, because it describes the behavior of events that are rare We can observe the frequency of some rare phenomenon, find its mean occurrence, and then construct a Poisson distribution and compare our observed values to those from the distribution (effectively expected values) to see the degree to which our observed phenomenon is obeying the Law of Small Numbers:

Murder rates in Fayetteville Fitting a Poisson distribution to the 24-hour murder rates in Fayetteville in a 31-day month (to ask the question ‘Do murders randomly occur in time?’) # of Murders Days (Frequency) Total 31 days

Murder rates in Fayetteville = mean murders per day = 22 / 31 = 0.71 # of Murders Days# of Murders* # of Days Total 31 days22

Murder rates in Fayetteville = mean murders per day = 22 / 31 = 0.71

Murder rates in Fayetteville x (# of Murders)Obs. Frequency (F obs ) x*F obs F exp Total = mean murders per day = 22 / 31 = 0.71 We can compare F obs to F exp using a X 2 test to see if observations do match Poisson Dist.

Murder rates in Fayetteville

The Poisson Distribution Procedure for finding Poisson probabilities and expected frequencies: (1) Set up a table with five columns as on the previous slide (2) Multiply the values of x by their observed frequencies (x * F obs ) (3) Sum the columns of F obs (observed frequency) and x * F obs (4) Compute λ = Σ (x * F obs ) / Σ F obs (5) Compute P(x) values using the equation or a table (6) Compute the values of Fexp = P(x) * Σ F obs

Source: