More Examples: There are 4 security checkpoints. The probability of being searched at any one is 0.2. You may be searched more than once in total and all.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Chapter 6 – Normal Probability Distributions
Sampling Distributions and Sample Proportions
Statistics review of basic probability and statistics.
Normal Distribution * Numerous continuous variables have distribution closely resemble the normal distribution. * The normal distribution can be used to.
Sampling Distributions (§ )
1 Normal Probability Distributions. 2 Review relative frequency histogram 1/10 2/10 4/10 2/10 1/10 Values of a variable, say test scores In.
Note 7 of 5E Statistics with Economics and Business Applications Chapter 5 The Normal and Other Continuous Probability Distributions Normal Probability.
Continuous Probability Distributions.  Experiments can lead to continuous responses i.e. values that do not have to be whole numbers. For example: height.
Review.
Introduction to the Continuous Distributions
Evaluating Hypotheses
Last Lecture: Histograms: –Definition –Interpretation in terms of probability –Estimate of distribution function Sample Means, Sample Medians, and Sample.
Lecture 3 Sampling distributions. Counts, Proportions, and sample mean.
The Normal Distribution
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Continuous Random Variables and Probability Distributions.
Statistical Analysis – Chapter 4 Normal Distribution
Chapter 6: Normal Probability Distributions
1 Sampling Distributions Presentation 2 Sampling Distribution of sample proportions Sampling Distribution of sample means.
Chapter 6 The Normal Probability Distribution
JMB Chapter 6 Lecture 3 EGR 252 Spring 2011 Slide 1 Continuous Probability Distributions Many continuous probability distributions, including: Uniform.
JMB Ch6 Lecture 3 revised 2 EGR 252 Fall 2011 Slide 1 Continuous Probability Distributions Many continuous probability distributions, including: Uniform.
Statistics 303 Chapter 4 and 1.3 Probability. The probability of an outcome is the proportion of times the outcome would occur if we repeated the procedure.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
AP Statistics Chapter 9 Notes.
Topic 4 - Continuous distributions
Random Variables & Probability Distributions Outcomes of experiments are, in part, random E.g. Let X 7 be the gender of the 7 th randomly selected student.
PROBABILITY & STATISTICAL INFERENCE LECTURE 3 MSc in Computing (Data Analytics)
Each child born to a particular set of parents has probability of 0.25 having blood type O. Suppose these parents have 5 children. Let X = number of children.
Statistics for Data Miners: Part I (continued) S.T. Balke.
Theory of Probability Statistics for Business and Economics.
Modular 11 Ch 7.1 to 7.2 Part I. Ch 7.1 Uniform and Normal Distribution Recall: Discrete random variable probability distribution For a continued random.
Normal Distribution Introduction. Probability Density Functions.
Don’t forget HW due on Tuesday. Assignment is on web.
Central Limit Theorem Example: (NOTE THAT THE ANSWER IS CORRECTED COMPARED TO NOTES5.PPT) –5 chemists independently synthesize a compound 1 time each.
LECTURE 14 THURSDAY, 12 March STA 291 Spring
Exam 1 is two weeks from today (March 9 th ) in class 15% of your grade Covers chapters 1-6 and the central limit theorem. I will put practice problems,
Discrete distribution word problems –Probabilities: specific values, >, =, … –Means, variances Computing normal probabilities and “inverse” values: –Pr(X
On Thursday, I’ll provide information about the project Due on Friday after last class. Proposal will be due two weeks from today (April 15 th ) You’re.
Distributions of the Sample Mean
Sample Variability Consider the small population of integers {0, 2, 4, 6, 8} It is clear that the mean, μ = 4. Suppose we did not know the population mean.
Confidence intervals: The basics BPS chapter 14 © 2006 W.H. Freeman and Company.
Section 10.1 Confidence Intervals
Exam 1 next Thursday (March 7 th ) in class 15% of your grade Covers chapters 1-6 and the central limit theorem I will put practice problems, old exams,
HW solutions are on the web. See website for how to calculate probabilities with minitab, excel, and TI calculators.
Random Variables Presentation 6.. Random Variables A random variable assigns a number (or symbol) to each outcome of a random circumstance. A random variable.
EAS31116/B9036: Statistics in Earth & Atmospheric Sciences Lecture 3: Probability Distributions (cont’d) Instructor: Prof. Johnny Luo
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
Review: Large Sample Confidence Intervals 1-  confidence interval for a mean: x +/- z  /2 s/sqrt(n) 1-  confidence interval for a proportion: p +/-
Normal Distribution * Numerous continuous variables have distribution closely resemble the normal distribution. * The normal distribution can be used to.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
1 7.3 RANDOM VARIABLES When the variables in question are quantitative, they are known as random variables. A random variable, X, is a quantitative variable.
1 6. Mean, Variance, Moments and Characteristic Functions For a r.v X, its p.d.f represents complete information about it, and for any Borel set B on the.
Ch4: 4.3The Normal distribution 4.4The Exponential Distribution.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
Chapter 7: Sampling Distributions Section 7.2 Sample Proportions.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Chapter 3 Probability Distribution Normal Distribution.
Estimating the Value of a Parameter Using Confidence Intervals
Announcements Exams are graded. I’ll hand them back at end of class. Solution will be on the web. Many people did better than I think they’d guess… Easy.
Chapter 8: Fundamental Sampling Distributions and Data Descriptions:
Distribution functions
Continuous Random Variable
Daniela Stan Raicu School of CTI, DePaul University
Welcome Back Please hand in your homework.
Sampling Distributions (§ )
Chapter 8: Fundamental Sampling Distributions and Data Descriptions:
Presentation transcript:

More Examples: There are 4 security checkpoints. The probability of being searched at any one is 0.2. You may be searched more than once in total and all searches are independent. What’s the probability of being searched at least one time? 50 geese in a flock of 200 are tagged by a wildlife biologist. The next year, 10 geese from the flock are captured. Assume the flock still has (the same) 200 geese and no tags are lost. What’s the probability that at least 5 of the recaptured geese have tags? Suppose a written test has 5 True/False questions. Passing = at least 3 correct answers and the test can be taken at most 3 times. (Assume no learning occurs between tests if one fails!) –If one randomly guesses what’s the probability of passing? –What’s the probability that someone who randomly guesses will eventually pass? An overloaded server receives an average of 25 s per second at 12:00PM. If it receives more than 30 s in a second, it will crash. What’s the probability of a crash at 12:00PM on a given day (based on the traffic in the previous 1 second)?

Answers to Examples 1.X = number of times searched. X has a binomial distribution with n=4 and p=0.2. We want Pr(X>0) = 1- Pr(X=0) 2.X = number of recaptured geese w/ tags. X has a hypergeometric distribution with N = 200, M = 50, n=10. We want Pr(X>=5) = Pr(X=5)+Pr(X=6)+Pr(X=7)+Pr(X=8)+Pr(X=9)+Pr(X=10) 3.X = number of questions right. X has a binomial distribution with n = 5 and p=0.5. Want Pr(X>=3) = Pr(X=3)+Pr(X=4)+Pr(X=5) 4.Pr eventually pass = Pr(Pass on first try or fail first and then pass or fail twice and then pass) = Pr(X>=3) + Pr(X =3) + Pr(X =3) 5.X = number of s in a second. X has a Poisson distribution with rate = 25 per second. Want Pr(X>30) = 1-Pr(X<=30) = Pr(X=0)+…+Pr(X=30) (in each case, once you know the distribution and the parameters, the Pr(X=k) can be calculated with the pdf.)

If you’re interested in polls, an interesting “statistics related” website is: Polls that ask questions w/ 2 answers are related to the binomial distribution: –n = number of people asked –p = probability of one of the answers –Note that a poll uses data to estimate p (i.e. estimate of p = number of yeses / n) From gallup.com (Feb 19, 2003) n = 483 Example: X = number of people who think “unfinished business is the reason. X has a Bin(483,0.31) distribution (assume 0.31 is the true p).

Example: Suppose 10 people are polled: –Is a terrorist attack at least somewhat likely at the Olympics? Suppose p=0.31 Q: What’s the probability that fewer than 9 people say yes? A: Let X ~ Bin(10,0.31) Want Pr(X<9) = 1-Pr(X=9)-Pr(X=10) =1-(10 choose 9)( )( ) -(10 choose 10)( )( ) = =

Example: Dietary Data As part of an epidemiological study, physicians measured the amount of folate in the diets of 545 people. What’s the probability that a new person’s folate consumption equals exactly 5.5? Histogram from observed sample Question about the random variable describing dietary folate of a new person.

In the folate example, if folate were measured accurately enough, the probability of seeing any exact value on a new person is zero. Note that this is different from random variables like “the number of questions right on a test, etc”. –The folate example gives an example of continuous data. –Probability can be applied to the probability that a continuous random variable is in an interval, but any particular value has zero probablity.

Chapter 6: Continuous Distributions & Normality Up to this point, all random variables have been discrete: –Possible values are integers (any integer or a subset): Binomial(n,p) random variables can be 0 or 1 or …or n. Poisson(rate) random variables can be 0 or 1 or … Hypergeometric(N,M,n) random variables can be 0 or 1 or …or n. PDFs give probabilities that the random variables take on any of these values CDFs give probabilities that the random variables are less than or equal to a certain value

Random variables that can take on any real number are continuous. Continuous random variables have probability density functions (pdfs) too. Again, they are models for how the random variables behave. The probability that a continuous random variable is in an interval is the area under the pdf in that interval.

PDF for the Folate Data (assume we know this function): Pr(5 < random person’s folate intake < 6) = 0.54 = shaded area (i.e. )

Continuous PDFs : –notation: f(x) –f(x) is greater than or equal to zero. –All the area under f(x) is 1. –i.e. –CDF:

Let a be a number. For a continuous random variable X:

Continuous pdfs will be known functions Most commonly used: –Normal or Gaussian distribution (“bell curve”) –We’ll see why this is so common in a few weeks. –2 parameters: mean  and std dev 

Mean = center of normal distribution 2 normal distibutions: Both have the same mean (0). Narrower one has a std dev of 2. Fatter one has std dev of 1. Smaller standard deviation means that the model says the data are more likely to be concentrated around the mean.

[1/(  sqrt(2  ))]e [-0.5((x-  )/  )2] The normal pdf is this functinon:

Determining normal probabilities: Suppose X has a normal distribution with mean 5 and std dev 2. Notation X~N(5,4) [notation uses N(mean,variance)] What’s the probability that X is less than 7? It turns out that no one can “solve” the integral that defines this probability. As a result, we need to use tables, computers, or calculators to compute normal probabilities.

7 Pr(X<7) = area under curve to left of x=7

Fact 1: Pr(X < its mean) = 1/2

Fact 2: Pr(X > its mean + a number) = Pr(X < its mean - same number)

Fact 3: Assume a > b. Pr(b< X < a) = Pr(X<a)-Pr(X<b) a b Area under curve Between a and b Is area under curve To the left of a minus The area under the curve to the left of b.

Fact 4: Pr(X > a) = 1-Pr(X < a)

Fact 5: Tables inside the cover of your book are given in terms of Pr(0 0 and Z~N(0,1)) (Tables with P(Z<a) are in Appendix 1) a

Table in book: (inside cover) Z … Pr(0 < Z < 0.13) = Ones and tenths places Hundredths place This is the upper left hand corner of the table.

Using Tables: 4 Easy Steps Want Pr(X<7) 1.Draw picture (next page) (allows use of common sense) 2.Translate X to a normal random variable with mean 0 and std dev 1 (called “Z”, a standard normal r.v.) –Do this by “centering and scaling”: Rule: If X~N(5,4) then (X-5)/2 ~N(0,1) 3.Manipulate to get in terms of Pr(Z<a) form –So, Pr(X<7) = Pr( (X-5)/2 < (7-5)/2) = Pr( Z < 1) where Z~N(0,1) 4.Look up in table: Pr(X<7) = Pr(Z<1) =

7 Pr(X<7) = area under curve to left of x=7

What’s Pr(X < 4)? Draw (on next page) Center and scale: –Pr(X<4) = Pr( (X-5)/2 < (4-5)/2 ) = Pr( Z < -1/2 ) Look up =

7 Pr(X<4) = area under curve to left of x=4