Random Variables and Probability Distributions

Slides:



Advertisements
Similar presentations
Sampling: Final and Initial Sample Size Determination
Advertisements

Normal Distribution; Sampling Distribution; Inference Using the Normal Distribution ● Continuous and discrete distributions; Density curves ● The important.
5 - 1 © 1997 Prentice-Hall, Inc. Importance of Normal Distribution n Describes many random processes or continuous phenomena n Can be used to approximate.
Normal distribution. An example from class HEIGHTS OF MOTHERS CLASS LIMITS(in.)FREQUENCY
ELEC 303 – Random Signals Lecture 18 – Statistics, Confidence Intervals Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 10, 2009.
Chapter 6 Continuous Random Variables and Probability Distributions
1 Sociology 601, Class 4: September 10, 2009 Chapter 4: Distributions Probability distributions (4.1) The normal probability distribution (4.2) Sampling.
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 18 = Start Chapter “The Normal Distribution and Other.
Chapter 7: Variation in repeated samples – Sampling distributions
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 19 = More of Chapter “The Normal Distribution and Other.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
Today Today: Chapter 8, start Chapter 9 Assignment: Recommended Questions: 9.1, 9.8, 9.20, 9.23, 9.25.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Random Variables and Probability Distributions
Chapter 11: Random Sampling and Sampling Distributions
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Chapter 4 Continuous Random Variables and Probability Distributions
Continuous Probability Distributions
© Copyright McGraw-Hill CHAPTER 6 The Normal Distribution.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Estimation of Statistical Parameters
PROBABILITY & STATISTICAL INFERENCE LECTURE 3 MSc in Computing (Data Analytics)
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
1 Normal Random Variables In the class of continuous random variables, we are primarily interested in NORMAL random variables. In the class of continuous.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
4.3 NORMAL PROBABILITY DISTRIBUTIONS The Most Important Probability Distribution in Statistics.
Please turn off cell phones, pagers, etc. The lecture will begin shortly. There will be a quiz at the end of today’s lecture. Friday’s lecture has been.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
© 2010 Pearson Prentice Hall. All rights reserved. CHAPTER 12 Statistics.
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
Random Variables Presentation 6.. Random Variables A random variable assigns a number (or symbol) to each outcome of a random circumstance. A random variable.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 6-1 The Normal Distribution.
Chapter 5 Sampling Distributions. Introduction Distribution of a Sample Statistic: The probability distribution of a sample statistic obtained from a.
© 1999 Prentice-Hall, Inc. Chap Statistics for Managers Using Microsoft Excel Chapter 6 The Normal Distribution And Other Continuous Distributions.
Parameter versus statistic  Sample: the part of the population we actually examine and for which we do have data.  A statistic is a number summarizing.
Many times in statistical analysis, we do not know the TRUE mean of a population on interest. This is why we use sampling to be able to generalize the.
THE NORMAL DISTRIBUTION
Chapter 6 The Normal Distribution and Other Continuous Distributions
Topic 8: Sampling Distributions
Sampling Distributions
The normal distribution
Normal Distribution and Parameter Estimation
Normal Distribution.
Parameter versus statistic
4.3 Probability Distributions of Continuous Random Variables:
Chapter 6. Continuous Random Variables
BIOS 501 Lecture 3 Binomial and Normal Distribution
Chapter 7 Sampling Distributions.
Lecture 13 Sections 5.4 – 5.6 Objectives:
Probability and Distributions
Descriptive Statistics
Sampling Distribution Models
Chapter 7 Sampling Distributions.
Chapter 5: Normal Probability Distributions
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Chapter 7 Sampling Distributions.
4.3 Probability Distributions of Continuous Random Variables:
Chapter 7 Sampling Distributions.
CHAPTER 12 Statistics.
Reading Property Data Analysis – A Primer, Ch.8
Sampling Distributions (§ )
Continuous Random Variables
Chapter 7 Sampling Distributions.
Chapter 5 Continuous Random Variables and Probability Distributions
Presentation transcript:

Random Variables and Probability Distributions Random Variables - Random outcomes corresponding to subjects randomly selected from a population. Probability Distributions - A listing of the possible outcomes and their probabilities (discrete r.v.s) or their densities (continuous r.v.s) Normal Distribution - Bell-shaped continuous distribution widely used in statistical inference Sampling Distributions - Distributions corresponding to sample statistics (such as mean and proportion) computed from random samples

Discrete Probability Distributions Discrete RV - Random variable that can take on a finite (or countably infinite) set of discontinuous possible outcomes (Y) Discrete Probability Distribution - Listing of outcomes and their corresponding probabilities (y , P(y))

Example - Supreme Court Vacancies Supreme Court Vacancies by Year 1837-1975 Y  # Vacancies in Randomly selected year Source: R.J. Morrison (1977), “FDR and the Supreme Court: An Example of the Use of Probability Theory in Political History”, History and Theory, Vol. 16, pp 137-146

Parameters of a P.D. Mean (aka Expected Value) - Long run average outcome Standard Deviation - Measure of the “typical” distance of an outcome from the mean

Example - Supreme Court Vacancies

Normal Distribution Bell-shaped, symmetric family of distributions Classified by 2 parameters: Mean (m) and standard deviation (s). These represent location and spread Random variables that are approximately normal have the following properties wrt individual measurements: Approximately half (50%) fall above (and below) mean Approximately 68% fall within 1 standard deviation of mean Approximately 95% fall within 2 standard deviations of mean Virtually all fall within 3 standard deviations of mean Notation when Y is normally distributed with mean m and standard deviation s :

Normal Distribution

Example - Heights of U.S. Adults Female and Male adult heights are well approximated by normal distributions: YF~N(63.7,2.5) YM~N(69.1,2.6) Source: Statistical Abstract of the U.S. (1992)

Standard Normal (Z) Distribution Problem: Unlimited number of possible normal distributions (- < m <  , s > 0) Solution: Standardize the random variable to have mean 0 and standard deviation 1 Probabilities of certain ranges of values and specific percentiles of interest can be obtained through the standard normal (Z) distribution

Standard Normal (Z) Distribution Standard Normal Distribution Characteristics: P(Z  0) = P(Y  m ) = 0.5000 P(-1  Z  1) = P(m-s  Y  m+s ) = 0.6826 P(-2  Z  2) = P(m-2s  Y  m+2s ) = 0.9544 P(Z  za) = P(Z  -za) = a (using Z-table)

Finding Probabilities of Specific Ranges Step 1 - Identify the normal distribution of interest (e.g. its mean (m) and standard deviation (s) ) Step 2 - Identify the range of values that you wish to determine the probability of observing (YL , YU), where often the upper or lower bounds are  or - Step 3 - Transform YL and YU into Z-values: Step 4 - Obtain P(ZL Z  ZU) from Z-table

Example - Adult Female Heights What is the probability a randomly selected female is 5’10” or taller (70 inches)? Step 1 - Y ~ N(63.7 , 2.5) Step 2 - YL = 70.0 YU =  Step 3 - Step 4 - P(Y  70) = P(Z  2.52) = .0059 (  1/170)

Finding Percentiles of a Distribution Step 1 - Identify the normal distribution of interest (e.g. its mean (m) and standard deviation (s) ) Step 2 - Determine the percentile of interest 100p% (e.g. the 90th percentile is the cut-off where only 90% of scores are below and 10% are above) Step 3 - Turn the percentile of interest into a tail probability a and corresponding z-value (zp): If 100p  50 then a = 1-p and zp = za If 100p < 50 then a = p and zp = -za Step 4 - Transform zp back to original units:

Example - Adult Male Heights Above what height do the tallest 5% of males lie above? Step 1 - Y ~ N(69.1 , 2.6) Step 2 - Want to determine 95th percentile (p = .95) Step 3 - Since 100p > 50, a = 1-p = 0.05 zp = za = z.05 = 1.645 Step 4 - Y.95 = 69.1 + (1.645)(2.6) = 73.4

Statistical Models When making statistical inference it is useful to write random variables in terms of model parameters and random errors Here m is a fixed constant and e is a random variable In practice m will be unknown, and we will use sample data to estimate or make statements regarding its value

Sampling Distributions and the Central Limit Theorem Sample statistics based on random samples are also random variables and have sampling distributions that are probability distributions for the statistic (outcomes that would vary across samples) When samples are large and measurements independent then many estimators have normal sampling distributions (CLT): Sample Mean: Sample Proportion:

Example - Adult Female Heights Random samples of n = 100 females to be selected For each sample, the sample mean is computed Sampling distribution: Note that approximately 95% of all possible random samples of 100 females will have sample means between 63.0 and 64.0 inches