Random Variable Overview

Slides:



Advertisements
Similar presentations
Probability.
Advertisements

Chapter 5 Some Key Ingredients for Inferential Statistics: The Normal Curve, Probability, and Population Versus Sample.
Lecture 7 THE NORMAL AND STANDARD NORMAL DISTRIBUTIONS
4.3 NORMAL PROBABILITY DISTRIBUTIONS
Lecture 4 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Discrete Distributions
Ethan Cooper (Lead Tutor)
Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
Commonly Used Distributions
MOMENT GENERATING FUNCTION AND STATISTICAL DISTRIBUTIONS
AP Statistics Chapter 7 – Random Variables. Random Variables Random Variable – A variable whose value is a numerical outcome of a random phenomenon. Discrete.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 4 Using Probability and Probability Distributions.
Psychology 10 Analysis of Psychological Data February 26, 2014.
Sections 4.1 and 4.2 Overview Random Variables. PROBABILITY DISTRIBUTIONS This chapter will deal with the construction of probability distributions by.
Probability Distributions
Discrete Probability Distributions
 The Law of Large Numbers – Read the preface to Chapter 7 on page 388 and be prepared to summarize the Law of Large Numbers.
The Binomial Probability
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 11: Random Sampling and Sampling Distributions
Discrete and Continuous Probability Distributions.
Week71 Discrete Random Variables A random variable (r.v.) assigns a numerical value to the outcomes in the sample space of a random phenomenon. A discrete.
Chapter 7: Random Variables
Chapter 6 Random Variables. Make a Sample Space for Tossing a Fair Coin 3 times.
Chapter 6: Probability Distributions
Statistics 303 Chapter 4 and 1.3 Probability. The probability of an outcome is the proportion of times the outcome would occur if we repeated the procedure.
PROBABILITY & STATISTICAL INFERENCE LECTURE 3 MSc in Computing (Data Analytics)
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Lesson 7 - R Review of Random Variables. Objectives Define what is meant by a random variable Define a discrete random variable Define a continuous random.
Theory of Probability Statistics for Business and Economics.
Statistical Experiment A statistical experiment or observation is any process by which an measurements are obtained.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
5.3 Random Variables  Random Variable  Discrete Random Variables  Continuous Random Variables  Normal Distributions as Probability Distributions 1.
Bernoulli Trials Two Possible Outcomes –Success, with probability p –Failure, with probability q = 1  p Trials are independent.
Binomial Experiment A binomial experiment (also known as a Bernoulli trial) is a statistical experiment that has the following properties:
Week 21 Conditional Probability Idea – have performed a chance experiment but don’t know the outcome (ω), but have some partial information (event A) about.
5-1 Random Variables and Probability Distributions The Binomial Distribution.
Random Variables Presentation 6.. Random Variables A random variable assigns a number (or symbol) to each outcome of a random circumstance. A random variable.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Mistah Flynn.
Probability Distribution
PROBABILITY AND STATISTICS WEEK 4 Onur Doğan. Random Variable Random Variable. Let S be the sample space for an experiment. A real-valued function that.
Measuring chance Probabilities FETP India. Competency to be gained from this lecture Apply probabilities to field epidemiology.
Probability Theory Modelling random phenomena. Permutations the number of ways that you can order n objects is: n! = n(n-1)(n-2)(n-3)…(3)(2)(1) Definition:
+ Chapter 5 Overview 5.1 Introducing Probability 5.2 Combining Events 5.3 Conditional Probability 5.4 Counting Methods 1.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Business Statistics,
Introduction: Random Variables What are random variables? Intro to probability distributions Discrete Continuous.
Chap 5-1 Discrete and Continuous Probability Distributions.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
THE NORMAL DISTRIBUTION
Probability Distributions ( 확률분포 ) Chapter 5. 2 모든 가능한 ( 확률 ) 변수의 값에 대해 확률을 할당하는 체계 X 가 1, 2, …, 6 의 값을 가진다면 이 6 개 변수 값에 확률을 할당하는 함수 Definition.
Theoretical distributions: the Normal distribution.
Chapter Five The Binomial Probability Distribution and Related Topics
CHAPTER 6 Random Variables
CHAPTER 6 Random Variables
3 Discrete Random Variables and Probability Distributions
Chapter 6- Random Variables
Business Statistics Topic 4
Chapter 4 – Part 3.
Simple Random Sample A simple random sample (SRS) of size n consists of n elements from the population chosen in such a way that every set of n elements.
Probability distributions
6.1: Discrete and Continuous Random Variables
CHAPTER 6 Random Variables
CHAPTER 6 Random Variables
CHAPTER 6 Random Variables
CHAPTER 6 Random Variables
CHAPTER 6 Random Variables
AP Statistics Chapter 16 Notes.
Discrete Distributions
CHAPTER 6 Random Variables
Chapter 6: Probability.
Presentation transcript:

Random Variable Overview What are random variables? Intro to probability distributions Discrete Continuous Linear transformations of RVs Combinations of RVs

What are random variables? Let X represent a quantitative variable that is measured or observed in an experiment. The value that X takes on in a given experiment is a random outcome. Counting the number of defective lightbulbs in a case of bulbs Measuring daily rainfall in inches Measuring the average depression score of computer science majors

Sample means, standard deviations, proportions, and frequencies are all random variables.

Discrete Continuous Two types of random variables The observations can take only a finite, countable number of values. The observations can take on any of the countless number of values in an interval The number of heads in four coin tosses The number of anorexics in a random sample of 500 people The average response time of a random sample of 200 depressed patients The average IQ of a random sample of 22 statistics students

In general, averages are continuous and counts are discrete. The average anger response The number of juvenile delinquents

Discrete Continuous What is a probability distribution? The probabilities associated with each specific value of the RV Continuous The probabilities associated with a range of values of the RV.

Discrete Probability Distributions

The Sample Space Suppose that we toss three coins. Let X = the number of heads appearing. X is a random variable taking on one of the values 0,1,2,3 Two balls are randomly chosen from an urn of blue and red balls. We win $1 for every blue and lose $1 for every red. Let X = our total winnings. X is a random variable taking on one of the values -2, 0, 2

The Sample Space Suppose that we toss two dice. Let X = the sum of the two tosses. X is a random variable taking on one of the values _________ Suppose that we toss two dice. Let X = the difference of the two tosses. X is a random variable taking on one of the values ___________ Sum: Y is 2,3,4,5,6,7,8,9,10,11,12 Difference: Y is 0, 1, 2, 3, 4, 5

Probability Distributions The probability distribution of X lists the values in the sample space and their associated probabilities. Suppose that we toss a fair die. Let X = the outcome of the toss. X is a random variable taking on one of the values 1, 2, 3, 4, 5, 6 xi pi 1 1/6 2 1/6 3 1/6 4 1/6 5 1/6 6 1/6

Probability Distributions The probability distribution of X lists the values in the sample space and their associated probabilities. probability outcome xi pi 1 1/6 2 1/6 3 1/6 4 1/6 5 1/6 6 1/6

Probability Distributions Suppose that we toss two coins. Let X = the number of heads. Make the probability distribution

Probability Distributions xi pi 0 1/4 1 2/4 2 1/4 Suppose that we toss two coins. Let X = the number of heads. Make the probability distribution probability outcome

Probability Distributions Sometimes you can estimate discrete probability distributions using a really large sample Cryptography: Frequencies of letters in a 1000 letter sample xi pi A 73/1000 B 9/1000 C 30/1000 D 44/1000 E 130/1000 F 28/1000 probability outcome

Expected Values The mean of a discrete probability distribution (called the “expected value”) can be found using this formula It is a weighted average of the possible values of X, each value being weighted by its probability of occurrence.

Expected Values xi pi 1 1/6 2 1/6 3 1/6 4 1/6 5 1/6 6 1/6 1 1/6 2 1/6 3 1/6 4 1/6 5 1/6 6 1/6 Suppose that we toss a fair die. Let X = the outcome of the toss. X is a random variable taking on one of the values 1, 2, 3, 4, 5, 6 What is the expected value?

Expected Values xi pi 10 3/10 -2 7/10 Suppose we draw one marble out of a bowl containing 3 green and 7 black marbles. We win $10 if we draw a green marble but we lose $2 if we draw a black marble. Let X = our winnings. What is the expected value of X? Should you play this game? xi pi 10 -2 3/10 7/10

Variance The variance of a discrete probability distribution can be found using this formula It is a weighted average of the squared deviations in X

xi pi 1 1/6 2 1/6 3 1/6 4 1/6 5 1/6 6 1/6 Suppose that we toss a fair die. Let X = the outcome of the toss. X is a random variable taking on one of the values 1, 2, 3, 4, 5, 6 μx = 3.5 σx2 = ?

xi pi ( xi-X)2 1 1/6 6.25 2 1/6 2.25 3 1/6 .25 4 1/6 .25 5 1/6 2.25 6 1/6 6.25 Suppose that we toss a fair die. Let X = the outcome of the toss. X is a random variable taking on one of the values 1, 2, 3, 4, 5, 6 μx = 3.5 σx2 = ?

Suppose that we toss a coin Suppose that we toss a coin. Let X = 1 if it’s heads and 0 if it’s tails. What is the expected value of X? What is the variance? xi pi ( xi-X)2 Expected value .50 Variance .25 μ = .50 σ2 = .25

Suppose that we toss 3 coins Suppose that we toss 3 coins. For every head we get $1 and for every tail we lose $1. Let X = our winnings. What is the expected value of X? What is the variance of X? xi pi ( xi-X)2 μ = 0 σ2 = 3

Known Discrete Distributions The bernoulli (heads versus tails) The binomial (# heads in n tosses) The poisson (# customers entering a post office in a day)

Continuous Probability Distributions

Continuous Probability Distributions We talk about probabilities for a range of values, not a particular value. Probability for a range of values is determined by the area under the probability distribution curve (use calculus or a table). Expected value Variance

Known Continuous Distributions The uniform distribution The normal distribution The t distribution The F distribution

Normal Distribution The probability distribution curve for the normal distribution N(µ,σ) is defined by this function Luckily, you can you can find the probabilities for this curve using Table E Expected value Variance

Standard Normal Distribution A normal distribution has mean μx and variance σx2 The standard normal distribution is a normal distribution that has been transformed to have mean 0 and variance 1 If raw scores are normally distributed, the distribution of z-scores will be standard normal Thus if raw scores are normally distributed, we can associate z-scores with standard normal probabilities (whether or not raw scores are normally distributed, a z-score accurately indexes/positions a score in terms of the number of standard deviations away from the mean) Normalized standard scores Normalized T scores, stanines, ..

- 3 - 2 - 1 1 2 3 Z Unusual Values Ordinary Interpreting Z Scores

Z-scores: Handy for thinking about the normal probability distribution If the distribution of raw scores is normal, the z distribution will be “standard normal” Z scores This is a probability density curve In particular, it is the “standard normal” probability distribution Probability corresponds to area under the curve Total area under the curve is 1

Standard Normal Distribution Area = 0.3413 Z scores 1 standard deviation includes about 68% of cases (34% on each side) 2 standard deviations includes about 95% of cases 3 standard deviations includes about 99.7% of cases ASSUMING RAW SCORES DISTRIBUTED NORMALLY

Using Appendix E.10 For positive z scores, gives you area under curve that corresponds to probability For negative z scores, use the complement rule μ z μ z μ z z mean to larger smaller z portion portion .00 0.0000 0.5000 0.5000 .50 0.1915 0.6915 0.3085 1.0 0.3413 0.8413 0.1587 1.5 0.4332 0.9332 0.0668 2.0 0.4772 0.9772 0.0228 2.5 0.4938 0.9938 0.0062 Note: Proportion and probability are equivalent.

Standard Normal Distribution Area found in Appx E.10 Area = 0.3413 1 2 3 -1 -2 -3 Area = 0.1587 1.58 Score (z ) ? Score (z ) 0.0571 0.4429

Exercise What is the probability of getting a Z greater than 1.96? What z-score will give you a probability of 5% in the upper tail?

Applications Let’s say the population of bartenders has an IQ of 100 and a standard deviation of 10 If we measure the IQ of any one bartender, how likely is it that her score would be greater than 80?

P(x > 80) = ? Step 1: Translate the score into z score Step 2: Use E.10 to get probability P(Z > -2) = 98%

Exercise Let’s say the population of bartenders has an IQ of 100 and a standard deviation of 10 If we measure the IQ of any one bartender, how likely is it that her score would be Greater than 80? Between 90 and 110? Greater than 115?

Transformations & Combinations

Rules for Expected Values Linear Transformations: If you add/subtract a constant to the RV, then add/subtract that number to the X If you mult/divide the RV by a constant, then mult/divide the X by that number Combining Two Random Variables If you add random variable X to random variable Y, then add X to Y If you subtract random variable X from random variable Y, then subtract X from Y

Rules for Variances of Random Variables Linear Transformations: If you add/subtract a constant to the RV, then nothing happens to X2 If you mult/divide the RV by a constant, then mult/divide the X2 by that number squared Combining Two Independent Random Variables If you add random variable X to random variable Y, then add X2 to Y2 If you subtract random variable X from random variable Y, then add X2 to Y2

Example 1 Suppose that we toss a coin. Let X = 1 if it’s heads and 0 if it’s tails. What is the expected value of X? X = ______ What is the variance of X? σx2 = _____ Now we go into a special “double or nothing” round. All dollar values are doubled in this round. What is the new expected value of X? x = ______ What is the new variance of X? σx2 = _____ .50 1.00 .25 1.00

Example 2 Suppose that we toss a coin. Let X = 1 if it’s heads and 0 if it’s tails. What is the expected value of X? X = ______ What is the variance of X? σx2 = _____ Now suppose we toss three coins. What is the expected value of all three tosses combined? X+Y+Z = ______ What is the new variance? σX+Y+Z = _____ .50 1.50 .25 .75

Example 3 50 vegetarians and 100 non-vegetarians participate in a study of cardiovascular health. On average, the vegetarians received a score of 80 with a standard deviation of 5. The non-vegetarians scored 70 points on average with a standard deviation of 10. A sneaky researcher tries to fudge the data by multiplying the scores of the non-vegetarians by 1.2 and adding 5 points. What happens to the mean and sd?

Example 4 100 pairs of male-female siblings participate in a study of repressive coping. For the women, the average repressive coping score was 6 with a standard deviation of .5. For men, the average repressive coping score was 5, with a standard deviation of .5. What is the average and standard deviation of the set of male-female difference scores?