Random Sampling Approximations of E(X), p.m.f, and p.d.f.

Slides:



Advertisements
Similar presentations
1 Random Sampling - Random Samples. 2 Why do we need Random Samples? Many business applications -We will have a random variable X such that the probability.
Advertisements

Sampling Distribution of a Sample Proportion Lecture 26 Sections 8.1 – 8.2 Wed, Mar 8, 2006.
1 Practical Psychology 1 Week 5 Relative frequency, introduction to probability.
Terminology A statistic is a number calculated from a sample of data. For each different sample, the value of the statistic is a uniquely determined number.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Chapter 7 Introduction to Sampling Distributions
© 2010 Pearson Prentice Hall. All rights reserved Sampling Distributions and the Central Limit Theorem.
CONTINUOUS RANDOM VARIABLES These are used to define probability models for continuous scale measurements, e.g. distance, weight, time For a large data.
Probability Distributions
Test 2 Stock Option Pricing
Scenario  You have last week’s closing stock price for a particular company. You are trying to place a value to a European call option which expires a.
Chapter 6 Introduction to Sampling Distributions
Probability Distributions Finite Random Variables.
Probability Distributions
Chapter 5: Probability Concepts
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
16 MULTIPLE INTEGRALS.
CHAPTER 6 Statistical Analysis of Experimental Data
Probability Distributions Random Variables: Finite and Continuous A review MAT174, Spring 2004.
Probability Distributions Random Variables: Finite and Continuous Distribution Functions Expected value April 3 – 10, 2003.
Random Sampling. In the real world, most R.V.’s for practical applications are continuous, and have no generalized formula for f X (x) and F X (x). We.
Probability Distributions Continuous Random Variables.
Variance Fall 2003, Math 115B. Basic Idea Tables of values and graphs of the p.m.f.’s of the finite random variables, X and Y, are given in the sheet.
© 2010 Pearson Prentice Hall. All rights reserved 8-1 Chapter Sampling Distributions 8 © 2010 Pearson Prentice Hall. All rights reserved.
5.4 The Central Limit Theorem Statistics Mrs. Spitz Fall 2008.
Random Sampling  In the real world, most R.V.’s for practical applications are continuous, and have no generalized formula for f X (x) and F X (x). 
Probability Distributions: Finite Random Variables.
© 2010 Pearson Education Inc.Goldstein/Schneider/Lay/Asmar, CALCULUS AND ITS APPLICATIONS, 12e – Slide 1 of 15 Chapter 12 Probability and Calculus.
L7.1b Continuous Random Variables CONTINUOUS RANDOM VARIABLES NORMAL DISTRIBUTIONS AD PROBABILITY DISTRIBUTIONS.
What We Know So Far… Data plots or graphs
Chapter 7: The Normal Probability Distribution
Chapter 6: Probability Distributions
Continuous Probability Distributions  Continuous Random Variable  A random variable whose space (set of possible values) is an entire interval of numbers.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
© 2010 Pearson Prentice Hall. All rights reserved 8-1 Objectives 1.Describe the distribution of the sample mean: samples from normal populations 2.Describe.
Applications of Integration 6. Probability Probability Calculus plays a role in the analysis of random behavior. Suppose we consider the cholesterol.
Anthony J Greene1 Where We Left Off What is the probability of randomly selecting a sample of three individuals, all of whom have an I.Q. of 135 or more?
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Exam 1 next Thursday (March 7 th ) in class 15% of your grade Covers chapters 1-6 and the central limit theorem I will put practice problems, old exams,
Simulation Using computers to simulate real- world observations.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter The Normal Probability Distribution 7.
1 Topic 5 - Joint distributions and the CLT Joint distributions –Calculation of probabilities, mean and variance –Expectations of functions based on joint.
Lecture 6 Normal Distribution By Aziza Munir. Summary of last lecture Uniform discrete distribution Binomial Distribution Mean and Variance of binomial.
CY1B2 Statistics1 (ii) Poisson distribution The Poisson distribution resembles the binomial distribution if the probability of an accident is very small.
Sampling and estimation Petter Mostad
Using Probability Distributions for Project 2. Class Project How can probability distributions help us price a stock option? Last time we computed the.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter The Normal Probability Distribution 7.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
Lecture 5 Introduction to Sampling Distributions.
Chapter 7 The Normal Probability Distribution 7.1 Properties of the Normal Distribution.
Sec 6.3 Bluman, Chapter Review: Find the z values; the graph is symmetrical. Bluman, Chapter 63.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Lesson 99 - Continuous Random Variables HL Math - Santowski.
Estimating the Value of a Parameter Using Confidence Intervals
Properties of the Normal Distribution
Distribution of the Sample Means
Chapter 7 Sampling Distributions.
The Normal Probability Distribution
Chapter 7 Sampling Distributions.
CONTINUOUS RANDOM VARIABLES AND THE NORMAL DISTRIBUTION
Chapter 7 Sampling Distributions.
AGENDA: DG minutes Begin Part 2 Unit 1 Lesson 11.
Chapter 7 Sampling Distributions.
Day 13 AGENDA: DG minutes Begin Part 2 Unit 1 Lesson 11.
Chapter 7 Sampling Distributions.
Presentation transcript:

Random Sampling Approximations of E(X), p.m.f, and p.d.f

Important Read through simulation slides for Thursday Homework #8 is due on Thursday Check web-page on Wednesday night -- print off any worksheets for simulation that might be there for Thursday Major mistakes on study guide put on-line; there is a new one there now –A different definition of the p.d.f and c.d.f for the uniform random variable then what was given in class but they mean the same thing. P.M.F versus P.D.F – need clarification because I mispoke

P.M.F versus P.D.F Either graph can be a histogram –I was assuming that the bin width will always be 1 for a finite random variable but that is not necessarily the case Take X = 0, ½, 1, etc. Probability Mass Function –The values along the y-axis of a histogram represent probabilities If you sum up the probabilities, they should add up to 1 every time (regardless of the bin width) Thus, to determine is a graph is a p.m.f, you need to add up the heights of the rectangles – if they add up to 1, then it is a p.m.f.

P.M.F versus P.D.F A probability density function can also be a histogram The values along the y-axis do not represent probabilities for a continuous random variable However, the area under graph must be equal to 1 –How can you check if a histogram represents an p.d.f? If the heights of the rectangles do not add up to 1, but the areas of the rectangles do sum to 1.

In conclusion Both a p.m.f and p.d.f graph can be histograms –To tell if a histogram represents a p.m.f, the sum of the HEIGHTS of the rectangles must equal 1 (because the heights represent probabilities) –To tell if a histogram represents an p.d.f, the sum of the heights of the rectangles do not equal but the area of the rectangles do. Let’s look at number 4b from the homework just turned in ….

Why do we use Random Sampling? In business, we identify a random variable We want its probability information Problem: We do not know its distribution OR expected value Solution: Estimate E(X) and estimate F X (x) and f X (x) using random sampling

Definitions A number x that results from a trial of the process is called an observation of X A set {x 1, x 2, ……, x n } of n independent observations of the same random variable X is called a random sample of size n.

Example #1 Suppose that X is the number of assembly line stoppages that occur during an 8-hour shift in a manufacturing plant. We could obtain a random sample of size 10 by watching the line for 10 different shifts and recording the number of stoppages during each 8-hour shift

Example #1 (continued) Looking above, we see the information recorded during the 10 different shift observations We can compute the sample mean of the observations The sample mean is denoted by

Statistics and Probability There is a difference between probabilities and statistics even though people use them interchangably A number that describes a sample is called a statistic THEOREM: The statistic can be used as an estimate of E(X). In general, the larger the sample size n, the better the estimate will be

Sample Mean We can find the mean of example #1

Approximating Probability Mass and Density Functions If we have a large enough sample, we can approximate functions I.e., we can approximate a p.m.f or a p.d.f depending on the random variable If we approximate a p.m.f or p.d.f, we can also look at the corresponding graphs

Example #3 Suppose that the assembly line discussed in Example #1 runs 24 hours a day, with workers in three shifts. Observations of the number of stoppages during an 8-hour shift were recorded for a nine month period. I.e., 819 different shifts were observed and recorded in the file Stoppages.xls.Stoppages.xls

Relative Frequencies Relative frequencies were plotted to obtain the histogram seen in Stoppages.xls The relative frequency of each value X in the sample gives an estimate for the probability that X will assume that value. WHY? How did we obtain the relative frequencies? A histogram will give a good approximation for the graph of f X

Continuous Random Variables A large random sample can also be used to approximate the p.d.f of a continuous random variable One way we can obtain our p.d.f is by looking at smaller and smaller bin widths of our data Use the HISTOGRAM function in Excel to find the approximation of the graph of the p.d.f

Example #4 The manager of the plant that was described in the the previous examples wants to get a better of understanding of the delays caused by stoppagesof the assembly line. So, in addition to knowing how many stoppages there are, the manager wants to know how long they last.

Example #4 (continued) Let T be the (exact) length of time, in minutes, that a randomly selected stoppage will last QUESTION: Is T a continuous random variable? In Stoppages.xls, the duration of each stoppage was also recorded for all 819 shifts Therefore, we have a random sample of observations for T

Example #4 (continued) Used the function HISTOGRAM in Excel to plot an approximation of the p.d.f., f T In Stoppages.xls, bin widths of 2 minutes are used Since our bin width is 2, to make the area under the graph be 1, we had to divide each relative frequency by 2 and then plot those “new” relative frequencies –Note: Here you are dividing the relative frequency by the bin width – not the frequency by the bin width as stated in class Thus, you find the relative frequency as you did before and then divide it by the bin width By connecting the midpoints of the tops of the rectangles gives us an approximate curve

Using the approximated p.d.f We can use our plot to calculate probabilities For example, if we wanted to know P(2<T  4), we could look at the corresponding area under the graph Note: P(2<T  4) corresponds to an area under the graph between (2,4] which is a rectangle So, to find our probability, find the area of the rectangle

Focus on the Project We have a continuous random variable R norm which gives the normalized ratio of weekly closing prices on Disney stock (class project) Option Focus.xls contains 417 values of R norm from 417 weekly closing ratios They are considered to be independent observations Thus, make up a random sample of size 417 for R norm

Focus on the Project We can calculate sample mean which we know should be equal to what? We can create a plot using the relative frequencies Note: If your bin width is greater than 1, you will have to divide the relative frequency by your bin width to make the area under the curve be 1 Graphing the midpoints at the tops of the bars will produce a line graph approximation for f norm

What should you do? Plot an approximation of the probability density function for the normalized ratios of weekly closing prices The plot should be a line graph, where you are connecting the midpoints of the tops of the bars Remember, if your bin width is greater than 1 you will have to divide the relative frequencies by that width before you plot Find the sample mean of the normalized ratios – you already know what it should equal