Ch7: Sampling Distributions 29 Sep 2011 BUSI275 Dr. Sean Ho HW3 due 10pm.

Slides:



Advertisements
Similar presentations
Sampling Distributions (§ )
Advertisements

1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.3 Estimating a Population mean µ (σ known) Objective Find the confidence.
Objectives Look at Central Limit Theorem Sampling distribution of the mean.
Chapter 7 Sampling and Sampling Distributions
Introduction to Inference Estimating with Confidence Chapter 6.1.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 9: Hypothesis Tests for Means: One Sample.
Chapter Sampling Distributions and Hypothesis Testing.
Inference about a Mean Part II
Statistical inference Population - collection of all subjects or objects of interest (not necessarily people) Sample - subset of the population used to.
BCOR 1020 Business Statistics
The Sampling Distribution Introduction to Hypothesis Testing and Interval Estimation.
Sampling Distributions & Point Estimation. Questions What is a sampling distribution? What is the standard error? What is the principle of maximum likelihood?
Chapter 11: Random Sampling and Sampling Distributions
Chapter 6 Sampling and Sampling Distributions
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Review of normal distribution. Exercise Solution.
Ch8: Confidence Intervals 4 Oct 2011 BUSI275 Dr. Sean Ho Dataset description due tonight 10pm HW4 due Thu 10pm.
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Chapter 8: Confidence Intervals
Populations, Samples, Standard errors, confidence intervals Dr. Omar Al Jadaan.
Stat 13, Thu 5/10/ CLT again. 2. CIs. 3. Interpretation of a CI. 4. Examples. 5. Margin of error and sample size. 6. CIs using the t table. 7. When.
Sampling and Confidence Interval
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Topic 5 Statistical inference: point and interval estimate
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Ch6: Continuous Distributions 27 Sep 2011 BUSI275 Dr. Sean Ho HW3 due Thu 10pm Download and open: 06-Normal.xls 06-Normal.xls.
Stat 13, Tue 5/8/ Collect HW Central limit theorem. 3. CLT for 0-1 events. 4. Examples. 5.  versus  /√n. 6. Assumptions. Read ch. 5 and 6.
Sampling Distributions & Standard Error Lesson 7.
Statistical Sampling & Analysis of Sample Data
Sampling and Confidence Interval Kenneth Kwan Ho Chui, PhD, MPH Department of Public Health and Community Medicine
Ch5-6: Common Probability Distributions 31 Jan 2012 Dr. Sean Ho busi275.seanho.com HW3 due Thu 10pm Dataset description due next Tue 7Feb Please download:
Section 5.2 The Sampling Distribution of the Sample Mean.
Sample Variability Consider the small population of integers {0, 2, 4, 6, 8} It is clear that the mean, μ = 4. Suppose we did not know the population mean.
LECTURE 25 THURSDAY, 19 NOVEMBER STA291 Fall
Confidence intervals: The basics BPS chapter 14 © 2006 W.H. Freeman and Company.
READING HANDOUT #5 PERCENTS. Container of Beads Container has 4,000 beads 20% - RED 80% - WHITE Sample of 50 beads with pallet. Population - the 4,000.
Confidence Intervals Target Goal: I can use normal calculations to construct confidence intervals. I can interpret a confidence interval in context. 8.1b.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Chapter 10: Confidence Intervals
26134 Business Statistics Tutorial 11: Hypothesis Testing Introduction: Key concepts in this tutorial are listed below 1. Difference.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall
26134 Business Statistics Tutorial 12: REVISION THRESHOLD CONCEPT 5 (TH5): Theoretical foundation of statistical inference:
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Chapters 6 & 7 Overview Created by Erin Hodgess, Houston, Texas.
Ch3: Exploring Your Data with Descriptives 15 Sep 2011 BUSI275 Dr. Sean Ho HW1 due tonight 10pm Download and open “02-SportsShoes.xls”02-SportsShoes.xls.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
Introduction to inference Estimating with confidence IPS chapter 6.1 © 2006 W.H. Freeman and Company.
1 Probability and Statistics Confidence Intervals.
m/sampling_dist/index.html.
Ch5: Discrete Distributions 22 Sep 2011 BUSI275 Dr. Sean Ho HW2 due 10pm Download and open: 05-Discrete.xls 05-Discrete.xls.
Many times in statistical analysis, we do not know the TRUE mean of a population on interest. This is why we use sampling to be able to generalize the.
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
And distribution of sample means
Introduction to Inference
Statistical Inference
Sampling Distributions
Estimating the Population Mean Income of Lexus Owners
Towson University - J. Jung
Chapter 7 Sampling Distributions.
Confidence intervals: The basics
Chapter 7 Sampling Distributions.
Sampling Distributions
Confidence intervals: The basics
Chapter 7 Sampling Distributions.
Sampling Distributions (§ )
Chapter 7 Sampling Distributions.
How Confident Are You?.
Presentation transcript:

Ch7: Sampling Distributions 29 Sep 2011 BUSI275 Dr. Sean Ho HW3 due 10pm

29 Sep 2011 BUSI275: sampling distributions2 Outline for today Sampling distributions Sampling distribution of the sample mean μ x and σ x Central Limit Theorem Uses of the SDSM Probability of sample avg above a threshold 90% confidence interval Estimating needed sample size

29 Sep 2011 BUSI275: sampling distributions3 Sampling error Sampling is the process of drawing a sample out from a population Sampling error is the difference between a statistic calculated on the sample and the true value of the statistic in the population e.g., pop. of 100 products; avg price is μ=$50 Draw a sample of 10 products, calculate average price to be x=$55 We just so happened to draw 10 products that are more expensive than the average Sampling error is $5 Pop. μ Sample x

29 Sep 2011 BUSI275: sampling distributions4 Sampling distribution 1) Draw one sample of size n 2) Find its sample mean x (or other statistic) 3) Draw another sample of size n; find its mean 4) Repeat for all possible samples of size n 5) Build a histogram of all those sample means In the histogram for the population, each block represents one observation In the histogram for the sampling distribution, each block represents one whole sample! Pop. sample x x x SDSM

29 Sep 2011 BUSI275: sampling distributions5 SDSM Sampling distribution of sample means Histogram of sample means (x) of all possible samples of size n taken from the population It has its own mean, μ x, and SD, σ x SDSM is centred around the true mean μ i.e., μ x = μ If μ=$50 and our sample of 10 has x=$55, we just so happened to take a high sample But other samples will have lower x On average, the x should be around $50

29 Sep 2011 BUSI275: sampling distributions6 Properties of the SDSM μ x = μ: centred around true mean σ x = σ/√n: narrower as sample size increases For large n, any sample looks about the same Larger n ⇒ sample is better estimate of pop σ x is also called the standard error If pop is normal, then SDSM is also normal If pop size N is finite and sample size n is a sizeable fraction of it (say >5%), need to adjust standard error:

29 Sep 2011 BUSI275: sampling distributions7 Central Limit Theorem In general, we won't know the shape of the population distribution, but As n gets larger, the SDSM gets more normal So we can use NORMDIST/INV to make calculations on it So, as sample size increases, two good things: Standard error decreases (σ x = σ/√n) SDSM becomes more normal (CLT) n n: x Population: x xx

29 Sep 2011 BUSI275: sampling distributions8 SDSM as n SDSM matches original population As n increases, SDSM gets tighter and normal Regardless of shape of original population! Note: pop doesn't get more normal; it does not change Only the sampling distribution changes Hemist

29 Sep 2011 BUSI275: sampling distributions9 SDSM: example Say package weight is normal: μ=10kg, σ=4kg Say we have to pay extra fee if the average package weight in a shipment is over 12kg If our shipment has 4 packages, what is the chance we have to pay fee? Standard error: σ x = 4/√4 = 2kg z = (x – μ x )/σ x = (12-10)/2 = 1 Area to right: 1-NORMSDIST(1)=15.87%  Or: 1 - NORMDIST(12, 10, 2, 1) 16 pkgs? Std err: σ x = 4/√16 = 1kg; z = (12-10)/1 = 2 Area to right: 1-NORMSDIST(2) = 2.28% n=4 n=16

29 Sep 2011 BUSI275: sampling distributions10 SDSM: example Assume mutual fund MER norm: μ=4%, σ=1.8% Broker randomly(!) chooses 9 funds We want to say, “90% of the time, the avg MER for the portfolio of 9 funds is between ___% and ___%.” (find the limits) Lower limit: 90% in middle ⇒ 5% in left tail NORMSINV(0.05) ⇒ z = Std err: σ x = 1.8/√9 = 0.6% z = (x – μ x ) / σ x ⇒ = (x – 4) / 0.6 ⇒ lower limit is x = 4 – (1.645)(0.6) = 3.01% Upper: x=μ+(z)(σ x ) = 4 + (1.645)(0.6) = 4.99% 90%

29 Sep 2011 BUSI275: sampling distributions11 MER example: conclusion We conclude that, if the broker randomly chooses 9 mutual funds from the population 90% of the time, the average MER in the portfolio will be between 3.01% and 4.99% This does not mean 90% of the funds have MER between 3.01% and 4.99%! 90% on SDSM, not 90% on orig. population If the portfolio had 25 funds instead of 9, the range on avg MER would be even narrower But the range on MER in the population stays the same

29 Sep 2011 BUSI275: sampling distributions12 SDSM: estimate sample size So: given μ, σ, n, and a threshold for x ⇒ we can find probability (% area under SDSM) Std err ⇒ z-score ⇒ % (use NORMDIST) Now: if given μ, σ, threshold x, and % area, ⇒ we can find sample size n Experimental design: how much data needed Outline: From % area on SDSM, use NORMINV to get z Use (x – μ) to find standard error σ x Use σ x and σ to solve for sample size n

29 Sep 2011 BUSI275: sampling distributions13 Estimating needed sample size Assume weight of packages is normally distributed, with σ=1kg We want to estimate average weight to within a precision of ±50g, 95% of the time How many packages do we need to weigh? NORMSINV(0.975) → z=±1.96 ±1.96 = (x – μ x ) / σ x. Don't know μ, but we want (x – μ) = ±50g ⇒ σ x = 50g / 1.96 So σ/√n = 50g / Solving for n: n = (1000g * 1.96 / 50g) 2 = 1537 (round up)

29 Sep 2011 BUSI275: sampling distributions14 TODO HW3 (ch3-4): due tonight at 10pm Remember to format as a document! HWs are to be individual work Dataset description due this Tue 4 Oct