Sampling Error.  When we take a sample, our results will not exactly equal the correct results for the whole population. That is, our results will be.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Chapter 6 Sampling and Sampling Distributions
Estimation in Sampling
Sampling: Final and Initial Sample Size Determination
Topics: Inferential Statistics
Topic 7 Sampling And Sampling Distributions. The term Population represents everything we want to study, bearing in mind that the population is ever changing.
Chapter 7 Sampling and Sampling Distributions
Sampling Distributions
Chapter 8 Estimation: Single Population
INFERENTIAL STATISTICS. By using the sample statistics, researchers are usually interested in making inferences about the population. INFERENTIAL STATISICS.
Inferential Statistics
Standard error of estimate & Confidence interval.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Estimating the Value of a Parameter Using Confidence Intervals 9.
Review of normal distribution. Exercise Solution.
Chapter 7 Confidence Intervals and Sample Sizes
Chapter 7 Estimation: Single Population
Estimation Statistics with Confidence. Estimation Before we collect our sample, we know:  -3z -2z -1z 0z 1z 2z 3z Repeated sampling sample means would.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Chapter 11: Estimation Estimation Defined Confidence Levels
Populations, Samples, Standard errors, confidence intervals Dr. Omar Al Jadaan.
Sampling and Confidence Interval
Estimation of Statistical Parameters
7-1 Estim Unit 7 Statistical Inference - 1 Estimation FPP Chapters 21,23, Point Estimation Margin of Error Interval Estimation - Confidence Intervals.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
ESTIMATION. STATISTICAL INFERENCE It is the procedure where inference about a population is made on the basis of the results obtained from a sample drawn.
© 2003 Prentice-Hall, Inc.Chap 6-1 Business Statistics: A First Course (3 rd Edition) Chapter 6 Sampling Distributions and Confidence Interval Estimation.
Introduction to Statistical Inference Chapter 11 Announcement: Read chapter 12 to page 299.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
Vegas Baby A trip to Vegas is just a sample of a random variable (i.e. 100 card games, 100 slot plays or 100 video poker games) Which is more likely? Win.
Statistical Interval for a Single Sample
Statistical Sampling & Analysis of Sample Data
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
Statistical estimation, confidence intervals
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-1 Review and Preview.
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
Confidence Interval Estimation For statistical inference in decision making:
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
STT 315 Ashwini maurya This lecture is based on Chapter 5.4 Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Various Topics of Interest to the Inquiring Orthopedist Richard Gerkin, MD, MS BGSMC GME Research.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
Confidence Interval Estimation For statistical inference in decision making: Chapter 9.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Confidence Intervals INTRO. Confidence Intervals Brief review of sampling. Brief review of the Central Limit Theorem. How do CIs work? Why do we use CIs?
1 Probability and Statistics Confidence Intervals.
9-1 ESTIMATION Session Factors Affecting Confidence Interval Estimates The factors that determine the width of a confidence interval are: 1.The.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Estimating a Population Proportion ADM 2304 – Winter 2012 ©Tony Quon.
Confidence Intervals Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Statistics for Business and Economics 7 th Edition Chapter 7 Estimation: Single Population Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Sampling Distributions
ESTIMATION.
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
Econ 3790: Business and Economics Statistics
Chapter 7 Sampling Distributions.
Estimating a Population Proportion
Chapter 7 Sampling Distributions.
GENERALIZATION OF RESULTS OF A SAMPLE OVER POPULATION
ESTIMATION.
Chapter 7 Sampling Distributions.
Presentation transcript:

Sampling Error

 When we take a sample, our results will not exactly equal the correct results for the whole population. That is, our results will be subject to errors.  Sampling error: A sample is a subset of a population. Because of this property of samples, results obtained from them cannot reflect the full range of variation found in the larger group (population). This type of error, arising from the sampling process itself, is called sampling error which is a form of random error.  Sampling error can be minimized by increasing the size of the sample. When n = N ⇒ sampling error = 0

 Non-sampling error (bias)  It is a type of systematic error in the design or conduct of a sampling procedure which results in distortion of the sample, so that it is no longer representative of the reference population.  We can eliminate or reduce the non-sampling error (bias) by careful design of the sampling procedure and not by increasing the sample size.

 Sources of non sampling errors:  Accessibility bias, volunteer bias, etc.  The best known source of bias is non response. It is the failure to obtain information on some of the subjects included in the sample to be studied.  Non response results in significant bias when the following two conditions are both fulfilled. 1. When non-respondents constitute a significant proportion of the sample (about 15% or more) 2. When non-respondents differ significantly from respondents.

 There are several ways to deal with this problem and reduce the possibility of bias: 1. Data collection tools (questionnaire) have to be pre-tested. 2. If non response is due to absence of the subjects, repeated attempts should be considered to contact study subjects who were absent at the time of the initial visit. 3. To include additional people in the sample, so that non respondents who were absent during data collection can be replaced (make sure that their absence is not related to the topic being studied).

ESTIMATION

 The sample from a population is used to provide the estimates of the population parameters.  A parameter is a numerical descriptive measure of a population ( μ is an example of a parameter).  A statistic is a numerical descriptive measure of a sample ( X is an example of a statistic).  To each sample statistic there corresponds a population parameter.  We use X, S2, S, p, etc. to estimate μ, σ2, σ, P (or π), etc

Sample statisticCorresponding population parameter X (sample mean)μ (population mean) S2 ( sample variance)σ2 ( population variance) S (sample Standard deviation)σ(population standard deviation) p ( sample proportion)P or π (Population proportion)

Sampling Distribution of Means

 Sampling Distribution is a frequency distribution and it has its own mean and standard deviation.  Steps: 1. Obtain a sample of n observations selected completely at random from a large population. Determine their mean and then replace the observations in the population. 2. Obtain another random sample of n observations from the population, determine their mean and again replace the observations.

1. Repeat the sampling procedure indefinitely, calculating the mean of the random sample of n each time and subsequently replacing the observations in the population. 2. The result is a series of means of samples of size n. If each mean in the series is now treated as an individual observation and arrayed in a frequency distribution, one determines the sampling distributionof means of samples of size n.

 Because the scores ( X s) in the sampling distribution of means are themselves means (of individual samples), we shall use the notation σ X for the standard deviation of the distribution.  Standard error of mean (SEM): The standard deviation of the sampling distribution of means is called the standard error of the mean.  Formula: σ x = √ Ʃ ( x i - μ)2 / N

Properties of sampling distribution 1. The mean of the sampling distribution of means is the same as the population mean, μ. 2. The SD of the sampling distribution of means is σ / √n 3. The shape of the sampling distribution of means is approximately a normal curve, regardless of the shape of the population distribution and provided n is large enough (Central limit theorem).

Confidence interval

Interval Estimation (large samples)  A point estimate does not give any indication on how far away the parameter lies. A more useful method of estimation is to compute an interval which has a high probability of containing the parameter.  An interval estimate is a statement that a population parameter has a value lying between two specified limits.

Confidence interval  Confidence interval provides an indication of how close the sample estimate is likely to be to the true population value.  Gives an estimated range of values which is likely to include the true value of the unknown population parameter with a certain confidence (probability) and the estimated range being calculated from a given set of sample data.  Consider the standard normal distribution and the statement Pr (-1.96≤ Z ≤1.96) =. 95. it means that 95% of the standard normal curve lies between and –1.96.

 Formula: Pr( X (σ /√n) ≤ μ ≤ X (σ /√n) ) =.95  The range X -1.96(σ /√n) to X (σ /√n) ) is called the 95% confidence interval;  X -1.96(σ /√n) is the lower confidence limit while X (σ /√n) is the upper confidence limit

Few things to remember  At 90%, the corresponding Z score to be used in the formula is 1.64  At 95%, the corresponding Z score to be used in the formula is 1.96  99%, the corresponding Z score is 2.58

 problem 1 :  Suppose x= 50, SD = 10 and N=100. what is the 99% confidence interval? CI lower, X –2.58 (σ /√n) = 47 CI upper, X (σ /√n) = 53

Example 1 The mean fasting blood sugar of a group of 70 individuals was found to be 115 with a SD of Find the 95% and 99% CI’s for the population mean. Solution: SE (mean) = (12.56/√70) = % CI = 115 ± 1.96 (1.51) = (112.05, )

Interpretation of 95% CI:  Probabilistic Interpretation: In repeated sampling, approximately 95 percent of the intervals constructed will include the population mean.  Practical Interpretation: One can say with 95 percent confidence that the population mean fasting blood sugar is between and

Example 2 In a study, it was found that 129 out of 150 carcinoma of lung patients were smokers. Find the 95% & 99% CI’s for the proportion of smokers among lung cancer patients. Solution: SE (Proportion) = √ (0.86)(0.14)/150) = % CI = 0.86 ± 1.96 (0.028) = (0.8, 0.91) Similarly, 99% CI is (0.932, 0.788)

Example 3 In a study to assess the effect of anabolic steroids in weight gain, the following data was observed Find the 95% CI for the difference in the mean weight gain? Solution: SE (Diff. in mean) = √( /50)+(9 2 /50)) = CI= (3.7 – 3.1) + (1.96 x 3.257) 95% CI = (-5.78, 6.98) GroupnMean weight gainSD of Weight Study Contro l

Example 4 In a study to assess the effect of BCG, the following data was observed Find the 99% CI for the difference in proportion? Solution: SE (Diff. in Proportion) = √((0.0088)(0.9912)/2500+(0.03)(0.97)/3000) = % CI = ( -0.03, 0.019) Actual difference between proportion = = 2.12 BCGnTB developedDisease rate Vaccinated % Unvaccinated %

Factors affecting the width of confidence interval  Variation in the data (standard deviation): more the SD, more the confidence interval  Sample size : as N increases, confidence interval decreases  Level of Confidence: more the confidence level (90%, 95% and 99%), more the Confidence interval

THANK YOU