Presentation is loading. Please wait.

Presentation is loading. Please wait.

Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.

Similar presentations


Presentation on theme: "Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad."— Presentation transcript:

1 Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad

2 Review 2

3 Confidence Intervals Confidence Interval: An interval of values computed from the sample, that is almost sure to cover the true population value. We make confidence intervals using values computed from the sample, not the known values from the population Interpretation: In 95% of the samples we take, the true population proportion (or mean) will be in the interval. This is also the same as saying we are 95% confident that the true population proportion (or mean) will be in the interval

4 Sample Size Estimation An investigator might have number of goals while handling samplings issue. Deciding amount of sampling error and balance the precision of estimates with the cost of survey especially in SRS. Estimating a sample size is one of the major goal of surveys. 4

5 Steps involved in sample size estimation Step-1 – What is expected of the sample and how much precision do I need? – What are the consequences of sample results. – How much error is tolerable. – A preliminary investigation, however, often needs less precision than an ongoing survey. 5

6 A wrong approach – Many people usually ask “what percentage of the population should I include in my sample”. – Ideally focus should be on precision of estimates. Precision is obtained through the absolute size of the sample, not the proportion of the population covered (except in very small populations) 6

7 Step-2 – Find an equation relating the sample size n and your expectations of the sample Step-3 – Estimate any unknown quantities and solve for n. Step-4 – If you feel that sample size (estimated) is too large to handle, go back and adjust your expectation and then try again. – Still if your sample size is large, then do think again to initiate your study. 7

8 Specify the Tolerable Error How much precision is needed (decided by investigator only). The desired precision is often expressed in absolute term, as Pr(|Estimator-Parameter|≤e)=1-α Where e is called Margin of Error Reasonable values for α and e must be decided by investigator. For many surveys of people in which a proportion is measured, e=0.03 and α=0.05 8

9 Sometimes you would like to achieve a desired relation precision, controlling coefficient of variation (CV) rather than the absolute Error. In that case, if Parameter ≠ 0, the precision may be expressed as 9

10 Find an Equation The simplest equation relating the precision and sample size comes from the confidence intervals in the previous section. To obtains absolute precision e, find a value of n that satisfies To solve this equation for n, we first find the sample n 0 that we would use for an SRSWR, that is 10

11 Then the desired sample size is Of course, if n 0 ≥N, we simply take a census with n=N 11

12 Example Suppose we want to estimate the proportion of recipes in the Better Homes and Gardens New Cook Book that do not involve animal products. We plan to take an SRS of the N=1251 test kitchen-tested recipes, and want to use a 95% CI with margin of error 0.03. then n 0 = 12

13 Randomization theory results for Simple Random Sampling In this section we will prove some unbiased estimators. No distribution assumption are made about the y i ’s in order to ascertain that is unbiased for estimating population mean µ, like normality assumptions. Let us see how the randomization theory works for deriving properties of the sample mean in SRS. As in Cornfield ( 1944) _____________________________ 13

14 14

15 15

16 16

17 17

18 18

19 19

20 When should a Simple Random Sample Be Used? Avoid SRS in such situations – Before taking an SRS, you should consider whether a survey sample is the best method for studying your research question. – You may not have a list of the observation units, or it may be expensive in terms of travel time to take an SRS. – You may have additional information that can be used to design a more cost effective sampling scheme. 20

21 SRS should be used in following situations Little extra information is available that can be used when designing the survey like sampling frame. Person using the data insist on using SRS formula, whether they are appropriate or not. The primary interest is in multivariate relationships such as regression equations that hold for the whole population, and there are no compelling reasons to take a stratified or cluster sample. 21

22 Key Terms Used in SRS Unit Cluster sample: A probability sample in which each population unit belongs to a group, or cluster, and the clusters are sampled according to the sampling design. Confidence interval (CI): An interval estimate for a population quantity, for which the probability that the random interval contains the true value of the population quantity is known. Design-based inference: Inference for finite population characteristics based on the survey design, also called randomization inference. Finite population correction (fpc): A correction factor which, when multiplied by the with-replacement variance, gives the without-replacement variance. For an SRS of size n from a population of size N, the fpc is 1 − n/N. 22

23 Inclusion probability: π i = probability that unit i is included in the sample. Margin of error: Half of the width of a 95% CI. Model-based inference: Inference for finite population characteristics based on a model for the population, also called prediction inference. Probability sampling: Method of sampling in which every subset of the population has a known probability of being included in the sample. Sampling distribution: The probability distribution of a statistic generated by the sampling design. 23

24 Sampling weight: Reciprocal of the inclusion probability; w i =1/π i. Self-weighting sample: A sample in which all probabilities of inclusion π i are equal, so that all sampling weights wi are the same. Simple random sample with replacement (SRSWR): A probability sample in which the first unit is selected from the population with probability 1/N; then the unit is replaced and the second unit is selected from the set of N units with probability 1/N, and so on until n units are selected. 24

25 Simple random sample without replacement (SRS): An SRS of size n is a probability sample in which any possible subset of n units from the population has the same probability = (n!(N −n)!/N!) of being the sample selected. Standard error (SE): The square root of the estimated variance of a statistic. Stratified sample: A probability sample in which population units are partitioned into strata, and then a probability sample of units is taken from each stratum. Systematic sample: A probability sample in which every k th unit in the population is selected to be in the sample, starting with a randomly chosen value R. Systematic sampling is a special case of cluster sampling. 25


Download ppt "Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad."

Similar presentations


Ads by Google