Shooting right Sampling methods FETP India. Competency to be gained from this lecture Select a sample from a population to generate precise and valid.

Slides:



Advertisements
Similar presentations
for epidemiological studies
Advertisements

Sample size estimation
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 10: Sampling and Sampling Distributions
© 2003 Prentice-Hall, Inc.Chap 1-1 Business Statistics: A First Course (3 rd Edition) Chapter 1 Introduction and Data Collection.
QBM117 Business Statistics Statistical Inference Sampling 1.
1 1 Slide 2009 University of Minnesota-Duluth, Econ-2030(Dr. Tadesse) Chapter 7, Part B Sampling and Sampling Distributions Other Sampling Methods Other.
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
Taejin Jung, Ph.D. Week 8: Sampling Messages and People
MISUNDERSTOOD AND MISUSED
Who and How And How to Mess It up
Sampling.
Sampling and Sample Size Determination
PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.
Why sample? Diversity in populations Practicality and cost.
Chapter 11 Sampling Design. Chapter 11 Sampling Design.
Fundamentals of Sampling Method
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics 10 th Edition.
11 Populations and Samples.
7-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft.
Formalizing the Concepts: Simple Random Sampling.
Sampling Designs and Sampling Procedures
Chapter 5: Descriptive Research Describe patterns of behavior, thoughts, and emotions among a group of individuals. Provide information about characteristics.
Sample Design.
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
McGraw-Hill/Irwin McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All rights reserved.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Sampling: Theory and Methods
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
Sampling Techniques LEARNING OBJECTIVES : After studying this module, participants will be able to : 1. Identify and define the population to be studied.
1 1 Slide Chapter 7 (b) – Point Estimation and Sampling Distributions Point estimation is a form of statistical inference. Point estimation is a form of.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
PARAMETRIC STATISTICAL INFERENCE
Copyright ©2011 Pearson Education 7-1 Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft Excel 6 th Global Edition.
LECTURE 3 SAMPLING THEORY EPSY 640 Texas A&M University.
Sampling “Sampling is the process of choosing sample which is a group of people, items and objects. That are taken from population for measurement and.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
STANDARD ERROR Standard error is the standard deviation of the means of different samples of population. Standard error of the mean S.E. is a measure.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
SAMPLING TECHNIQUES. Definitions Statistical inference: is a conclusion concerning a population of observations (or units) made on the bases of the results.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Sampling Techniques 19 th and 20 th. Learning Outcomes Students should be able to design the source, the type and the technique of collecting data.
Learning Objectives Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait,
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Data Collection & Sampling Dr. Guerette. Gathering Data Three ways a researcher collects data: Three ways a researcher collects data: By asking questions.
Chapter 10 Sampling: Theories, Designs and Plans.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Sampling and Sampling Distributions Basic Business Statistics 11 th Edition.
Bangor Transfer Abroad Programme Marketing Research SAMPLING (Zikmund, Chapter 12)
 When every unit of the population is examined. This is known as Census method.  On the other hand when a small group selected as representatives of.
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
Basic Business Statistics
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
Population vs. Sample. Population: a set which includes all measurements of interest to the researcher (The collection of all responses, measurements,
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
PRESENTED BY- MEENAL SANTANI (039) SWATI LUTHRA (054)
Sampling Design and Procedure
Sampling Dr Hidayathulla Shaikh. Contents At the end of lecture student should know  Why sampling is done  Terminologies involved  Different Sampling.
Graduate School of Business Leadership
4 Sampling.
Meeting-6 SAMPLING DESIGN
Slides by JOHN LOUCKS St. Edward’s University.
Sampling.
Presentation transcript:

Shooting right Sampling methods FETP India

Competency to be gained from this lecture Select a sample from a population to generate precise and valid estimates

Key issues Sampling Sampling error Validity and precision Sample size calculation

Definition of sampling Procedure by which some members of the population are selected as representatives of the entire population Sampling

Sample Population

Study population The study population is the population to which the results of the study will be inferred Sampling

The study population depends upon the research question How many injections do people receive each year in India?  Study population: Population of India How many needle-sticks to health care workers experience each year in India?  Study population: Health care workers of India How many hospitals have a needle-stick prevention policy in India?  Study population: Hospitals of India Sampling

The sample needs to be representative of the population in terms of time Seasonality Day of the week Time of the day Sampling

The sample needs to be representative of the population in terms of place Urban Rural Sampling

The sample needs to be representative of the population in terms of persons Age Sex Other demographic characteristics Sampling

Definition of sampling terms Sampling unit (Basic sampling unit, BSU)  Elementary unit that will be sampled People Health care workers Hospitals Sampling frame  List of all sampling units in the population Sampling scheme  Method used to select sampling units from the sampling frame Sampling

Why do we sample populations? Obtain information from large populations Ensure the efficiency of a study Obtain more accurate information Sampling

Population Sample Infinite/finite size Characterized by unknown parameters Finite size Characterized by measurable parameters (e.g., mean, standard dev.) A sample is a part of the population, selected by the investigator to gather information (measures) on certain characteristics of the original population Sampling

Practical example The Ministry of Health of a country X wants to estimate the proportion of children in elementary schools who have been immunized against childhood infectious diseases The task must be completed in one month The objective is to estimate the proportion of immunized children Sampling

Planning and implementing a sample survey Sampling plan  Methodology used for selecting the sample from the population Estimation procedures  Algorithms or formulas used for obtaining estimates of population values from the sample data and for estimating the reliability of these population estimates Sampling

The various steps of a survey (1/2) 1.Describe the objective of the survey 2.Define the target population 3.Prepare a “sampling frame” 4.Analysis plan: Develop “table shells” for the results 5.Choose the sampling method 6.Calculate the sample size (s) Sampling

The various steps of a survey (2/2) 7.Develop, field test and revise data collection instrument(s) 8.Train the data collectors 9.Conduct the survey 10.Monitor the field work 11.Tabulate, analyze and interpret the results 12.Use the results Sampling

Collaborative sample design for steps 5 and 6 The statistician, the epidemiologist, and those who will use the data from the survey work together to choose the right sample design Sampling

Taking several samples If you take repeated samples out of a population, each result will be a little different Their distribution is called a sampling distribution Sampling error

Population mean A sampling distribution A sample mean 1 Standard Error Sampling error

MeanSD 95%99% The standard normal curve What is the value of mean and standard deviation?

Properties of the standard normal curve If every member of the population has an equal chance to be in the sample  We can plot our results on the normal curve by calculating a z score This tells us how likely the results are due to chance Example:  95% of the sample means will be within two standard deviations of the population mean Sampling error

Chance How likely is the result due to chance? Measured by the confidence level, example 95% or 0.95 We are 95% confident the population mean is within the confidence interval around our sample mean Sampling error

The validity and reliability of these extrapolations depend on:  How well the sample was chosen  How well the measurements were made Although hypothesis can be tested based on data collected on sample surveys, the primary objective is always estimation Sampling error

Quality of the measures: The precision If the results are precise, they do not vary if the measures are repeated Precision is expressed by the confidence we have in the results Precision is estimated by the confidence interval around the measure We can estimate the variability by computing the confidence interval True value Precision, validity

The sample size increases precision and reduces the confidence interval Lower limit Upper limit x Precision, validity

Estimating precision If every member of the target population had an equal chance of being in the sample Confidence interval provides an estimate of how closely the sample population proportion estimates the target population  Example: ± 10%. Precision, validity

Quality of the measures: The validity Absence of systematic error A valid measure reflects the true value in the population In contrast, a biased measure gives an unreliable “point of view” TrueObserved Precision, validity

Appropriate study design and sampling frame lead to valid results True Observed Precision, validity

Assuring validity Good design Interviewer training Quality assurance Precision, validity

10%5% A valid and precise measure Precision, validity

10%5% A precise measure that is not valid Precision, validity

10%5% A valid measure that is not precise Precision, validity

10%5% A measure that is not valid nor precise Precision, validity

Truth is everything It is easier to control bias and errors in a small sample than in a big one Better have:  A small sample that gives a true estimate Than:  A large sample that gives a false estimate Precision, validity

Type of samples Non-probability samples  Probability of being selected is unknown  Convenience samples Biased Best or worst scenario  Subjective samples Based on knowledge Time/resource constraints Probability samples Sampling techniques

Type of samples Non-probability samples Probability samples  Every unit in the population has a known probability of being selected  Only sampling method that allows to draw valid conclusions about population Sampling techniques

Random sampling in probability samples Removes the possibility of bias in selection of subjects Ensures that each subject has a known probability of being chosen Allows application of statistical theory Sampling techniques

Sampling error No sample is a perfect mirror image of the population Magnitude of error can be measured in probability samples Expressed by standard error of mean, proportion, differences… Function of:  Sample size  Variability in measurement Sampling techniques

Methods used in probability samples 1.Simple, random sampling 2.Systematic sampling 3.Stratified sampling 4.Cluster sampling 5.Multistage sampling Sampling techniques

1. Simple, random sampling Principle  Equal chance for each statistical unit Procedure  Number all units  Randomly draw units Advantages  Simple  Sampling error easily measured Disadvantages  Need complete list of units  Does not always achieve best representativity Sampling techniques

Example of simple, random sampling Numbers are selected at random

2. Systematic sampling Principle  A unit drawn every k units  Equal chance of being drawn for each unit Procedure  Calculate sampling interval (k = N/n)  Draw a random number (  k) for starting  Draw every k units from first unit Advantages  Ensures representativity across list  Easy to implement Disadvantage  Dangerous if list has cycles Sampling techniques

Example of systematic sampling Every eighth house is selected

3. Stratified sampling Principle  Classify population into homogeneous subgroups (strata)  Draw sample in each strata  Combine results of all strata Advantage  More precise if variable associated with strata  All subgroups represented, allowing separate conclusions about each of them Disadvantages  Sampling error difficult to measure  Loss of precision if small numbers sampled in individual strata Sampling techniques

Example of stratified sampling Estimate vaccination coverage in a country One sample drawn from each region Estimates calculated for each stratum Each strata weighted to obtain estimate for country Sampling techniques

4. Cluster sampling Principle  Random sample of groups (“clusters”) of units  All or proportion of units included in selected clusters Advantages  Simple: No list of units required  Less travel/resources required Disadvantages  Imprecise if clusters homogeneous (Large design effect)  Sampling error difficult to measure Sampling techniques

Cluster sampling The sampling unit is not a subject, but a group (cluster) of subjects. It is assumed that:  The variability among clusters is minimal  The variability within each cluster is what is observed in the general population Sampling techniques

The two stages of a cluster sample 1.First stage: Probability proportional to size Select the number of clusters to be included Compute a cumulative list of the populations in each unit with a grand total Divide the grand total by the number of clusters and obtain the sampling interval Choose a random number and identify the first cluster Add the sampling interval and identify the second cluster By repeating the same procedure, identify all the clusters 2.Second stage In each cluster select a random sample using a sampling frame of subjects (e.g. residents) or households Sampling techniques

Self-weighting in cluster samples Stage one: The larger units are more likely to be selected in the first round  Unit B twice as large as unit A will have twice the chance of being selected Stage two: Individuals in larger unit selected are less likely to be selected in the second round  Individual in unit B will have half the chance of being selected within the unit The two effects cancel each other and each person in the population has the same probability of being sampled

30 x 7 cluster sampling in the expanded programme of immunization Procedure:list of all villages (areas) with total population VillageInhabitantsCumulative ,715 Divide the cumulative total by 30 clusters we wish to select 4,715 : 30= Sampling techniques

30 x 7 cluster sampling in the expanded programme of immunization Find a random number with three digits (= Sampling interval) e.g. 123 Choose from the cumulative distribution the clusters by adding 157 (sampling interval) *1st cluster **2nd =280 In each village (area) choose 7 children Total sample 30 X 7= 210 Sampling techniques

Design effect (Use Epitable software) Global variance p(1-p) Var srs = n Cluster variance p= global proportion pi= proportion in each stratum n= number of subjects k= number of strata Σ (pi-p)² Var clus = k(k-1) Design effect = Var srs Var clust Sampling techniques

Example of cluster sampling Section 4 Section 5 Section 3 Section 2Section 1 Sampling techniques

5. Multistage sampling Principle  Several chained samples  Several statistical units Advantages  No complete listing of population required  Most feasible approach for large populations Disadvantages  Several sampling lists  Sampling error difficult to measure Sampling techniques

Steps in estimating sample size Identify major study variable Determine type of estimate (%, mean, ratio,...) Indicate expected frequency of factor of interest Decide on desired precision of the estimate Decide on acceptable risk that estimate will fall outside its real population value Adjust for estimated design effect Adjust for expected response rate (Adjust for population size) Sampling techniques

Sample size formula in descriptive survey (Use Epitable) z: alpha risk expressed in z-score p: expected prevalence q: 1 - p d: absolute precision g: design effect z² * p * q 1.96²*0.15*0.85 n = = 544 d²0.03² Cluster sampling z² * p * q 2*1.96²*0.15*0.85 n = g* = 1088 d² 0.03² Simple random / systematic sampling Sampling techniques

Remember Probability samples are the best Beware of …  Refusals  Absentees  “Do not know” Sampling techniques

Summary of methods used in probability samples 1.Simple, random sampling Draw subjects from list with random number 2.Systematic sampling Draw every xth subject 3.Stratified sampling Take one sample for each strata 4.Cluster sampling Select clusters and then select individuals 5.Multistage sampling Sample stage by stage Sampling techniques

Key issues We cannot study the whole population so we sample it Taking a sample leads to sampling error, which is measurable Good design and quality assurance ensure validity and while appropriate sample size will ensure precision Probability samples are the only ones that allow use of statistics as we know them