Calculation of Sampling Errors MICS3 Data Analysis and Report Writing Workshop.

Slides:



Advertisements
Similar presentations
Basic Sampling Theory for Simple and Cluster Samples
Advertisements

Calculation of Sampling Errors MICS3 Regional Workshop on Data Archiving and Dissemination Alexandria, Egypt 3-7 March, 2007.
Multiple Indicator Cluster Surveys Survey Design Workshop
Session 1: Introduction to Complex Survey Design
1 Session 10 Sampling Weights: an appreciation. 2 To provide you with an overview of the role of sampling weights in estimating population parameters.
Prerequisites Recommended modules to complete before viewing this module 1. Introduction to the NLTS2 Training Modules 2. NLTS2 Study Overview 3. NLTS2.
Statistical Sampling.
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
SAMPLE DESIGN: HOW MANY WILL BE IN THE SAMPLE—DESCRIPTIVE STUDIES ?
9. Weighting and Weighted Standard Errors. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Estimates and sampling errors for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Sampling: Final and Initial Sample Size Determination
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Multiple Indicator Cluster Surveys Survey Design Workshop
Complex Surveys Sunday, April 16, 2017.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Topics: Inferential Statistics
The Ontario Cancer Risk Factor Surveillance Program Michael Spinks Senior Research Analyst Cancer Care Ontario at 5 th Annual RRFSS Workshop Institute.
Chapter 8 Estimation: Single Population
Copyright © 2014 Pearson Education, Inc.12-1 SPSS Core Exam Guide for Spring 2014 The goal of this guide is to: Be a side companion to your study, exercise.
8/2/2015Slide 1 SPSS does not calculate confidence intervals for proportions. The Excel spreadsheet that I used to calculate the proportions can be downloaded.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Sampling Designs Avery and Burkhart, Chapter 3 Source: J. Hollenbeck.
Scot Exec Course Nov/Dec 04 Ambitious title? Confidence intervals, design effects and significance tests for surveys. How to calculate sample numbers when.
Lecture 15 Basics of Regression Analysis
Chapter 7 Confidence Intervals and Sample Sizes
Inference for regression - Simple linear regression
Chapter 7 Estimation: Single Population
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
18b. PROC SURVEY Procedures in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 7 Inferences Based on a Single Sample Estimation with Confidence Intervals.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Lohr 2.2 a) Unit 1 is included in samples 1 and 3.  1 is therefore 1/8 + 1/8 = 1/4 Unit 2 is included in samples 2 and 4.  2 is therefore 1/4 + 3/8 =
Determination of Sample Size: A Review of Statistical Theory
OPENING QUESTIONS 1.What key concepts and symbols are pertinent to sampling? 2.How are the sampling distribution, statistical inference, and standard.
Review-1 SPSS Training Naveen Shrestha. Epidemiologic Study Designs A.Descriptive studies 1.Populations (ecological studies) 2.Individuals a.Case reports.
Inference for 2 Proportions Mean and Standard Deviation.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Analysis Overheads1 Analyzing Heterogeneous Distributions: Multiple Regression Analysis Analog to the ANOVA is restricted to a single categorical between.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
1 Probability and Statistics Confidence Intervals.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Statistics for Business and Economics 7 th Edition Chapter 7 Estimation: Single Population Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Variability. The differences between individuals in a population Measured by calculations such as Standard Error, Confidence Interval and Sampling Error.
Variability.
Statistical analysis.
Regression Analysis.
ESTIMATION.
Working with the ECLS-B Datasets Weights and other issues.
Chapter 7 Inferences Based on a Single Sample
Statistical analysis.
Elementary Statistics
Estimation of Sampling Errors, CV, Confidence Intervals
جمعیت –نمونه –روشهای نمونه گیری دکتر محسن عسکرشاهی دکترای آمار زيستی
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
CORRELATION AND MULTIPLE REGRESSION ANALYSIS
COMPARING TWO PROPORTIONS
Estimating a Population Variance
Presentation transcript:

Calculation of Sampling Errors MICS3 Data Analysis and Report Writing Workshop

Background The sample selected in a survey is one of the many samples that could have been selected (with same design and size). Sampling errors are measures of the variability between all possible samples, which can be estimated from survey results.

Background Calculation of sampling errors is very important; -Provides information on the reliability of your results -Tells you the ranges within which your estimates most possibly fall -Provide clues as to the sample sizes (and designs) to be selected in forthcoming surveys

Background MICS3 sample designs are complex designs, usually based on stratified, multi-stage, cluster samples. It is not possible to use straightforward formulae for the calculation of sampling errors. Sophisticated approaches have to be used New versions of SPSS (13 or 14) is used for this purpose. SPSS uses Taylor linearization method of variance estimation for survey estimates that are means or proportions. This approach is used by most other package programs: Weswar, Sudaan, Systat, EpiInfo, SAS

Background In MICS3, the objective is to calculate sampling errors for a selection of variables, for the national sample, as well as selected sub-populations, such as urban and rural areas, and regions Sampling errors will be presented as part of the final report, in an appendix

Background

Value of the estimate should be the same as that in the corresponding table

Background Standard error is the square root of the variance – a measure of the variability between all possible samples

Background Coefficient of variation (relative error) is the ratio of SE to the estimate

Background Design effect is the ratio between the SE using the current design and the SE that would result if a simple random sample was used. A DEFT value of 1.0 indicates that the sample is as efficient as a SRS

Background Upper and lower confidence limits are calculated as p +/- 2.SE Indicate the ranges within which the estimate would fall in 95 percent of all possible samples of identical design and size

How SPSS works COMPLEX SAMPLES module Can be used to select a sample, or indicate the design of the sample from which the data set comes, so that sampling error estimates can be calculated Calculations can be done for means and proportions, ratios, frequencies and crosstabs. Also possible to use general linear models and logistic regression.

How SPSS works Prepare an analysis file to indicate the parameters that define the sample design. CSPLAN ANALYSIS /PLAN FILE='micsplan.csplan' /PLANVARS ANALYSISWEIGHT=hhweight /PRINT PLAN /DESIGN STRATA= strat CLUSTER= HH1 /ESTIMATOR TYPE=WR. Using the plan file, calculate sampling errors. Complex Samples Descriptives. CSDESCRIPTIVES /PLAN FILE = 'micsplan.csplan' /SUMMARY VARIABLES =treated iodized /MEAN /STATISTICS SE CV COUNT DEFF DEFFSQRT /MISSING SCOPE = ANALYSIS CLASSMISSING = EXCLUDE.

Problems with using SPSS Need to pair clusters and create pseudo-strata. Cannot handle normalized weights – multiply the weights by 1,000,000 before analysis. Provides estimates for subpopulations only when the data file used contains only cases for the subpopulation in question Provides incorrect confidence limits Cannot report on sampling errors for variables coming from different data sets

SPSS Output