Objectives and data needs

Slides:



Advertisements
Similar presentations
Basic Principles of GMP
Advertisements

Sampling: Theory and Methods
Multiple Indicator Cluster Surveys Survey Design Workshop
Multistage Sampling Module 3 Session 9.
1 Questionnaire design Module 3 Session 3. 2 Overview (of Session) This session starts by introducing some aspects that need to be considered when designing.
Introduction to Sampling : Censuses vs. Sample Surveys
Collecting data for informed decision-making
1 From the data to the report Module 2. 2 Introduction Welcome Housekeeping Introductions Name, job, district, team.
1 Session 10 Sampling Weights: an appreciation. 2 To provide you with an overview of the role of sampling weights in estimating population parameters.
Module 2 Sessions 10 & 11 Report Writing.
SADC Course in Statistics Basic summaries for demographic studies (Session 03)
Basic Sampling Concepts
SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)
Overview of Sampling Methods II
SADC Course in Statistics Further ideas concerning confidence intervals (Session 06)
Data collection for demographic & vital statistics
SADC Course in Statistics Tests for Variances (Session 11)
Estimation in Stratified Random Sampling
SADC Course in Statistics Basic principles of hypothesis tests (Session 08)
SADC Course in Statistics (Session 20)
SADC Course in Statistics Sampling weights: an appreciation (Sessions 19)
Correlation & the Coefficient of Determination
SADC Course in Statistics Samples and Populations (Session 02)
SADC Course in Statistics Sample size determinations (Session 11)
SADC Course in Statistics Sampling design using the Paddy game (Sessions 15&16)
SADC Course in Statistics Multi-stage sampling (Sessions 13&14)
SADC Course in Statistics Assessing data critically Module B1 Session 17.
SADC Course in Statistics Session 4 & 5 Producing Good Tables.
SADC Course in Statistics Exploratory Data Analysis (EDA) in the data analysis process Module B2 Session 13.
SADC Course in Statistics Graphical summaries for quantitative data Module I3: Sessions 2 and 3.
SADC Course in Statistics Choosing appropriate methods for data collection.
SADC Course in Statistics Types and Sources of Errors in Statistical Data.
SADC Course in Statistics Comparing two proportions (Session 14)
SADC Course in Statistics Review and further practice (Session 10)
SADC Course in Statistics Revision using CAST (Session 04)
SADC Course in Statistics Introduction to Statistical Inference (Session 03)
Preparing & presenting demographic information: 1
SADC Course in Statistics Hierarchies of Units and non-traditional sampling approaches (Session 18)
SADC Course in Statistics Overview of Sampling Methods I (Session 03)
SADC Course in Statistics General approaches to sample size determinations (Session 12)
SADC Course in Statistics To the Woods discussion (Sessions 10)
SADC Course in Statistics Developing a sampling strategy (Session 05)
SADC Course in Statistics Setting the scene (Session 01)
SADC Course in Statistics Producing a product portfolio Module I3 Session
The MDGs and School Enrolment: An example of administrative data
SADC Course in Statistics Objectives and analysis Module B2, Session 14.
1 Table design Module 3 Session 2. 2 Objectives of this session By the end of this session, you will be able to: appreciate the different type of objectives.
SADC Course in Statistics Comparing Means from Paired Samples (Session 13)
SADC Course in Statistics Revision on tests for means using CAST (Session 17)
Probability Distributions
1 Third Workshop on ICP Western Asia Beirut, October 2004 Design of ICP price survey Sultan Ahmad, Consultant Based on Keith.
Training in monitoring and epidemiological assessment of mass drug administration for eliminating lymphatic filariasis Module 4 Survey design.
STATISTICS FOR MANAGERS LECTURE 2: SURVEY DESIGN.
SADC Course in Statistics Introduction and Study Objectives (Session 01)
Why sample? Diversity in populations Practicality and cost.
SADC Course in Statistics Introduction to the module and the session Module I1, Session 1.
Sampling Concepts Population: Population refers to any group of people or objects that form the subject of study in a particular survey and are similar.
Lecture 30 sampling and field work
Sample Design.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
The new HBS Chisinau, 26 October Outline 1.How the HBS changed 2.Assessment of data quality 3.Data comparability 4.Conclusions.
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Bangkok,
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
Copyright 2010, The World Bank Group. All Rights Reserved. Part 1 Sample Design Produced in Collaboration between World Bank Institute and the Development.
Targeting of Public Spending Menno Pradhan Senior Poverty Economist The World Bank office, Jakarta.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Basic Sampling Concepts
Presentation transcript:

Objectives and data needs SADC Course in Statistics Objectives and data needs (Session 01)

Module overview Basic concepts and definitions Sampling methods – simple random, stratified, cluster, multi-stage, etc Designing a sampling scheme for relatively simple scenarios in accordance with objectives and available resources How to produce estimates for population characteristics with measures of precision Sample size determinations An appreciation of what is meant by sampling weights

Module aims By the end of this module, you will be able to explain what is meant by sample, population, sampling frame, sampling units explain the notions of representativeness and generalisability of results design sampling schemes for simple scenarios produce population estimates and associated measures of precision discuss options for calculation of sample sizes with a good understanding of general issues involved

Aims – this session… By the end of this session, you will be able to appreciate the different type of objectives that may arise in real life surveys critically assess the type of data needed to address questions of interest explain the benefits of sampling recognise importance of utilising existing knowledge about the population sampled We begin with some general remarks, then move to survey objectives & other issues.

Some general remarks This module is on sampling ideas and issues arising from sampling procedures when conducting a survey It is not about survey methods and analysis – which are covered in Module H7 As such, there will not be coverage of the survey process except where it relates to sampling The accompanying handout gives an outline of the survey process so the context is clear – useful to read…

Survey Objectives Decisions regarding the sampling process cannot be made rationally unless we are clear about the survey objectives. Surveys conducted by national statistical offices are often done to provide information on which policy decisions can be made The stated objectives are often vague, e.g. “the objective of this survey is to collect information about…….” OK as a starting point, but need to be more specific if a sensible sampling procedure is to be used – some examples follow…

Objectives related to estimation To estimate median income of dwellings in slum areas of a city proportion of rural households that have no access to a medical facility within 3 kms maternal mortality rate, i.e. deaths per 1000 live births of mothers from puerperal causes mean yield per hectare of pigeonpea production in small-holder commercial farms

Objectives related to comparisons Questions of interest may be: does a newly introduced farming practice for managing banana plants result in higher yields compared to a standard practice? is there a difference in access to health facilities between rural & urban areas? is there evidence that children from poorer families have less opportunities for entering higher educational institutions?

Objectives related to relationships Is there a relationship between consumption expenditure (as a proxy for income) and household demographics and assets? children’s enrolment in primary school and educational level of household head? mean number of visits by household members to a health clinic and their level of access to clean water and adequate sanitary facilities?

Sampling units and data Sampling is a first step in any survey study. There is always a need, before data collection, to identify (amongst other things):- the ultimate sampling unit on which measurements are to be made actual measurements needed, plus clarity on the calculation of any derived variable(s) sampling procedure to use to select sampling units Two examples from slides 7 and 9 follow…

Initial steps – some examples Example 1. Estimating the proportion of rural households that have no access to a medical facility with qualified personnel within 3 kms of their homestead. Unit: Household Measurement: Distance to the closest medical facility (latter appropriately defined) Derived variable: Coded as 1 if above measurement > 3 kms, 0 otherwise

An example with a hierarchy Example 2. Determining if there is a relationship between number of visits by household members to a health clinic and their level of access to clean water and adequate sanitary facilities? Units (within HH): Household members Measurements: Visits made by each member in HH, how clean water is accessed, sanitary facilities Derived variable: Mean number of visits = Sum of visits by all members divided by HH size (needed as objective is at the HH level, but measurement is at person level)

Selection of households? Another aspect needed before data collection is to plan the sampling procedure, i.e. how the households can be selected. Here it is important to study the sampling setting to understand better the structure (social, geographical etc) of the “target” population from which households are to be drawn. Here “target” population refers to the group of households to which the survey results are intended to generalise.

Using existing knowledge Examining the literature should also point to existing data sources – need to avoid duplicate data collection. Use what is relevant. Also use general knowledge about the target population. Often much is on record: it would be desirable to use this information Administrative structure: e.g. districts, counties, subcounties, parishes, villages in Uganda Agro-ecological regions Rural/urban divide, etc Build knowledge about existing data sources and population into the sampling and data collection process

Why Sample? The whole population is rarely measurable An exception is the census, e.g. usually population censuses are done once every 10 years A well-designed sample enables us to extrapolate our results to the population Statistical methods enable us to measure the reliability of our conclusions Though they were covered in Module H2, we reiterate benefits below for completeness and as a reminder.

Benefits of sampling Cheaper, quicker and administratively easier than census Less prone to errors – and those that do occur are more easily identified A well-thought out sampling procedure can ensure proper coverage of major population characteristics If suitably structured, the sample survey can (i) take account of varying sizes of units, e.g. farms, and (ii) correct for under-enumeration and some sorts of non-response.

Limitations of sampling Sound sample surveys require considerable time and effort to plan and run. If tasks entailed and resources needed are under-estimated, the results will be poor Unless a pre-determined data analysis plan is in place at the start, data relevant to objectives may not be collected, or too much unnecessary data will be collected Training survey staff is crucial. Ill-phrased questions, poorly linked to objectives, can lead to non-informative results

References De Vaus, D.A. (2001) Research Design in Social Research. Sage Publications, London. ISBN 0 76195346 9 KALTON, G. (1990) Introduction to Survey Sampling. Sage Publications. ISBN 0 8039 2126 8