Sampling issues in PIES and PETS. But if you care about impact evaluation, register for Module 6 right now ! Impact Evaluation Studies Need to compare.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
1 G Lect 2a G Lecture 2a Thinking about variability Samples and variability Null hypothesis testing.
Statistical Decision Making
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Topic 6: Introduction to Hypothesis Testing
1. Estimation ESTIMATION.
Using Statistics in Research Psych 231: Research Methods in Psychology.
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Clustered or Multilevel Data
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
Sample Size Determination In the Context of Hypothesis Testing
S519: Evaluation of Information Systems Social Statistics Inferential Statistics Chapter 8: Significantly significant.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
PSY 307 – Statistics for the Behavioral Sciences
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power.
Mitchell Hoffman UC Berkeley. Statistics: Making inferences about populations (infinitely large) using finitely large data. Crucial for Addressing Causal.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Math 227 Elementary Statistics
Modular 15 Ch 10.1 to 10.2 Part I. Ch 10.1 The Language of Hypothesis Testing Objective A : Set up a Hypothesis Testing Objective B : Type I or Type II.
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
Descriptive statistics Inferential statistics
Hypothesis Testing.
Introduction to Hypothesis Testing for μ Research Problem: Infant Touch Intervention Designed to increase child growth/weight Weight at age 2: Known population:
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Chapter 8 Introduction to Hypothesis Testing
Hypothesis Testing (Statistical Significance). Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample.
1 1 Slide © 2005 Thomson/South-Western Chapter 9, Part B Hypothesis Tests Population Proportion Population Proportion Hypothesis Testing and Decision Making.
Sampling: What you don’t know can hurt you Juan Muñoz.
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Statistics (cont.) Psych 231: Research Methods in Psychology.
IE241: Introduction to Hypothesis Testing. We said before that estimation of parameters was one of the two major areas of statistics. Now let’s turn to.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
Education 793 Class Notes Decisions, Error and Power Presentation 8.
Issues concerning the interpretation of statistical significance tests.
Information Technology and Decision Making Information Technology and Decision Making Example 10.1 Experimenting with a New Pizza Style at the Pepperoni.
3-1 MGMG 522 : Session #3 Hypothesis Testing (Ch. 5)
Ex St 801 Statistical Methods Inference about a Single Population Mean.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
URBDP 591 I Lecture 4: Research Question Objectives How do we define a research question? What is a testable hypothesis? How do we test an hypothesis?
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
European Patients’ Academy on Therapeutic Innovation The Purpose and Fundamentals of Statistics in Clinical Trials.
The Practice of Statistics Third Edition Chapter 11: Testing a Claim Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Hypothesis Testing and Statistical Significance
Statistics (cont.) Psych 231: Research Methods in Psychology.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
Inferential Statistics Psych 231: Research Methods in Psychology.
15 Inferential Statistics.
Hypothesis Tests: One Sample
Statistical Inference about Regression
Hypothesis Testing.
STAT Z-Tests and Confidence Intervals for a
CHAPTER 6 Statistical Inference & Hypothesis Testing
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Inferential statistics Study a sample Conclude about the population Two processes: Estimation (Point or Interval) Hypothesis testing.
Statistical Power.
Presentation transcript:

Sampling issues in PIES and PETS

But if you care about impact evaluation, register for Module 6 right now ! Impact Evaluation Studies Need to compare two populations, differentiated by –Time before and after the crisis, the boom, the earthquake, the program, … –Space North/South, rural/urban, mountain/forest, … –Policy program vs. non-program treatment vs. control

The basic question Let y 1 and y 2 be the values of an indicator in populations 1 and 2 –Prevalence of malnutrition –Average income –Literacy –etc… We are interested in the difference y 2 - y 1 y 1 and y 2 are both unknown We estimate them from samples The estimations ŷ 1 and ŷ 2 are affected by sampling error What is the error of the difference?

Let e 1 and e 2 be the standard errors of ŷ 1 and ŷ 2 If the samples are independent*, the standard error of ŷ 2 - ŷ 1 is The basic answer * this is the case when the populations are different

Panel Surveys can measure change better ŷ1ŷ1 ŷ2ŷ It seems that y 2 < y1 but… …both measures are affected by sampling errors ( e1 et e2 ) The error of the difference ŷ 2 - ŷ 1 is… …√ ( e² 1 + e² 2 ) if the two samples are independent …only √( e² 1 + e² 2 – 2ρ[y 1,y 2 ] ) if the sample is the same

Advantages and disadvantages of panels Analytical advantages –Can measure changes better –Permit understanding better why things changed –Permits correlating past and present behavior Analytical disadvantages –Become progressively less representative of the population Practical disadvantages –Sample attrition –Much harder to manage –Vulnerable to manipulation –Design them prospectively rather than in afterthought Practical advantages –No sampling design needed for the second and subsequent surveys

For independent samples State of the world H 0 accepted if ŷ 2 – ŷ 1 ≤ D H 0 rejected if ŷ 2 – ŷ 1 > D OK Type I errorOK Type II error P (Type I error) = α P (correctly accepting H 0 ) = 1 - α P (correctly rejecting H 0 ) = 1 – β P (Type II error) = β Decision rule Null hypothesis H 0 true H 0 : y 2 – y 1 = 0 Alternative hypothesis H A true H 0 : y 2 – y 1 = Δ Research findings ŷ 1 ŷ 2 Significance level of the testPower of the test With Simple Random Samples, e ² 1 = σ ² 1 /n 1 and e ² 2 = σ ² 2 / n 2 But real-life samples are almost never SRS, design effects cannot be ignored ! The decision rule based on ŷ 2 – ŷ 1 intends to detect changes in one specific direction (a 1-sided test) To detect changes in any of the two directions, it should be based on | ŷ 2 – ŷ 1 | instead (a 2-sided test) But impact evaluation is almost always one-sided ! One sided: Two sided:

Public Expenditure Tracking Studies PETS need to observe different kinds of units –Large administrative areas (e.g., regions) –Medium administrative areas (e.g. districts) –Service delivery facilities (schools, hospital, clinics…) –Service providers (teachers, doctors, …) –Clients (students, patients, …) –Households Analyses need to account for their hierarchical relationships How should the samples be selected?

?

Two approaches Top-Down (A multi-stage sample) –Select regions first –Then districts in the selected regions –Then Hospitals in the selected districts Bottom-Up –Take a sample of facilities first, –This implicitly defines the sample of districts –Which in turn defines the sample of regions Disadvantages –High design effects –Poor understanding of the upper levels –Vulnerable to unscientific choices Disadvantage –Higher (slightly higher) costs

In each facility To select providers (teachers, doctors, nurses, etc.) –The samples need to be random (perhaps stratified) –The selection has to be entrusted to fieldworkers… –…but it should be repeatable (for supervision) –Can be done with Kish tables, or with random stickers To select clients in health facilities –The selection also needs to be random and has to be entrusted to fieldworkers. –It cannot be repeatable, but the procedures should try to avoid biases due to the Day of the week Time of the day