Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.

Slides:



Advertisements
Similar presentations
Inferential Statistics and t - tests
Advertisements

Designing an impact evaluation: Randomization, statistical power, and some more fun…
Inferential Statistics
Hypothesis testing Another judgment method of sampling data.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Inference Sampling distributions Hypothesis testing.
Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Fundamentals of Sampling Method
Inference about a Mean Part II
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Sampling Designs and Techniques
Chapter 9 Hypothesis Testing.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Today Concepts underlying inferential statistics
Statistics for Managers Using Microsoft® Excel 5th Edition
The t Tests Independent Samples.
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 7 Sampling, Significance Levels, and Hypothesis Testing Three scientific traditions critical.
Sampling and Sample Size Part 2 Cally Ardington. Lecture Outline  Standard deviation and standard error Detecting impact  Background  Hypothesis testing.
AM Recitation 2/10/11.
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Overview Definition Hypothesis
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
© 2011 Pearson Prentice Hall, Salkind. Introducing Inferential Statistics.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Estimation and Hypothesis Testing Now the real fun begins.
Chapter 8 Introduction to Hypothesis Testing
1/2555 สมศักดิ์ ศิวดำรงพงศ์
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
Povertyactionlab.org Planning Sample Size for Randomized Evaluations Esther Duflo MIT and Poverty Action Lab.
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Inferential Statistics 2 Maarten Buis January 11, 2006.
B AD 6243: Applied Univariate Statistics Hypothesis Testing and the T-test Professor Laku Chidambaram Price College of Business University of Oklahoma.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Confidence intervals and hypothesis testing Petter Mostad
Sampling for an Effectiveness Study or “How to reject your most hated hypothesis” Mead Over, Center for Global Development and Sergio Bautista, INSP Male.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Education 793 Class Notes Decisions, Error and Power Presentation 8.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 7 Sampling, Significance Levels, and Hypothesis Testing Three scientific traditions.
CHAPTER OVERVIEW Say Hello to Inferential Statistics The Idea of Statistical Significance Significance Versus Meaningfulness Meta-analysis.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Copyright © 2011, 2005, 1998, 1993 by Mosby, Inc., an affiliate of Elsevier Inc. Chapter 13: Boundary Setting in Experimental-Type Designs A deductive.
T tests comparing two means t tests comparing two means.
Chapter 13 Understanding research results: statistical inference.
Chapter 9: Hypothesis Tests for One Population Mean 9.2 Terms, Errors, and Hypotheses.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Chapter 8 Introducing Inferential Statistics.
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Power, Sample Size, & Effect Size:
Sampling and Power Slides by Jishnu Das.
Power and Sample Size I HAVE THE POWER!!! Boulder 2006 Benjamin Neale.
Statistical Power.
Presentation transcript:

Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006

WBISARHDN 2 Sample Selection in Evaluation  Population based representative surveys: Sample representative of whole population Good for learning about the population Not always most efficient for impact evaluation  Sampling for Impact evaluation Balance between treatment and control groups Power  statistical inference for groups of interest Concentrate sample strategically  Survey budget as major consideration In practice, sample size is many times set by budget Concentrate sample on key populations to increase power

WBISARHDN 3 Purposive Sampling:  Risk: We will systematically bias our sample, so results don’t generalize to the rest of the population or other sub-groups  Trade off between power within population of interest and population representation  Results are internally valid, but not generalizable.

WBISARHDN 4 Survey - Sampling  Population: all cases of interest  Sampling frame: list of all potential cases  Sample: cases selected for analysis  Sampling method: technique for selecting cases from sampling frame  Sampling fraction: proportion of cases from population selected for sample (n/N)

WBISARHDN 5 Sampling Frame  Simple Sampling  Stratified Sampling  Cluster Sampling

WBISARHDN 6 Sampling Methods  Random Sampling  Systematic Sampling

WBISARHDN 7 The Design Effect in Clustering  Necessary to take into account when samples are clustered

WBISARHDN 8 Correlación intracluster (  )  DEFF depends on the size of the cluster and the intra-cluster correlation   is the degree of homogeneity in the cluster, and is called the “intra-cluster” correlation

WBISARHDN 9 Tamaño de muestra  The necessary sample size will increase in clustered samples  But, you have to have some idea of the intra-cluster coefficient to get at this number!

WBISARHDN 10 Power Calculations  Test significance of a null hypothesis.  For example, whether two means are different.

WBISARHDN 11 Type I and Type II errors Type II error =  Significance Level Power = 1- Type I error = 

WBISARHDN 12 Type I and type II errors  Type I error: Reject the null hypothesis when it is true Significance level  probability of rejecting the null when it is true (Type I error)  Type II error: Accept (fail to reject) the null hypothesis when it is false Power  probability of rejecting the null when an alternative null is true (1-probability of Type II)  We want to minimize both types of errors Increase sample size

WBISARHDN 13 Type I and Type II errors  Type I error =  Probability that you conclude the intervention had an effect if actually it did not  Type II error =  Probability you conclude that intervention had no effect when it actually did  Power = 1 -  Probabilty of correctly conluding that the intervention had an effect  Fix the type I error and use sample size to increase the power

WBISARHDN 14 Power Calculations for sample size  Fix the confidence level and as you increase the size of the sample: Rejection region gets larger The power increases n↑n↑

WBISARHDN 15 What we have so far  Clustering increases the required sample size  As does the need for statistical testing: if we know The estimated size of the treatment The variance of the distribution  We can start making power calculations for evaluations

WBISARHDN 16 In Practice  Many, many analytical statistical results  May be simpler to use simulations in Stata or similar package Easily accounts for complicated designs

WBISARHDN 17 In Practice: An Example  Does Information improve child performance in schools? (Pakistan)  Randomized Design Interested in villages where there are private schooling options  What Villages should we work in? Stratification: North, Central, South Random Sample: Villages chosen randomly from list of all villages with a private school

WBISARHDN 18 In Practice: An Example  How many villages should we choose?  Depends on: How many children in every village How big do we think the treatment effect will be What the overall variability in the outcome variable will be

WBISARHDN 19 In Practice: An Example  Simulation Tables Table 1 assumes very high variability in test- scores.  X,Y: X is for intervention with small effect size; Y for larger effect size N: Significant < 1% of simulations S: Significant < 10% of simulations A: Significant > 99% of simulations

WBISARHDN 20 In Practice: An Example  Simulation Tables Table 1 assumes lower variability in test- scores.  X,Y: X is for intervention with small effect size; Y for larger effect size N: Significant < 1% of simulations S: Significant < 10% of simulations A: Significant > 99% of simulations

WBISARHDN 21 A smorgasbord of topics  Probability proportional to size sampling to pick clusters  Using weights Estimating means vs. Estimating regressions  Increasing efficiency using matched randomizations  Using evaluations to say something about baseline populations Age targeted programs

WBISARHDN 22 When do we really worry about this?  IF Very small samples at unit of treatment! Suppose treatment in 20 schools and control in 20 schools  But there are 400 children in every school This is still a small sample  IF Interested in sub-groups (blocks) Sample size requirements increase exponentially  IF Using Regression Discontinuity Designs