Jeffrey S. Napierala Prof. Glenn D. Deane Department of Sociology, SUNY Albany Prof. Donald J. Hernandez Department of Sociology, Hunter College & CUNY.

Slides:



Advertisements
Similar presentations
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
Advertisements

Diverse Children: Race, Ethnicity, and Immigration in America’s New Non-Majority Generation by Donald J. Hernandez, Ph.D. Hunter College, City University.
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Statistics 100 Lecture Set 7. Chapters 13 and 14 in this lecture set Please read these, you are responsible for all material Will be doing chapters
Departments of Medicine and Biostatistics
Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
Comparing Two Groups’ Means or Proportions Independent Samples t-tests.
Meet the Author Webcast Public Health Reports Meet the Author Webcast Socioeconomic Status and Risk of Diabetes-Related Morality in the United States With.
Data Analysis Statistics. Inferential statistics.
Correlation and Simple Regression Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Data Analysis Statistics. Inferential statistics.
Brown, Suter, and Churchill Basic Marketing Research (8 th Edition) © 2014 CENGAGE Learning Basic Marketing Research Customer Insights and Managerial Action.
Quantifying Data.
Descriptive measures of the strength of a linear association r-squared and the (Pearson) correlation coefficient r.
Scot Exec Course Nov/Dec 04 Ambitious title? Confidence intervals, design effects and significance tests for surveys. How to calculate sample numbers when.
Inferential Statistics
Comparing Two Groups’ Means or Proportions
Introduction to Multilevel Modeling Using SPSS
The Long Reach of Early Math Skills Greg J. Duncan University of California, Irvine Robert Siegler Carnegie Mellon University.
Week 12 Chapter 13 – Association between variables measured at the ordinal level & Chapter 14: Association Between Variables Measured at the Interval-Ratio.
Research Methods. Research Projects  Background Literature  Aims and Hypothesis  Methods: Study Design Data collection approach Sample Size and Power.
Chapter 12: AP Statistics
Simple Covariation Focus is still on ‘Understanding the Variability” With Group Difference approaches, issue has been: Can group membership (based on ‘levels.
Links to Positive Parenting among African American and Hispanic American Low-Income Mothers Laura D. Pittman Psychology Department Northern Illinois University.
Section #6 November 13 th 2009 Regression. First, Review Scatter Plots A scatter plot (x, y) x y A scatter plot is a graph of the ordered pairs (x, y)
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Correlation and Regression. The test you choose depends on level of measurement: IndependentDependentTest DichotomousContinuous Independent Samples t-test.
From GLM to HLM Working with Continuous Outcomes EPSY 5245 Michael C. Rodriguez.
1 Sampling Distributions Lecture 9. 2 Background  We want to learn about the feature of a population (parameter)  In many situations, it is impossible.
Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
Week 4: Multiple regression analysis Overview Questions from last week What is regression analysis? The mathematical model Interpreting the β coefficient.
Andrew Thomson on Generalised Estimating Equations (and simulation studies)
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
ANOVA and Linear Regression ScWk 242 – Week 13 Slides.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Amanda Szekely, Senior Policy Analyst Education Division October 17, 2013 A Comprehensive State Strategy to Improve Third Grade Reading Proficiency.
Introduction to Regression Analysis. Dependent variable (response variable) Measures an outcome of a study  Income  GRE scores Dependent variable =
Building Wave Response Rates in a Longitudinal Survey: Essential for Nonsampling Error Reduction or Last In - First Out? Steven B. Cohen Fred Rohde and.
Generalizing Observational Study Results Applying Propensity Score Methods to Complex Surveys Megan Schuler Eva DuGoff Elizabeth Stuart National Conference.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Chapter 6 Simple Regression Introduction Fundamental questions – Is there a relationship between two random variables and how strong is it? – Can.
Bivariate Poisson regression models for automobile insurance pricing Lluís Bermúdez i Morata Universitat de Barcelona IME 2007 Piraeus, July.
The Simple Linear Regression Model: Specification and Estimation ECON 4550 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s.
Chapter 10 Correlation and Regression Lecture 1 Sections: 10.1 – 10.2.
Lynn Lethbridge SHRUG November, What is Bootstrapping? A method to estimate a statistic’s sampling distribution Bootstrap samples are drawn repeatedly.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Increasing Inequality in Parent Incomes and Children’s Completed Schooling Greg J. Duncan University of California- Irvine Greg J. Duncan University of.
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
Regression Discontinuity Design Case Study : National Evaluation of Early Reading First Peter Z. Schochet Decision Information Resources, Inc.
Facilitators and Barriers to Service Learning in China: Perspectives of Chinese Students Felicia Wilczenski & Wenfan Yan University of Massachusetts Boston.
National Center for Chronic Disease Prevention and Health Promotion Centers for Disease Control and Prevention *The findings and conclusions in this presentation.
Appropriate use of Design Effects and Sample Weights in Complex Health Survey Data: A Review of Articles Published using Data from Add Health, MTF, and.
Review Law of averages, expected value and standard error, normal approximation, surveys and sampling.
Multiple Regression Scott Hudson January 24, 2011.
REGRESSION G&W p
Bivariate & Multivariate Regression Analysis
Lecture 6: Linear Regression and Correlation
Statistical Methods For Engineers
Linear Regression.
From GLM to HLM Working with Continuous Outcomes
SIMPLE LINEAR REGRESSION
15.1 The Role of Statistics in the Research Process
Lecture11 review for final examination
Correlation and Regression Lecture 1 Sections: 10.1 – 10.2
STEPS Site Report.
Presentation transcript:

Jeffrey S. Napierala Prof. Glenn D. Deane Department of Sociology, SUNY Albany Prof. Donald J. Hernandez Department of Sociology, Hunter College & CUNY Graduate Center Research was supported by a Grant from the Annie E. Casey Foundation Please do not cite or distribute this research without consent from the Authors

Outline The origins of this method An example using children's reading performance A “hybrid” approach Standard Errors Results Conclusions

Origins The audience for policy research often has a diverse background in statistics. To accommodate those without any proficiency, the presentation must be simple and transparent. To satisfy those with advanced proficiency, current methods must be used. We want to have our cake and eat it too…

Origins At one end of the spectrum: Sample means/proportions are simple and easy to understand …but may not clearly translate into effective policies… Towards the other end: A standard regression approach has obvious advantages, but results can be difficult to explain to general audiences.

Origins Hernandez, Donald J. Double Jeopardy: How Third-Grade Reading Skills and Poverty Influence High School Graduation. Baltimore: Annie E. Casey Foundation

An Example… Policy Question: How much would income transfers increase the percentage of (3 rd Grade) students reading at the proficient level? Data: National Longitudinal Survey of Youth (NLSY) 4,060 children in 2369 families followed across about 30 years Dependent variable: Peabody Individual Achievement (PIAT) Reading Comprehension Test. Continuous variable ranging from Methods: Weighted, linear GEE models with a correction for clustering within families.

An Example: Model Specification Two models to highlight approach: 1. A “Base” model with just income and income squared predicting PIAT Reading Comprehension. 2. A “Full” model with controls for the 1) non-linear effect of mother’s education, 2) health insurance coverage, 3) whether a child attended head start or preschool, 4) the “quality” of their neighborhood, 5) race, 6) sex, and 7) year of interview.

A “hybrid” Approach Let’s utilize the power of a regression, but keep the presentation as simple and flexible as possible. 1. Create a statistical model of the outcome. 2. “Simulate” new outcomes using the relevant parameters of the model. 3. Meaningfully summarize the distributions before and after the “simulation” using simple statistics/tabulations. 4. Compare the summary statistics/tabulations.

A “hybrid” Approach: Standard Errors Using the common formula for the standard error of a proportion (with and without a Design Effect multiplier of 1.388). A Monte-Carlo approach to incorporate error from the sample regression and sample proportion. First, the effect(s) of covariates are added into the original score (the raw DV) by sampling from a normal distribution with a mean and s.d. from the regression. Rates (of reading proficiency) are computed then additional sampling error is introduced. After 5000 iterations, the S.E. of the distribution is computed from all the rates.

Also, to compare rates from the same group of children (before and after “simulations”) a “paired proportions” t-test is used (Altman 1997). Source: Altman, Douglas G Practical Statistics for Medical Research. London: Chapman & Hall. A “hybrid” Approach: Standard Errors

Results…

Conclusions The “hybrid” approach has a few notable advantages over other methods… The independent effect of income on reading proficiency is much (much) less than might be expected from looking at bivariate or univariate results. We expect that about 4% ( ; 95% C.I.) more kids in poverty would read proficiently if their families were given additional income to move them out of povery.

Thank You!