Outline National Assessment of Educational Progress (NAEP) Multivariate Design Problem Implications for analysis Example with similar structure in Biostatistics.

Slides:



Advertisements
Similar presentations
The Application of Propensity Score Analysis to Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Lilly Yue, Ph.D.* CDRH, FDA,
Advertisements

Statistics for Improving the Efficiency of Public Administration Daniel Peña Universidad Carlos III Madrid, Spain NTTS 2009 Brussels.
Dennis J. Kennedy NAEP State Coordinator West Virginia Department of Education.
Handling attrition and non- response in longitudinal data Harvey Goldstein University of Bristol.
Pennsylvania Value-Added Assessment System (PVAAS) High Growth, High Achieving Schools: Is It Possible? Fall, 2011 PVAAS Webinar.
Getting to Know the Math COE Office of Superintendent of Public Instruction Scott Brittain April 24, 2012.
Comparing State Reading and Math Performance Standards Using NAEP Don McLaughlin Victor Bandeira de Mello National Conference on Large-Scale Assessment.
Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
Innovation and Growth of Large Scale Assessments Irwin Kirsch Educational Testing Service February 18, 2013.
How Should We Assess the Fit of Rasch-Type Models? Approximating the Power of Goodness-of-fit Statistics in Categorical Data Analysis Alberto Maydeu-Olivares.
Split Questionnaire Designs for Consumer Expenditure Survey Trivellore Raghunathan (Raghu) University of Michigan BLS Workshop December 8-9, 2010.
Experimental Design making causal inferences. Causal and Effect The IV precedes the DV in time The IV precedes the DV in time The IV and DV are correlated.
The art and science of measuring people l Reliability l Validity l Operationalizing.
Common Problems in Writing Statistical Plan of Clinical Trial Protocol Liying XU CCTER CUHK.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy Psychiatric Biostatistics Symposium May 2009.
Thoughts on Biomarker Discovery and Validation Karla Ballman, Ph.D. Division of Biostatistics October 29, 2007.
Multivariate Analysis of Variance, Part 1 BMTRY 726.
PAT - MATHS Progressive Achievement Tests in Mathematics 3 rd Edition.
Statistical Issues in Data Collection and Study Design For Community Programs and Research October 11, 2001 Elizabeth Garrett Division of Biostatistics.
Structural Equation Modeling 3 Psy 524 Andrew Ainsworth.
Absolute error. absolute function absolute value.
Introduction to plausible values National Research Coordinators Meeting Madrid, February 2010.
Inference for the mean vector. Univariate Inference Let x 1, x 2, …, x n denote a sample of n from the normal distribution with mean  and variance 
Background to Adaptive Design Nigel Stallard Professor of Medical Statistics Director of Health Sciences Research Institute Warwick Medical School
Statistics for Health Care Biostatistics. Phases of a Full Clinical Trial Phase I – the trial takes place after the development of a therapy and is designed.
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
Decisions from Data: The Role of Simulation Gail Burrill Gail Burrill Michigan State University
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
CJT 765: Structural Equation Modeling Class 12: Wrap Up: Latent Growth Models, Pitfalls, Critique and Future Directions for SEM.
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
The Multiple Comparisons Problem in IES Impact Evaluations: Guidelines and Applications Peter Z. Schochet and John Deke June 2009, IES Research Conference.
1 Understanding and Measuring Uncertainty Associated with the Mid-Year Population Estimates Joanne Clements Ruth Fulton Alison Whitworth.
A Statistical Linkage Between NAEP and ECLS-K Grade Eight Reading Assessments Enis Dogan Burhan Ogut Young Yee Kim Sharyn Rosenberg NAEP Education Statistics.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Multilevel and multifrailty models. Overview  Multifrailty versus multilevel Only one cluster, two frailties in cluster e.g., prognostic index (PI) analysis,
Latent regression models. Where does the probability come from? Why isn’t the model deterministic. Each item tests something unique – We are interested.
Inference for the mean vector. Univariate Inference Let x 1, x 2, …, x n denote a sample of n from the normal distribution with mean  and variance 
Armando Teixeira-Pinto AcademyHealth, Orlando ‘07 Analysis of Non-commensurate Outcomes.
SGPP: Spatial Gaussian Predictive Process Models for Neuroimaging Data Yimei Li Department of Biostatistics St. Jude Children’s Research Hospital Joint.
Obtaining International Benchmarks for States Through Statistical Linking: Presentation at the Institute of Education Sciences (IES) National Center for.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Lecture 3 (Chapter 4). Linear Models for Longitudinal Data Linear Regression Model (Review) Ordinary Least Squares (OLS) Maximum Likelihood Estimation.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
[Part 5] 1/43 Discrete Choice Modeling Ordered Choice Models Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.
Reducing Burden on Patient- Reported Outcomes Using Multidimensional Computer Adaptive Testing Scott B. MorrisMichael Bass Mirinae LeeRichard E. Neapolitan.
Department of Curriculum and Instruction Considerations for Choosing Mathematics Progress Monitoring Measures from K-12 Anne Foegen, Ph.D. Pursuing the.
Presentation : “ Maximum Likelihood Estimation” Presented By : Jesu Kiran Spurgen Date :
Evaluation Requirements for MSP and Characteristics of Designs to Estimate Impacts with Confidence Ellen Bobronnikov March 23, 2011.
Statistical Innovations for Health and Educational Research
JMP Discovery Summit 2016 Janet Alvarado
Questions Tool and Constructed Response Items
Probability Theory and Parameter Estimation I
Inference for Proportions
Inference for the mean vector
Classical Test Theory Margaret Wu.
CJT 765: Structural Equation Modeling
Summarising and presenting data - Univariate analysis continued
Working Independence versus modeling correlation Longitudinal Example
Mark Rothmann U.S. Food and Drug Administration September 14, 2018
Common Problems in Writing Statistical Plan of Clinical Trial Protocol
Comparing Populations
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
OVERVIEW OF LINEAR MODELS
OVERVIEW OF LINEAR MODELS
(Approximately) Bivariate Normal Data and Inference Based on Hotelling’s T2 WNBA Regular Season Home Point Spread and Over/Under Differentials
CAASPP Results 2015 to 2016 Santa Clara Assessment and Accountability Network May 26, 2017 Eric E, Zilbert Administrator, Psychometrics, Evaluation.
Björn Bornkamp, Georgina Bermann
How Should We Select and Define Trial Estimands
Presentation transcript:

Outline National Assessment of Educational Progress (NAEP) Multivariate Design Problem Implications for analysis Example with similar structure in Biostatistics

NAEP On-going surveys at national and state levels 4th, 8th, and 12th grade students and their teachers math, reading, writing background demographic and educational environment questions

Excellent web site

NAEP Mathematics Mathematics –5 domains/sub-scales/traits/latent proficiencies –Algebra –Geometry Several hundred potential test questions

NAEP Objectives and Constraints Goal is population estimates Individual students (and schools) are NOT rated based on NAEP 45 minutes for cognitive questions (items) 15 minutes for background questions/administration

Matrix Sampling (1984) AlgebraGeometry

Model for Cognitive Data Longitudinal data model – : Student algebra proficiency – : Student geometry proficiency IRT (Item response theory)

Design Issue Fixed number of items per student –How many algebra items? –How many geometry items? Obtain (equally) accurate population estimates of both algebra and geometry proficiencies

Balanced Designs Give each student approximately the same number of algebra and geometry items Up to 5 or 6 sub-domains, so the number of items per sub-domain is very small Extended collections of related items may make a balanced design infeasible

Split Designs (symmetric) Some students assigned only algebra items Same number of other students assigned only geometry items Remaining students are assigned equal number of algebra and geometry items

Optimal Design and Estimation The balanced design is optimal Maximum likelihood estimation –The joint MLE for the algebra distribution and the geometry distribution is the same as the univariate MLE with the geometry and algebra proficiencies estimated separately

Balanced design –There is no gain from multivariate estimation –Estimates for individual student proficiencies are much improved by multivariate estimation Split design –Multivariate estimation is much better than univariate –Multivariate estimation for the split design approaches balanced design efficiency as the proficiency correlation approaches 1

Bivariate outcomes (Jessica Mancuso) Experimental biomarker for stroke patients –Measurement error –It can be applied to the infarct and non-infarct sides of the brain –Anticipated that the non-infarct side of the brain will be predictive of the infarct side –Evaluate an oral compound using the biomarker

Study design Placebo/drug in parallel blinded randomized groups Measurements –Baseline –On-dosing measurements (longitudinal) –Measurements on the infarct and non-infarct sides of the brain (bivariate)

Estimation The primary goal is to estimate the treatment effect on the infarct side of the brain What is the role of the measurements on the non-infarct side in the primary estimation?

Depends on other information If there is no effect (or a known effect) on the non-infarct side of the brain, the non- infarct data can improve estimation –Baseline non-infarct measurement may be very helpful –If the treatment does not effect the non-infarct side of the brain, the on-dosing measurement(s) are like covariates and may improve estimation

Depends on design Balanced design –Both sides of brain measured each time –No planned or unplanned missing measurements –On-dosing non-infarct measurements do not contribute to estimation of the drug effect on the infarct side (mostly true) –Lack of contribution despite improvement in estimation for individual patients

Split design –At some on-dosing times, the non-infarct measurement is available but the infarct is not available –The on-dosing non-infarct data may contribute substantially

Summary The use of multiple outcomes to improve inference is very complex The fact that an outcome can be used to improve the estimation/prediction of another outcome at the level of an individual person is not sufficient