Analysis of missing responses to the sexual experience question in evaluation of an adolescent HIV risk reduction intervention Yu-li Hsieh, Barbara L.

Slides:



Advertisements
Similar presentations
Handling attrition and non- response in longitudinal data Harvey Goldstein University of Bristol.
Advertisements

Handling Missing Data on ALSPAC
Non response and missing data in longitudinal surveys.
Standardized Scales.
Correcting for Common Causes of Nonresponse and Measurement Error Andy Peytchev International Total Survey Error Workshop Stowe, June 15, 2010.
Improving the quality of data through imputing missing values (Part One: Introduction to types of missing data) Saeid Shahraz MD, PhD Student Heller School.
Treatment of missing values
Missing values problem in Data Mining
1 QOL in oncology clinical trials: Now that we have the data what do we do?
Approaches for Addressing Issues of Missing Data in the Statistical Modeling of Adolescent Fertility Dudley L. Poston, Jr. Texas A&M University & Eugenia.
 Overview  Types of Missing Data  Strategies for Handling Missing Data  Software Applications and Examples.
Some birds, a cool cat and a wolf
Quasi Experiments Non-Experimental Research
Adapting to missing data
How to Handle Missing Values in Multivariate Data By Jeff McNeal & Marlen Roberts 1.
How to deal with missing data: INTRODUCTION
Modeling Achievement Trajectories When Attrition is Informative Betsy J. Feldman & Sophia Rabe- Hesketh.
Statistical Methods for Missing Data Roberta Harnett MAR 550 October 30, 2007.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Missing Data in Epidemiology: Issues & Approaches
Consumer behavior studies1 CONSUMER BEHAVIOR STUDIES STATISTICAL ISSUES Ralph B. D’Agostino, Sr. Boston University Harvard Clinical Research Institute.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 19 Process of Quantitative Data Analysis and Interpretation.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
G Lecture 11 G Session 12 Analyses with missing data What should be reported?  Hoyle and Panter  McDonald and Moon-Ho (2002)
Handling Attrition and Non- response in the 1970 British Cohort Study Tarek Mostafa Institute of Education – University of London.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 10 Missing Data Henian Chen, M.D., Ph.D.
Imputation for Multi Care Data Naren Meadem. Introduction What is certain in life? –Death –Taxes What is certain in research? –Measurement error –Missing.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
Chapter 3.1.  Observational Study: involves passive data collection (observe, record or measure but don’t interfere)  Experiment: ~Involves active data.
EFFICACY OF A STAGE-BASED BEHAVIORAL INTERVENTION TO PROMOTE STI SCREENING IN YOUNG WOMEN: A RANDOMIZED CONTROLLED TRIAL Chacko MR, Wiemann CM, Kozinetz.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
1 Updates on Regulatory Requirements for Missing Data Ferran Torres, MD, PhD Hospital Clinic Barcelona Universitat Autònoma de Barcelona.
1 G Lect 13W Imputation (data augmentation) of missing data Multiple imputation Examples G Multiple Regression Week 13 (Wednesday)
The Impact of Missing Data on the Detection of Nonuniform Differential Item Functioning W. Holmes Finch.
1 G Lect 13M Why might data be missing in psychological studies? Missing data patterns Overview of statistical approaches Example G Multiple.
Missing Values Raymond Kim Pink Preechavanichwong Andrew Wendel October 27, 2015.
Tutorial I: Missing Value Analysis
Research design By Dr.Ali Almesrawi asst. professor Ph.D.
DATA STRUCTURES AND LONGITUDINAL DATA ANALYSIS Nidhi Kohli, Ph.D. Quantitative Methods in Education (QME) Department of Educational Psychology 1.
Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.
HANDLING MISSING DATA.
Missing data: Why you should care about it and what to do about it
Handling Attrition and Non-response in the 1970 British Cohort Study
Maximising the Value of Time Series Data:
Present: Disease Past: Exposure
MISSING DATA AND DROPOUT
Analysis of Survey Results
The Centre for Longitudinal Studies Missing Data Strategy
Ken Larimore, Ph.D., LISW-S
Maximum Likelihood & Missing data
Introduction to Survey Data Analysis
Multiple Imputation.
Multiple Imputation Using Stata
Relationships among Adolescents’ Negative Interaction Styles with Friends and Romantic Partners and Depressive Symptoms Joanna M. Chango, Erin M. Miga,
Dealing with missing data
Presenter: Ting-Ting Chung July 11, 2017
The bane of data analysis
CH2. Cleaning and Transforming Data
EM for Inference in MV Data
Dr. Matthew Keough August 8th, 2018 Summer School
Scientific Method Steps
An Introduction to Latent Class Analysis (LCA)
Non response and missing data in longitudinal surveys
EM for Inference in MV Data
Chapter 4: Missing data mechanisms
The European Statistical Training Programme (ESTP)
Rachael Bedford Mplus: Longitudinal Analysis Workshop 23/06/2015
Missing data: Is it all the same?
Considerations for the use of multiple imputation in a noninferiority trial setting Kimberly Walters, Jie Zhou, Janet Wittes, Lisa Weissfeld Joint Statistical.
Imputation Strategies When a Continuous Outcome is to be Dichotomized for Responder Analysis: A Simulation Study Lysbeth Floden, PhD1 Melanie Bell, PhD2.
Presentation transcript:

Analysis of missing responses to the sexual experience question in evaluation of an adolescent HIV risk reduction intervention Yu-li Hsieh, Barbara L. Dancy and Kathleen S. Crittenden University of Illinois at Chicago

Background What are missing data? Non-responses to data collection for quantitative research: (A) refuse to answer, (B) answer that they “don’t know”, (C) have a valid skip or (D) their responses are lost by errors. Missing discussion of Missing Data Analysis How to reconcile methodological and substantive or theoretical concerns in handling missing data empirically

Background Building Blocks of Missing Data Analysis 1. Missing Data Patterns Column: variables Row: cases A. Univariate/ multivariate pattern y1 y2 y3 y4 y5 B. Monotone pattern C. General pattern

Background Building Blocks of Missing Data Analysis 2. Missing Data Mechanisms Suppose there are two possibly related variables X and Y, we have all observations on X, and some missing values on Y, Missingness can be seen as a dummy variable (R) (A) Missing Completely At Random (MCAR) No relationships among R, X and Y. (B) Missing At Random (MAR) R is conditioned on X (C) Missing Not At Random (MNAR) R is conditioned on Y

Background Building Blocks of Missing Data Analysis 3. Implications of Missing Data Mechanisms (A) Missing Completely At Random (MCAR) No harm to the results of our research. (B) Missing At Random (MAR) Possible biases could/have to be adjusted and controlled for (C) Missing Not At Random (MNAR) Real effects are unknown and potentially problematic

Background Procedures for Missing Data Analysis 1. Traditional Approaches (A) Complete-case Analysis (listwise deletion) (B) Available-case Analysis (pairwise deletion) (C) Weight Adjustment for Complete Cases (D) Including Indicator Variables (E) Single Imputation Methods Most of them either will generate biased estimates unless the mechanism is MCAR and the pattern is univariate or monotone, or are very difficult to run and interpret.

Background Procedures for Missing Data Analysis 2. Modern Approaches (A) Maximum Likelihood estimation (ML) (B) Multiple Imputation (MI) Both require large sample, work best under MAR. Advantages of Multiple Imputation Efficiency: 5 to 10 imputations are enough. Not very sensitive to missing data mechanisms.

Empirical Application Research Design and Data (A) HIV/AIDS risk reduction intervention evaluation (B) Community-based, quasi-experimental, and longitudinal (C) Low income African American female adolescents (D) Intervention groups: Mother/Daughter Risk Reduction (MDRR) Health Experts Risk Reduction (HERR) Mother/Daughter Health Promotion (MDHP) (E) Outcome variables: Sexual behaviors etc.

Empirical Application Substantive arguments related to missing data (A) Inconsistency in self-reported risk behaviors of adolescents (B) Conservatively treating missing data in the key outcome as a failure Missing Data Problem (A) Missing responses occur disproportionately in MDHP (B) Girls who declined to answer to the key sexual behavior questions have a higher-risk profile than those who say “Yes”

Empirical Application 1. Missing Data Pattern-1 Mixed (monotone and general) Pattern with five sub-patterns a.The frequency of the first pattern refers to complete cases without any missing value. b.The frequency of the last pattern refers to cases dropped out prior to T2 or T3, respectively.

Empirical Application 1. Missing Data Pattern-2 Mixed (monotone and general) Pattern with five sub-patterns a.The frequency of the first pattern refers to complete cases without any missing value. b.The frequency of the last pattern refers to cases dropped out prior to T2 or T3, respectively.

Empirical Application 1. Missing Data Pattern-3 Mixed (monotone and general) Pattern with five sub-patterns a.The frequency of the first pattern refers to complete cases without any missing value. b.The frequency of the last pattern refers to cases dropped out prior to T2 or T3, respectively.

Empirical Application 2. Missing Data Mechanism (A) Dropped out cases: MAR but very close to MCAR General causes of dropped out may not apply. (B) Missingness in Sexual behavior variables: MNAR Inconsistent reporting, social desirability, etc. (C) Missingness in Self-efficacy variables: MAR Only very small number of cases Require more information from later waves to clarify

Empirical Application 3. Our Missing Data Treatment (A) Conservatively recode the missing values in recent sexual activity variable as affirmative response. (B) Create a categorical variable (response group) to indicate the missingness of sexual experience variable Theoretical grounding Empirical and technical rationales (C) Create another dummy variable for “at least have one missing value across three time points” (D) Exclude the cases that provided missing values in self- efficacy variables

Empirical Application 4. Comparison between MI and our treatment a. All p values are two tailed, + p<.1, *p<.05, **p<.01. b. RR refers to Risk Reduction intervention group (1=RR), reference group is MDHP. c. Missing Response is “have at least one missing value on sexual experience question at T1, T2 or T3” (1=Missing Response), reference group is those provide valid responses at all three time points.

Empirical Application 4. Comparison between MI and our treatment a. All p values are two tailed, + p<.1, *p<.05, **p<.01. b. RR refers to Risk Reduction intervention group (1=RR), reference group is MDHP. c. Missing Response is “have at least one missing value on sexual experience question at T1, T2 or T3” (1=Missing Response), reference group is those provide valid responses at all three time points.

Empirical Application 4. Comparison between MI and our treatment a. All p values are two tailed, + p<.1, *p<.05, **p<.01. b. RR refers to Risk Reduction intervention group (1=RR), reference group is MDHP. c. Missing Response is “have at least one missing value on sexual experience question at T1, T2 or T3” (1=Missing Response), reference group is those provide valid responses at all three time points.

Concluding Remarks 1. Investigating the patterns and interpreting the mechanisms can still be tricky. 2. There are no “best” missing data procedures. 3. Make decisions informed by substantive insights 4. If possible, researchers should apply both traditional and modern procedures in order to acquire more information to maximize the quality of results