Slides:



Advertisements
Similar presentations
Probit The two most common error specifications yield the logit and probit models. The probit model results if the are distributed as normal variates,
Advertisements

Controlling for Time Dependent Confounding Using Marginal Structural Models in the Case of a Continuous Treatment O Wang 1, T McMullan 2 1 Amgen, Thousand.
Lecture 11 (Chapter 9).
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Lecture 4 (Chapter 4). Linear Models for Correlated Data We aim to develop a general linear model framework for longitudinal data, in which the inference.
Repeated Measures, Part 3 May, 2009 Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, UCSF.
Nguyen Ngoc Anh Nguyen Ha Trang
Simple Linear Regression
Multilevel Models 4 Sociology 8811, Class 26 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
Regression with a Binary Dependent Variable
1 Unsupervised Learning With Non-ignorable Missing Data Machine Learning Group Talk University of Toronto Monday Oct 4, 2004 Ben Marlin Sam Roweis Rich.
Clustered or Multilevel Data
How to deal with missing data: INTRODUCTION
Business Statistics - QBM117 Statistical inference for regression.
Simple Linear Regression and Correlation
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Statistical Methods for Missing Data Roberta Harnett MAR 550 October 30, 2007.
Generalized Linear Models
GEE and Generalized Linear Mixed Models
Lecture 9: Marginal Logistic Regression Model and GEE (Chapter 8)
Simple Linear Regression
Repeated Measures, Part 2 May, 2009 Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, UCSF.
Lecture 8: Generalized Linear Models for Longitudinal Data.
Andrew Thomson on Generalised Estimating Equations (and simulation studies)
HSRP 734: Advanced Statistical Methods June 19, 2008.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Multiple Imputation (MI) Technique Using a Sequence of Regression Models OJOC Cohort 15 Veronika N. Stiles, BSDH University of Michigan September’2012.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
Forecasting Choices. Types of Variable Variable Quantitative Qualitative Continuous Discrete (counting) Ordinal Nominal.
Introduction to Multiple Imputation CFDR Workshop Series Spring 2008.
Chapter 7 - Modeling Issues 7.1 Heterogeneity 7.2 Comparing fixed and random effects estimators 7.3 Omitted variables –Models of omitted variables –Augmented.
1 GLM I: Introduction to Generalized Linear Models By Curtis Gary Dean Distinguished Professor of Actuarial Science Ball State University By Curtis Gary.
GEE Approach Presented by Jianghu Dong Instructor: Professor Keumhee Chough (K.C.) Carrière.
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
ECON 338/ENVR 305 CLICKER QUESTIONS Statistics – Question Set #8 (from Chapter 10)
Estimation in Marginal Models (GEE and Robust Estimation)
A REVIEW By Chi-Ming Kam Surajit Ray April 23, 2001 April 23, 2001.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Recapitulation! Statistics 515. What Have We Covered? Elements Variables and Populations Parameters Samples Sample Statistics Population Distributions.
Tutorial I: Missing Value Analysis
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
Dependent Variable Discrete  2 values – binomial  3 or more discrete values – multinomial  Skewed – e.g. Poisson Continuous  Non-normal.
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Logistic Regression Categorical Data Analysis.
Week 7: General linear models Overview Questions from last week What are general linear models? Discussion of the 3 articles.
Logistic Regression: Regression with a Binary Dependent Variable.
Inference about the slope parameter and correlation
Multilevel modelling: general ideas and uses
Survival-time inverse-probability- weighted regression adjustment
Missing data: Why you should care about it and what to do about it
Statistical Modelling
Simple Linear Regression
MISSING DATA AND DROPOUT
Linear Mixed Models in JMP Pro
CH 5: Multivariate Methods
The Centre for Longitudinal Studies Missing Data Strategy
Chapter Six Normal Curves and Sampling Probability Distributions
Generalized Linear Models
Introduction to logistic regression a.k.a. Varbrul
Multiple Imputation Using Stata
How to handle missing data values
Quantitative Methods What lies beyond?.
Hidden Markov Autoregressive Models
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
From GLM to HLM Working with Continuous Outcomes
Quantitative Methods What lies beyond?.
BY: Mohammed Hussien Feb 2019 A Seminar Presentation on Longitudinal data analysis Bahir Dar University School of Public Health Post Graduate Program.
Task 6 Statistical Approaches
Clinical prediction models
Generalized Additive Model
Presentation transcript:

31 Biostatistics 540 FINAL REVIEW

32 Correlated Data Models Three formulations of regression models for correlated outcomes lead to models with distinct interpretations of parameter estimates: – Marginal (or direct) models – random effects (or mixed) models – transition models

33 Practical matters Use of the xtgee procedure in Stata – Differences between models with/without “robust” specification Use of regress, poisson and logistic procedures in Stata – Differences between models with/without “cluster” specification – Differences between models with/without “robust” specification

34 Marginal Models – Modeling assumptions – Interpretation of parameters – Linear and nonlinear models What are they?

35 Marginal Models We directly (and separately) specify – the mean model g(E[y ij ]) = b 0 +b 1 x 1ij +b 2 x 2ij +…+b p x pij – g(.) is a link function (e.g., logit, log, identity, etc.) – a working variance (and correlation) structure E.g., independence, exchangeable, AR(1), etc. We do not specify distributions of the data – Specify only the means and variances – Valid estimates of means and regression coefficients can be obtained by ignoring correlation – Statistical inference ignoring correlation is not valid

36 Random Effects Models Modeling assumptions Model Specification (of random effects) – Y | b ~ Normal, b ~ Normal – Y | b ~ Poisson, b ~ Normal (or b ~ Gamma) – Y | b ~ Binomial (Bernoulli), b ~ Normal Use xtreg or xtmixed for Normal, xtpois for Poisson, xtlogit for binomial Subject specific interpretation of regression parameters – Same as population average (PA) estimates for linear model – Same PA estimates for predictors in Poisson model – Subject specific interpretation for logistic model

37 Random Effects Models Random (or mixed) models Use “conditional” means and variances – by conditioning on the random effects – Typically one assumes “conditional” independence of the outcomes given the random effects – All modeling assumptions need to be correct for valid inference (and checked!) Useful for studying – Sources of variation (between versus within) – Magnitude of the variances – Predicting unobserved random variables (BLUPs)

38 Transition Models Transition models A type of “conditional” model – One conditions upon some function of the outcome history – Markov models are a special class of transition models – Models can be fitted using likelihood methods or GEE Useful for – Estimating transition probabilities – In binary response models, incidence probabilities

39 Missing Data Missing data types – Missing by design, incomplete data, unbalanced data – Unintended missing data Missing data mechanisms – MCAR – Missing completely at random – MAR – Missing at random – NI – Non-ignorable missingness Problems with missing data – bias, loss of efficiency, power

40 Missing Data Accomodating missing data – Understanding missing data mechanism(s) – Complete case analysis, simple imputation methods – Modeling the missingness (IPW, MI, joint modeling) – Sensitivity analysis methods

41 Questions?