Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 1: The Logic Behind Statistical Adjustment.

Slides:



Advertisements
Similar presentations
Associations between Obesity and Depression by Race/Ethnicity and Education among Women: Results from the National Health and Nutrition Examination Survey,
Advertisements

Clinical Decision Support: Using Logistic Regression to Diagnose COPD and CHF ©2012 Wayne G. Fischer, PhD 1 COPD patient inclusion criteria: Discharged.
HSRP 734: Advanced Statistical Methods July 24, 2008.
Multiple Linear Regression
RELATIVE RISK ESTIMATION IN RANDOMISED CONTROLLED TRIALS: A COMPARISON OF METHODS FOR INDEPENDENT OBSERVATIONS Lisa N Yelland, Amy B Salter, Philip Ryan.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 5: Reporting Subgroup Results.
BA 555 Practical Business Analysis
Multiple Regression Involves the use of more than one independent variable. Multivariate analysis involves more than one dependent variable - OMS 633 Adding.
EPI 809/Spring Multiple Logistic Regression.
Vanderbilt Sports Medicine Chapter 4: Prognosis Presented by: Laurie Huston and Kurt Spindler Evidence-Based Medicine How to Practice and Teach EBM.
Chapter 7 Correlational Research Gay, Mills, and Airasian
EVIDENCE BASED MEDICINE
Association vs. Causation
12/10/02Harry Bushar1 Computerized Thermal Imaging Breast Cancer System 2100 (CTI BCS2100) Radiological Devices Advisory Panel December 10, 2002 Statistical.
Biostat Didactic Seminar Series Analyzing Binary Outcomes: Analyzing Binary Outcomes: An Introduction to Logistic Regression Robert Boudreau, PhD Co-Director.
Results Table 1: Factors associated with advanced liver fibrosis in univariable analysis among 216 chronic HCV patients Background There is a clinical.
Correlation, Regression Covariate-Adjusted Group Comparisons
Biostatistics in Practice Session 5: Associations and confounding Youngju Pak, Ph.D. Biostatisticianhttp://research.LABioMed.org/Biostat 1.
Inference for regression - Simple linear regression
Biostatistics Case Studies 2009 Peter D. Christenson Biostatistician Session 1: Classification Trees.
Topic 28: Unequal Replication in Two-Way ANOVA. Outline Two-way ANOVA with unequal numbers of observations in the cells –Data and model –Regression approach.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Biostatistics in Practice Session 5: Associations and Confounding Peter D. Christenson Biostatistician
Biostatistics in Clinical Research Peter D. Christenson Biostatistician January 12, 2005IMSD U*STAR RISE.
Biostatistics Case Studies 2015 Youngju Pak, PhD. Biostatistician Session 4: Regression Models and Multivariate Analyses.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 5: Analysis Issues in Large Observational Studies.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 5: Classification Trees: An Alternative to Logistic.
Statistical Bootstrapping Peter D. Christenson Biostatistician January 20, 2005.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 5: Associations and Confounding.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 5: Methods for Assessing Associations.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 2: Diagnostic Classification.
Andrew Thomson on Generalised Estimating Equations (and simulation studies)
Experimental Design All experiments have independent variables, dependent variables, and experimental units. Independent variable. An independent.
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 6: Case Study.
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.
Biostat Didactic Seminar Series Correlation and Regression Part 2 Robert Boudreau, PhD Co-Director of Methodology Core PITT-Multidisciplinary Clinical.
Basic epidemiologic analysis with Stata Biostatistics 212 Lecture 5.
Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 5: Choices for Longitudinal Data Analysis.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 2: Using the Bootstrap: Throw Out Those Messy.
1 THE ROLE OF COVARIATES IN CLINICAL TRIALS ANALYSES Ralph B. D’Agostino, Sr., PhD Boston University FDA ODAC March 13, 2006.
Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 2: Statistical Adjustment: How it Works.
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 4: Study Size and Power.
Osteoarthritis Initiative Analytic Strategies for the OAI Data December 6, 2007 Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size and Power.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 6: Discrepancies as Predictors: Discrepancy.
Chapter 13 Multiple Regression
Biostatistics in Practice Peter D. Christenson Biostatistician Session 6: Case Study.
PSYC 3030 Review Session April 19, Housekeeping Exam: –April 26, 2004 (Monday) –RN 203 –Use pencil, bring calculator & eraser –Make use of your.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 4: An Alternative to Last-Observation-Carried-Forward:
META-ANALYSIS, RESEARCH SYNTHESES AND SYSTEMATIC REVIEWS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 3: Testing Hypotheses.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size for Precision or Power.
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 2: Correlation of Time Courses of Simultaneous.
Ch 5 Relationships Between Quantitative Variables (pg 150) –Will use 3 tools to describe, picture, and quantify 1) scatterplot 2) correlation 3) regression.
Sample Size Determination
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 3: An Alternative to Last-Observation-Carried-Forward:
Biostatistics in Practice Session 5: Associations and confounding Youngju Pak, Ph.D. Biostatisticianhttp://research.LABioMed.org/Biostat 1.
Chapter Two Methods in the Study of Personality. Gathering Information About Personality Informal Sources of Information: Observations of Self—Introspection,
Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 6: Classification Trees.
THE ROLE OF SUBGROUPS IN CLINICAL TRIALS Ralph B. D’Agostino, Sr., PhD Boston University September 13, 2005.
Biostatistics Case Studies 2009 Peter D. Christenson Biostatistician Session 4: The Logic Behind Statistical Adjustment.
Sample Size Determination
Biostatistics Case Studies 2007
Fig. 1. Correlation of TSH with TC, LDL-C, HDL-C, and TG.
Journal Club Notes.
Figure 1. Serum B-ALP levels in normal male (•) and female (○) children as a function of age. Serum B-ALP levels were increased during infancy and puberty.
Evaluating Classifiers for Disease Gene Discovery
Presentation transcript:

Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 1: The Logic Behind Statistical Adjustment

Case Study J Clin Endocrin Metab 2006 Nov; 91(11):

Eventual Goal: Establish Diagnostic Test for GH Administration Potential Approaches: Use some difference between endogenous and exogenous GH, such as isoforms. Problem is short time window. Measure proteins or peptides that respond to GH. This study. Why preferable to GH levels themselves? Could use a profile of several markers. After single marker or profile is selected, need policy on sensitivity and specificity of the test.

Study Goal: Determine Other Relevant Subject Characteristics Outline: Serum IGF and collagen markers are known to be GH-responsive. Are age, gender, BMI, ethnicity, and sport type related to the marker levels in elite athletes? Determine the relative and combined explanatory power of these factors on the markers. Suggest factors to be included in a doping test.

Not a Study Goal: Establish Reference Ranges Gender, Age SpecificGender, Stature Specific

Eventual Goal: Establish Reference Ranges Gender, Age, Other Specific Female IGF1 Mean Curves Decreasing Age Other1 Other2 Other3 Other4 Male Need separate Other3 and Other4 Curves for Males? Others could be Ethnicities Mock-Up

Eventual Goal: Use Reference Ranges for Doping Test Issues: Statistical significance of associations of subject characteristics with markers not so important. Even weak associations may be included in a formula (e.g., logistic regression) or rule (e.g., CART) for Yes/No doping using markers and other factors. Primary criteria are sensitivity and specificity of rule and their precision.

Back to Study Goals: Outcomes are IGF and Collagen Markers (Not Yes/No Doping) Determine the relative and combined explanatory power of age, gender, BMI, ethnicity, and sport type on the markers. * * for age, gender, and BMI. Figure 2. One conclusion is lack of differences between ethnic IGF1 means, after adjustment for age, gender, and BMI (Fig 2). How are these adjustments made?

Adjustment: What Specifically is It? Standardization? Normalization? Stratification? Adjustment?

Adjustment: For a Single Continuous Characteristic Problem: Groups to be compared (e.g., ethnicities) have different mean ages, and age is associated with IGF1. Solution: Make groups appear to have the same mean age. Find regression line predicting IGF1 from age. Move each subject parallel to the regression line to the mean age. This is the expected IGF1 if this subject had been at the mean age. Adjusted means are means of these adjusted individual values.

Adjustment: For a Single Continuous Characteristic We simulate data for Caucasians and Africans only for simplicity, to demonstrate attenuation of a =15 μg/L difference to a =3 μg/L difference

Adjustment: For a Single Continuous Characteristic Typical Software (SAS): proc glm; class ethnic; model igf1=ethnic age; means ethnic; *This gives unadjusted IGF1 means; lsmeans ethnic; *This gives adjusted IGF1 means; run;

Adjustment: For a Single Continuous Characteristic From the regression: Δ=slope=ΔIGF1/ΔAge. Individual Subjects: Adjusted IGF1 = IGF1 + Δ(Age - Overall Mean Age). Groups: Adjusted IGF1 Group Mean = Group Mean IGF1 + Δ(Group Mean Age - Overall Mean Age).

Adjustment: For a Single Classification Characteristic Problem: Groups to be compared (e.g., ethnicities) have different proportions of females, and gender is associated with IGF1. Solution: Make groups appear to have the same proportion of females. IGF1 means are adjusted to what is expected if each group had the same proportion of females. Different proportions of subjects in each group are adjusted up or down to accomplish this.

Adjustment: For a Single Classification Characteristic Δ = Male IGF1 Mean - Female IGF1 Mean Individual Subjects: Adjusted IGF1 = IGF1 + Δ(I F - Overall Prop Female). where I F = 1 if Female, 0 if Male Ethnic Groups: Adjusted IGF1 Group Mean = Group Mean IGF1 + Δ(Group Prop Female - Overall Prop Female).

Adjustment: For a Single Classification Characteristic Δ = Male IGF1 Mean - Female IGF1 Mean = = -78 Ethnic Groups: Adjusted IGF1 Group Mean = Group Mean IGF1 + Δ(Group Prop Female - Overall Prop Female). Caucasian: ( ) = 153 African: ( ) = 150 Caucasian: 236/590 = 0.40 Female African: 27/109 = 0.25 Combined: 263/699 = 0.38 Female

Adjustment: For a Single Classification Characteristic Stratified (Subgroup) Caucasian vs. African: Male: 123 vs. 125p=0.23 Female:203 vs. 185p= Adjusted:153 vs. 150p=0.32 Genders Differ on Ethnic Differences: Is adjustment justified? Loses a main feature of the results. The interaction, i.e., =-2 vs =17 has p= (Recall that this is simulated data, not actual paper results)

Adjustment: For Multiple Characteristics The process is sequentially applied. Two types of p-values: Type I: Adjusting only for factors earlier in the sequence applied. Type III: Adjusting for all included factors. This is most common - “independent” effect, relative to all other effects. Figure 2: Adjusts for age, BMI, gender. Unclear which removes ethnic differences.