Presentation is loading. Please wait.

Presentation is loading. Please wait.

Guidelines for Appropriate OAI Data Use December 6, 2007 Yuqing Zhang, Boston University.

Similar presentations


Presentation on theme: "Guidelines for Appropriate OAI Data Use December 6, 2007 Yuqing Zhang, Boston University."— Presentation transcript:

1 Guidelines for Appropriate OAI Data Use December 6, 2007 Yuqing Zhang, Boston University

2 OAI Design and Analysis Issues Many of the variables are clustered by knee. Many of the variables are clustered by knee. Many of the variables are collected longitudinally over time on a person and/or knee Many of the variables are collected longitudinally over time on a person and/or knee Expensive MRI measurements may not be available for all participants – may make case-control design (nested within cohort) more attractive Expensive MRI measurements may not be available for all participants – may make case-control design (nested within cohort) more attractive

3 Consequences Need to weigh the advantages and disadvantages of various design strategies. Need to weigh the advantages and disadvantages of various design strategies. Failure to properly analyze clustered or longitudinal data can cause p-values and confidence intervals to be seriously incorrect. Failure to properly analyze clustered or longitudinal data can cause p-values and confidence intervals to be seriously incorrect.

4 Agenda Yuqing Zhang to discuss design considerations with a 15 minute question and answer period Yuqing Zhang to discuss design considerations with a 15 minute question and answer period Chuck McCulloch to discuss analysis issues with a 15 minute question and answer period Chuck McCulloch to discuss analysis issues with a 15 minute question and answer period

5 Study Design

6 Types of Epidemiologic Study Cross-sectional Study Cross-sectional Study Cohort Study Cohort Study Longitudinal cohort study Longitudinal cohort study Case-control StudyCase-control Study Matched case-control studyMatched case-control study

7 Cross-sectional Study Assess presence or absence of both risk factor and disease at same point in time among study population and examine their association Assess presence or absence of both risk factor and disease at same point in time among study population and examine their association Measure of association Measure of association odds of disease among exposed subjects odds of disease among exposed subjects Prevalence odds ratio = odds of disease among non-exposed subjects odds of disease among non-exposed subjects

8 Cross-sectional Study ROA Subjects with Knee Pain No Yes Total No. Subjects Prevalence of Knee Pain (%) Prevalence odds ratio NO70010080012.51.0 YES1208020040.04.7 In a survey of 1000 subjects, 200 subjects had ROA in at least one knee. Of those with knee OA, 80 reported knee pain; and of those without knee ROA, 100 reported knee pain

9 Cohort Study Subjects are classified on the basis of presence or absence of exposure to a particular risk factor and followed for a specified period of time to determine development of disease in each group Measure of association RR = CI e / CI u

10 Cohort Study BMI (kg/m 2 ) No. subjects Developing Knee ROA Total No. Subjects Risk of ROA Over 2 years (%) Risk Ratio < 25 5010005.01.0 > 25 200200010.02.0 3000 subjects without knee ROA were followed up for 2 years. Among 2000 subjects with BMI > 25 kg/m 2, 200 developed knee ROA. Among subjects with BMI 25 kg/m 2, 200 developed knee ROA. Among subjects with BMI < 25 kg/m 2, 50 developed knee ROA.

11 Longitudinal Cohort Study Studies in which outcome variable is repeatedly measured over time (i.e., outcome variable is measured in same individual on several different occasions) Studies in which outcome variable is repeatedly measured over time (i.e., outcome variable is measured in same individual on several different occasions) Observations of one individual over time are not independent Observations of one individual over time are not independent Special statistical techniques are required to take into account correlation between repeated observations Special statistical techniques are required to take into account correlation between repeated observations It can assess effect of change of exposure over time on change in outcome variable It can assess effect of change of exposure over time on change in outcome variable

12 Longitudinal Cohort Study Both knee pain (WOMAC) and BMI were assessed repeatedly (5 times) among participants Both knee pain (WOMAC) and BMI were assessed repeatedly (5 times) among participants Is greater BMI associated with higher WOMAC score (cross-sectional association) Is greater BMI associated with higher WOMAC score (cross-sectional association) Is change in BMI associated with change in WOMAC score (longitudinal association) Is change in BMI associated with change in WOMAC score (longitudinal association) Is increasing BMI associated with increasing WOMAC score Is increasing BMI associated with increasing WOMAC score Is decreasing BMI associated with decreasing WOMAC score Is decreasing BMI associated with decreasing WOMAC score

13 Case-control Study Subjects with a disease of interest are selected as the cases, and a sample of subjects from the source population which gives rise to cases are selected as the controls Subjects with a disease of interest are selected as the cases, and a sample of subjects from the source population which gives rise to cases are selected as the controls ‘Nested case-control’ study: cases and controls are drawn from the defined sample of a cohort study ‘Nested case-control’ study: cases and controls are drawn from the defined sample of a cohort study Measure of association Measure of association OR (odds ratio) =(a*d)/(b*c) OR (odds ratio) =(a*d)/(b*c) K. Rothman, Epidemiology: an Introduction, 2002

14 Case-control Study Of 100 knees with incident knee pain at follow-up visit (cases), 40 knee had BMLs present at baseline examination; among 100 knees without pain (controls), 10 had BMLs present at baseline examination. Of 100 knees with incident knee pain at follow-up visit (cases), 40 knee had BMLs present at baseline examination; among 100 knees without pain (controls), 10 had BMLs present at baseline examination. Frequent Knee Pain Presence of BMLs at Baseline No Yes Odds Ratio Controls90101.0 Cases60406.0

15 Matched Case-control Study Subjects are selected in such a way that some potential confounders (i.e., matching variables) are distributed in an identical manner among cases and controls Subjects are selected in such a way that some potential confounders (i.e., matching variables) are distributed in an identical manner among cases and controls Explicit matching Explicit matching e.g., each case and control pair was matched by study center, sex age (+ years), etc. Implicit matching Implicit matching e.g., knee with pain (case) vs. contra-lateral knee without pain (control) e.g., knee with pain (case) vs. contra-lateral knee without pain (control)

16 Should We Analyze Knees or Persons?

17 Background In a knee OA study, data are often collected at both the person-level and the knee-level In a knee OA study, data are often collected at both the person-level and the knee-level Analysis for knee-specific outcome variable presents special issue in statistical inference owing to correlation between two knees within a person Analysis for knee-specific outcome variable presents special issue in statistical inference owing to correlation between two knees within a person

18 Observation Unit (Knees vs. Person) Observation Unit (Knees vs. Person) Research Questions Research Questions Do subjects with SxOA take longer time to complete 15-m walk than those without SxOA? Do subjects with SxOA take longer time to complete 15-m walk than those without SxOA? Is greater BMI associated with an increased risk of knee ROA? Is greater BMI associated with an increased risk of knee ROA? Is presence of synovitis associated with prevalent knee pain? Is presence of synovitis associated with prevalent knee pain? Are changes of severity of bone marrow lesions (BMLs) associated with pain fluctuation? Are changes of severity of bone marrow lesions (BMLs) associated with pain fluctuation? Cost of Additional Data Collection OutcomeOutcome ExposureExposure ConfoundingConfounding Residual Confounding Residual Confounding

19 Research Questions Do subjects with SxOA take longer time to complete 15-m walk than those without SxOA? Do subjects with SxOA take longer time to complete 15-m walk than those without SxOA? Outcome is a person-based Outcome is a person-based Is greater BMI associated with an increased risk of knee ROA? Is greater BMI associated with an increased risk of knee ROA? Outcome can be either person-based or knee-based Outcome can be either person-based or knee-based

20 Person-Based Approach Observation unit for the outcome variable is the person Observation unit for the outcome variable is the person Worst knee Worst knee None, unilateral, or bilateral None, unilateral, or bilateral Model outcome as a function of a set of risk factors Model outcome as a function of a set of risk factors

21 Pros and Cons Statistical software is readily available (logistic regression, linear regression) Statistical software is readily available (logistic regression, linear regression) Effect estimate for site-specific risk factor might be biased Effect estimate for site-specific risk factor might be biased Statistical power might be sacrificed Statistical power might be sacrificed

22 Is greater BMI associated with an increased risk of knee ROA? Is greater BMI associated with an increased risk of knee ROA? Is presence of synovitis associated with prevalent knee pain? Is presence of synovitis associated with prevalent knee pain? Research Question

23 Knee-Based Approach Observation unit for the outcome variable is the knee Observation unit for the outcome variable is the knee Model outcome in each knee as a function of a set of risk factors, while accounting for correlation between two knees within a person Model outcome in each knee as a function of a set of risk factors, while accounting for correlation between two knees within a person

24 Pros and Cons Offer improved statistical power Offer improved statistical power Provide new insights into etiology of risk factor Provide new insights into etiology of risk factor Statistical software is readily available (Generalized Estimating Equations (GEE)) Statistical software is readily available (Generalized Estimating Equations (GEE)) Increase cost of data collection in both knees Increase cost of data collection in both knees

25 Is presence of synovitis associated with prevalent knee pain? Is presence of synovitis associated with prevalent knee pain? Research Questions

26 Cost of Data Collection All subjects in OAI had knee MRI taken repeatedly All subjects in OAI had knee MRI taken repeatedly Cost of reading all MRIs is extreme Cost of reading all MRIs is extreme To reduce cost, read MRIs of cases and a sample of controls To reduce cost, read MRIs of cases and a sample of controls

27 Traditional Approach Study design: case-control study Knees with pain (cases) Knees with pain (cases) Knees without pain (controls) Knees without pain (controls) Assess presence of synovitis (or its severity) Assess presence of synovitis (or its severity) Evaluate association between synovitis and prevalent knee pain using logistic regression model adjusting for confounding factors Evaluate association between synovitis and prevalent knee pain using logistic regression model adjusting for confounding factors

28 Pros and Cons Reduce cost of data collection Reduce cost of data collection Statistical model is readily available (logistic regression model) Statistical model is readily available (logistic regression model) Residual confounding is a concern Residual confounding is a concern

29 Which is better control knee?

30 Person-Matched Case-Control Design Eligible subjects Eligible subjects Knee with pain (case) Knee with pain (case) Contra-lateral knee without pain (control) Contra-lateral knee without pain (control) Case and control are implicitly matched by person-level risk factors Case and control are implicitly matched by person-level risk factors

31 Pros and Cons Reduce cost of data collection Reduce cost of data collection Eliminate effect of person-level confounding factors Eliminate effect of person-level confounding factors Statistical model is readily available (conditional logistic regression) Statistical model is readily available (conditional logistic regression) Unable to examine effect of change in risk factor on pain fluctuation Unable to examine effect of change in risk factor on pain fluctuation Findings may not apply to persons with bilateral knee pain Findings may not apply to persons with bilateral knee pain

32 Are changes of severity of synovitis associated with pain fluctuation? Are changes of severity of synovitis associated with pain fluctuation? Research Questions

33 Potential Confounding Pain is subjective measurement Pain is subjective measurement There is a natural variability in pain sensitivity, perception and tolerance to pain stimuli There is a natural variability in pain sensitivity, perception and tolerance to pain stimuli Many factors affecting pain variability are often not collected in the study Many factors affecting pain variability are often not collected in the study genetic predispositiongenetic predisposition socio-cultural environmentsocio-cultural environment prior experience prior experience expectations expectations current mood status current mood status Use each knee as its own control to minimize potential confounding between persons Use each knee as its own control to minimize potential confounding between persons

34 Traditional Approach Study design: longitudinal study Collect data on presence of pain for each knee repeatedly Collect data on presence of pain for each knee repeatedly Collect data on presence (or severity) of synovitis for each knee repeatedly Collect data on presence (or severity) of synovitis for each knee repeatedly Analyze severity of synovitis and its change in relation to pain fluctuation adjusting for confounding factors Analyze severity of synovitis and its change in relation to pain fluctuation adjusting for confounding factors

35 Pros and Cons Allow investigators to assess relation of severity of synovitis and its change to occurrence of knee pain Allow investigators to assess relation of severity of synovitis and its change to occurrence of knee pain Statistical model is readily available (GEE or MIXED model) Statistical model is readily available (GEE or MIXED model) Cost of assessing synovitis from MRI for all subjects is too high Cost of assessing synovitis from MRI for all subjects is too high Residual confounding is still a concern Residual confounding is still a concern

36 Self-Matched Case-Control Design Each knee serves as its own control (akin to case-crossover study design) Each knee serves as its own control (akin to case-crossover study design) Included are only knees that have pain at one or more, but not all, examinations Included are only knees that have pain at one or more, but not all, examinations Synovitis, either severity or presence, is assessed for each knee repeatedly Synovitis, either severity or presence, is assessed for each knee repeatedly

37 Eligible Knees Knees which had pain in at least one, but not all, visits (i.e., exams) Knees which had pain in at least one, but not all, visits (i.e., exams) Case-visit: a knee had pain at a scheduled visit Case-visit: a knee had pain at a scheduled visit Control visit: the same knee that did not have pain at a scheduled visit Control visit: the same knee that did not have pain at a scheduled visit

38 Knees Baseline 12-month 24-month Eligible 12341234 12341234 pain -> no pain -> pain no pain -> pain -> no pain pain -> no pain -> pain no pain -> pain -> no pain NO NO

39 56785678 56785678 pain -> no pain no pain -> pain pain -> no pain no pain -> pain Knees Baseline 12-month 24-month Eligible

40 Pros and Cons Allow investigators to examine effect of change in severity of synovitis on pain fluctuation Allow investigators to examine effect of change in severity of synovitis on pain fluctuation Statistical model is readily available (conditional logistic regression model) Statistical model is readily available (conditional logistic regression model) Reduce cost of MRI reading Reduce cost of MRI reading Minimizing residual confounding Minimizing residual confounding Unable to identify risk factors for persistently painful or never painful knees Unable to identify risk factors for persistently painful or never painful knees

41 Summary No single study design can fit every research question in OAI No single study design can fit every research question in OAI When choosing study design, one must consider When choosing study design, one must consider Cost Cost Internal validity Internal validityConfoundingBias Generalizability or external validity (???) Generalizability or external validity (???) Collaboration and group effort are required Clinician, basic scientist, statistician, epidemiologist Clinician, basic scientist, statistician, epidemiologist It takes a village!!!! It takes a village!!!!

42 Thank you for your attention.


Download ppt "Guidelines for Appropriate OAI Data Use December 6, 2007 Yuqing Zhang, Boston University."

Similar presentations


Ads by Google