Classification Schemes for Error in Clinical Research Szklo and Nieto –Bias »Selection Bias »Information/Measurement Bias –Confounding –Chance Other Common Approach –Bias »Selection Bias »Information/Measurement Bias »Confounding –Chance
Confounding and Interaction I Confounding: one of the central problems in observational clinical research –What is it? What does it do? –What kind of variables act as confounders? –Which variables to consider as confounders? –Which variables not to consider as confounders? »Emphasis on specifying the research question
Smoking, Matches, and Lung Cancer A tobacco company researcher believes that exposure to matches is the cause of lung cancer He conducts a large case-control study to test this hypothesis
Smoking, Matches, and Lung Cancer Your colleague has located 1000 cases of lung cancer from a population-based registry, of whom 820 have a history of carrying matches. Among 1000 reference (control) patients (selected randomly from the population and determined to have normal chest x-rays), 340 carry matches. Quantitate the relationship between matches and lung cancer in your colleague’s data.
Matches and Lung Cancer Exposure odds ratio = (820/180) / (340/660) = disease odds ratio OR = 8.8 95% CI (7.2, 10.9)
Smoking, Matches, and Lung Cancer You decide to look at the relationship between matches and lung cancer in the smokers separately from the non-smokers. You find that among the 1000 cases, 900 are smokers and 810 (OF THE 900) carry matches. Among the 1000 control patients, 300 are smokers and 270 (OF THE 300) carry matches. Draw the necessary stratified tables and calculate the relevant measure of association
Smoking, Matches, and Lung Cancer Stratified Crude Non-SmokersSmokers OR crude OR CF+ = OR smokers OR CF- = OR non - smokers Stratification produces 2 two-by-two tables In each stratum, all subjects are homogeneous with respect to smoking We have adjusted or controlled for smoking OR crude = 8.8 (7.2, 10.9) OR smokers = 1.0 (0.6, 1.5) OR non-smoker = 1.0 (0.5, 2.0)
Confounding: Smoking, Matches, and Lung Cancer Illustrates how confounding can create an apparent effect even when there is no actual true effect –Can also be opposite: confounding can mask an effect when one is truly present Proper terminology –In the relationship between matches and lung cancer, smoking is a confounding factor or a confounder –Smoking confounds the relationship between matches and lung cancer In clinical research, confounding has a very specific meaning
Estes continues to be confounding puzzle Ray RATTO Friday, July 20, 2001 ©2001 San Francisco Chronicle Ray RATTO ©2001 San Francisco Chronicle SHAWN ESTES seemed loath to analyze his own performance last night, for fear that people would see the first three innings and use them to obscure the last four. But that's what made his outing so perfectly Estes-like -- an ongoing argument with himself that he eventually won. Well, an argument in which he held his own and his teammates won for him in the bottom of the ninth. Ramon Martinez lined a game-tying single with two outs, and Jeff Kent followed two batters later with a bases- loaded walk off Juan Acevedo to give the Giants a 2-1 victory against Colorado and move them to within 4 1/2 games of division leader Arizona. It was in many ways an eye-opening victory for a team that hadn't had one of this type for a while.
Finding: “After an initial course of post-exposure prophylactic (PEP) medication following a sexual exposure to HIV infection, gay men reported a decrease in the practice of high-risk behavior over the following year.” Reviewer: “Perhaps the men simply withheld the real amount of high-risk behavior they had in order to be eligible for future courses of PEP. How do you account for this confounding?”
Smoking, Matches, and Lung Cancer The study is not over! To be complete, you also decide to examine the relationship between smoking and lung cancer independent from the use of matches. What tables should you construct to do this?
Smoking, Matches, and Lung Cancer Stratified Crude Matches Absent Matches Present OR crude OR CF+ = OR matches OR CF+ = OR no matches OR crude = 21.0 (16.4, 26.9) OR matches = 21.0 (10.7, 41.3) OR no matches = 21.0 (13.1, 33.6)
Confounding: Smoking, Matches, and Lung Cancer Interpretation? What is the effect of matches on the relationship between smoking and lung cancer? –Illustrates one important component in the requirements of a confounder (aka a confounding factor) - Must be associated with the disease
Confounding: Examples of Magnitude and Direction Stratified (adjusted) Crude (unadjusted) Potential Confounder Absent Potential Confounder Present RR crude RR CF+ RR CF-
Nightlights Let there be light!
Nightlights and Myopia Quinn et al. Nature 1999 Prevalence Ratio =
Insert picture with nightlight off Lights are off and the stumbling around begins.
How might confounding be accounting for this finding?
Nightlights and Myopia: Two subsequent studies found no association –Zadnik et al. and Gwiazda et al. Nature, 2000
Child’s Myopia Night Light Parental Myopia X X
Insert picture with nightlight on again Let there be light (again)!
What kind of variables act as confounders? Properties of a True Confounder –A true confounder (C) must be associated with: »the exposure (E) in question and »the disease (D) under study Confounder D D ANOTHER PATHWAY TO GET TO THE DISEASE ANOTHER PATHWAY TO GET TO THE DISEASE RQ: Is E associated with D independent of C?
C C ? ? E E D D Causal Diagrams Formally called directed acyclic graphs (DAGs) Frontier of epidemiologic theory Use for identifying pitfalls of adjusting and not adjusting for certain variables (see text)
Lung Cancer Matches Smoking ? ? RQ: Are matches associated with lung cancer independent of smoking?
Properties of a True Confounder Refined Properties: Association with Exposure A confounding variable can be either: – the cause of – the result of, or, – simply associated in a non-causal manner with the exposure in question Confounder D D
C causes E ? [via cardiovascular work-out] RQ: Is sexual activity associated with survival independent of general health?
Non-causal relationship between C and E CAD Other Meds (e.g., ASA) Ca channel Blockers GI Bleeding ? RQ: Are Ca channel blockers associated with GI bleeding independent of other med use?
E causes C Poor Diet Poverty Mortality ? [access to care] RQ: Is poverty associated with survival independent of effects on diet?
Properties of a True Confounder Refined Properties: Association with Disease A confounding variable must be associated with the disease. –It can be a “cause” of disease, or –it may merely be a marker for a true cause Confounder D D
Hep B and C virus infection C causes D IDU Early Mortality ? [via bacterial infections] RQ: Is injection drug use associated with survival independent of effect on hepatitis infections?
Maternal Age Unknown biologic factor(s) C as a marker for D Birth Order Down Syndrome ? RQ: Is birth order associated with survival independent of maternal age?
General Health Unknown biologic factor(s) C as a marker for D Sexual Activity ? [via cardiovascular work-out] Mortality RQ: Is sexual activity associated with survival independent of general health?
Suicide Anti-depressant use in children ? ?
Suicide Anti-depressant use in children Depression ? ?
Head Injury Use of Helmets in Motorcyclists ? ? Serious Head Injury
Use of Helmets in Motorcyclists Safety- oriented Personality ? ? Safe Driving
Cardio- vascular Disease Anti-retroviral Drugs for HIV ? ?
Cardio- vascular Disease Anti-retroviral Drugs for HIV Aging ? ? Unknown Biological Factors
What is NOT a Confounder? A variable that is an intermediate step in the causal path under study between the exposure in question and a disease is not a confounding variable. E E D D factor I Factor I is not a confounder It is on the pathway under study. It is an intermediary variable Factor I is not a confounder It is on the pathway under study. It is an intermediary variable
CCR5 and HIV Disease Progression CCR5 (receptor) defect AIDS CCR5 is the human cellular receptor for HIV Defects in CCR5 have been described HIV viral load is a powerful predictor of time-to-AIDS and is associated with CCR5 defect status How should HIV viral load be handled in assessing the association between CCR5 defect status and progression in HIV disease to AIDS? CCR5 is the human cellular receptor for HIV Defects in CCR5 have been described HIV viral load is a powerful predictor of time-to-AIDS and is associated with CCR5 defect status How should HIV viral load be handled in assessing the association between CCR5 defect status and progression in HIV disease to AIDS? ? ?
It depends upon the research question CCR5 defect ? [Other mechanisms] ? [HIV plasma viral load] AIDS #1: Is CCR5 associated with progression to AIDS, irrespective of mechanism? CCR5 defect Low viral load AIDS Do not adjust for viral load ! High viral load HIV plasma viral load Do Adjust ! #2: Is CCR5 associated with progression to AIDS, independent of viral load?
RQ 1: What if you did adjust for viral load? CCR5 defect AIDS #1: Is CCR5 associated with progression to AIDS, irrespective of mechanism? Low viral load High viral load If “via plasma viral load” was only pathway, no effect for CCR5 would be observed ? [HIV plasma viral load]
Ioannidis et al. Ann Int Med 2001 CCR5 defect Other mechanism #2 ? ? HIV plasma viral load AIDS #1 CCR5 defect ? ? AIDS Do not adjust ! HIV plasma viral load Adjust ! Crude (unadjusted) association: - Relative hazard: 0.71 (95% CI: 0.5 to 0.9) Crude (unadjusted) association: - Relative hazard: 0.71 (95% CI: 0.5 to 0.9) Stratified (adjusted) by HIV viral load - Relative hazard: 0.82 (95% CI: 0.6 to 1.2) Stratified (adjusted) by HIV viral load - Relative hazard: 0.82 (95% CI: 0.6 to 1.2)
Exercise and CAD When evaluating the relationship between exercise and CAD, is HDL a confounder or an intermediary? Exercise CAD HDL cholesterol
It depends on the pathway under investigation If interest is in a pathway other than through HDL, then HDL is a confounder Here, HDL is extraneous to pathway under study Confounding factors are extraneous factors Exercise CAD [not yet specified mechanism] HDL ? ?
Exercise and CAD If HDL is on the pathway in question, then HDL is an intermediary variable. e..g., Does exercise influence CAD risk in a newly studied population (elderly Asians)? Hence, classification of HDL as confounder or intermediary depends upon the biological pathway under investigation Exercise CAD HDL is not a confounder here [HDL... other mechanisms]
When Planning a Study, Which Factors Should be Considered as Potential Confounders? In established research areas: –any factor for which prior evidence indicates it is a confounder »e.g., effect of diet on CAD? must deal with smoking as potential confounder and When studying new risk factors (exposures) for which little is known: –plan on measuring ALL factors associated with the disease –i.e. If you don’t, you may regret it later Confounding can be dealt with in the analysis phase of a study but NOT if the confounder is not measured
Seeking cause of high Marin cancer rates Activists canvass residents to search for trends Sunday, November 10, 2002 Thousands of volunteers scattered across Marin County under baleful skies Saturday in an unprecedented grassroots campaign against the region's soaring cancer rate. Armed with surveys, some 2,000 volunteers went door to door in every neighborhood in the county.... The volunteers hope to collect enough money to hire an epidemiologist...
Methods to reduce confounding During study design: »Randomization »Restriction »Matching During study analysis: »Stratified analysis »Multivariable regression models