Presentation is loading. Please wait.

Presentation is loading. Please wait.

Main points to be covered

Similar presentations


Presentation on theme: "Main points to be covered"— Presentation transcript:

1 Main points to be covered
Rate ratios Relationship between the type of ratio measure of association and study design How the odds ratio estimates other ratio measures depending on the type of sampling in a case-control study Strengths and weaknesses of case-control studies

2 Comparison of two person-time rates with a ratio
Ratio of two person-time rates NB: denominators of two person-time rates must be in the same units Rate NSAID use = per 1000 person-years Rate for non use = per 1000 person-years Rate ratio = 1.01 Described as “rate ratio” in the article This is the example we looked at when we were discussing forming incidence rates in a cohort with a time varying exposure, in this case use of non-steroidal anti-inflammatory medications (NSAIDS). The basic measure of association reported was the rate ratio, the event rate during the person-time of NSAID use divided by the rate during the person-time of no use. A rate ratio is simply the ratio of two rates and thus is the analogue of the risk ratio for person-time measures of incidence. It should be clear that an incidence rate should not be compared to cumulative incidence: a ratio using one of each type of incidence measure would not be interpretable. In comparing two incidence rates, the important thing to remember is that the time units of the two measures forming the ratio must be the same. In the example above, if the first rate were per 100 person-years and the second remained per 1,000 person-years, the rate ratio would jump up to 10.1, and it would appear there was a strong association when there was no association. The authors correctly reported their measure of association as a rate ratio. They could have also reported a rate difference if they had chose. It was very small, 0.16 per 1,000 person-years or 1.6 per 10,000 person-years. Ray, Lancet, 2002

3 Rate Ratio described as Relative Risk
CA mortality rate ratio by BMI group: 841.6 per 100,000 person-years (BMI 40+) 578.3 per 100,000 person-years (BMI < 40) Rate ratio = 1.46 (Calle, NEJM, April 2003) Described as “relative risk” in the article (very common practice) Although we think everyone should use the rate ratio (relative rate is a defensible alternative since the term “relative” implies a ratio), rate ratios are probably reported as relative risks about as often as they are called rate ratios or relative rates in the literature. In the recent NEJM article cited above, the abstract uses relative risk to describe rate ratios.

4 Rate ratio vs. Risk ratio
Risk must be between 0 and 1 Thus in comparing 2 groups high risk in unexposed group limits how large ratio can be Eg, risk in unexposed group = 0.7 means maximum risk ratio = 1.0/0.7 = 1.42 Rates are not restricted between 0 and 1 If exposed rate = 10/100 person-years and unexposed rate = 5/100 person-years, risk (cumulative incidence) in 2 groups after 20 years = 0.88 and 0.64. Risk ratio would be 0.88/0.64 = 1.38 but rate ratio = 10/5 = 2.0. An important difference between cumulative incidence (=risk) and incidence rates is that the constraint placed on risk of being a proportion and therefore bounded by 0 and 1 does not apply to rates which can be greater than 1. When the ratio of two proportions is formed (risk ratio), the constraint on proportions also constrains the possible values of the ratio. As the proportion rises in the group with the smaller proportion (say, the unexposed group), a maximum risk ratio value is imposed. For example, if the cumulative incidence is 0.33 in the unexposed group, the maximum risk ratio is 3 (1.0/0.33); if it is 0.50, the maximum risk ratio is 2 (1.0/0.5); and if it is greater than 0.50, as in the example above, the risk ratio must be < The rate ratio is not limited in the same way.

5 Ratio measures of association and Study Design
Cross-sectional study Prevalence ratio (Prevalence) odds ratio Cohort Study Risk ratio Rate ratio Odds ratio Case-control study The type of study limits the kind of ratio measure that can be calculated. The cohort study allows calculation of all three measures, but as we have discussed, the risk and rate ratio measures are more easily understood than the odds ratio and are therefore the measures reported in cohort studies rather than the odds ratio. The prevalence ratio looks like a risk ratio, but one should always keep in mind that it is risk based on existing – or prevalent – disease rather than incident disease. The odds ratio from a cross-sectional study is not usually described as a prevalence odds ratio, hence the parentheses above, but of course it is also based on a prevalent sample. Prevalence odds ratio is a more precise way to characterize what is being reported. Finally, the case-control study only allows calculation of the odds ratio, a major reason for the importance of the odds ratio in clinical research.

6 Measure of Association in Case-Control Studies
Odds ratio (OR) of exposure in cases and controls is the measure of association in case-control studies OR of exposure = OR of disease OR of exposure estimates different measures of association depending on the type of sampling used for the case-control designs Since the case-control design starts with the cases and controls and fixes the ratio of the two by selecting one or more controls per case, the investigator is looking within the disease and non-disease groups at the proportion in each who have the exposure of interest. So the odds ratio is the OR of exposure, but as we demonstrated in the last lecture, one of the useful properties of the OR is that the OR of exposure equals the OR of disease, the OR we are really interested in. For a long time, it was generally accepted that the OR in a case-control study could be thought of as approximating the risk ratio if “the disease was rare.” This idea stems from the properties of odds we discussed in the last lecture; i.e., the odds approximates the probability when the probability is low (say, less than 10%) and therefore a ratio of odds approximates a ratio of probabilities when the probabilities are low. Current epidemiologic thinking has demonstrated that this is not the case. Depending on the sampling design, the odds ratio can give an unbiased estimate of the ratio measures without this “rare disease” assumption.

7 1= exposed and 0 = not exposed
To understand what OR in a case-control study estimates, we return to the setting of a cohort Some abbreviations: 1= exposed and 0 = not exposed E1 = Events = diagnoses in exposed group E0 = Events = diagnoses in unexposed group N1 = number of persons in exposed group N0 = number of persons in unexposed group Some notation we will need to see how the OR estimates other ratio measures.

8 Risk ratio in a cohort study
So if you have the cases, To simplify the presentation, assume a cohort with complete and equal follow-up on everyone in both groups (i.e., the outbreak example we have talked about). In that case everyone at baseline is present at the end of the study and the risk ratio is calculated simply as shown above. In the situation of the typical cohort with censoring, it is a bit more complicated, but one can easily see that if the assumption of no informative censoring is met, that an unbiased estimate of these ratios is what is being calculated as the risk ratio (the meaning of no informative censoring is that the censoring is not related to E nor to exposure so the four groups above retain their original proportions). The algebra just rearranges the risk ratio to show that the ratio of exposed to unexposed in the whole cohort is a separate ratio of interest. Only need the ratio from a case-control study to estimate risk ratio.

9 Capturing the events with a case-control design
From a well-defined study base: All new cases in the study base will be the same cases as in a cohort study of the base (E1 and E0 ) Or a random sample of the cases will give the same ratio Thinking back to the concept of a primary study base such as participants in a cohort, residents of California, or members of Kaiser Permanente, one can see that obtaining all the cases that occur during a defined time period (or a random sample of those cases) is a goal of the case-control design and easily achievable if all, or nearly all, the cases are diagnosed and registered as in a disease registry or Kaiser’s system of record keeping. Therefore, getting an unbiased estimate of this ratio is not a problem in a case-control design.

10 Sampling in a case-control study
Given all the cases (or a random sample), goal of case-control design is to sample the study base to get an unbiased estimate of the exposure distribution in the controls The exposure distribution in the controls is given by the ratio Thus if this ratio can be obtained in a case-control study, the risk ratio is obtained with a case-control design Still thinking of the concept of the primary study base, the goal in selecting controls in a case-control study is to sample that primary study base in an unbiased way; that is, to sample it independently of exposure status. If that is achieved, then the proportion of controls who are exposed and unexposed will have the same ratio as if we had measured exposure in the entire study base and obtained that ratio.

11 Case Control Sampling and the Risk Ratio
How can the study base be sampled to get an unbiased estimate of the risk ratio? Question can be rephrased as “How can the study base be sampled to get an unbiased estimate of ?”

12 Three Case-Control Designs
1) Case-cohort design: controls sampled from baseline of a cohort or study base 2) Incidence-density controls: controls sampled from risk sets of cohort or study base each time a case is diagnosed 3) Prevalent controls: controls sampled from non-cases at one point in time during follow-up of the cohort or study base To review lecture one, we described these three types of case-control design. All three were described in relation to an explicit cohort or primary study base or a hypothetical cohort (secondary study base), so it should not be surprising that we can now turn to examine what the odds ratio in the different case-control designs estimates in relation to a cohort or primary study base.

13 Case-cohort: sample baseline of cohort
N0 / N1 is sampled randomly from baseline The case-cohort design is distinguished by taking a random sample of the baseline. Since everyone is eligible to be in this sample and it is taken independently of exposure, the sample will give an unbiased approximation of the ratio N0 / N1 . All of the cases are included in the study (although also could be a random sample), so unbiased estimates of both the ratio of exposed to unexposed cases and the ratio of exposed to unexposed in the original cohort are obtained in a case-cohort design.

14 Case-cohort Sampling Control group is random sample of cohort at baseline Estimates the odds of exposure in the study base (i.e., estimates N0 / N1) Control group can be used for >1 outcome Can use same controls later for more follow-up Relatively new design: first described by Prentice (1986); now being extended from cohorts to other primary study bases Odds ratio estimates the risk ratio The case-cohort design begins with the cases, so like other types of case-control sampling it is case-based. It differs from the other approaches to sampling controls by taking a sample of the cohort, or the study base, at time zero. This random sample of the baseline population will estimate the proportion exposed and unexposed in the entire study base. With that ratio, the risk ratio for the study base experience can be estimated without bias. This design is not very well known as it is relatively new. It becomes a good alternative to the other types of case-control sampling if the study has a primary study base which has baseline information or biological specimens on the persons who would be eligible for sampling. Among its advantages are the use of the same control group for more than one outcome or for additional follow-up at time later than the current study. Examining time-varying exposures can be a problem if only baseline data are available on the controls in contrast to incidence-density sampling that selects controls matching on time.

15 exposure odds in cases = a/c and odds in controls = b/d a = E1 b ≈ N1
How OR = Risk ratio in a Case-Cohort design Cases Controls exposure odds in cases = a/c and odds in controls = b/d So OR = E1 E0 N1 N0 a = E1 b ≈ N1 Yes Exposure c = E0 d ≈ N0 No This graphic puts the notation from a cohort of exposed and unexposed cases and the proportions of the cohort exposed and unexposed into the notational framework of the 2x2 table we have been using to illustrate the properties of the odds ratio. The critical point is that the controls in the case cohort design estimate the proportions exposed and unexposed in the whole cohort, so cells b and d are unbiased estimates of N1 and N0. When the odds ratio of exposure in the cases and controls is formed and the notation from a cohort is substituted for the cells in the 2x2 table, the resulting ratio is the same as the risk ratio as completing the algebra on the next slide shows.

16 Risk ratio in a cohort study
So if you have the cases, Returning to the earlier slide, inverting the ratio on the bottom from the previous slide and multiplying gets the right-hand term shown above, which is the same as the risk ratio on the left-hand side. Only need the ratio

17 Example of case-cohort study
In the Netherlands Cohort Study among 120,852 subjects aged 55-69 years at baseline (1986), the association between vitamins and carotenoids intake, vitamin supplement use, and bladder cancer incidence was examined After 6.3 years of follow-up, data from 569 cases and 3123 subcohort members were available for case-cohort analyses. The age-, sex-, and smoking-adjusted relative risks (RRs) for retinol, vitamin E, folate, a-carotene, b-carotene, lutein and zeaxanthin, and lycopene were 1.04, 0.98, 1.03, 0.99, 1.16, 1.11, and 1.08, respectively, comparing highest to lowest quintile of intake. Only vitamin C (RR: 0.81, 95% CI: , P-trend = 0.08), and b-cryptoxanthin intake (RR: 0.74, 95% CI: , P-trend = 0.01) were inversely associated with bladder cancer risk. Zeegers MP, Goldbohm RA, Brandt PA. Are retinol, vitamin C, vitamin E, folate and carotenoids intake associated with bladder cancer risk? Results from the Netherlands Cohort Study. Br J Cancer Sep;85(7):977-83 Here is an example of a recent study that used the case-cohort design. This is a very large study in the Netherlands of 120,852 persons followed since Since some laboratory assays of biological samples stored at baseline were needed, it was not feasible to test all 120,852 so a case-cohort sample was used instead to sample The authors report a relative risk, which if it is meant as a synonym for risk ratio, is the correct measure of association for a case-cohort design.

18 Case-cohort within Netherlands Cohort
Cohort baseline = 120,852 subjects 569 incident cases of bladder cancer This uses our text graphic to illustrate the case-cohort study from the Netherlands just described. 3123 subjects randomly sampled

19 Estimating the rate ratio in a case-control study
For calculating a rate ratio, what is analogous to estimating the proportion of exposed and unexposed persons? Answer: the proportion of exposed and unexposed person-time Moving on now to another measure of association and another type of case-control sampling, we will consider rate ratios in the setting of a case-control study. The first concept is to remember that when we talk about rates, we are talking about person-time and when we talk about measures of association we are talking about looking at incidence rates within the groups of exposed and unexposed persons. To calculate the risk ratio in a case-control setting we need to get an unbiased estimate of the proportion of persons with and without the exposure. For the rate ratio we need an analogous estimate of exposed and unexposed person-time.

20 Rate ratio in cohort where = exposed and = unexposed person-time
So analogous to estimating risk ratio, need to estimate the proportion If we can estimate that proportion in a case- control study, we can estimate the rate ratio This just does the same algebra we have seen in manipulating the form of the risk ratio for the rate ratio to isolate the crucial proportion we need to estimate in order to get the rate ratio.

21 Second type of case-control sampling
Incidence density sampling Controls are sampled from the risk set at the time each case is diagnosed Samples person-time experience of the study base Odds ratio estimates the rate ratio A case-control study with incidence density sampling gives an odds ratio that estimates the rate ratio because the sampling method is based on person-time of exposure.

22 Incidence density sampling in a fixed cohort study base
Controls are matched to cases on time at risk (same amount of follow-up time) Sampling non-cases at each time case occurs samples person-time Someone who is a control at one time can later be a case and/or a control again The key point is that with incidence density sampling in a fixed cohort study base the controls are matched on the amount of follow-up time, so you are sampling both persons and time when you pick the controls and therefore you can conceptualize it as sampling the person-time of the cohort. As with the case-cohort design, someone who is a control can later become a case but this should not be seen as a problem.

23 Incidence density sampling within a fixed cohort
The schematic illustrates how matching on follow-up time would be sampling the person-time follow-up of the cohort as time progresses. Imagine ending the study after the first event occurs. A random sample of those still at risk at that time will give a ratio of exposed and unexposed person-time that is the same as the ratio obtained using everyone in the cohort (give or take random sampling error). The same reasoning applies to each subsequent event. The sum of all those person-time samplings will approximate the total person-time follow-up of the cohort in the exposed and unexposed provided the control samples were taken independently of exposure (always the key issue!). Since controls are matched on follow-up time, sampling controls each time a case occurs samples the person-time of the cohort up to that point. So the total person-time of follow-up is sampled with this design.

24 Incidence density sampling in a dynamic cohort
(e.g., Kaiser Permanente membership) Primary Study Base New members And to further remind you that, when we discussed the concept of a primary study base, we pointed out that having persons joining the study base is no bigger a problem than it is to have persons leaving (which always happens). So the same incidence density sampling is just as valid in a dynamic as in a closed cohort. Sampling in a dynamic cohort gives unbiased estimate of person-time in the same way as sampling in a closed cohort

25 exposure odds in cases = a/c and odds in a = E1 b ≈ N1 T1
How OR = Rate ratio in a case-control study with incidence density sampling Controls Cases exposure odds in cases = a/c and odds in controls = b/d So OR = E1 E0 N1T1 N0T0 a = E1 Yes b ≈ N1 T1 Exposure c = E0 d ≈ N0 T0 So analogously to showing how the OR in a case-cohort design estimates the risk ratio, using the same graphic shows how the OR in a case-control design with incidence density sampling estimates the rate ratio because the controls are providing an estimate of the proportion of exposed to unexposed person-time. No

26 Rate ratio in cohort where = exposed and = unexposed person-time
Completing the algebra shows that this OR is equivalent to the rate ratio. So inverting and multiplying the OR on the previous slide gives the right-hand term on this slide which equals the left-hand term, the rate ratio.

27 Case-control incidence density sampling
...In a population-based case-control study in Germany, the authors determined the effect of alcohol consumption at low-to-moderate levels on breast cancer risk among women up to age 50 years. The study included 706 case women whose breast cancer had been newly diagnosed in and 1,381 residence- and age-matched controls. In multivariate conditional logistic regression analysis, the adjusted odds ratios for breast cancer were 0.71 (95% confidence interval (CI): 0.54, 0.91) for average ethanol intake of 1-5 g/day, 0.67 (95% CI: 0.50, 0.91) for intake of 6-11 g/day, 0.73 (95% CI: 0.51, 1.05) for g/day, 1.10 (95% CI: 0.73, 1.65) for g/day, and 1.94 (95% CI: 1.18, 3.20) for > or = 31 g/day. . . These data suggest that low-level consumption of alcohol does not increase breast cancer risk in premenopausal women. Kropp, S; Becher, H; Nieters, A; Chang-Claude, J. Low-to-moderate alcohol consumption and breast cancer risk by age 50 years among women in Germany. Am J Epidemiol 2001 Oct 1, 154(7): This is an example of a recent study using incidence-density sampling. The abstract does not tell you that they used incidence density sampling; you have to read the methods for that to be clear. The study was carried out in the primary study base of an area of Germany; all new cases of breast cancer during a four-year period were identified; and two controls were selected from the population at the time of each diagnosis (incidence density sampling) who were also matched on area of residence and age group. This has become a very popular design in cancer epidemiology that makes use of a cancer registry to capture nearly all of the cases in a geographic area and the area population becomes a very good primary study base.

28 Random sample of population each time breast cancer diagnosed
Incidence density sampling within a dynamic cohort (German population ) Primary Study Base New residents 706 incident cases of breast cancer 1,381 age & residence matched Two controls sampled from the population each time a case of breast cancer was diagnosed. Random sample of population each time breast cancer diagnosed

29 Third type of case-control sampling
Sampling only non-cases in a primary or secondary study base Prevalent controls because controls are sampled from those without disease with a cross-sectional sample of the study base Odds ratio approximates risk ratio only if disease occurrence is rare This is the third and last of the three methods of sampling controls for a case-control study that we have described earlier. It is the least desirable of the three but it used to be the most common by far. That may no longer be the case as researchers are becoming more sophisticated about case-control design. This sampling is the classic instance of needing the “rare disease” assumption that many text books discuss because the OR will approximate the risk ratio only if the incidence is low or rare.

30 Text example of case-control design showing
sampling prevalent controls from non-cases Study Base So even if all the cases are captured as in the schematic, the controls are drawn only from those present at the time the study is conducted. So unlike the case-cohort and the case-control with incidence density sampling designs, no case can be included in the control group. Because the cases are excluded, the control group can no longer represent the entire baseline population of the cohort. Furthermore, losses to follow-up and deaths also make this group of controls not very representative of the population that gave rise to the cases. Nor can it represent the person-time of the cohort because time is not represented throughout the study base experience in sampling the controls, only one time point is used. Only non-cases are eligible to be controls in this design

31 Inability to calculate risk ratio if controls sampled from non-cases
ratio is known in all case-control designs But sampling only non-cases cannot get unbiased estimate of The ratio of exposed to unexposed in the whole cohort can only be estimated by a sample of everyone at the beginning of follow-up, not just those who remain non-cases at the end of follow-up.

32 Inability to calculate risk ratio if controls sampled from non-cases
Example: Cohort of 200 with prevalence of exposure = 50% 100 exposed cases occur--leaves 60 non-cases 100 unexposed--10 cases occur--leaves 90 non-cases Total: 50 cases non-cases A numerical example to illustrate who is left to sample for controls using the prevalent controls design.

33 Inability to calculate risk ratio if controls sampled from non-cases
So in this example: And: Risk ratio = 4.0

34 40 60 10 90 OR using controls from prevalent non-cases cases controls
Disease No disease 40 60 Exposed time No disease Disease Unexposed 10 90 One quarter of the cohort has been diagnosed with disease during the cohort follow-up leaving only 150 of the original 200 left from which to select controls using the prevalent control case-control design. Since the original cohort was divided 50/50 by exposure and the odds of disease among exposed versus unexposed cases is 4 to 1, the remaining subjects without disease will have a ratio of 60/90 or 2/3 of exposed to unexposed. In other words, the odds of exposure in the eligible controls will be 2/3 and the odds ratio will be 4 divided by 2/3 = These numbers use everyone in the cohort and the case-control study will only use a sample of 150 remaining without disease but as they will be sampled independently of exposure status the ratio of 2/3 also applies to any random sample of controls. Thus the OR in this example is much larger than the risk ratio and cannot be considered even an approximation of it. Using all prevalent non-cases in cohort, the OR would be: OR = 40/10 over 60/90 = 6.0 A random sample from cells b (60) and d (90) will give a ratio equal to 60/90 and therefore an OR = 6.0 vs. the true risk ratio = 4.0

35 Case-Control with prevalent controls
If controls are selected among those without disease at time of study (+ prevalent cases), the OR approximates risk ratio only with the rare disease assumption Rare disease assumption: if disease incidence low in unexposed and exposed (<10%), OR  RR Exposure in controls  exposure in whole cohort The rare disease assumption says that if the incidence of the disease under study is very low the OR from a case-control study even with sampling prevalent controls approximates the risk ratio. Although there is no absolute number, less than 10% is often given as a definition of what is meant by “rare.”

36 Case-Control with prevalent controls
Sampling only non-cases means sampling from N1 – E1, so if E1 is small, N1 isn’t changed much (and similarly for N0) Older texts describe case-control design as always requiring rare disease assumption to estimate risk ratio -- not the case If the disease only removes a few persons from the original cohort, the ratio of exposure in those remaining will stay close to the original ratio at baseline. It follows that estimating N0/N1 by using prevalent controls becomes increasing more valid as the number removed by the disease gets smaller.

37 4 96 1 99 OR using controls from prevalent non-cases
when incidence low cases controls Disease No disease 4 96 Exposed time No disease Disease Unexposed 1 99 Using the same hypothetical cohort numbers as before but now assuming that the incidence of disease was 2.5% (5 out of 200 developed disease), the OR is only slightly higher than the risk ratio for the simple reason that the ratio of exposure in the remaining non-cases is close to 1.0, which is what it was in the whole cohort at baseline. The somewhat arbitrary rule of thumb of incidence below 10% is sometimes given as what is meant by a “rare disease.” You can work out for yourself how the OR = 4.57 is arrived at if the incidence were 10% (hint: there will be 16 exposed cases and 4 unexposed cases). It might also be debatable whether 4.57 is a good approximation of 4.0, but the point should be clear. Using all prevalent non-cases in cohort would be OR = 4/1 over 96/99 = 4.13 A random sample from cells b (60) and d (90) will give a ratio equal to 96/99 and therefore an OR = 4.13 close to true risk ratio = 4.0 If the incidence had been 10% overall, OR = 4.57.

38 Caveat: Sampling non-cases may introduce bias even if disease is rare
Disease may remove few from study base sampled for controls, but other sources of loss to follow-up can bias control group Losses to follow-up and deaths among potential controls from the study base giving rise to the cases affect who is available at one point in time The rare disease assumption only looks at the effect of removing potential controls who are diagnosed with the outcome disease. Returning to the setting of a study base that gave rise to those cases over time, some members of the study base population at time zero will not be in the population of non-cases sampled at the end of the time when all the cases have been ascertained. Some will have left the study base or died, and these changes in the group of non-cases who are sampled can bias the estimate of exposure in the controls. Since no information is available on who left the study base with the prevalent controls design, the nature of this bias cannot be known. Thus, even though the rare disease assumption is met, the OR from this type of case-control sampling may give a biased estimate of the risk ratio.

39 Case-Control study with prevalent sampling
of rare disease Surveillance for toxic-shock syndrome (TSS) in Wisconsin detected cases with onsets from September 1975 through June 1980… 35 patients were matched for age and menstruation to 105 controls: 34 of 35 cases (versus 80 of 105 controls) used tampons during every menstrual period (OR = 10.6, p < 0.01)… In Wisconsin the minimum incidence of TSS as defined by clinical criteria is 6.2 cases per 100,000 menstruating women per year. Davis JP et al. N Engl J Med Dec 18;303(25): With an incidence rate this low sampling prevalent controls among all the women of Wisconsin is a very reasonable control group, assuming that nearly every case occurring in Wisconsin during those years was ascertained. With an incidence of An example of an extremely rare disease: only 6 per 100,000 person-years. The odds ratio of 10.6 is a very good approximation of the risk ratio in this instance.

40 Study Base: all Wisconsin women
Sampling prevalent controls from non-cases Study Base: all Wisconsin women Controls D 35 toxic shock cases The size of the population from which the controls were selected is so large relative to the number of cases that the rare disease assumption certainly applies. Nothing was known about whether losses to follow-up among the controls introduced bias, but those losses may also be small compared to the entire population of state. On the other hand, it took 5 years to accumulate the 35 cases of toxic shock, so it is not impossible that changes in the population of women in Wisconsin over 5 years might have introduced some bias although it is unlikely to be large. 105 controls among women without toxic shock

41 Effect of control sampling on measure of association estimated by OR
Design Sampling Measure of Association Prevalent non-cases at case-control single point in time odds ratio Case-cohort entire cohort at baseline risk ratio Incidence- non-cases at time density each case diagnosed rate ratio

42 Statistical penalties for sampling the study base
Case-control design obtains a sample of the denominator rather than entire denominator Introduces some sampling error Reduces precision of the risk ratio or rate ratio or odds ratio estimate because sample N is smaller than full study base N Loss of precision offset by large gain in cost and time of study

43 OR misuse in presenting results
Language like “X times as likely to” implies a comparison of probabilities, not odds acceptable only if incidence is low Language like “7% more likely” suggests an absolute risk difference, not a ratio (also a misuse of risk ratios) With high incidence of outcome, OR misrepresents the relative probabilities, so language should not imply probability Abstracts and press releases determine how results are received by public Regardless of the actual measure of association that was calculated by the study or what measure it validly estimates, most researchers want to use the language of probability or words that suggest probabilities when they discuss their results. As a result, it is rare to hear the appropriate language: “the odds of disease among those who had ever smoked were 3.6 fold greater than among those who never smoked.” The language of probability can be justified if the rare disease assumption is met although we would hope that you will generally avoid the prevalent controls study design. Preferable is to attempt to use either incidence density sampling of controls or a case-cohort design for a case-control study and to point out in the methods that the OR from your design is an unbiased estimated of the risk ratio or the rate ratio, depending on which design you have used. It then becomes legitimate to use the language of risk ratios or rate ratios in reporting your findings. Language like “7% more likely” sounds as if an absolute risk difference is being reported but in some instances is actually describing an odds ratio of This is quite misleading and should be avoided.

44 Summary: Common misunderstandings about case-control studies
They can only study one disease outcome Inference is not as valid as from a cohort “Rare disease assumption” is required for OR from case-control to estimate risk ratio Retrospective measurement is necessary in case-control studies

45 Summary: What is true about case-control studies
There are typically more opportunities for bias and misclassification in case-control studies than in cohort studies Relative ease with which they can be done has encouraged a lot of badly designed studies Low cost and shorter time should be an incentive to better, not worse, design

46 Case-control design recommendations
Look for a primary study base that can be clearly defined and has good case ascertainment Use incidence density or case-cohort sampling whenever possible incidence density more often possible Use measurements recorded prior to the diagnosis when possible (medical records, etc.) or perform biological measurements on stored specimens

47 2 x 2 table for correctly classifying professions
White Coat Yes No Necktie Yes No Physician Administrator Lab tech Epidemiologist Andrew Moss, Journal of Unpublishable Results, 1985


Download ppt "Main points to be covered"

Similar presentations


Ads by Google