Thoughts on Simplifying the Estimation of HIV Incidence John Hargrove, Alex Welte, Paul Mostert [and others]
Estimates of incident (new) cases are important in the assessment of changes in an epidemic, identifying “hot spots” and in gauging the effects of interventions HIV incidence most accurately estimated via longitudinal studies – but these are lengthy, expensive, logistically challenging. Do provide a “gold standard” against which to judge other estimates of HIV incidence
An alternative way of estimating incidence, involving none of the disadvantages of a longitudinal study, would be to use a single chemical test that can be used to estimate the proportions of recent vs long- established HIV infections in cross- sectional surveys
Idea: identify HIV test where measured outcome not simply +/- but rather a graded response increasing steadily over a long period
One such assay is the BED-CEIA developed by CDC Graph shows result for a seroconverting client taken from the ZVITAMBO study carried out in Zimbabwe [14,110 post partum women followed up at 6-wk, 3-mo, then every 3-mo to two years]
The idea is to calibrate the BED assay to estimate the “average” time [or “window”] taken for a person’s BED optical density [OD] to increase to a given OD cutoff In cross-sectional surveys proportion of HIV positive people with BED < cut-off allows us to calculate the proportion of new infections – and thus the incidence. Estimation of the window period is thus central to the successful application of the BED
Data from commercial seroconversion panels with accurately known times of seroconversion indicate Problem 1. Delay (~25 days) between sero- conversion and the onset of then increase in BED optical density
Sero- negative Baseline OD = Extrapolated time when OD = baseline Date of seroconversion Date of infection 22 '' 1 Window period ( ) Sero- positive
Problem 2: C onsiderable variability between clients in a real population. No prospect of using BED to identify individual recent infections. Idea only to estimate population incidence
Problem 3: Often have limited follow-up: of 353 seroconverters in ZVITAMBO, 167 only produced a single HIV positive sample, Samples per client (S) Frequency
Problem 4: The available data for a given client quite often do not span the OD cut- off. The proportion that fail to do so varies with the chosen cut-off. Failure to span increases the uncertainty in estimating the time at which the OD cut-off is crossed Problem 5: There is a large variation (27 – 656 days) in the time (t 0 ) elapsing between last negative and first positive HIV tests. The degree of uncertainty in the timing of seroconversion increases with increasing t 0
We need to consider how variation in samples per client, t0, and failure to span the cut- off affect our estimate of the window period.
How to approach problem? Scatter-plot of the data? Makes no use of the information of the trend for individual clients and ignores the fact that the sequential points for that client are not independent.
Alternative which uses trend in BED OD is suggested by an approximately linear relationship between square root of OD and time-since-last- negative HIV test (t). Allows a regression approach taking out variance due to t and to difference between clients
Gives consistent results; in that results independent of whether we insist on minimum of 3, 4 or 5 samples per client; and on value of t 0 between 75 and 180 days
Are we even using the right transformation? And should we be using the time of last negative HIV test as the origin Try instead to do a preliminary estimate of the time when OD starts to increase by fitting a quadratic polynomial to the data. Then use this estimate as the origin.
Seems to suggest that the true relationship may actually be a power function. What it really were? What would we see if we plotted OD vs time since-last negative
Our problem is that we do not know when seroconversion occurred. We only know the time of the last HIV negative test. And the greater the delay between last negative and first positive tests the greater the uncertainty Examples of times when HIV -ve tests might have been taken
For zero offset the window is UNDER-estimated; for 100-day offset it is OVER-estimated
This approach to window estimation is clearly not optimal since the window estimate changes with the timing of the last HIV-negative test But can we do any better?
If OD increases as a power function fit: or equivalently where a and b are constants, t is the time since the last negative and t 0 is the estimated time of seroconversion. We use the data to estimate a, b and t 0 by non- linear regression
For the generated data [without noise] this approach gives the correct window – regardless of the time of the last negative test But for real data in 40% of 61 cases the time of seroconversion was estimated to be before the time of the last negative test or after the time of the first positive. [Work in progress]
Turnbull survival analysis different approach suggested by Paul Mostert (Stellenbosch Statistics Department). This is a slightly more sophisticated variant of the Kaplan Meier survival analysis. Works on the basis that the (unknown) times of: i) seroconversion ii) OD cut-off each lie between two known times The times of the two events are quantified using interval censoring
Turnbull window estimates Runs All data (red; 183 d) 2: Excluding max OD 0.8 (green; 210 d) 4: Excluding 2 and 3 (blue; 163 d)
The window length is estimated using a non-parametric survival technique which makes no assumptions about any parametric models and underlying distributions.. No interpolation is used to obtain the cut-off time where the BED OD reaches 0.8 or the seroconversion time point. Only time points that will define the interval boundaries were used, which means that time points more than four for a specific women were not fully utilised. However, time points as few as two per women could be used in this estimation of window length.
Conclusion There is still no general agreement on how best to estimate the window for methods like the BED. Fortunately most of those described seem to give fairly similar answers – though it’s not clear to what extent this is happening by chance.