Cervical Cancer Case Study Supervising Professor: Dr. P.D.M. Macdonald Team Members: Christine Calzonetti, Simo Goshev, Rongfang Gu, Shahidul Mohammad Islam, Amanda Lafontaine, Marcus Loreti, Maria Porco, William Volterman, Qihao Xie -McMaster University-
To determine which of the documented variables are useful for predicting recurrence of the disease To evaluate the extent to which tumor size, in particular, predicts the recurrence of the disease Objectives:
Graphical Analysis
The majority of patients observed were between the ages of 35 and 50 No significant difference between relapse and non-relapse patients MeanMedian Standard deviation Non-relapse Relapse
Similar means Dissimilar boxplots possibly due to outliers Missing values in the relapse group may have affected the outcome MeanMedian Standard deviation Non-relapse Relapse
A great disparity exists between the means and variability of relapse and non-relapse patients Relapse patients had larger tumor sizes upon diagnosis, suggesting that tumor size should be considered an important prognostic factor MeanMedian Standard deviation Non-relapse Relapse
The difference in pie charts indicates that there are more cancerous cells found in the lymph nodes of patients who relapsed The statistical significance is unclear
Relapse patients had a greater quantity of cells deemed “worse”
Recorded at the time of follow up appointment (therefore cannot be used as a diagnostic factor) Most non-relapse patients have no presence of disease at last follow up appointment In relapse patients, approx. ½ died of disease, ¼ are alive with disease, ¼ are alive with no evidence of disease
Results and Conclusions
Survival Plot of Cervical Cancer Data Survival plot of data indicates that most relapses occur during the first three years after surgery, it is highly unlikely that relapse will occur after eight years The exponential curve deviated away from the survival curve at the tail end due to the patients who will never relapse
Time Small (0-10mm) Medium (11-30mm) Large (30+mm) Recurrence time for large group considerably lower than medium Clear distinction between medium and small The patients in the different size groups had noticeably different mean times to recur
Survival Analysis yielded the following results: Significant difference between medium and small groups Significant difference between large and small groups Same results found using Weibull distribution in place of exponential distribution A survival analysis of the data on S-Plus where the exponential distribution was assumed produced the following output: Value Std. Error z p (Intercept) e+000 cutsize e-005 cutsize e-011
A step-wise regression analysis of the data on S-Plus where the exponential distribution was assumed produced the following output: Value Std. Error z p (Intercept) e-042 cutsize e-001 cutsize e-002 lymph e-002 depth e-006 grad e-002 age e-001 rad e-001 Regression Analysis yielded the following results: Initial variables: Final variables: Value Std. Error z p (Intercept) e-098 cutsize e-001 cutsize e-002 lymph e-002 depth e-006 grad e-003
Initial analysis showed that possible prognostic factors were Size, Lymph Nodes, Tumor Depth and Cell Grade Cox’s Proportional Hazard reaffirmed that Size, Depth and Cell Grade were important diagnostic factors, but Lymph Nodes are only significant at the 10% level