Clinical Trial Data Validation Using SAS PROC SQL Jie Liu BIOS 524 Project 2003, Fall
Background Toxicity data are collected for NCI Sponsored clinical trials at MCC. Data are monitored by NCI/CTEP quarterly. Toxicity (Protocol, CourseNo, Patient, Toxicity (Toxicity Code), Grade, Attribution, DLT, etc) Standard (for Toxicity Code and Grade ) : CTCAE 3.0 It lists hundreds of toxicity types and each toxicity type needs to be graded differently.CTCAE 3.0
Background For example: Dry Skin: Grade can only be 1, 2, 3. 4 or 5 Fatigue: Grade can only be 1, 2, 3, 4. 5 Errors can be made by data entry person easily. Challenge: How to detect errors? Invalid value!
Sample Toxicity Data "Protocol","CourseNo","Patient ","CourseDate ","Toxicity ","Onset Date ","CDUS Tox Type Code ","Resolved ","AER Filed ","Grade ","Attribution ","Dose Limiting Tox? ","Serious ","Action ","Therapy ","Outcome ","A/N (obsolete) ", "MCC0001 ",1,"PT0001 ", ,"DIARRHEA ", , , ,"N",1,"4","N",1,"1","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"FATIGUE ", , , ,"N",1,"4","N",1,"1","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"HEMOGLOBIN ", , , ,"N",1,"3","N",1,"1","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"HYPOGLYCEMIA ", , , ,"N",1,"1","N",1,"1","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"LEUKOCYTES_(TOTAL_WBC) ",-6, , ,"N",2,"1","N",1,"1","1","1", , "MCC0001 ",1,"PT0001 ", ,"LEUKOCYTES_(TOTAL_WBC) ", , , ,"N",1,"3","N",1,"1","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"LEUKOCYTES_(TOTAL_WBC) ", , , ,"N",2,"3","N",1,"1","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"LYMPHOPENIA ",-6, , ,"N",1,"1","N",1,"1","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"LYMPHOPENIA ", , , ,"N",2,"3","N",1,"1","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"LYMPHOPENIA ", , , ,"N",2,"3","N",1,"1","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"NEUTROPENIA ",-6, , ,"N",2,"1","N",1,"1","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"NEUTROPENIA ", , , ,"N",3,"3","N",1,"5","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"PAIN OTHER-ACHES ", , , ,"N",1,"4","N",1,"1","1","1", , "MCC0001 ",1,"PT0001 ", ,"PAIN OTHER-ACHES ", , , ,"N",1,"4","N",1,"1","1","1", , "MCC0001 ",1,"PT0001 ", ,"PLATELETS ",-6, , ,"N",1,"1","N",1,"1","1","1",-2.000, "MCC0001 ",1,"PT0001 ", ,"PLATELETS ", , ,-6,"N",1,"3","N",1,"1","1","2",-2.000, "MCC0001 ",2,"PT0001 ", ,"CREATININE ", , , ,"N",1,"3","N",1,"1","1","1",-2.000, "MCC0001 ",2,"PT0001 ", ,"HYPOGLYCEMIA ", , , ,"N",1,"1","N",1,"1","1","1",-2.000
CTCAE 3.0 Common Terminology Criteria for Adverse Events Adverse Events Category, AE/Supra-ordinate Term, Select AE, MedDRA Preferred Term, MedDRA Code, Grade, Description "ALLERGY/IMMUNOLOGY","Allergic reaction/hypersensitivity (including drug fever)","","Hypersensitivity NOS"," ","1","Transient flushing or rash; drug fever <38 degrees C (<100.4 degrees F)" "ALLERGY/IMMUNOLOGY","Allergic reaction/hypersensitivity (including drug fever)","","Hypersensitivity NOS"," ","2","Rash; flushing; urticaria; dyspnea; drug fever >=38 degrees C (>=100.4 degrees F)" "ALLERGY/IMMUNOLOGY","Allergic reaction/hypersensitivity (including drug fever)","","Hypersensitivity NOS"," ","3","Symptomatic bronchospasm, with or without urticaria; parenteral medication(s) indicated; allergy-related edema/angioedema; hypotension" "ALLERGY/IMMUNOLOGY","Allergic reaction/hypersensitivity (including drug fever)","","Hypersensitivity NOS"," ","4","Anaphylaxis" "ALLERGY/IMMUNOLOGY","Allergic reaction/hypersensitivity (including drug fever)","","Hypersensitivity NOS"," ","5","Death" "ALLERGY/IMMUNOLOGY","Allergic rhinitis (including sneezing, nasal stuffiness, postnasal drip)","","Rhinitis allergic NOS"," ","1","Mild, intervention not indicated" "ALLERGY/IMMUNOLOGY","Allergic rhinitis (including sneezing, nasal stuffiness, postnasal drip)","","Rhinitis allergic NOS"," ","2","Moderate, intervention indicated" "ALLERGY/IMMUNOLOGY","Allergy/Immunology - Other (Specify, __)","","Not available"," ","1","Mild" "ALLERGY/IMMUNOLOGY","Allergy/Immunology - Other (Specify, __)","","Not available"," ","2","Moderate" "ALLERGY/IMMUNOLOGY","Allergy/Immunology - Other (Specify, __)","","Not available"," ","3","Severe" "ALLERGY/IMMUNOLOGY","Allergy/Immunology - Other (Specify, __)","","Not available"," ","4","Life-threatening; disabling" "ALLERGY/IMMUNOLOGY","Allergy/Immunology - Other (Specify, __)","","Not available"," ","5","Death" "ALLERGY/IMMUNOLOGY","Autoimmune reaction","","Autoimmune disorder NOS"," ","1","Asymptomatic and serologic or other evidence of autoimmune reaction, with normal organ function and intervention not indicated" "ALLERGY/IMMUNOLOGY","Autoimmune reaction","","Autoimmune disorder NOS"," ","2","Evidence of autoimmune reaction involving a non-essential organ or function (e.g., hypothyroidism)" "ALLERGY/IMMUNOLOGY","Autoimmune reaction","","Autoimmune disorder NOS"," ","3","Reversible autoimmune reaction involving function of a major organ or other adverse event (e.g., transient colitis or anemia)" "ALLERGY/IMMUNOLOGY","Autoimmune reaction","","Autoimmune disorder NOS"," ","4","Autoimmune reaction with life- threatening consequences" "ALLERGY/IMMUNOLOGY","Autoimmune reaction","","Autoimmune disorder NOS"," ","5","Death" "ALLERGY/IMMUNOLOGY","Serum sickness","","Serum sickness"," ","3","Present" "ALLERGY/IMMUNOLOGY","Serum sickness","","Serum sickness"," ","5","Death"
Solution A SAS program is developed to detect grade errors and run summary analysis. Files needed AEGrades.txt is CTCAE 3.0 standard and used to do data validation.AEGrades TX.txt is the file need to be checked. Constraints: Toxicity Code is from CTCAE 3.0 Grade is from CTCAE 3.0 and 0 < Grade <= 5
PROC SQL Structured Query Language (SQL). PROC SQL is a powerful Base SAS PROC combining the functionality of the DATA and PROC Steps into a single procedure. PROC SQL In some cases is a more efficient alternative to traditional SAS code.
SQL procedure enables you to Extract data from Data set Create a new data set Select unique values of one or more columns Subsetting and calculating And more…
Syntax PROC SQL; /*Begin SQL*/ … QUIT;/* End */ SQL commands
SQL Example Select Statement select * All variables in the table from toxicity where patient = ‘JL’; Create TABLE Statement create table newtox as select patient, tox_code, grade, attribution from toxicity; Table name Field name New table name Table name
SQL Example select distinct '1' as ErrorID, Patient, CourseNo, CDUS_Tox_Type_Code as Tx_Code, Grade, 'Grade must not be blank' as ERRMSG from NewAE where Grade is NULL union select distinct '2' as ErrorID, Patient, CourseNo, CDUS_Tox_Type_Code as Tx_Code, Grade, 'Grade must not be greater than 5' as ERRMSG from NewAE where Grade > 5 and Grade is not NULL Combine the results of two SQL commands together
SAS Output Note: Patient, CourseNo, Tx_Code and Grade together is primary key so we are able to tell the error’s location by the primary key.
Summary Analysis proc tabulate data=NewAeTx; class Toxicity Attribution Grade; table Toxicity*Attribution, Grade; title “Toxicity Summary"; format Attribution Attrifmt.; run;
SAS Output
Questions?