Avoiding bias in RCTs David Torgerson Director, York Trials Unit

Slides:



Advertisements
Similar presentations
Postgraduate Course 7. Evidence-based management: Research designs.
Advertisements

Patient Preference and Comprehensive Cohort Designs.
Unequal Randomisation
Use of Placebos in Controlled Trials. Background The traditional ‘double-blind’ RCT uses a placebo to conceal allocation. There are a number of advantages.
A Guide to Education Research in the Era of NCLB Brian Jacob University of Michigan December 5, 2007.
Sample size issues & Trial Quality David Torgerson.
Observational Studies and RCT Libby Brewin. What are the 3 types of observational studies? Cross-sectional studies Case-control Cohort.
Experimental evaluation in education Professor Carole Torgerson School of Education, Durham University, United Kingdom International.
Adapting Designs Professor David Torgerson University of York Professor Carole Torgerson Durham University.
Critical Appraisal: Epidemiology 101 POS Lecture Series April 28, 2004.
KINE 4565: The epidemiology of injury prevention Randomized controlled trials.
Conference for EEF evaluators: Building evidence in education Hannah Ainsworth, York Trials Unit, University of York Professor David Torgerson, York Trials.
The use of administrative data in Randomised Controlled Trials (RCT’s) John Jerrim Institute of Education, University of London.
Why to Randomize a Randomized Controlled Trial? (and how to do it) John Matthews University of Newcastle upon Tyne.
Improving the Economic Efficiency of Clinical Trials.
Sources of Bias in Randomised Controlled Trials
What makes a good quality trial? Professor David Torgerson York Trials Unit.
Reading the Dental Literature
Sources of Bias in Randomised Controlled Trials. REMEMBER Randomised Trials are the BEST way of establishing effectiveness.
Recruitment to Trials. Background Recruitment of participants is a VERY important issue. The general consensus is that most trials under recuit.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Journal Club Alcohol and Health: Current Evidence May–June 2005.
Experimental Design.
Introduction to the Design and Analysis of Trials can be found on: Before and After Studies: A Reminder.
Non-Experimental designs: Developmental designs & Small-N designs
N = 1, Cross-Over Trials and Balanced Designs. N = 1 Trials Trials can be undertaken with just one participant. If the condition is a chronic relapsing.
Random and Quasi-random Allocation. Background Surprisingly many researchers do not understand the concept of random allocation. For example, a Professor.
Allocation Methods David Torgerson Director, York Trials Unit
Design Issues: Policy Trials Professor David Torgerson Director, York Trials Unit
Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2010.
Pre-randomisation consent (Zelen’s method)
Statistical Thinking Experiments in the Real World
Cohort Studies Hanna E. Bloomfield, MD, MPH Professor of Medicine Associate Chief of Staff, Research Minneapolis VA Medical Center.
Bootstrapping applied to t-tests
The World Bank Human Development Network Spanish Impact Evaluation Fund.
8.0 Assessing the Quality of the Evidence. Assessing study quality or critical appraisal ► ► Minimize Bias ► ► Weight for Quality ► ► Assess relationship.
Rigour of evaluation Dr Carole Torgerson Senior Research Fellow Institute for Effective Education University of York.
1 Developing evidence-based products using the systematic review process Session 4/Unit 10 Assessing study quality Carole Torgerson November 13 th 2007.
Building Evidence in Education: Conference for EEF Evaluators 11th July: Theory 12th July: Practice
Discussion Gitanjali Batmanabane MD PhD. Do you look like this?
Quantitative Research Designs
Randomised controlled trials Peter John. Causation in policy evaluation Outcome Intervention Other agency actions External environment.
EBD for Dental Staff Seminar 2: Core Critical Appraisal Dominic Hurst evidenced.qm.
Copyright © 2010 Pearson Education, Inc. Chapter 13 Experiments and Observational Studies.
Experiments and Observational Studies. Observational Studies In an observational study, researchers don’t assign choices; they simply observe them. look.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 13 Experiments and Observational Studies.
ARROW Trial Design Professor Greg Brooks, Sheffield University, Ed Studies Dr Jeremy Miles York University, Trials Unit Carole Torgerson, York University,
Reading and reporting evidence from trial-based evaluations Professor David Torgerson Director, York Trials Unit
H860 Reading Difficulties Week 7 Reading Interventions: How Do They Weigh Up?
Programme Information Incredible Years (IY)Triple P (TP) – Level 4 GroupPromoting Alternative Thinking Strategies (PATHS) IY consists of 12 weekly (2-hour)
Clinical Writing for Interventional Cardiologists.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Adaptive randomization
What is a non-inferiority trial, and what particular challenges do such trials present? Andrew Nunn MRC Clinical Trials Unit 20th February 2012.
Critical appraisal of randomized controlled trial
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Impact of two teacher training programmes on pupils’ development of literacy and numeracy ability: a randomised trial Jack Worth National Foundation for.
Compliance Original Study Design Randomised Surgical care Medical care.
EVALUATING u After retrieving the literature, you have to evaluate or critically appraise the evidence for its validity and applicability to your patient.
Lecture 2: Evidence Level and Types of Research. Do you recommend flossing to your patients? Of course YES! Because: I have been taught to. I read textbooks.
Producing Data: Experiments BPS - 5th Ed. Chapter 9 1.
CONSORT 2010 Balakrishnan S, Pondicherry Institute of Medical Sciences.
Making Randomized Clinical Trials Seem Less Random Andrew P.J. Olson, MD Assistant Professor Departments of Medicine and Pediatrics University of Minnesota.
Unit 8.  Program improvement or appraisal  Assessing the value of a program  Measuring the efficacy of particular components of a program  Meeting.
1 Study Design Imre Janszky Faculty of Medicine, ISM NTNU.
Experiments Textbook 4.2. Observational Study vs. Experiment Observational Studies observes individuals and measures variables of interest, but does not.
Journal Club Curriculum-Study designs. Objectives  Distinguish between the main types of research designs  Randomized control trials  Cohort studies.
Randomized Trials: A Brief Overview
Research Designs, Threats to Validity and the Hierarchy of Evidence and Appraisal of Limitations (HEAL) Grading System.
Challenges of statistical analysis in surgical trials
Presentation transcript:

Avoiding bias in RCTs David Torgerson Director, York Trials Unit

A reminder Randomised trials are needed for 4 reasons: »Avoiding selection bias; »Controlling for temporal changes; »Controlling for regression to the mean; »Basis for statistical inference.

Background Randomisation, when undertaken properly, prevents selection bias. Selection bias occurs when participants are allocated in such a way that allocation correlates with outcome. Selection bias is one of the main threats to validity that randomisation seeks to avoid. However, forms of selection and other sources of bias can still undermine the validity of a RCT.

Non-random methods: Alternation Alternation is where trial participants are alternated between treatments. EXCELLENT at forming similar groups if alternation is strictly adhered to. Problems because allocation can be predicted and lead to people withholding certain participants leading to selection bias.

Non-Random Methods Quasi-Alternation Dreadful method of forming groups. This is where participants are allocated to groups by month of birth or first letter of surname or some other approach. Can lead to bias in own right as well as potentially being subverted.

Example: Quasi alternation “Before mailing, recipients were randomized by rearranging them in alphabetical order according to the first name of each person. The first 250 received one scratch ticket for a lottery conducted by the Norwegian Society for the Blind, the second 250 received two such scratch tickets, and the third 250 were promised two scratch tickets if they replied within one week.”(Finsen and Storeheier, Biomed Central 2006)

Randomisation Some text books typically suggest the use of random number tables or coin tossing to allocate participants.

What is wrong with that? Coin tossing there is no ‘audit’ trail. We have to take your word for what you did. Random number tables are ‘open’ and again we have to take the researcher’s word that they did what the said. A separate computer allocation system from a third party is best.

Simple or restricted? Simple allocation is best when the number of units to be allocated >50. Because it is always unpredictable it is difficult to subvert or sabotage. Restricted randomisation has statistical advantages for <50 (e.g., cluster trial of 20 schools/classes would benefit from using a restricted technique such as blocking or minimisation).

Will people subvert the allocation? Schulz [1] has described, anecdotally, a number of incidents of researchers subverting allocation by looking at sealed envelopes through x-ray lights. Researchers have confessed to breaking open filing cabinets to obtain the randomisation code. In a survey [2] of 25 researchers 4 admitted to keeping ‘a log’ of previous allocations to try and predict future allocations. [1] Schulz JAMA 1995;274:1456. [2] Brown et al. Stats in Medicine, 2005,24:3715.

Case Study Subversion is rarely reported for individual studies. One study where it has been reported was for a large, multicentred surgical trial. Participants were being randomised to 5+ centres using opaque, sequentially numbered, sealed envelopes.

Case-study (cont) After several hundred participants had been allocated the study statistician noticed that there was an imbalance in age. This age imbalance was occurring in 3 out of the 5 centres. Independently 3 clinical researchers were subverting the allocation.

Mean ages of groups Kennedy & Grant. 1997;Controlled Clin Trials 18,3S,77-78S

Recent Blocked Trial “This was a block randomised study (four patients to each block) with separate randomisation at each of the three centres. Blocks of four cards were produced, each containing two cards marked with "nurse" and two marked with "house officer." Each card was placed into an opaque envelope and the envelope sealed. The block was shuffled and, after shuffling, was placed in a box.” Kinley et al., BMJ :1323.

What is wrong here? SouthamptonSheffieldDoncaster DoctorNurseDoctorNurseDoctorNurse Kinley et al., BMJ 325:1323.

Problem? If block randomisation of 4 were used then each centre should not be different by more than 2 patients in terms of group sizes. Two centres had a numerical disparity of 11. Either blocks of 4 were not used or the sequence was not followed.

Randomisation summary Very important to avoid possible problems of subversion: »Who do you trust? »Need independent allocation, third party, need to be convincing.

Selection bias after randomisation Selection bias is avoided if ALL participants who are randomised are completely followed up. Often there is some attrition – after randomisation some refuse to continue to take part. Or some may refuse the intervention but can still be tracked – IMPORTANT to distinguish between these.

What is wrong here? Random allocation 160 children 8 from Each school 76 children allocated to control 76 allocated to intervention group 1 school 8 children withdrew N = 17 children replaced following discussion with teacher

Selection bias POSSIBLE selection bias has been introduced in this trial. About 10% of the sample have been non- randomly allocated to the interventions. What should have happened? The authors should have retained the 17 refusers in an intention to teach (treat) analysis.

Ascertainment bias This occurs when the person reporting the outcome can be biased. A particular problem when outcomes are not ‘objective’ and there is uncertainty as to whether an event has occurred. Example, of homeopathy study of histamine, showed an effect when researchers were not blind to the allocation but no effect when they were. Multiple sclerosis treatment appeared to be effective when clinicians unblinded but ineffective when blinded.

Avoiding ascertainment bias Post-tests should be administered by someone who is unaware of the group allocation. Pre-tests before allocation is made. Record searches (e.g., arrest records) should be done by a researcher blind to the participant’s group allocation.

Resentful Demoralisation This can occur when participants are randomised to treatment they do not want. This may lead to them reporting outcomes badly in ‘revenge’. This can lead to bias.

Resentful Demoralisation One solution is to use a participant preference design where only participants who are ‘indifferent’ to the treatment they receive are allocated. This should remove its effects.

Example of preference interaction SPRINTER was an RCT of treatments for neckpain. Two treatments: a Brief Intervention (1-2 sessions with a physio using CBT) vs usual care (5+ sessions). BEFORE randomisation we asked patients their treatment preference.

SPRINTER Preferences In SPRINTER preferences were mixed »53% did not have a preference; »16% wanted brief intervention; »31% wanted usual care. ALL patients were randomised IRRESPECTIVE of their preference.

Patient Flow and 12 month Results - SPRINTER Overall 12 month improvement Overall 12 month improvement

SPRINTER Results

Examples of educational trials As part of a collaboration between educational researchers and trial methodologists we have completed two RCTs in education. We tried to conduct them according to health care standards (i.e., CONSORT). »Computers to teach spelling to children; »Incentives to retain adult learners in evening classes.

Computer trial Since 1997 the Government has boasted that > £1.7 billion of taxpayers money has been put into equipping schools with computer technology. Very few RCTs have been undertaken to evaluate and substantiate this investment. We supported the largest RCT of computer technology ever undertaken in the UK.

Methodological challenges Contamination children in the control group may access the technology. »Deliver intervention on a lap top. Secure randomisation. »Randomisation done by York Trials Unit – series of blocked allocation to match availability of laptops (pre-test data did not inform allocation). Avoiding ascertainment bias. »Post-tests given to all children in same room by invigilators blind to allocation, test marked by marker blind to group allocation. Pre-tests done blindly. Resentful demoralisation. »Waiting list control – control children to get intervention at end of term.

Methods Based in one school. Lack of funding meant that it couldn’t be extended elsewhere. All year 7 pupils were randomised to receive literacy instruction via computer or not. Sample size (n = 157) allowed us to have > 80% power to show 0.5 of effect size between groups. Sample size fixed we couldn’t increase it, if we had wanted. Waiting list design all pupils received intervention half at the beginning of term half later in the term (normal practice was to arbitrarily assign pupils).

Participant Flow in Trial

Results VariableICT Group Means (SD) No ICT Group Means (SD) Adjusted Difference (Intervention – Control) (95% CI) P value of adjusted Difference Spelling38.57 (18.00) (18.33) (-1.83, 3.74) Reading18.97 (6.11) (6.64) (-0.96, -3.71) Brooks et al, Ed Studies 2006

Computers and Literacy In this study we found no benefit of using computers for teaching literacy skills. A larger recently completed trial in the USA (n = 512) found similar results – no benefit. > $90 million had been spent implementing Fast forword before it was found to be ineffective. Rouse et al, 2004; NBER Working Paper Series,

Incentives Poor attendance to adult literacy and numeracy classes is a major problem. Recent government policy has offered either financial incentives to attend or financial penalties (reduction in benefit) if attendance is poor. Neither approach has been tested in an RCT. With colleagues in Sheffield we undertook a RCT of incentives.

Methodological challenges Incentive may lead to resentful demoralisation if we randomised individuals. »Cluster or class allocation of 28 classes. Small number of units of allocation may result in chance bias or numerical imbalance. »Used a minimisation algorithm to achieve balance on observed variables.

Design Cluster or class based RCT. 28 classes were randomised to either receive £5 per class attended or no incentive. Outcome was number of sessions attended.

VariableControl (n = 70) Intervention (n = 82) Mean (SD) number of sessions attended 6.69 (2.71) 5.28 (2.79) Mean (SD) post-test literacy scores (8.84) (8.68) Significant reduction of about 1.5 sessions (95% confidence interval 0.28, 2.79; p = 0.019) (adjusted for clustering, cluster size and pre-test literacy scores

Incentives Small £5 incentives are counter productive and reduce attendance. Larger incentives may work but we need to evaluate these in a large RCT with educational achievement as main outcome.

Differences between health and education Education trials are much easier and quicker. Pre-test variables strongly predict post- test, which usually isn’t the case in health care trials, consequently pre-test measurement is particularly useful to gain more statistical power.

Trial designs Unequal allocation »Good to use when resources limit us to the amount of one intervention (e.g., mentoring sessions). Cluster designs »Avoiding contamination, care needs to be taken on using ITT and recruitment bias. Factorial designs »Two trials for the price of one – good value! Balanced design »Controls for Hawthorne effect (e.g., Intervention group gets maths, control group gets English and 2 trials for the price of one).

Factorial Design Computer teachingNo computers and no assistants. Computer teaching Class room assistants

Challenges Follow-up – this is crucial to maximise this. Schools are good children are there and apart from (random?) absences most will be present for post-test Adults may need encouragement (e.g., possible incentives to complete follow-up) or statutory data collection (e.g., arrests) allows ITT to be completed. Convincing other researchers that RCTs are feasible and worth doing.

A few fallacies We can’t accurately measure an important variable (e.g., social class) that really predicts outcome – our trial will be biased. »No it won’t, good measurement of confounders may be helpful but randomisation will cancel their effects out. Non compliance means randomisation is not possible. »Poor compliance will dilute the treatment effects, however, using ITT analysis will give a public policy estimate of effect and other statistical techniques (e.g., CACE) may adjust for non-compliance.

Can trials overcome strong beliefs? Often not. RCT of 25 schools to be given an enhanced sex education programme – aim to reduce abortions among teenage women. New package cost £900 per teacher compared with £20 for conventional package.

Results Termination rate among intervention schools was higher (15.7 per 1,000 women – 95% CI to 42.1, p = 0.26). Difference not statistically significant, but point estimate is a 14% increase risk of terminations.

What did the authors conclude? “High quality sex education should be continued, but to reduce unwanted pregancies complementary, longer term interventions that address socioeconomic inequalities and the influence of parents should be developed and rigorously evaluated”. How does this recommendation derive from the data?

Summary Trials have challenges – some simple mistakes can result in a trial losing its internal validity. Many of the problems of trials can be overcome with careful thought and preparation.