1 Xiaomei Yao, R. Brian Haynes, Nancy L. Wilczynski Health Information Research Unit, McMaster University Optimal sample sizes for testing search strategies.

Slides:



Advertisements
Similar presentations
Sampling techniques & sample size
Advertisements

BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Gall C, Katch A, Rice T, Jeffries HE, Kukuyeva I, and Wetzel RC
Finding the Best Evidence Literature for Evidence Based Health Care.
Department of Child and Adolescent Health and Development Identifying topics on TB treatment for literature review In collaboration with International.
Evidence-Based Medicine Week 3 - Prognosis Department of Medicine - Residency Training Program Tuesdays, 9:00 a.m. - 11:30 a.m., UW Health Sciences Library.
1 Lauren E. Finn, 2 Seth Sheffler-Collins, MPH, 2 Marcelo Fernandez-Viña, MPH, 2 Claire Newbern, PhD, 1 Dr. Alison Evans, ScD., 1 Drexel University School.
15 de Abril de A Meta-Analysis is a review in which bias has been reduced by the systematic identification, appraisal, synthesis and statistical.
Chapter 8 Estimation: Additional Topics
1 A heart fills with loving kindness is a likeable person indeed.
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
Journal Club Alcohol and Health: Current Evidence September–October 2004.
Vanderbilt Sports Medicine Chapter 4: Prognosis Presented by: Laurie Huston and Kurt Spindler Evidence-Based Medicine How to Practice and Teach EBM.
Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2010.
Variables and Measurement (2.1) Variable - Characteristic that takes on varying levels among subjects –Qualitative - Levels are unordered categories (referred.
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
How good are your testers? An assessment of testing ability Liang Huang, Chris Thomson and Mike Holcombe Department of Computer Science, University of.
Multiple Choice Questions for discussion
Minimal Test Collections for Retrieval Evaluation B. Carterette, J. Allan, R. Sitaraman University of Massachusetts Amherst SIGIR2006.
8.1 Inference for a Single Proportion
Statistics: Concepts and Controversies What Is a Confidence Interval?
Division of Population Health Sciences Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Developing a web-based international register.
Nursing 386. Your Assignment:  Summarize two research articles that address the clinical issue. Acquire these articles by searching various databases.
Improving search efficiency for economic evaluations in major databases using semantic technology Julie Glanville, Carol Lefebvre, Pamela Negosanti, Bill.
Keller and Ozment (1999)  Problems of driver turnover  Costs $3,000 to $12,000 per driver  Shipper effect  SCM impact  Tested solutions  Pay raise.
Basic Sampling & Review of Statistics. Basic Sampling What is a sample?  Selection of a subset of elements from a larger group of objects Why use a sample?
Population and Sampling
Mrs. Watcharasa Pitug ID The Association between Waist Circumference and Renal Insufficiency among Hypertensive Patients 20/10/58 1.
Clinical Writing for Interventional Cardiologists.
SAMPLING TECHNIQUES. Definitions Statistical inference: is a conclusion concerning a population of observations (or units) made on the bases of the results.
BPS - 3rd Ed. Chapter 131 Confidence Intervals: The Basics.
Nonparametric Statistics
Understanding Sampling
Wipanee Phupakdi, MD September 15, Overview  Define EBM  Learn steps in EBM process  Identify parts of a well-built clinical question  Discuss.
1 How to find literature - A very short introduction - How to start smart Students IIC/IID Medical Library, August 2013.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Statistical planning and Sample size determination.
Confidence intervals. Want to estimate parameters such as  (population mean) or p (population proportion) Obtain a SRS and use our estimators, and Even.
Chapter 7 Statistical Inference: Estimating a Population Mean.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size for Precision or Power.
Stats 4-17  Previous data indicate that 17% of trolls are bifungal (two mushroom hands). Wilfrax Margarn wants to create 95% a confidence interval for.
EBM --- Journal Reading Presenter :呂宥達 Date : 2005/10/27.
1 If we live with a deep sense of gratitude, our life will be greatly embellished.
Compliance Original Study Design Randomised Surgical care Medical care.
EVALUATING u After retrieving the literature, you have to evaluate or critically appraise the evidence for its validity and applicability to your patient.
Evidence-Based Information Retrieval and Resources GEMP 2 © Dr Glenda Myers WHSL 18 th March 2008.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
How to Conduct a Meta-Analysis Arindam Basu MD MPH About the Author Required Browsing.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
Date of download: 5/28/2016 From: Benefits and Harms of Computed Tomography Lung Cancer Screening Strategies: A Comparative Modeling Study for the U.S.
Article Title Resident Name, MD SVCH6/13/2016 Journal Club.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Can't Type? press F11 Can’t Hear? Check: Speakers, Volume or Re-Enter Seminar Put ? in front of Questions so it is easier to see them. 1 Welcome to Unit.
Meta-analysis Overview
EBM R1張舜凱.
INTRODUCTION AND DEFINITIONS
A Quick Review.
45 Nursing: A Concept-Based Approach to Learning
Essential Statistics Inference about a Population Proportion
How HIRU Might Support Chronic Pain Research
Interval Estimation.
Bootstrap Confidence Intervals using Percentiles
Empirically Derived Search Filters for the Development of Kidney Disease Clinical Practice Guidelines  Pavel S. Roshanov, MD, MSc, Arthur V. Iansavichus,
on EBSCOhost research databases
SURVEY & SAMPLING DESIGN
Tuberculosis control interventions targeted to previously treated people in a high- incidence setting: a modelling study  Florian M Marx, MD, Reza Yaesoubi,
Title Goes Here Title Goes Here Title Goes Here Title Goes Here
Lack of Confidence Interval Reporting in Dermatology: A Call to Action
Essential Statistics Inference about a Population Proportion
Ovid User Training -Medline-
Presentation transcript:

1 Xiaomei Yao, R. Brian Haynes, Nancy L. Wilczynski Health Information Research Unit, McMaster University Optimal sample sizes for testing search strategies for retrieving studies of treatment, diagnosis and prognosis from MEDLINE

2 Background MEDLINE covers 4800 journals and contains over 13 million articles in life sciences. Clinicians and researchers need to retrieve the target articles quickly.

3 Background con’d Unique optimal search filters (“hedges”) have been developed, based on the 161 core clinical journals published in 2000 from MEDLINE. ( ic/clinical.html). ic/clinical.html Determining optimal sample sizes for testing search strategies for retrieving studies of treatment, diagnosis and prognosis from MEDLINE.

4 Categories TypesOvid strategiesSensitivities (%)Specificities (%) TreatmentBest sensitivityclinical trial.mp. OR clinical trial.pt. OR random:.mp. OR tu.xs Best specificityrandomized controlled trial.mp. OR randomized controlled trial.pt Optimizationrandomized controlled trial.pt. OR randomized.mp. OR placebo.mp DiagnosisBest sensitivitySensitive:.mp. or diagnos:.mp. or di.fs Best specificitySpecificity.tw OptimizationSensitive:.mp. or predictive value:.mp. or accuracy:.tw PrognosisBest sensitivityIncidence.sh. OR exp mortality OR follow-up studies.sh. OR prognos:.tw OR predict:.tw. OR course:.tw Best specificityprognos:.tw. OR first episode.tw. OR cohort.tw Optimizationprognosis.sh. OR diagnosed.tw. OR cohort:.mp. OR predictor:.tw. OR death.tw. OR exp models, statistical Table 1 - Best search strategies for 3 categories retrieval from MEDLINE

5 Method The search terms were treated as “diagnostic tests” and hand searching as a “gold standard” articles were categorized as “pass” or “fail” studies for clinical topic areas. Precision analysis was used to calculate the sample size of “pass articles” needed to estimate sensitivity with 95% confidence intervals of ±0.01, ±0.02, ±0.025, ±0.03, ±0.04, ±0.05, ±0.10 (margin of error [ME]).

6 Method con’d The lowest sensitivity among 3 strategy types was chosen. Random samples of journals for size 100, 50, 30, 20, 10, etc. were generated. The order of the 161 journals was ranked in terms of the number of “pass articles”. STATA 9.0 was used as the statistical software.

7 Results sample size calculations based on search sensitivity were shown in Table 2. Table 3 shows the numbers of “pass articles” for the 3 purpose categories in different random samples of journals from the 161 database.

8 CategoriesME = ±0.01 ME = ±0.02 ME = ±0.025 ME = ±0.03 ME = ±0.04 ME = ±0.05 ME = ±0.10 Treatment Diagnosis Prognosis Table 2 – Sample size needed for different margins of error (MEs)

9 Random journal subsets from 161N for treatment (ME)N for diagnosis (ME)N for prognosis (ME) (±0.02)136 (±0.01)159 (±0.01) (±0.02)109 (±0.01)150 (±0.01) (±0.02)94 (±0.01)153 (±0.01) (±0.02)93 (±0.01)110 (±0.01) (±0.02)86133 (±0.01) (±0.02)83128 (±0.01) (±0.02)70125 (±0.01) (±0.02) (±0.025) (±0.02) (±0.02) (±0.025) (±0.03) (±0.04) (±0.04) (±0.05) (±0.05) (±0.10) (±0.10)01 Table 3 - The Numbers of “pass articles” (N) with their achieved MEs for random journal subsets

10 Results con’d Based on table 2 and table 3, we can find the smallest journal subset for any ME. The scatterplot of various MEs against random journal sample sizes for treatment is shown in Figure. For the diagnosis and prognosis categories, we only can achieve a ME = ±0.10 because of low prevalence of “pass articles”.

11 Figure

12 Results con’d We tested the previous optimal search strategies for the treatment category in different random journal subsets (Table 4). 1, 2, and 6 top journals can achieve a ME = ±0.10, ±0.05, and ±0.025 for treatment, respectively. The performances of the strategy in 1, 2, and 6 top journal subsets were very similar to the result of the 161 journals.

13 Ovid search strategies Sensitivities (%) of 100; 45; 15; 10 journal subsets, 161 journals (95% CI) Specificities (%) of 100; 45; 15; 10 journal subsets, 161 journals (95% CI) Precisions (%) of 100; 45; 15; 10 journal subsets, 161 journals (95% CI) Accuracies (%) of 100; 45; 15; 10 journal subsets, 161 journals (95% CI) Absolute differences: 100−161, 45−161, 15−161, Absolute differences: 100−161, 45−161, 15−161, Absolute differences: 100−161, 45−161, 15−161, Absolute differences: 100−161, 45−161, 15−161, Best sensitivity clinical trial.mp. OR clinical trial.pt. OR random:.mp. OR tu.xs (98.4, 99.6), 99.0 (98.0, 100.0), 98.4 (96.7, 100.2), (100.0, 100.0), 99.2 (98.7, 99.8) 71.5 (71.0, 71.9), 66.6 (65.7, 67.5), 69.3 (68.1, 70.5), 75.1 (73.6, 76.6), 70.4 (69.8, 70.9) 9.8 (9.2, 10.4), 9.2 (8.4, 10.1), 9.4 (8.1, 10.7), 9.9 (7.9, 11.9), 9.9 (9.3, 10.5) 72.3 (71.8, 72.8), 67.7 (66.8, 68.5), 70.2 (69.0, 71.3), 75.8 (74.3, 77.3), 71.3 (70.8, 71.8) −0.2, −0.8, , −3.8, −1.1, 4.7 −0.1, −0.7, −0.5, , −3.6, −1.1, 4.5 Table 4 - Performance characteristics in random sample 100, 45, 15, 10 journal subsets, and 161 journals for treatment category

14 Best specificity randomized controlled trial.mp. OR randomized controlled trial.pt (92.3, 95.3), 94.4 (92.2, 96.7), 90.1 (85.8, 94.3), 96.5 (92.6, 100.4), 93.1 (91.5, 94.8) 97.7 (97.5, 97.9), 97.3 (97.0, 97.6), 97.6 (97.2, 98.0), 97.4 (96.8, 97.9), 97.5 (97.3, 97.6) 56.0 (53.7, 58.4), 54.5 (50.8, 58.3), 55.0 (49.4, 60.5), 50.0 (42.4, 57.7), 54.4 (52.0, 56.8) 97.6 (97.4, 97.7), 97.2 (96.9, 97.5), 97.4 (97.0, 97.8), 97.3 (96.8, 97.9), 97.3 (97.1, 97.5) 0.7, 1.3, −3.0, , −0.2, 0.1, − , 0.1, 0.6, − , −0.1, 0.1, 0.0 Best optimization of sensitivity & specificity randomized controlled trial.pt. OR randomized.mp. OR placebo.mp (95.2, 97.5), 97.0 (95.3, 98.7), 93.2 (89.6, 96.8), 98.8 (96.5, 101.1), 95.8 (94.5, 97.1) 95.9 (95.7, 96.1), 92.7 (92.2, 93.2), 96.3 (95.8, 96.8), 95.1 (94.4, 95.9), 95.0 (94.8, 95.3) 42.2 (40.2, 44.2), 31.3 (28.7, 33.9), 45.0 (40.1, 49.9), 35.7 (29.6, 41.9), 38.5 (36.5, 40.5) 95.9 (95.7, 96.1), 92.8 (92.4, 93.3), 96.2 (95.7, 96.7), 95.2 (94.5, 96.0), 95.0 (94.8, 95.3) 0.5, 1.2, −2.6, , −2.3, 1.3, , −7.2, 6.5, − , −2.2, 1.2, 0.2 Table 4 con’d

15 Conclusion Our sample size calculations suggest that search strategies calibrated in small journal subsets will be as robust as those calibrated in larger collections for some categories that have relatively much more “pass articles”, such as treatment category with 3.24% of “pass articles”.

16 Conclusion con’d The "top yielding journal" approach seems most economical. We will develop and test the new search strategies in small random and top journal subsets to see how close the estimates are.

17 Acknowledgments The Hedges Team included Angela Eady, Brian Haynes, Susan Marks, Ann McKibbon, Doug Morgan, Cindy Walker-Dilks, Stephen Walter, Stephen Werre, Nancy Wilczynski, and Sharon Wong. Chris Cotoi (our computer programmer) helped us generate a journal randomizer.