Indexing the Biomedical Literature in a Time of Increased Demand and Limited Resources BioASQ Workshop September 27, 2013 Alan R. Aronson Lister Hill Center,

Slides:



Advertisements
Similar presentations
CINAHL DATABASE FOR HINARI USERS: nursing and allied health information (Module 7.1)
Advertisements

An introduction to Medline (CMM2) Medical Subject Librarian Team.
PubMed Searching: Automatic Term Mapping (ATM) PubMed for Trainers, Spring 2014 U.S. National Library of Medicine (NLM) and NLM Training Center.
Search Strategy and Information Retrieval By Rekha Gupta, NIC
PubMed Review Medical Library Association Annual Meeting May 20 – 22, 2007 Philadelphia.
PubMed and its search options Jan Emmerich, Sonja Jacobi, Kerstin Müller (5th Semester Library Management)
NCBI/WHO PubMed/Hinari Course NCBI Literature Databases: PubMed Background.
PubMed: Outline Coverage MeSH, mapping and subheadings Simple search Limits Displaying and managing results MeSH database Single citation matcher.
What is the status of community acquired pneumonia in adults in the United States? Searching PubMed pubmed.gov.
Introduction to PubMed® (pubmed.gov)
The NLM Indexing Initiative Alan R. Aronson, PhD Lister Hill Center, National Library of Medicine American Society of Indexers Annual Meeting May 15, 2004.
Semantic indexing in PubMed CERN Workshop on Innovations in Scholarly Communication (OAI8) CERN Workshop on Innovations in Scholarly Communication (OAI8)
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
MEDLINE®/PubMed® Based on the PubMed for Trainers course, U.S. National Library of Medicine (NLM) and NLM Training Center Jane Bridges, ML, AHIP Associate.
Searching Pubmed Database استخدام قاعدة المعلومات Pubmed د. سيناء عبد المحسن العقيل قسم الصيدلة الإكلينيكية برنامج مهارات البحث العلمي.
NLM Online Users’ Meeting May 21, 2012
The National Library of Medicine online resources Salima M’seffar INH- Bibliotheque
Astrid Müller, Library of Medicine and Health Sciences PubMed and Other NLM Resources CE Course -10th European Conference for Medical and Health Libraries,
Ke Liu1, Junqiu Wu2, Shengwen Peng1,Chengxiang Zhai3, Shanfeng Zhu1
Nursing Research through the MCTC Library Use this hands-on session to learn effective searching for your Nursing research assignments. We will take a.
U. S. National Library of Medicine NLM Indexing Initiative Tools for NLP: MetaMap and the Medical Text Indexer Natural Language Processing: State of the.
NLM Medical Text Indexer (MTI) BioASQ Challenge Workshop September 27, 2013 J.G. Mork, A. Jimeno Yepes, A. R. Aronson.
PubMed/MeSH - Medical Subject Headings (Advanced Course: Module 1)
NATIONAL LIBRARY OF MEDICINE The PubMed ID and Entrez, PubMed and PubMed Central Edwin Sequeira National Center for Biotechnology Information June 21,
Medical Knowledge Watch at the Belgium Poison Centre Christophe Dupriez 26 June 2007.
Literature Searching: Theories Related to Nursing Care of the Adult Min-Lin Fang, MLIS Education and Information Consultant for Nursing and Social and.
Literature Searching: Theories of the policy Process Min-Lin Fang, MLIS Education and Information Consultant for Nursing and Social and Behavioral Sciences.
Mess ‘o MeSH …Or, What are all those funny terms anyway? MU Cataloging Workshop 24 April 2008 Amanda Sprochi.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
Implementing Metadata Marjorie M K Hlava, President Access Innovations, Inc. Albuquerque, NM
MS 640: Introduction to Biomedical Information Medical Professionalism Finding Information Using Alumni Medical Library Resources.
Medical Subject Headings (MeSH)
MeSH Vocabulary.
NURSING 475 Step Five: RESEARCH APPLICATION. STEP FIVE: The Assignment: n Select a nursing intervention you performed on this patient. What are some of.
Indexing 1/2 BDK12-3 Information Retrieval William Hersh, MD Department of Medical Informatics & Clinical Epidemiology Oregon Health & Science University.
Controlled Vocabulary & Thesaurus Design Planning & Maintenance.
A Report to the Board of Scientific Counselors
NICTA Copyright 2013From imagination to impact Identifying Publication Types Using Machine Learning BioASQ Challenge Workshop A. Jimeno Yepes, J.G. Mork,
Session II: Scientific Publishing and Semantic Web W3C Semantic Web for Life Sciences Workshop October 27, 2004 Moderator: Alan R. Aronson.
1 On the Record Report of the Library of Congress Working Group on the Future of Bibliographic Control Diane Boehr Head of Cataloging, NLM
PubMed and other Online Tools Michele R. Tennant, Ph.D., M.L.I.S. Health Science Center Libraries/ U.F. Genetics Institute GMS 6014 January.
Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Experiences in visualizing and navigating biomedical.
Searching Medline Alex Denby Regional MI Manager London Medicines Information Service (Northwick Park Hospital)
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
Searching Medline Helen Rowlandson Medicines Information Manager Northwick Park Hospital, London.
CINAHL DATABASE FOR HINARI USERS: nursing and allied health information (Module 7.1)
Semi-Automatic Indexing of Full Text Biomedical Articles Washington D.C. October 25, 2005 Clifford W. Gay Lister Hill National Center for Biomedical Communications.
Bio-Medical Information Retrieval from Net By Sukhdev Singh.
Searching PubMed® NCBI, NLM Resources, Micromedex -GSBS TTUHSC Preston Smith Library presents Rev. 08/17/14.
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Survey of Medical Informatics CS 493 – Fall 2004 September 27, 2004.
Doug Brutlag 2011 Bibliographic Search Doug Brutlag Professor Emeritus of Biochemistry.
Searching Medline Helen Rowlandson Principal Medicines Information Pharmacist London Medicines Information (Northwick Park) London.
Searching Medline Alex Denby Regional MI Manager London Medicines Information Service (Northwick Park Hospital)
Journal Searching Nancy B. Clark, M.Ed. Director of Medical Informatics Education FSU College of Medicine 1 All recourses are available online in Medical.
U. S. National Library of Medicine The Current State of MetaMap and MMTx UMLS Webcast Alan (Lan) R. Aronson Lister Hill Center/NLM/NIH
Searching Medline Helen Rowlandson Principal Medicines Information Pharmacist London Medicines Information (Northwick Park) London.
Medical Text Indexing Joe Thomas Unit Supervisor Index Section, NLM.
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
PICO Search Using Medline Via OVID Pathfinder By Shanna Giguere Information Services III.
PubMed Searching: Automatic Term Mapping (ATM) PubMed for Trainers, Fall 2015 U.S. National Library of Medicine (NLM) and NLM Training Center.
PubMed Basics Barbara A. Wood, MLIS Calder Library University of Miami Miller School of Medicine.
MEDLINE®/PubMed® PubMed for Trainers, Fall 2015 U.S. National Library of Medicine (NLM) and NLM Training Center An introduction.
GUIDE. P UB M ED
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
CINAHL DATABASE FOR HINARI USERS
Lívia Vasas, PhD 2018 The Nation Library of Medicine and its databases Mozilla Firefox or Google Chrome Lívia Vasas, PhD.
PubMed.
PubMed/How to Search, Display, Download & (module 4.1)
PubMed/How to Search, Display, Download & (module 4.1)
Presentation transcript:

Indexing the Biomedical Literature in a Time of Increased Demand and Limited Resources BioASQ Workshop September 27, 2013 Alan R. Aronson Lister Hill Center, US National Library of Medicine

The views and opinions expressed do not necessarily state or reflect those of the U.S. Government, and they may not be used for advertising or product endorsement purposes. Alan R. Aronson James G. Mork François-Michel Lang Willie J. Rogers Antonio J. Jimeno-Yepes J. Caitlin Sticco The NLM Indexing Initiative Team 2

 MEDLINE Indexing  Indexing Initiative  Medical Text Indexer (MTI)  MTI as First-Line Indexer (MTIFL)  MetaMap  Gene Indexing Assistant (GIA)  Machine Learning Improvements  Future Work Outline 3

MEDLINE Citation Example 4

Chemical Flag Example 5 Chemical Flags for PMID: (4-Cl-benzyl)-3-Cl-4-(CF3-phenylamino)-1H-pyrrol-2,5-dione (MI-1)|Structure on page amino-4-(1,3-benzothiazol-2-yl)-1-(3-methoxyphenyl)-1,2-dihydro-3H-pyrrol-3-one (D1)|Structure on page 75.

Gene Indexing Example 6 These results demonstrate that FLNA is prone to pathogenic rearrangements

Growth in MEDLINE * MEDLINE Baseline less OLDMEDLINE and PubMed-not-MEDLINE 7

 ~$9.40 to index an article  ~$4.90 to add a GeneRIF  ~$9.40 to add a Chemical Flag  Quickly approaching 1,000,000 articles indexed per year Costs of Doing Business 8

 Demand for indexing continues to grow  Budgets were flat in the mid-1990s  The NIH budget almost doubled in five years:  Current budgets are flat or declining Indexing Demand vs. Budgets 9

 MEDLINE Indexing  Indexing Initiative  Medical Text Indexer (MTI)  MTI as First-Line Indexer (MTIFL)  MetaMap  Gene Indexing Assistant (GIA)  Machine Learning Improvements  Future Work Outline 10

The NLM Indexing Initiative  The need for MEDLINE indexing support from the 1990s:  Increasing demand/costs for indexing in light of  Flat budgets  In response, NLM initiated the Indexing Initiative in 1996 to study ways of meeting the perceived need  Cross-library and cross-discipline team assembled  A prototype indexing system matured into the Medical Text Indexer (MTI) 11

 Medical Subject Headings (MeSH)  Biomedical indexing vocabulary maintained by NLM  Main headings, subheadings, supplementary concepts  Unified Medical Language System (UMLS) Metathesaurus  A thesaurus constructed from the terms in over 150 vocabularies one of which is MeSH  Synonymous terms are grouped into concepts (~3M) and assigned semantic types (133)  Relationships among concepts are inherited from term sources or assigned by UMLS editors MEDLINE Indexing Resources 12

MTI - Overview  Summarizes input text into an ordered list of MeSH Headings  Assisted indexing of Index Section journal articles since 2002  Assisted indexing of Cataloging and History of Medicine Division records  Automatic indexing of NLM Gateway meeting abstracts ( 2002 – 2012 )  First-line indexing (MTIFL) since February 2011  Also available to the Community  45,672 requests; 16,074,511 items processed (2012)  MTI reduces cost and lets indexers focus on tasks requiring human judgement 13

MTI Usage 14

MTI - How are we doing? 15

 First experiments conducted 2010  Focused on journals MTI performed extremely well on  Involved multiple experiments with Indexers  Timing information provided critical evaluation criteria  Moved into Production February 2011  Started with 14 journals  Now have 89 journals for approximately 10,000 articles/year  Expected to expand to 372 journals by end of 2015 MTI as First-Line Indexer (MTIFL) 16

MTI as First-Line Indexer (MTIFL) 17 MTI Processes/ Recommends MeSH Indexing Displays in PubMed as Usual Reviser Reviews Selects Adjusts Approves 89 MEDLINE Journals Indexer Reviews Selects MTI Processes/ Recommends MeSH Indexer Reviews Selects Reviser Reviews Selects Adjusts Approves Indexing Displays in PubMed as Usual “Normal” MTI Processing

MTI as First-Line Indexer (MTIFL) 18 MTI Processes/ Indexes MeSH Indexing Displays in PubMed as Usual Index Section Compares MTI and Reviser Indexing Reviser Reviews Selects Adjusts Approves 89 MEDLINE Journals Indexer Reviews Selects MTI Processes/ Indexes MeSH Reviser Reviews Selects Adjusts Approves Indexing Displays in PubMed as Usual MTIFL MTI Processing

 MTIFL precision is higher than non-MTIFL (73% vs 57%)  Accepted MeSH terms indexed by MTIFL tend to be suggested by both MetaMap and PubMed Related Citations.  Removed MeSH terms indexed by MTIFL tend to be more general than what the indexer chooses (typically one tree level).  MTIFL indexing is faster than non-MTIFL indexing  Removing incorrect terms takes longer than adding missing terms  On average:  indexers remove 2.4 incorrect terms  indexers add 3.8 missing terms  MTIFL misses only 1 major term MTIFL: What We Know So Far 19

 Named-entity recognition  Identify UMLS Metathesaurus concepts in text  Important and difficult problem  MetaMap’s dual role:  Local: Critical component of NLM’s Medical Text Indexer (MTI)  Global: Biomedical concept-identification application MetaMap 20

The Gene Indexing Assistant  An automated tool to assist the indexer in identifying and creating GeneRIFs  Evaluate the article  Identify genes  Make links to Entrez Gene  Suggest GeneRIF annotation  Anticipated Benefits:  Increase in speed  Increase in comprehensiveness 21

 Word Sense Disambiguation (WSD)  Knowledge-based  Corpus method, including creation of a new WSD TC  MTI Meta learning  Check tags  Gene indexing  Automatic summarization  Publication types Machine Learning Improvements 22

 MEDLINE Indexing  Indexing Initiative  Medical Text Indexer (MTI)  MTI as First-Line Indexer (MTIFL)  MetaMap  Gene Indexing Assistant (GIA)  Machine Learning Improvements  Future Work Outline 23

Future Work  Continued collaboration with the NLM Index Section  Planned improvements to MetaMap and MTI such as  Expansion/improvement of MTIFL capability  Add species detection to MTI for disambiguation and for GIA  Further MTI research with Antonio Jimeno-Yepes and Caitlin Sticco  Incorporation of strategies used by BioASQ participants 24

Alan (Lan) R. AronsonWillie J. Rogers James G. MorkAntonio J. Jimeno-Yepes François-Michel LangJ. Caitlin Sticco Questions 25 Generated using Wordle™ (