Extracting claim sentences from biomedical documents:

Slides:



Advertisements
Similar presentations
The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
Advertisements

Seyedehmehrnaz Mireslami, Mohammad Moshirpour, Behrouz H. Far Department of Electrical and Computer Engineering University of Calgary, Canada {smiresla,
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
IB THEORY OF KNOWLEDGE An Overview.
1 Question Answering in Biomedicine Student: Andreea Tutos Id: Supervisor: Diego Molla.
Sunita Sarawagi.  Enables richer forms of queries  Facilitates source integration and queries spanning sources “Information Extraction refers to the.
1 CBioC: Collaborative Bio- Curation Chitta Baral Department of Computer Science and Engineering Arizona State University.
Information Extraction from Clinical Reports Wendy W. Chapman, PhD University of Pittsburgh Department of Biomedical Informatics.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
Memoplex Browser: Searching and Browsing in Semantic Networks CPSC 533C - Project Update Yoel Lanir.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Introduction to Current Contents Connect. What is CCC? A multidisciplinary current awareness resource –Browse and search journals, books and websites.
Beyond Genes, Proteins, and Abstracts: A Framework to Capture Scientific Claims Catherine Blake School of Information and Library Science University of.
Decision Support for Quality Improvement
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Clinical Pharmacy Part 2
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
MinorThird 서울시립대학교 인공지능연구실 곽별샘
Introduction to Evidence-Based Medicine Dr Hayfaa A.A Wahbi Assistant Professor, Chair of Evidence Based Medicine and Knowledge translation.
Relevance Detection Approach to Gene Annotation Aid to automatic annotation of databases Annotation flow –Extraction of molecular function of a gene from.
Mining fuzzy domain ontology based on concept Vector from wikipedia category network.
Correlating Knowledge Using NLP: Relationships between the concepts of blood cancers, stem cell transplantation, and biomarkers Katy Zou and Weizhong Zhu.
ADVANCED DB SYSTEMS BIOMEDICAL ENGINEERING. Index INTRODUCTION  BIOMEDICAL ENGINEERING  B.E. DATASETS APPLICATIONS  DATA MINING ON FDA DATABASE  ONTOLOGY-BASED.
Writing a Research Proposal 1.Label Notes: Research Proposal 2.Copy Notes In Your Notebooks 3.Come to class prepared to discuss and ask questions.
EFFECTS OF DULOXETINE ON THE PHARMACOKINETICS OF NEBIVOLOL IN HEALTHY VOLUNTEERS Corina Briciu 1, Maria Neag 2, Dana Muntean 3, Corina Bocsan 2, Anca Buzoianu.
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
Gene Clustering by Latent Semantic Indexing of MEDLINE Abstracts Ramin Homayouni, Kevin Heinrich, Lai Wei, and Michael W. Berry University of Tennessee.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
Quality Education for a Healthier Scotland Pharmacy Pharmaceutical Care Planning Vocational Training Scheme: Level = Stage 2 Arlene Shaw Specialist Clinical.
Distribution of information in biomedical abstracts and full- text publications M. J. Schuemie et al. Dept. of Medical Informatics, Erasmus University.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Using Text Mining and Natural Language Processing for.
Opportunities for Text Mining in Bioinformatics (CS591-CXZ Text Data Mining Seminar) Dec. 8, 2004 ChengXiang Zhai Department of Computer Science University.
Shirley M. Tsunoda Liver Transplant, Drug Metabolism/PK Research Interests Investigating the genetic and environmental factors that influence variability.
Update on Current FDA Labeling Initiatives Iris P. Masucci, PharmD, BCPS Center for Drug Evaluation and Research FDA November 16, 2006 Pediatric Advisory.
Introduction to Evidence-Based Medicine Dr Hayfaa A.A Wahbi Assistant Professor, Chair of Evidence Based Medicine and Knowledge translation.
Using Semantic Relations to Improve Information Retrieval
Methods to Adjust Doses Based on Exposure-Response Information Points to Consider Richard Lalonde Clinical Pharmacokinetics and Pharmacodynamics Pfizer.
Data Mining for Expertise: Using Scopus to Create Lists of Experts for U.S. Department of Education Discretionary Grant Programs Good afternoon, my name.
WP4 Models and Contents Quality Assessment
Showcasing work by Jonnageddala, Liaw, Ray, Kumar, Chang, and Dai on
The Claim Framework Catherine Blake
Concept Grounding to Multiple Knowledge Bases via Indirect Supervision
Sentiment analysis algorithms and applications: A survey
Introduction of Biopharmaceutics & Pharmacokinetics
Formal Conceptualization of Dental Diagnoses: Status Report
Color Marking Body Response Lab
RaJoLink: Creative Knowledge Discovery by Literature Outlier Detection
Pharmacokinetics.
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Pharmacokinetics.
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Harry Hochheiser Assistant Professor
Senior Medical Director, Cardiovascular
Translational Informatics Applied to Drug Safety (TrIADS)
Social Knowledge Mining
Lixia Yao, James A. Evans, Andrey Rzhetsky  Trends in Biotechnology 
Pharmacokinetics.
iSRD Spam Review Detection with Imbalanced Data Distributions
CSE 635 Multimedia Information Retrieval
Batyr Charyyev.
Representation of biomedical sublanguages using symbolic notation
Introduction to Pharmacology
PubMed.
TJTS505: Master's Thesis Seminar
By Hossein Hematialam and Wlodek Zadrozny Presented by
The Semantic Shift Guillaume Mazieres EVP, North American Operations
Presentation transcript:

Extracting claim sentences from biomedical documents: a pilot study focusing on drug-drug interaction claims Linh Hoang1, Richard D. Boyce2, Jodi Schneider1 1 School of Information Sciences, University of Illinois at Urbana-Champaign 2 Department of Biomedical Informatics, School of Medicine, University of Pittsburgh In the biomedical domain, a large amount of new research is published every year. It is difficult to keep track of new research claims. Drug-drug interactions are a particularly important kind of claim. Only claims of interactions have been extracted with NLP, more work is needed to extract supporting evidence of drug-drug interactions. Semantic models that connect claims with their supporting evidence could support better information retrieval of biomedical literature. Given a dataset of drug-drug interaction claims and evidence, our goals are: 1) To understand what types of claims appear 2) To understand how data and methods are used as supporting evidence 3) To apply text mining and machine learning to automate the process of extracting claims and evidence 4) To apply a semantic model connecting claims and supporting evidence Motivation Pilot Study Methods Completed so far: Categorized annotated claims into groups. Identified the most common components each group of claims. 1. Analyze existing dataset Ongoing: Get full-text of the original documents that contain claims. Identify the document sections the claims were found in. Analyze the language used in the claims by comparing words in the annotated claims with the document titles, and identifying the most common words that appear in the claims. 875 drug-drug interaction claims with supporting evidence Human annotated from 143 documents (journal articles, drug labels, clinical case reports, etc.) Introduction of the Drug Interaction Knowledge Base dataset Annotated claim Drug-drug interaction Supporting evidence 2. Analyze the claims in details claimtext relationship drug1 drug2 aucvalue auctype aucdirection cmaxvalue cmaxtype cmaxdirection interact_with clopidogrel bupropion 60 Percent Increase 40 Ticlopidine, Clopidogrel: In a study in healthy male volunteers, clopidogrel 75 mg once daily or ticlopidine 250 mg twice daily increased exposures (Cmax and AUC) of bupropion by 40% and 60% for clopidogrel, by 38% and 85% for ticlopidine, respectively. The exposures of hydroxybupropion were decreased. Initial Data Analysis Results Next steps: Propose features to extract claims and evidence Build machine learning model to detect the claim and evidence automatically. Test and evaluate the models. Category 1: Claim includes phamacokinetics1 (PK) measurements “Combined administration of racemic citalopram (40 mg) and ketoconazole (200 mg), a potent CYP3A4 inhibitor, decreased the Cmax and AUC of ketoconazole by 21% and 10%, respectively, and did not significantly affect the pharmacokinetics of citalopram.” Category 2: General claim: doesn’t include PK measurements, specifically mentions participants, methods & results “A 77-year-old man with dyslipidemia treated with simvastatin was prescribed a course of clarithromycin by his general practitioner for an upper respiratory tract infection. He subsequently presented to the hospital with generalized muscle pain, weakness, and inability to walk.” Category 3: General claim: doesn’t include PK measurements or methods or results “The following report describes a case of rhabdomyolysis that occurred in an acutely ill patient who was taking atorvastatin in combination with gemfibrozil.” 3. Implement automatic extraction Next steps: Represent claims and supporting evidence (methods & data) in a semantic model. 4. Apply a semantic model In the above examples: Drug entities (mandatory component): two drugs that interact (in RED) Interaction (mandatory component): the direction of the interaction (in PURPLE) Supporting evidence (optional component): information (e.g. phamacokinetics measurements—AUC or Cmax) that indicates an interaction, or lack of interaction, between the drugs (in GREEN) Context (optional component): the context in which the drug-drug interaction occurs (in BLUE) 1: Pharmacokinetics is the study of drug absorption, distribution, metabolism, and excretion. Partially supported by NIH R01LM011838.