The Claim Framework Catherine Blake

Slides:



Advertisements
Similar presentations
PubMed.
Advertisements

NCBI/WHO PubMed/Hinari Course NCBI Literature Databases: PubMed Background.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Guided Enquiry. OBJECTIVES databases  Understand what information is available from the databases  Locate and become familiar with the Student Research.
> a patent search service supplied by Patents & Technology Surveys Ltd PROFESSIONAL ONLINE PATENT INFORMATION SERVICE.
April 2001Division of Library Services IDEAL® is a collection of full text journal titles. Includes 173 journal titles from Academic Press. Abstracts and.
15 de Abril de A Meta-Analysis is a review in which bias has been reduced by the systematic identification, appraisal, synthesis and statistical.
Dissemination and Critical Evaluation of Published Research Peg Bottjen, MPA, MT(ASCP)SC.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
DI FC UL1 Gene Function Prediction by Mining Biomedical Literature Pooja Jain Master in Bioinformatics Supervisor - Mário Jorge Costa Gaspar.
How to find a worthy research subject Sadeghi Ramin, MD Nuclear Medicine Research Center, Mashhad University of Medical Sciences.
PSYC512: Research Methods PSYC512: Research Methods Lecture 2 Brian P. Dyre University of Idaho.
A Comparison of Document, Sentence, and Term Event Spaces Catherine Blake School of Information and Library Science University of North Carolina at Chapel.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Identifying deleterious Single Nucleotide Polymorphisms using multiple sequence alignments CMSC858P Project by Maya Zuhl.
Moving beyond free text. Authors Scientist does research Scientist publishes research results in journal article Old Paradigm:
Srihari-CSE730-Spring 2003 CSE 730 Information Retrieval of Biomedical Text and Data Inroduction.
CHAPTER 3: DEVELOPING LITERATURE REVIEW SKILLS
Beyond Genes, Proteins, and Abstracts: A Framework to Capture Scientific Claims Catherine Blake School of Information and Library Science University of.
1 The Discovery Informatics Framework Pat Rougeau President and CEO MDL Information Systems, Inc. Delivering the Integration Promise American Chemical.
Biological Science Database Proquest WEDAD AL-HUSAINAN ISD/NSTIC Kuwait Institute for Scientific Research November/2012.
1 The BT Digital Library A case study in intelligent content management Paul Warren
 CiteGraph: A Citation Network System for MEDLINE Articles and Analysis Qing Zhang 1,2, Hong Yu 1,3 1 University of Massachusetts Medical School, Worcester,
Towards Evidence-Based Discovery Catherine Blake School of Information and Library Science University of North Carolina at Chapel Hill
Information overload –more than 12 million references already in MEDLINE –thousands more each day –well-articulated queries retrieve many relevant articles.
Incorporating Primary Literature into Science Learning Faculty Development Workshop October 8, 2012 Donna L. Pattison, PhD Instructional Professor Department.
The (Almost) Free ILL System for Medical Information DOCLINE: Northwest Interlibrary Loan and Resource Sharing Conference September 11, 2003.
BioSumm A novel summarizer oriented to biological information Elena Baralis, Alessandro Fiori, Lorenzo Montrucchio Politecnico di Torino Introduction text.
 Copyright 2007 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute Research publication & enabling.
Revised 7/19/10.  This policy states that, as of April 7, 2008, all articles resulting from U.S. National Institutes of Health (NIH) funds must be submitted.
Gene Clustering by Latent Semantic Indexing of MEDLINE Abstracts Ramin Homayouni, Kevin Heinrich, Lai Wei, and Michael W. Berry University of Tennessee.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
CODE (Committee on Digital Environment) July 26, 2000 Rice University THE NET OF THE 21st CENTURY: Concepts across the Interspace Bruce Schatz CANIS Laboratory.
Distribution of information in biomedical abstracts and full- text publications M. J. Schuemie et al. Dept. of Medical Informatics, Erasmus University.
An overview of Bioinformatics. Cell and Central Dogma.
Doing your literature review: an overview Katy Jordan Librarian, Social & Policy Sciences Library & Learning Centre.
Iana Atanassova Research: – Information retrieval in scientific publications exploiting semantic annotations and linguistic knowledge bases – Ranking algorithms.
Opportunities for Text Mining in Bioinformatics (CS591-CXZ Text Data Mining Seminar) Dec. 8, 2004 ChengXiang Zhai Department of Computer Science University.
International Atomic Energy Agency Sources of National Literature INIS Training Seminar November 2011, Vienna, Austria Taghrid ATIEH Leader, Capacity.
Automatically Identifying Candidate Treatments from Existing Medical Literature Catherine Blake Information & Computer Science University.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
Ukpmc.ac.uk As a result of the mandates Research in the open How mandates work in practice 29 th May, 2009 Paul Davey, UK PubMed Central Engagement Manager,
PubMed Basics Barbara A. Wood, MLIS Calder Library University of Miami Miller School of Medicine.
BioCreAtIvE Critical Assessment for Information Extraction in Biology Granada, Spain, March28-March 31, 2004 Task 2: Functional annotation of gene products.
LIBRARY INTRODUCTION FOR PG IT (ONLINE) Monica Crump On behalf of: Fiona Quinlan, Subject Librarian Science & Engineering James Hardiman Library.
Smart points and Kaplan online resources
Razieh Moghadam, Kowsar Corporation,
Applications of the Interspace Analysis for Community Repositories
Genomics research paper presentation
How to Read a Scientific Paper
Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan
Chapter 2: Where to Start
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Library tutorial for CM5173 Reaxys
سرطان الثدي Breast Cancer
Research related to Health Informatics
WISER Finding stuff: Journal Articles
Building the Literature Review
Blake & Pratt’s ‘Collaborative Information Synthesis’
Extracting claim sentences from biomedical documents:
Strategies for annotation of a genome
Exercise #4: Cell Biology Research Paper
Citation-based Extraction of Core Contents from Biomedical Articles
Chapter Two: Review of the Literature
Lecture 6: How to Read an Academic Paper
Agenda A Quick Note on Research Questions
Describing Documents Ch3 in textbook Organizing Knowledge: An
Title Goes Here Title Goes Here Title Goes Here Title Goes Here
Fig. 5 Correlation of RNA expression and protein abundance.
INIS ACTIVITIES IN GHANA A Presentation by Mr. Albert P. K. E
Presentation transcript:

The Claim Framework Catherine Blake clblake@illinois.edu School of Library and Information Science University of Illinois at Urbana-Champaign clblake@illinois.edu

Shift from Retrieval to Synthesis Motivation Relentless increase in electronically available text Life Sciences 17 millionth entry added in April 2007 5,200 journals indexed 12,000 new articles each week ! Chemistry – more than 110,000 articles in 1 year alone Consequences: Hundreds of thousands of relevant articles Implicit connections between literature go unnoticed Shift from Retrieval to Synthesis

The Claim Framework Scientists use a shared sublanguage to express claims made in an empirical study  The Claim Framework captures the key characteristics of the claim sublanguage Text mining can be used to populate the Claim Framework automatically  An automated system will identify all and only the claims that have been identified manually

Claim Definition “To assert in the face of possible contradiction” Example sentence reporting a claim “This study showed that Tamoxifen reduces the breast cancer risk” Explicit Claim in the Claim Framework Tamoxifenagent reduceschange [breast cancer risk] object

Distribution of Claim Categories Category Total (%) Pilot(%) Main(%) Explicit 2489 77.11 332 83.42 2157 76.63 Implicit 87 2.70 3 0.75 84 2.98 Observation 298 9.23 24 6.03 274 9.73 Correlation 174 5.39 12 3.02 162 5.75 Comparison 165 5.11 27 6.85 138 4.9 Total 3228 100 398 2830

Inter Annotator Agreement Information Facet Kappa Agreement Agent 0.71 substantial Object 0.77 substantial Change 0.57 moderate Change+ChangeDir 0.88 almost perfect

Location of Claims Total Sentences With % Section Claim Total section   With % Section Claim Total section claim Abstract 98 309 31.72 7.84 Introduction 357 979 36.47 28.56 Method 6 1121 0.54 0.48 Result 293 1829 16.02 23.44 Discussion 539 1406 38.34 43.12 1250 5535 22.58 100.00

Interested ? Send me an email clblake@illinois.edu To see more details on the Claim Framework and an automated approach to populate explicit claims: Blake, C. (2010) Beyond genes, proteins, and abstracts: Identifying scientific claims from full-text biomedical articles, Journal of Biomedical Informatics, 43(2), 173-189.