Improved Cancer Risk Assessment Using Text Mining Ilona Silins 1, Anna Korhonen 2, Johan Högberg 1, Lin Sun 2 and Ulla Stenius 1 1 Institute of Environmental.

Slides:



Advertisements
Similar presentations
Critical Reading Strategies: Overview of Research Process
Advertisements

Oxford University Press Journals Collection Online.
Oxford University Press Journals Collection Online.
Jane Long, MA, MLIS Reference Services Librarian Al Harris Library.
Course KLARA bar codes In English
KNOWLEDGE FOR LIFE CABI database training CAB Abstracts Introductory demonstration.
PubMed for Trainers, Fall 2011 U.S. National Library of Medicine (NLM) and NLM Training Center PubMed Search Mechanics Tips & Tricks.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
International Agency for Research on Cancer (IARC) Classification of Radio Frequency (RF) Summary – May 2011.
1 Question Answering in Biomedicine Student: Andreea Tutos Id: Supervisor: Diego Molla.
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Automatic Classification of Accounting Literature Nineteenth Annual Strategic and Emerging Technologies Workshop Vasundhara Chakraborty, Victoria Chiu,
Disasters and Human Factors Literature Nestor L Osorio Northern Illinois University.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
OAT - The Ontology Annotation Tree browser tool The microarray technique has gained enormous popularity both among the major pharmaceutical companies and.
What Do Toxicologists Do?
Search on Journal of Dairy Science ® An Overview April
Biomedical research methods. What are biomedical research methods? An integrated approach using chemical, mathematical and computer simulations, in vitro.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Research Methods for Computer Science CSCI 6620 Spring 2014 Dr. Pettey CSCI 6620 Spring 2014 Dr. Pettey.
LITERATURE REVIEWS. What is a literature review?  “a synthesis of the literature on a topic.”  (Cottrell & McKenzie, 2011, pg 40)
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Welcome to the Web of Science tutorial By the end of this tutorial you should be able to: Do a basic search to find references Use search techniques to.
EU Framework Programme 6, Priority 5: ”Food Quality and Safety”,Topic 41: “Human health implications of exposure to chemical residues in the environment”
Biological Science Database Proquest WEDAD AL-HUSAINAN ISD/NSTIC Kuwait Institute for Scientific Research November/2012.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Understanding Science Articles (Journals, Periodicals, Primary References)
Click to edit Master title style Click to edit Master text styles – Second level Third level – Fourth level » Fifth level - Using Pollutant Release and.
“Knowing Revisited” And that’s how we can move toward really knowing something: Richard Feynman on the Scientific Method.
Qualifications Update: Environmental Science Qualifications Update: Environmental Science.
1 How to find literature - A very short introduction SMED 8004 Medicine and Health Library October 2014.
Knowledge Discovery in the Digital Library Access tools for mining science ICSTI Public Workshop Presented by: Bernard Dumouchel, Director-General February.
Text Mining Special Interest Group Stuart Murray, Wyeth Research Novartis Institute for Biomedical Research, Cambridge, MA 6-8 th October 2004.
 Remember, it is important that you should not believe everything you read.  Moreover, you should be able to reject or accept information based on the.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Learning Goals To understand the magnitude of drug information available today To understand the differences between primary, secondary, and tertiary resources.
Relevance Detection Approach to Gene Annotation Aid to automatic annotation of databases Annotation flow –Extraction of molecular function of a gene from.
Immunization Safety Review: Vaccines and Autism Marie McCormick Chair, Immunization Safety Review Committee Presentation to NVAC June 2004.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
Document Clustering for Forensic Analysis: An Approach for Improving Computer Inspection.
What is Science?. Competency Goal 1: The learner will design and conduct investigations to demonstrate an understanding of scientific inquiry.. –1.03.
Evaluating Semantic Metadata without the Presence of a Gold Standard Yuangui Lei, Andriy Nikolov, Victoria Uren, Enrico Motta Knowledge Media Institute,
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 5 Finding and Critiquing Evidence: Research Literature Reviews.
1 Automatic indexing Salton: When the assignment of content identifiers is carried out with the aid of modern computing equipment the operation becomes.
Handling Floods of Literature: BioMedical Document Management Andrew Dolbey Center for Computational Pharmacology University of Colorado, School of Medicine.
Annotating Gene List From Literature Xin He Department of Computer Science UIUC.
Purpose of a Literature Review Potential Research Sources Writing a Literature Review.
Office of Research and Development Photo image area measures 1.5” H x 7” and can be masked by a collage strip of one, two or three images. The photo image.
How to Write Literature Review ww.ePowerPoint.com
Unit 11: Evaluating Epidemiologic Literature. Unit 11 Learning Objectives: 1. Recognize uniform guidelines used in preparing manuscripts for publication.
DISCUSSION Using a Literature-based NMF Model for Discovering Gene Functional Relationships Using a Literature-based NMF Model for Discovering Gene Functional.
Machete: Charting Excursions through Bioscience Literature Shannon Bradshaw 1 and Marc Light 2 1 Department of Management Sciences 2 School of Library.
INTRODUCTION TO APPLIED LINGUISTICS
Deep Indexing in ProQuest Health and Medical Databases.
Major Issues n Information is mostly online n Information is increasing available in full-text (full-content) n There is an explosion in the amount of.
CAPE INFORMATION TECHNOLOGY
E303 Part II The Context of Language Research
Macromolecules Database Creation for Polymer Properties
TJTS505: Master's Thesis Seminar
Introduction to Data Mining
9. Introduction to signal detection
Biomedical Research.
CICC Combines Grid Computing with Chemical Informatics
Introduction of KNS55 Platform
Citation-based Extraction of Core Contents from Biomedical Articles
Publication of research
CAPE INFORMATION TECHNOLOGY
Investigative Biology Scientific Literature and Communication
Presentation transcript:

Improved Cancer Risk Assessment Using Text Mining Ilona Silins 1, Anna Korhonen 2, Johan Högberg 1, Lin Sun 2 and Ulla Stenius 1 1 Institute of Environmental Medicine, Karolinska Institutet, Sweden 2 Computer Laboratory, University of Cambridge, UK Test-chemicals used for annotation: Diethylnitrosamine, 1,3-Butadiene, Benzo(a)pyrene, Styrene, Diethylstilbestrol, Chloroform, Phenobarbital, Fumonisin B1. In total1297 Medline abstracts. Future goals and applications Further improvements, extentions and development of our tool to aid risk assessors managing the existing literature Identify chemicals’ ”Modes of Action” Identify cellular toxicity pathways that can be used to develop new in vitro tests for carcinogenic substances Identify chemicals’ interactions for risk assessment purposes Cancer Risk Assessment examines existing scientific evidence to determine the relationship between exposure to a chemical and the likelihood of developing cancer from that exposure. Text mining (TM) is a growing field of computer science. It involves the discovery (by computer) of new, previously ”unknown” information, by automatically extracting information from different written resources. It has become increasingly popular due to the need to provide access to the tremendous body of texts available in biomedical sciences. Aim: To develop a text mining tool that can aid risk assessors manage the existing literature to improve quality, consistency and effectiveness of the entire cancer risk assessment process. Why? Extensive data gathering and evaluation for cancer risk assessment is challenging and time-consuming! Data are scattered across thousands of journal articles from different areas of biomedicine. Rapid development of molecular biology techniques. Increased knowledge of mechanisms involved in cancer development. Exponentially growing volume of literature. Figure 2. A screenshot of the annotation tool (used for the development of taxonomies) and an annotated abstract. Literature Database (e.g. MEDLINE) Journal articles for cancer risk assessment Annotation of cancer risk assessment corpus Cancer risk assessment taxonomy Text Mining technology Text mining tool Identify chemicals’ ”modes of action” Further development, improvement and extension of the tool Identify ”toxicity pathways” Investigate chemicals’ interactions Figure 3. A preliminary taxonomy ”Scientific evidence for carcinogenic activity”, with example keywords. USER TEST Table 1. Results from user test. A user test examine the practical usefulness of the classification in a near real-world scenario. Experts were asked to judge if the system classified the relevant abstracts correctly in the taxonomies (as shown in figure 3 and 4). Contact: Ilona Silins, Institute of Environmental Medicine, Karolinska Institutet Nobels väg 13, Stockholm, Sweden Project’s homepage: CRAB Figure 1. A schematic description of the working process. Figure 4. A preliminary ”Mode of Action” taxonomy (with example keywords) which specifies some of the scientific evidence needed for cancer risk assessment.