Biomedical Text Mining and Its Applications

Slides:



Advertisements
Similar presentations
Introduction to Information Retrieval Introduction to Information Retrieval Lecture 7: Scoring and results assembly.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Bio-Medical Interaction Extractor Syed Toufeeq Ahmed ASU.
Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.
Chapter 5: Introduction to Information Retrieval
Mining External Resources for Biomedical IE Why, How, What Malvina Nissim
Biological literature mining
Andreas Doms Biotechnology Center Technical UniversityDresden ontology-based literature search life science literature gopubmed hands-on curation tool.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
The Jikitou Biomedical Question Answering System: Using High-Performance Computing to Preprocess Possible Answers Michael A. Bauer 1,2, Daniel Berleant.
Introduction to Natural Language Processing Phenotype RCN Meeting Feb 2013.
Literature Informatics Beyond PubMed: Next Generation Literature Searching Carrie Iwema, PhD, MLS 24 th August 2011.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Database and Information- Retrieval Methods for Knowledge Discovery Database and Information- Retrieval Methods for Knowledge Discovery Gerhard Weikum,
The Informative Role of WordNet in Open-Domain Question Answering Marius Paşca and Sanda M. Harabagiu (NAACL 2001) Presented by Shauna Eggers CS 620 February.
Machine Learning for Information Extraction Li Xu.
Retrieval Evaluation: Precision and Recall. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity.
DI FC UL1 Gene Function Prediction by Mining Biomedical Literature Pooja Jain Master in Bioinformatics Supervisor - Mário Jorge Costa Gaspar.
BioText Infrastructure Ariel Schwartz Gaurav Bhalotia 10/07/2002.
WXGB6106 INFORMATION RETRIEVAL Week 3 RETRIEVAL EVALUATION.
Creating and Visualizing Document Classification J. Gelernter, D. Cao, R. Lu, E. Fink, J. Carbonell.
1 Ontology Generation Based on a User-Specified Ontology Seed Cui Tao Data Extraction Research Group Department of Computer Science Brigham Young University.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
BeeSpace: An Interactive Environment for Analyzing Nature and Nurture in Societal Roles Bruce Schatz Institute for Genomic Biology University of Illinois.
Knowledge Integration for Gene Target Selection Graciela Gonzalez, PhD Juan C. Uribe Contact:
Srihari-CSE730-Spring 2003 CSE 730 Information Retrieval of Biomedical Text and Data Inroduction.
Analysis Environments For Scientific Communities From Bases to Spaces Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
Grant Number: IIS Institution of PI: Arizona State University PIs: Zoé Lacroix Title: Collaborative Research: Semantic Map of Biological Data.
IProLINK – A Literature Mining Resource at PIR (integrated Protein Literature INformation and Knowledge ) Hu ZZ 1, Liu H 2, Vijay-Shanker K 3, Mani I 4,
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Flexible Text Mining using Interactive Information Extraction David Milward
25/03/2003CSCI 6405 Zheyuan Yu1 Finding Unexpected Information Taken from the paper : “Discovering Unexpected Information from your Competitor’s Web Sites”
WEB SEARCH PERSONALIZATION WITH ONTOLOGICAL USER PROFILES Data Mining Lab XUAN MAN.
Text Mining In InQuery Vasant Kumar, Peter Richards August 25th, 1999.
Automatically Generating Gene Summaries from Biomedical Literature (To appear in Proceedings of PSB 2006) X. LING, J. JIANG, X. He, Q.~Z. MEI, C.~X. ZHAI,
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
Lars Juhl Jensen Biomedical text mining. exponential growth.
Math Information Retrieval Zhao Jin. Zhao Jin. Math Information Retrieval Examples: –Looking for formulas –Collect teaching resources –Keeping updated.
Chapter 6: Information Retrieval and Web Search
Mining Topic-Specific Concepts and Definitions on the Web Bing Liu, etc KDD03 CS591CXZ CS591CXZ Web mining: Lexical relationship mining.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
Gene Clustering by Latent Semantic Indexing of MEDLINE Abstracts Ramin Homayouni, Kevin Heinrich, Lai Wei, and Michael W. Berry University of Tennessee.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
A collaborative tool for sequence annotation. Contact:
Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search.
Annotating Gene List From Literature Xin He Department of Computer Science UIUC.
Answer Mining by Combining Extraction Techniques with Abductive Reasoning Sanda Harabagiu, Dan Moldovan, Christine Clark, Mitchell Bowden, Jown Williams.
Retrospective study of a gene by mining texts : The Hepcidin use-case Fouzia Moussouni-Marzolf.
Selecting Relevant Documents Assume: –we already have a corpus of documents defined. –goal is to return a subset of those documents. –Individual documents.
An Ontology-based Automatic Semantic Annotation Approach for Patent Document Retrieval in Product Innovation Design Feng Wang, Lanfen Lin, Zhou Yang College.
WHIM- Spring ‘10 By:-Enza Desai. What is HCIR? Study of IR techniques that brings human intelligence into search process. Coined by Gary Marchionini.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
Personalized Ontology for Web Search Personalization S. Sendhilkumar, T.V. Geetha Anna University, Chennai India 1st ACM Bangalore annual Compute conference,
Mohammad Alqahtani, Dr. Eric Atwell
Visual Information Retrieval
Genomics research paper presentation
Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Data challenges in the pharmaceutical industry
Multimedia Information Retrieval
Beyond PubMed--Next Generation Literature Searching
Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task.
CSE 635 Multimedia Information Retrieval
Citation-based Extraction of Core Contents from Biomedical Articles
Batyr Charyyev.
Presentation transcript:

Biomedical Text Mining and Its Applications By: Raul Rodriguez-Esteban Presented By: Ankita Tanwar

INTRODUCTION & MOTIVATION Tutorial for Biologists and Computational Biologists Main motivation to spread awareness Introduces to term BioNLP Introduces main concepts in Text Mining Lists multiple tools like Whatizit, GoPubMed, GoGene, …

TEXT MINING: MAIN CONCEPTS Term & Term Recognization Tools: Whatizit, Abner, GoPubMed/GoGene, … Relationships between terms Tools: MedGene, BioGene, Endeavour, G2G, … Discovering Relationships Tool: Arrowsmith Measure of output Quality F-Measure, i.e., harmonic mean between precision and recall Comprehensive text mining Sources of information: Medline and beyond.

EXAMPLES OF TEXT RECOGNITION

Focusing on Tool GoPubMed By: Ralph Delfs, Andreas Doms, Alexander Kozlenkov, and Michael Schroeder

INTRODUCTION Allows finding information needed through the use of biomedical background knowledge. It doesn't rank, the user does! It retrieves PubMed abstracts for user’s search query and sorts relevant information to the 4 top level categories: What Who Where When

MOTIVATION The biomedical literature grows at a tremendous rate and PubMed comprises over 14.000.000 abstracts Approaches such as protein interactions, pathways, and micro array data aim to improve literature search But, these approaches do not mimic human information foraging

CONTRIBUTIONS Introduction and realization of ontology-based literature search Derived a term-extraction algorithm Derived an induced ontology from the extracted terms

Structure of GENE ONTOLOGY

GoPubMed: Main Idea The main idea is to use GeneOntology to search and browse PubMed Problems to be solved: How to extract GeneOntology terms from PubMed abstract How to construct the relevant sub-ontology of GO

GoPubMed: Term Extraction Use of Regular Expression: \w matches a word \s a space the dot . any single character To repeatedly match an expression there are three operators: ? requires the preceding pattern to appear once or not at all + requires it to appear once at least once * requires it to appear any number of times (including 0)

Term Extraction: Example Keyword searched: cAMP-dependent kinase Seed term: kinase activity Seed Child: cAMP-dependent protein kinase activity Method to search for such a pattern: kinase \w+ cAMP-dependent .* kinase activity

GoPubMed: Induced Ontology Used to avoid unnecessary parts of ontology not relevant to given abstracts. Given an ontology, 𝑂 and a set of terms 𝑇′′, extracted from abstracts, construct a minimal sub-ontology of 𝑂 Find all the intermediate terms, from terms in 𝑇′′ to root

Screenshot of Initial Prototype

Screenshot of Current Application

THANK YOU!!