B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego 118026.

Slides:



Advertisements
Similar presentations
Office of SA to CNS GeoIntelligence Introduction Data Mining vs Image Mining Image Mining - Issues and Challenges CBIR Image Mining Process Ontology.
Advertisements

Social networks, in the form of bibliographies and citations, have long been an integral part of the scientific process. We examine how to leverage the.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Annikki Roos & Turid Hedlund ELPUB 2007 Vienna Importance of access to biomedical information for researchers in research groups of molecular medicine.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
1 Question Answering in Biomedicine Student: Andreea Tutos Id: Supervisor: Diego Molla.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Automating Discovery from Biomedical Texts Marti Hearst & Barbara Rosario UC Berkeley Agyinc Visit August 16, 2000.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
DI FC UL1 Gene Function Prediction by Mining Biomedical Literature Pooja Jain Master in Bioinformatics Supervisor - Mário Jorge Costa Gaspar.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Improved Cancer Risk Assessment Using Text Mining Ilona Silins 1, Anna Korhonen 2, Johan Högberg 1, Lin Sun 2 and Ulla Stenius 1 1 Institute of Environmental.
Topics in Computational Biology (COSI 230a) Pengyu Hong 09/02/2005.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Srihari-CSE730-Spring 2003 CSE 730 Information Retrieval of Biomedical Text and Data Inroduction.
Špindlerův Mlýn, Czech Republic, SOFSEM Semantically-aided Data-aware Service Workflow Composition Ondrej Habala, Marek Paralič,
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Medline Text Searching Tools – a Comparison Experiment McDermott Center for Human Growth and Development Center for Biomedical Inventions.
Analysis Environments For Scientific Communities From Bases to Spaces Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign.
Bioinformatics and it’s methods Prepared by: Petro Rogutskyi
Text Mining: Opportunities and Barriers John McNaught Deputy Director National Centre for Text Mining
Bioinformatics and medicine: Are we meeting the challenge?
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
IProLINK – A Literature Mining Resource at PIR (integrated Protein Literature INformation and Knowledge ) Hu ZZ 1, Liu H 2, Vijay-Shanker K 3, Mani I 4,
Flexible Text Mining using Interactive Information Extraction David Milward
Automatically Generating Gene Summaries from Biomedical Literature (To appear in Proceedings of PSB 2006) X. LING, J. JIANG, X. He, Q.~Z. MEI, C.~X. ZHAI,
Discovering Gene-Disease Association using On-line Scientific Text Abstracts. Raj Adhikari Advisor: Javed Mostafa.
Relevance Detection Approach to Gene Annotation Aid to automatic annotation of databases Annotation flow –Extraction of molecular function of a gene from.
Correlating Knowledge Using NLP: Relationships between the concepts of blood cancers, stem cell transplantation, and biomarkers Katy Zou and Weizhong Zhu.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
BioSumm A novel summarizer oriented to biological information Elena Baralis, Alessandro Fiori, Lorenzo Montrucchio Politecnico di Torino Introduction text.
Harbin Institute of Technology Computer Science and Bioinformatics Wang Yadong Second US-China Computer Science Leadership Summit.
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
1 What is Data Mining? l Data mining is the process of automatically discovering useful information in large data repositories. l There are many other.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
A collaborative tool for sequence annotation. Contact:
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College Bio Informatics January
Handling Floods of Literature: BioMedical Document Management Andrew Dolbey Center for Computational Pharmacology University of Colorado, School of Medicine.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
1 Semantic Relations for Interpreting DNA Microarray Data and for Novel Hypotheses Generation Dimitar Hristovski, 1 PhD, Andrej Kastrin, 2 Borut Peterlin,
Opportunities for Text Mining in Bioinformatics (CS591-CXZ Text Data Mining Seminar) Dec. 8, 2004 ChengXiang Zhai Department of Computer Science University.
DeepDive Introduction Dongfang Xu Ph.D student, School of Information, University of Arizona Sept 10, 2015.
Approach to building ontologies A high-level view Chris Wroe.
Clinical research data interoperbility Shared names meeting, Boston, Bosse Andersson (AstraZeneca R&D Lund) Kerstin Forsberg (AstraZeneca R&D.
DISCUSSION Using a Literature-based NMF Model for Discovering Gene Functional Relationships Using a Literature-based NMF Model for Discovering Gene Functional.
LECTURE 2: DATA MINING. WHAT IS DATA MINING? 2 D ATA M INING AND D ATA W AREHOUSES ? It evolved in to being as the science of databases evolved Database.
Principals of Biomedical Research Guri Tzivion, PhD Extension 506 PBMR 611: Winter 2016 Windsor University School of Medicine.
A New Generation of Artificial Neural Networks.  Support Vector Machines (SVM) appeared in the early nineties in the COLT92 ACM Conference.  SVM have.
Deep Indexing in ProQuest Health and Medical Databases.
Genomic Medicine Grid Juan Pedro Sánchez Merino Instituto de Salud Carlos III
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
TDM in the Life Sciences Application to Drug Repositioning *
Biologic Medicines.
Genomics research paper presentation
Biomedical Text Mining and Its Applications
Applications of Text Mining
Citation-based Extraction of Core Contents from Biomedical Articles
Batyr Charyyev.
Web Mining Department of Computer Science and Engg.
Introduction to Basic Research Methods
Presentation transcript:

B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego

O VERVIEW  Objectives  Introduction  Biomedical text mining phases and tasks  Data sets and tools for biomedical text mining  Application of biomedical text mining in cancer research  Cancer systems biology research with text mining approach  Future work and challenges  Conclusions

O BJECTIVES After completing this presentation you should :  Know the phases and tasks in biomedical text mining.  Know the application of biomedical text mining in cancer research.  The challenges of cancer research in biomedical text mining

I NTRODUCTION Cancer is a malignant disease that has caused so many deaths. The immense body and rapid growth of biomedical text on cancer has led to the appearance of large number of text mining techniques aimed at extracting unique knowledge from scientific text. Biomedical text mining on cancer research is computationally automatic and high-throughput in nature.

B IOMEDICAL T EXT M INING P HASES AND T ASKS The goal of text mining is to derive implicit knowledge that hides in unstructured text and present it in an explicit form. There are four phases in biomedical text mining:  Information retrieval : this aims at getting desired text on a certain topic.  Information extraction : this system is used to extract predefined types of information such as relation extraction.  Knowledge extraction : this system helps to extract important knowledge from texts.  Hypothesis generation : this system infer unknown biomedical facts based on texts.

G ENERAL TASKS IN BIOMEDICAL TEXT MINING There are four general tasks of biomedical text mining and they include:  Information retrieval  Named entity recognition and relation extraction  Knowledge discovery  Hypothesis generation

CONVENTIONAL PHASES AND TASKS INVOLVED IN BIOMEDICAL TEXT MINING

D ATA SETS AND TOOLS FOR BIOMEDICAL TEXT MINING Examples of data set and tools used for biomedical text mining:  PubMed: this is one of the best known biomedical databases and it contains more than 20 million citations on biomedical articles.  Textpresso: this uses an ontology, returns searching goals for classes of biological concept (e.g., gene, cell), classes of relations of objects (e.g., association, regulation), and related description (biological process).  GoPubMed: this classifies literature abstracts according to a Gene ontology and shows the ontology terms that are related to the query words.  ETC.

A PPLICATION OF B IOMEDICAL T EXT M INING IN C ANCER R ESEARCH  As a complex disease, cancer is related to a large number of genes and proteins. Biomedical researchers are interested in mining cancer- related genes and proteins from the literature to study cancer diagnostics, treatment, and prevention.

C ANCER S YSTEMS B IOLOGY R ESEARCH WITH T EXT M INING A PPROACH  Researchers tend to understand complex biological systems from a systems biology viewpoint. Systems biology-based networks cab be constructed by aggregating previously reported associations from the literature or various databases.  Generally, the conventional flow of text mining based cancer systems biology research is text acquisition, bio- entity terms recognition, complex relation extraction, new knowledge discovery, and hypothesis generation.

A N ILLUSTRATION OF A TEXT MINING - ASSISTED CANCER STUDY WORKFLOW FROM A SYSTEMS BIOLOGY VIEWPOINT.

FUTURE WORK AND CHALLENGES Challenges:  Application of biomedical text mining technologies in the personalized medicine development.  Complex of cancer molecular mechanisms.  Application of text mining techniques in translational medicine research.  The integration of the text information at molecule, cell, tissue, organ. Individual and even population levels to understand the complex biological systems.  The de-noising and testing of text mining results.

C ONCLUSIONS  There are huge body of biomedical text and their rapid growth makes it impossible for researchers to address the information manually. Researchers can use biomedical text mining to discover new knowledge.  Text mining has been used widely in cancer research. However to fully utilize text mining, it is still necessary to develop new methods for full text mining and for highly complex text, as well as platforms for integrating other biomedical knowledge bases.