MINING FOR MEANING: Data mining & Knowledge extraction Laboratory of Parasitic Diseases, NIAID.

Slides:



Advertisements
Similar presentations
All Rights Reserved, Copyright © FUJITSU LABORATORIES LTD An approach to KNOW-WHO using RDF Nobuyuki Igata, Hiroshi Tsuda, Isamu Watanabe and Kunio.
Advertisements

A Vector Space Model for Automatic Indexing
Pathways analysis Iowa State Workshop 11 June 2009.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
Contents of this Talk [Used as intro to Genome Databases Seminar, 2002] Overview of bioinformatics Motivations for genome databases Analogy of virus reverse-eng.
Machine Learning and the Semantic Web
Data Mining and Machine Learning Lab Document Clustering via Matrix Representation Xufei Wang, Jiliang Tang and Huan Liu Arizona State University.
Nitric oxide induces Mycobacterium tuberculosis stress response beyond dormancy regulon Isabel Gonzaga BIOL 368: Bioinformatics Laboratory December 10,
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
DI FC UL1 Gene Function Prediction by Mining Biomedical Literature Pooja Jain Master in Bioinformatics Supervisor - Mário Jorge Costa Gaspar.
Sai Moturu. Introduction Current approaches to microarray data analysis –Analysis of experimental data followed by a posterior process where biological.
Advance Information Retrieval Topics Hassan Bashiri.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Class Projects. Future Work and Possible Project Topic in Gene Regulatory network Learning from multiple data sources; Learning causality in Motifs; Learning.
Mining the Medical Literature Chirag Bhatt October 14 th, 2004.
The LINDI Project Linking Information for New Discoveries UIs for building and reusing hypothesis seeking strategies. Statistical language analysis techniques.
Pathway Informatics 6 th July, 2015 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University of.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
Knowledge Integration for Gene Target Selection Graciela Gonzalez, PhD Juan C. Uribe Contact:
Myeloid cells matter Conclusions. Glut1+ CD14++ CD16+ Associated with inflammatory markers HIV Immune activation HIV-related diseases Joshua Anzigen:
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Implicit An Agent-Based Recommendation System for Web Search Presented by Shaun McQuaker Presentation based on paper Implicit:
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Flexible Text Mining using Interactive Information Extraction David Milward
Neonatal Immunology Kristina Abel, PhD CNPRC UC Davis How a Lymphomaniac views ( IL )lness.
Demonstration of 2 -DE gel analysis procedure Virtual proteomics laboratory IIT-Bombay.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2005.
Mining real world data Web data. World Wide Web Hypertext documents –Text –Links Web –billions of documents –authored by millions of diverse people –edited.
In-vitro Analysis of cytokines responses of visceral leishmaniasis and pulmonary tuberculosis patients to homologous and heterologous antigen stimulation.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
1 Automatic indexing Salton: When the assignment of content identifiers is carried out with the aid of modern computing equipment the operation becomes.
Mining the Biomedical Research Literature Ken Baclawski.
Text Mining Application Programming Chapter 1 Introduction Manu Konchady, 2006.
A literature network of human genes for high-throughput analysis of gene expression Speaker : Shih-Te, YangShih-Te, Yang Advisor : Ueng-Cheng, YangUeng-Cheng,
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
Deciphering the Monocyte- Macrophage Lineage Differentiation With IPA Heikki Vuorikoski University of Turku Institute of Biomedicine Department of Anatomy.
Supporting Knowledge Discovery: Next Generation of Search Engines Qiaozhu Mei 04/21/2005.
NATURAL LANGUAGE PROCESSING Zachary McNellis. Overview  Background  Areas of NLP  How it works?  Future of NLP  References.
Healthy ControlCase 2D gel images of Serum protein sample Set of gel images for analysis.
Chapter 6. Inference beyond the index 2007 년 1 월 30 일 부산대학교 인공지능연구실 김민호 Text : FINDING OUT ABOUT Page. 182 ~ 251.
AB C D E F G H Supplemental Fig. 1. Figure S1. MIF induces maturation of CD11b + but not CD8α + DCs in spleen from Mif -/- mice infected with T. gondii.
UNIVERSITY UTARA MALAYSIA COLLEGE OF ARTS & SCIENCES.
An Ontology-based Automatic Semantic Annotation Approach for Patent Document Retrieval in Product Innovation Design Feng Wang, Lanfen Lin, Zhou Yang College.
Major Issues n Information is mostly online n Information is increasing available in full-text (full-content) n There is an explosion in the amount of.
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Research on Knowledge Element Relation and Knowledge Service for Agricultural Literature Resource Xie nengfu; Sun wei and Zhang xuefu 3rd April 2017.
Genomics research paper presentation
Biomedical Text Mining and Its Applications
STRING Large-scale data and text mining
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Word AdHoc Network: Using Google Core Distance to extract the most relevant information Presenter : Wei-Hao Huang   Authors : Ping-I Chen, Shi-Jen.
Molecular study of two types of mutations in promoters of IL-2 and IL-10 genes in Iraqi patients with Tuberculosis Mazin S.Salman Awatif.
Skin Pharmacol Physiol 2017;30: DOI: /
Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task.
ece 627 intelligent web: ontology and beyond
Batyr Charyyev.
Web Mining Department of Computer Science and Engg.
Network biology An introduction to STRING and Cytoscape
Differential Activation of Human Keratinocytes by Leishmania Species Causing Localized or Disseminated Disease  Breanna M. Scorza, Mark A. Wacker, Kelly.
Text Mining Application Programming Chapter 1 Introduction
Presentation transcript:

MINING FOR MEANING: Data mining & Knowledge extraction Laboratory of Parasitic Diseases, NIAID

Knowledge Experiment Results Data interpretation Conclusions

Experiment Results Knowledge Conclusions High publication rates Knowledge Data interpretation

Experiment Results Knowledge Conclusions High publication rates Knowledge High-throughput screening Results Data interpretation

Experiment Results Knowledge Conclusions Knowledge Results GENE EXPRESSION PROFILING Identify relevant genes Identify expression patterns Data interpretation

Experiment Results Knowledge Conclusions Knowledge GENE EXPRESSION PROFILING FUNCTIONAL PROFILING Identify relevant genes Identify expression patterns Identify functional implications Data interpretation

FUNCTIONAL PROFILING MINING FOR MEANING:

ProtozoanBacteriavs Leishmania Mycobacterium tuberculosis Leishmania majorLeishmania donovanivs Brugia malayi Toxoplasma gondii Elutriated Human Monocytes IntracellularExtracellular vs 7 donors Infection with 5 pathogens -overnight- DC IL-4 + GM-CSF Mac M-CSF RNA pools RNA pools U95

Dataset DC genesExtraction  75 Gene expression profiling  Filter 1200 Dataset Mac

DC Pathogens Genes Fold Change (log2) Lm Ld Tg Bm 50 Bm 5 Mtb Fold Change (log2) Induced by intracellular pathogens Leishmania & TB Toxoplasma & TB Toxoplasma

FUNCTIONAL PROFILING MINING FOR MEANING: WITH GENE ONTOLOGIES - GO

FUNCTIONAL PROFILING MINING FOR MEANING: WITH GENE ONTOLOGIES - GO WITH LITERATURE ONTOLOGIES - MESH

FUNCTIONAL PROFILING MINING FOR MEANING: WITH GENE ONTOLOGIES - GO WITH LITERATURE ONTOLOGIES - MESH WITH LITERATURE ABSTRACTS

Experiment Results Knowledge Conclusions Knowledge Data interpretation 12 million references

Experiment Results Knowledge Conclusions Knowledge Results Data interpretation 12 million references

FUNCTIONAL PROFILING MINING FOR MEANING: WITH LITERATURE ABSTRACTS  Co-citation Network

FUNCTIONAL PROFILING MINING FOR MEANING: WITH LITERATURE ABSTRACTS  Co-citation Network  Natural Language Processing

FUNCTIONAL PROFILING MINING FOR MEANING: WITH LITERATURE ABSTRACTS  Co-citation Network  Natural Language Processing  L LL Literature Profiling

Select relevant terms Determine term occurrence in abstracts Retrieve relevant literature for each gene Gene A Gene B Gene C… Gene X 1.Gene - Literature indexation 3.Term filtering DiscriminationCo-occurrence Analyze functional relationships 2. Analysis of abstract contents Abstracts Term occurrences in abstracts

KNOWLEDGE 12 million references Experimental system Gene List - Translate genelists into keywords - Interpret data - Identify functional relationships

FUNCTIONAL PROFILING MINING FOR MEANING: LITERATURE MINING

Experiment Results Knowledge Conclusions High publication rates Knowledge Data interpretation

DATA MINING LITERATURE MINING MINING FOR MEANING: DOCUMENT CLUSTERING

DATA MINING LITERATURE MINING MINING FOR MEANING: DOCUMENT CLUSTERING NLP

DATA MINING LITERATURE MINING MINING FOR MEANING: DOCUMENT CLUSTERING NLP LITERATURE PROFILING

Experiment Results Knowledge Conclusions Knowledge Results GENE EXPRESSION PROFILING Identify relevant genes Identify expression patterns Data interpretation