GRANITE: A Tool to Generate Gene Relational Networks Jahangheer Shaik, Ph.D. Department of Pathology and Immunology, Washington University School of Medicine.

Slides:



Advertisements
Similar presentations
EGAN Tutorial: Loading Network Data October, 2009 Jesse Paquette UCSF Helen Diller Family Comprehensive Cancer Center
Advertisements

Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Pathways analysis Iowa State Workshop 11 June 2009.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
Correlation Aware Feature Selection Annalisa Barla Cesare Furlanello Giuseppe Jurman Stefano Merler Silvano Paoli Berlin – 8/10/2005.
Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
Gene regulation in cancer 11/14/07. Overview The hallmark of cancer is uncontrolled cell proliferation. Oncogenes code for proteins that help to regulate.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Literature Mining Tools for Analysis of Genomic Data Ramin Homayouni, Ph.D. Associate Professor of Biology Director of Bioinformatics UTHSC BINF April.
Class Projects. Future Work and Possible Project Topic in Gene Regulatory network Learning from multiple data sources; Learning causality in Motifs; Learning.
 MicroRNAs (miRNAs) are a class of small RNA molecules, about ~21 nucleotide (nt) long.  MicroRNA are small non coding RNAs (ncRNAs) that regulate.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Pathway Informatics 6 th July, 2015 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University of.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Modeling Functional Genomics Datasets CVM Lessons 4&5 10 July 2007Bindu Nanduri.
1 SRI International Bioinformatics Advanced PGDB Editing: Regulation GO Terms Ingrid M. Keseler Bioinformatics Research Group SRI International
Cytoscape A powerful bioinformatic tool Mathieu Michaud
Overviews, Omics Viewers, and Object Groups. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery.
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery l Cellular Overview.
Radiogenomics in glioblastoma multiforme
Copyright OpenHelix. No use or reproduction without express written consent1.
XIP™ – the eXtensible Imaging Platform A rapid application development and deployment platform Lawrence Tarbox, Ph.D. September, 2010.
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Copyright OpenHelix. No use or reproduction without express written consent1.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Networks and Interactions Boo Virk v1.0.
Finish up array applications Move on to proteomics Protein microarrays.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
Text Mining Special Interest Group Stuart Murray, Wyeth Research Novartis Institute for Biomedical Research, Cambridge, MA 6-8 th October 2004.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
GeWorkbench Highlights caBIG ® Molecular Analysis Tools Knowledge Center AACR Annual Meeting, April 3, 2011.
1 of 38 Data Mining in Ensembl with BioMart. 2 of 38 Simple Text-based Search Engine.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Journal Club Meeting Sept 13, 2010 Tejaswini Narayanan.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
A curated database of biological pathways.
Copyright OpenHelix. No use or reproduction without express written consent1.
Cool BaRC Web Tools Prat Thiru. BaRC Web Tools We have.
A literature network of human genes for high-throughput analysis of gene expression Speaker : Shih-Te, YangShih-Te, Yang Advisor : Ueng-Cheng, YangUeng-Cheng,
1 Bioinformatics at Norwegian University of Science and Technology Professor Finn Drabløs Department of Cancer Research and Molecular Medicine Finn Drabløs.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Welcome to Gramene’s RiceCyc (Pathways) Tutorial RiceCyc allows biochemical pathways to be analyzed and visualized. This tutorial has been developed for.
CSC, Dec.15-16,2005. Cytoscape Team Trey Ideker Mark Anderson Nerius Landys Ryan Kelley Chris Workman Past contributors: Nada Amin Owen Ozier Jonathan.
Canadian Bioinformatics Workshops
GeneConnect Use Cases and Design August 3, GeneConnect Database IDs are linked by Direct Annotation, Inferred Annotation, or Sequence Alignment.
Department of Pathology UC Davis School of Medicine Jeff Gregg, M.D. The Development of an Informatics Platform for the Characterization of Clinical Samples.
Selection of Resources for the Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist.
Ingenuity Pathway Analysis Alex Pico. Description "IPA is a software application that enables researchers to analyze and understand the complex biological.
Development of an interactive pipeline for Genome wide association analysis Falola Damilare & Adigun Taiwo – Covenant University Bioinformatics research.
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
Pathway Visualization
Genomes and Their Evolution
Nicole Anguiano and Anu Varshneya
Volume 58, Issue 4, Pages (May 2015)
Interactome Networks and Human Disease
Presentation transcript:

GRANITE: A Tool to Generate Gene Relational Networks Jahangheer Shaik, Ph.D. Department of Pathology and Immunology, Washington University School of Medicine

Outline What is the need of Network analysis tool? What kind of data is available? Currently available tools Why GRANITE? GRANITE Architecture Web-based Interface Analytical Service Future Directions

Need for Network Analysis Diseases such as cancers are quite complex and involve genes and gene products operating at multiple levels of genetic program Advent of profiling technologies such as microarrays, ChIP-ChIP and ChIP-seq, genome wide analysis studies are now possible Gene Relational Networks (GRNs) offer a consolidated picture of functional relationships among genes

Available Data Pathway Databases Protein Interaction Databases Transcription factor-target interaction databases Gene Ontology PubMed Microarray Data

Available Tools TF-Target – PathBLAST, promoter analysis pipeline (PAP), Promoter Analysis and Interaction Network Generation Tool (PAINT), GeneVis Microarray Data – COEXPRESSdb, Simulated Annealing to Realize GEnetic networks (SARGE) GO Tree Machine (GOTM) Literature – GeneIndexer, PubGene Protein Interaction Data – Prediction Of INTeractome database (POINT), Agile Protein Interaction Data analyzer (APID), Protein Interaction Network Inference Tool (PIN-IT) Commercial – Ingenuity pathway analysis – GeneGO

Why GRANITE? GRANITE makes use of information from Pathway, Protein interaction, TranscriptionFactor-target, Gene Ontology, Literature and Microarray databases to draw rich gene relational networks GRANITE provides several advanced options for users to customize networks For advanced users, GRANITE also provides programmatic access to the tool AND ITS FREE!

Functional Overview Functional overview of GRANITE- Input Page accepts identifiers in several different formats such as Entrez Gene ID,Gene Symbol, Ensembl Gene ID, Unigene Cluster ID, Ensembl Transcript ID, Ensembl Peptide ID, Refseq ID and UniprotKb Primary Accession. The input identifiers are internally mapped to Entrez Gene ID to perform network analysis. Network Options page provides two different options- i) Regular Network and ii) Minimal Network. Regular Network connects all the input elements using all the databases whereas Minimal Network minimally connects the elements (see Functional Overview). Network Drawing page renders the interactions on a java applet using a Cytoscape plugin. The interaction information may be saved as an image or delimited text. 'Advanced Options' provide multiple options for user to customize networks- The user can pick databases using 'Choose Databases', set up appropriate thresholds using 'Choose Thresholds', create sub-network using 'Choose Subnetwork', select relevant pathways using 'Choose Pathways' or select appropriate GO terms using 'Choose GO Terms'. 'Other Options' provide the related information such as diseases associated with elements in the network using 'Get Phenotypes', other elements that might be interacting with elements in the network using 'Expand Network' or publications associated with input elements using 'Bibliography' features.

Granite Workflow GRANITE Work Flow showing order in which different databases from GRAND are queried. 'Regular Network' option provides the user with interactions corresponding to input elements from all the databases. When 'Minimal Network' option is chosen, GRANITE first interrogates pathway databases and protein interaction databases to establish connections among the genes. Other databases are queried in order shown until all elements are connected. Once all the elements are connected, GRANITE does not query rest of the databases. When 'Regular Network' option is chosen, GRANITE displays all interactions from all the databases and therefore, this order is not significant.

Access Granite test version at:

Analytical Service caBIG is equipped with caGrid infrastructure that leverages Globus tool kit as a grid middleware component to provide grid environment. A service may be defined as a wrapper around an analysis algorithm or a database which is exposed on the grid meeting caGrid standards. A common way to generate these services is to employ caGrid provided open source interactive development environment (IDE) tool called introduce The core logic of GRANITE is implemented in the service skeleton generated by introduce and analysis methods are exposed through the Grid to provide programmatic access to the tool.

Case Study1: Chemokine network Case Study 1- Chemokine Network a) Minimal network generated by using chemokines and their receptors as input to GRANITE. Specific interactions (highlighted by box) correspond to receptors that bind specifically to ligands whereas unshaded region shows chemokine receptors that bind to multiple ligands. GRANITE was successful in identifying chemokine receptors that bind specifically and non-specifically. b) Regular network generated by using only chemokine receptors as input. c) Network of receptors and ligands generated using 'Expand Network' feature of GRANITE. This demonstrates that GRANITE was able to identify chemokines that bind to the receptors even without the chemokines being presented as input.

Breast Cancer Network Case Study 2- Breast cancer network of 60 known breast cancer genes created using GRANITE. KRAS and PIK3CA in Sub-network 1 have somatic mutations in breast cancers. The transcription factor TP53 in sub-network2 is a tumor protein coding gene, which is a key regulator of transcription factors in sub-networks 3 and 5 respectively. CYP1A1, CYP19A1, CYP1B1 and ESR1 in sub-network 4 are associated with estrogen receptor positive breast cancers and BRCA1 is associated with breast ovarian cancer in females. The interactions between HFE and TFRC in sub-network 4 are associated with high grade neoplastic cancers including breast cancer. AR in sub-network 5 is associated with breast cancers in male. Genetic polymorphisms of NAT1 and NAT2 in sub-network 6 modulate breast cancers in the presence of GSTM1 and GSTT1 genotypes.

Human Viral Gene Network Case Study 3- Human Viral Gene Network of 43 genes from Obayashi et. al : a) Correlation network by Obayashi et. al. showing 5 sub- networks obtained by using COEXPRESSdb b) Correlation network using GRANITE showing the same 5 sub-networks c) Reference generated by finding interactions from publicly available databases and selecting 331 interactions that are confirmed by at least 2 databases. Venn Diagrams Comparing: i) Interactions from Obayashi et. al. Vs reference- 23 (7%) out of 67 interactions from Obayashi et. al. are confirmed by Ground truth, ii) Correlation Interactions from GRANITE Vs. reference- 69 (21%) out of 158 interactions from GRANITE are confirmed by reference iii) Interactions from Obayashi et. al. vs GRANITE correlation network- 19 (12%) interactions are common to both the networks. New reference is generated by selecting 644 interactions from public databases confirmed by at least one database. Iv) New reference vs. Obayashi et. al's correlation network-38(6%) interactions are confirmed by new ground truth. viii) New reference vs. GRANITE correlation network-91(41%) interactions are confirmed by new ground truth.

Work under Progress- NLP based Mining PubMed is a service of the U.S. National Library of Medicine that includes over 19 million citations from MEDLINEU.S. National Library of Medicine MedLine adds more than abstracts each month! Why PubMed? 80% of biological knowledge is only in research papers

Things to do GRANITE currently does not include micro RNA (miRNA) regulatory information. Information related to miRNA regulation is available from multiple databases (Griffiths 2006; Papadopoulos, Reczko et al. 2009; Wang 2009) and will be included in GRANITE in future GRANITE currently shows correlation information by using human chips. The correlation information extracted using mouse microarray chips will also be included in GRANITE