“Pathways” to analyze microarrays Just like the Gene Ontology, the notion of a cancer signaling pathway can also serve as an organizing framework for interpreting.

Slides:



Advertisements
Similar presentations
BioPortal: A Web Repository and Services for Biomedical Ontologies and Data Resources Natasha Noy and the BioPortal team Stanford Center for Biomedical.
Advertisements

NCBO-I2B2 Collaboration Overview and Use Cases Nigam Shah
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Molecular Systems Biology 3; Article number 140; doi: /msb
Asking translational research questions using ontology enrichment analysis Nigam Shah
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
A Systematic approach to the Large-Scale Analysis of Genotype- Phenotype correlations Paul Fisher Dr. Robert Stevens Prof. Andrew Brass.
BIOINFORMATICS Ency Lee.
Distinguishing Regulators of Biomolecular Pathways Mentor: Dr. Xiwei Wu City of Hope Sean Caonguyen SoCalBSI 8/21/08.
Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.
THE NATIONAL CENTER FOR BIOMEDICAL ONTOLOGY Ontology-based Tools to Enhance Data Curation Trish Whetzel, PhD Outreach Coordinator December 9, 2010.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
Integrating Literature and Experimental Data Fan Meng, Ph.D. Microarray Laboratory Psychiatry Department and Molecular & Behavioral Neuroscience Institute.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Biological Interpretation of Microarray Data Helen Lockstone DTC Bioinformatics Course 9 th February 2010.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
Introduction The goal of translational bioinformatics is to enable the transformation of increasingly voluminous genomic and biological data into diagnostics.
Edinburgh,UKBNCOD21 Heterogeneous Association Rules Mining Badr Al-Daihani School of Computer Science Cardiff University.
Malignant Melanoma and CDKN2A
>>> Korean BioInformation Center >>> KRIBB Korea Research institute of Bioscience and Biotechnology GS2PATH: Linking Gene Ontology and Pathways Jin Ok.
Accelerating Candidate Gene Discovery through Ontological Indexing of Large Scale Data Repositories Simon Twigger, Ph.D.
Shankar Subramaniam University of California at San Diego Data to Biology.
Structural Bioinformatics R. Sowdhamini National Centre for Biological Sciences Tata Institute of Fundamental Research Bangalore, INDIA.
Ontology-based Annotation & Query of TMA data Nigam Shah Stanford Medical Informatics
Is phosphorylation site disruption associated with cancer? Maricel G. Kann (University of Maryland, Baltimore County) Matthew E. Mort (Indiana University.
Genetics-multistep tumorigenesis genomic integrity & cancer Sections from Weinberg’s ‘the biology of Cancer’ Cancer genetics and genomics Selected.
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
Using ontologies to make sense of unstructured medical data Nigam Shah, MBBS, PhD
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
PGA Workshop August 2003 Rat Genome Database an introduction Simon N. Twigger, Ph.D. Bioinformatics Research Center Medical College of Wisconsin, Milwaukee.
Ontology based analyses methods ++ develop a grammar for making productions using mf, bp, cl: –derive a higher level grammar for next level of productions.
BIOS6660 shRNAseq Gene Set Enrichment Analysis Tzu L Phang PhD Robert Stearman PhD April 16, 2014.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Mining Biological Data. Protein Enzymatic ProteinsTransport ProteinsRegulatory Proteins Storage ProteinsHormonal ProteinsReceptor Proteins.
Association of variations in I kappa B-epsilon with Graves' disease using classical and my Grid methodologies Peter Li School of Computing Science University.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Epigenetic Modifications in Crassostrea gigas Claire H. Ellis and Steven B. Roberts School of Aquatic and Fishery Sciences, University of Washington, Seattle,
Computations using pathways and networks Nigam Shah
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
A collaborative tool for sequence annotation. Contact:
Bioinformatics and Computational Biology
A literature network of human genes for high-throughput analysis of gene expression Speaker : Shih-Te, YangShih-Te, Yang Advisor : Ueng-Cheng, YangUeng-Cheng,
GO based data analysis Iowa State Workshop 11 June 2009.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Mapping to Ontologies Nigam Shah
DAVID Bioinformatics Web Site 2012 – 2015 David Huang, MD LMS/CCR/NCI
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Supporting Collaborative Ontology Development in Protégé International Semantic Web Conference 2008 Tania Tudorache, Natalya F. Noy, Mark A. Musen Stanford.
Gene Set Analysis using R and Bioconductor Daniel Gusenleitner
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Practice:submit the ChIP_Streamline.pbs 1.Replace with your 2.Make sure the.fastq files are in your GMS6014 directory.
Ontology Web Services from the National Center for Biomedical Ontology Mark Musen and Nigam Shah {musen,
Using NCBO Web services
Collaborating with the National Center for Biomedical Ontology
By Michael Fraczek and Caden Boyer
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Schedule for the Afternoon
Presentation transcript:

“Pathways” to analyze microarrays Just like the Gene Ontology, the notion of a cancer signaling pathway can also serve as an organizing framework for interpreting microarray expression data. On examining a relatively small set of genes based on prior biological knowledge about a given pathway, the analysis becomes more specific.

Reactome’s sky painter (demo)

Recap: How do ontologies help? An ontology provides a organizing framework for creating “abstractions” of the high throughput (or large amount of) data The simplest ontologies (i.e. terminologies, controlled vocabularies) provide the most bang- for-the-buck Gene Ontology (GO) is the prime example More structured ontologies – such as those that represent pathways and higher order biological concepts – still have to demonstrate real utility.

Going beyond GO annotations

Different kinds of annotations ELMO1 expression is altered by mechanical stimuli : : Other experiments : : ELMO1 associated_with actin cytoskeleton organization and biogenesis Expression profiling of cultured bladder smooth muscle cells subjected to repetitive mechanical stimulation for 4 hours. Chronic overdistension results in bladder wall thickening, associated with loss of muscle contractility. Results identify genes whose expression is altered by mechanical stimuli. 7 Chronic Bladder Overdistension Low level result summary result annotation metadata Assertions Tags

Annotator: The Basic Idea Process textual metadata to automatically tag text with as many ontology terms as possible.

Annotator: Annotator: Give your text as input Select your parameters Get your results… in text or XML

Annotator: workflow “Melanoma is a malignant tumor of melanocytes which are found predominantly in skin but also in the bowel and the eye”. – NCI/C , Melanocyte in NCI Thesaurus – 39228/DOID:1909, Melanoma in Human Disease Transitive closure – 39228/DOID:191, Melanocytic neoplasm, direct parent of Melanoma in Human Disease – 39228/DOID: , cell proliferation disease, grand parent of Melanoma in Human Disease

Code Word Add-in to call the Annotator Service ? Word Add-in to call the Annotator Service ? Annotator service Multiple ways to access Specific UI Excel UIMA platform

Use-cases based on automated annotation

Tm2d1 RGD Svs4 Hbb Scgb2a1 Alb + Hbb is_expressed_in rat kidney Tm2d1 is_expressed_in rat kidney Human (U133, U133v2.), Mouse (430, U74, U95) and Rat (U34a/b/c, 230, 230v2) 62,000 samples x ca. 25,000 genes/sample = 1.5B data points Linking annotations to data (by Simon Twigger)

Ontology based annotation 20 diseases AMIA-TBI, Year in review

Mutation Profiling Matthew Mort, Uday S. Evani, … Nigam H. Shah … Sean D. Mooney In Silico Functional Profiling of Human Disease-Associated and Polymorphic Amino Acid Substitutions. Human Mutation, in press AMIA-TBI, Year in review

Resources index: The Basic Idea The index can be used for: Search Data mining

Resources index: Example

CodeResource Tab Resources annotated = 20 Total records = 1.3 million Direct annotations = 371 million After transitive closure = 5.3 Billion Custom UI (alpha)

Disease card

Data mining: Drug, Disease, Gene relationships Example: p(salmeterol | Asthma, ADRB2) = 0.07 p(salbutamol | Asthma, ADRB2) = 0.16 At best these are pointers to hypotheses: Stronger biomarker? More reported side effects? Simple recency? Many interpretations are possible!

An Ontology Neutral analysis tool Accepted at AMIA Annual Symposium 2010

Use-1: Subnetwork Analysis Schadt et al, PLoS Biology, May 2008 Mapping the Genetic Architecture of Gene Expression in Human Liver

Use-2: Patient cohort analysis Extended criteria kidney transplant Standard criteria Kidney transplant P (A | B, C …)

DIY Ontology Enrichment Analysis Live Demo

Cfl1 Cofilin is a widely distributed intracellular actin-modulating protein that binds and depolymerizes filamentous F-actin and inhibits the polymerization of monomeric G-actin in a pH- dependent manner. It is involved in the translocation of actin- cofilin complex from cytoplasm to nucleus. … The sequence variation of human CFL1 gene is a genetic modifier for spina bifida risk in California population G-n Some text … : Cfl1 spina bifida G-n Some disease condition : Cfl1 spina bifida G-n Some disease condition : /

THE END

Ontology services Accessing, browsing, searching and traversing ontologies in Your application

30

CodeSpecific UI

References 1.P Khatri, S Draghici: Ontological analysis of gene expression data: current tools, limitations, and open problems. Bioinformatics 2005, 21: NH Shah, NV Fedoroff: CLENCH: a program for calculating Cluster ENriCHment using the Gene Ontology. Bioinformatics 2004, 20: DL Gold, KR Coombes, J Wang, B Mallick: Enrichment analysis in high-throughput genomics--accounting for dependency in the NULL. Brief Bioinform P Glenisson, B Coessens, S Van Vooren, J Mathys, Y Moreau, B De Moor: TXTGate: profiling gene groups with text- based information. Genome Biol 2004, 5:R43. 5.S Myhre, H Tveit, T Mollestad, A Laegreid: Additional gene ontology structure for improved biological reasoning. Bioinformatics 2006, 22: A Subramanian, P Tamayo, VK Mootha, S Mukherjee, BL Ebert, MA Gillette, A Paulovich, SL Pomeroy, TR Golub, ES Lander, et al: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 2005, 102: Jonquet CM, Musen MA and Shah NH: Building a Biomedical Ontology Recommender Web Service. Journal of Biomedical Semantics, 2010 Jun 22;1 Suppl 1:S1. 8.Evani US, Krishnan VG, Kamati KK, Baenziger PH, Bagchi A, Peters BJ, Sathyesh R, Li B, Sun Y, Xue B, Shah NH, Kann MG, Cooper DN, Radivojac P and Mooney SD: In Silico Functional Profiling of Human Disease-Associated and Polymorphic Amino Acid Substitutions. Hum Mutat Jan 5;31(3): Shah NH, Bhatia N, Jonquet CM, Rubin DL, Chiang AP and Musen MA: Comparison of Concept Recognizers for building the Open Biomedical Annotator. BMC Bioinformatics 2009, 10(Suppl 9):S14 10.Noy NF, Shah NH, Whetzel PL, Dai B, Dorf M, Griffith N, Jonquet CM, Rubin DL, Storey MA, Chute CG, Musen MA: BioPortal: ontologies and integrated data resources at the click of a mouse. Nucleic Acids Res Jul 1; 37(Web Server issue):W Shah NH, Jonquet CM, Chiang AP, Butte AJ, Chen R and Musen MA: Ontology-driven Indexing of Public Datasets for Translational Bioinformatics. BMC Bioinformatics 2009, 10(Suppl 2):S1 12.Rob Tirrell, Uday Evani, Ari E. Berman, Sean D. Mooney, Mark A. Musen and Nigam H. Shah: An Ontology-Neutral Framework for Enrichment Analysis. AMIA Annu Symp Proc in press