How can you benefit from the Bioinformatics Resource? Can (John) Bruce, Ph.D. Associate Director Bioinformatics Resource Keck Biotechnology Laboratory.

Slides:



Advertisements
Similar presentations
Bioinformatics for genomics Kickoff Bioinformatics Expertise Center 10 November 2009 Judith Boer Dept. of Human Genetics.
Advertisements

Statistical methods and tools for integrative analysis of perturbation signatures Mario Medvedovic Laboratory for Statistical Genomics and Systems Biology.
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Gene expression analysis summary Where are we now?
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Pathways Analysis using Protein Expression Data Venkatesh Jitender Dr. Vanathi Gopalakrishnan Center for Biomedical Informatics, UPMC.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Genetics: From Genes to Genomes
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Pathway Informatics 6 th July, 2015 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University of.
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Inferring Cellular Networks Using Probabilistic Graphical Models Jianlin Cheng, PhD University of Missouri 2009.
1 SRI International Bioinformatics Advanced PGDB Editing: Regulation GO Terms Ingrid M. Keseler Bioinformatics Research Group SRI International
Ch10. Intermolecular Interactions and Biological Pathways
Cytoscape A powerful bioinformatic tool Mathieu Michaud
Bioinformatics.
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Bioinformatics and it’s methods Prepared by: Petro Rogutskyi
Transcription Factor Binding Motifs, Chromosome mapping and Gene Ontology analysis on Cross-platform microarray data from bladder cancer. Apostolos Zaravinos.
Detecting enriched regions (Chip- seq, RIP-seq) Statistical evaluation of enriched regions Data displayed in Genome Browser Detection of enriched motifs.
Networks and Interactions Boo Virk v1.0.
Finish up array applications Move on to proteomics Protein microarrays.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Commercial tools for gene list analysis Boo Virk v1.0.
NHLBI Genomics Core Facility. Kim Woodhouse Hangxia Qiu, Ph.D Tony Cooper Xiuli Xu, Ph.D Bio-Informatics Nalini Raghavachari, Ph.D Wet lab Peter Munson,
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
COMPUTATIONAL ANALYSIS OF MULTILEVEL OMICS DATA FOR THE ELUCIDATION OF MOLECULAR MECHANISMS OF CANCER Presented by Azeez Ayomide Fatai Supervisor: Junaid.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
EB3233 Bioinformatics Introduction to Bioinformatics.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Bioinformatics lectures at Rice University Li Zhang Lecture 11: Networks and integrative genomic analysis-3 Genomic data
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Construction of Shanghai Life Science & Bio-technology Service Platform for Data Access and Sharing International Workshop on Strategies Presentation of.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Module 5: Future 1 Canadian Bioinformatics Workshops
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
Affymetrix User’s Group Meeting Boston, MA May 2005 Keynote Topics: 1. Human genome annotations: emergence of non-coding transcripts -tiling arrays: study.
GRANITE: A Tool to Generate Gene Relational Networks Jahangheer Shaik, Ph.D. Department of Pathology and Immunology, Washington University School of Medicine.
Copyright GeneGo Cover Slide Cytoscape Reteat November 7 th 2007 Mark Hughes PhD.
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
Pathway Informatics 30 th March, 2016 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University.
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Selection of Resources for the Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist.
Ingenuity Pathway Analysis Alex Pico. Description "IPA is a software application that enables researchers to analyze and understand the complex biological.
Interrogation of cross talk between proteins and gene regulatory networks in breast cancer Chambers, Teressa Lee Hiren Karathia Sridhar Hannenhalli.
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
생물정보학 Bioinformatics.
Ingenuity Knowledge Base
Advanced PGDB Editing: Regulation GO Terms
Introduction to Bioinformatic
One SNP at a Time: Moving beyond GWAS in Psoriasis
Presentation transcript:

How can you benefit from the Bioinformatics Resource? Can (John) Bruce, Ph.D. Associate Director Bioinformatics Resource Keck Biotechnology Laboratory

The Bioinformatics Core Created within Keck Lab upon request from Yale School of Medicine, July Director Hongyu Zhao Ph.D; Associate Directors Can Bruce, Ph.D. & Yong Kong, Ph.D. The facility is located at Sterling Hall of Medicine. Commercial software packages provided free by the Core are available to Yale researchers 24/7.

Services Access to large number of widely used commercial and open source bioinformatics programs. Fee-based consultation services for well defined bioinformatics analyses. Collaborative projects requiring longer-term commitment of time and effort

Available programs DNA/protein sequence analysis : Lasergene and Gene Construction Kit. Pathway Analysis: Ingenuity Pathway Analysis and MetaCore. Protein structure modeling: Sybyl, a protein structure modeling and visualization program. Mass spectrometry data analysis: GPMAW. Pipelining programs: Pipeline Pilot and VIBE

Examples of Current Collaborations Pathway analysis on proteomics data (Yale/NIDA Proteomics Center Project and Yale/NHLBI Proteomics Center Project investigators) Development of an algorithm for identification of phosphorylation sites from tandem spectrometry data (E. Gulcicek in Keck Proteomics ) Molecular modeling of MAP Kinase ligand interactions (B. Turk in Pharmacology) Sequence analysis for defining invention claim for Office of Collaborative Research

Microarray analysis software GeneSpring GX, provides visualization and advanced statistical analysis for gene expression data. Partek Genomics Suite, provides advanced statistics and interactive data visualization designed for gene expression analysis, exon expression analysis, promoter tiling array analysis, chromosomal copy number analysis, and SNP analysis.

Sequence Analysis Software DNASTAR Lasergene, a comprehensive suite of programs for analysis of DNA/RNA/protein sequences including sequence editing, sequence assembly, sequence alignment, primer design, protein structure prediction, and gene detection and annotation. Gene Construction Kit 2.5, a tool for designing, drawing, and annotating DNA sequences especially plasmid constructs.

PIPELINING PROGRAMS This pipeline from Pipeline Pilot takes a Swiss-Prot sequence, from a Web portal, then generates a results page with four tabs, giving summary data, sequence features map, chemical structures of substrates and blast results.

PATHWAY ANALYSIS MetaCore (from GeneGo), Ingenuity Pathways Analysis 3.1 (from Ingenuity Systems). Both are integrated software suite for functional analysis. Based on a proprietary manually curated database of human protein- protein, protein-DNA and protein compound interactions, metabolic and signaling pathways and the effects of bioactive molecules. Metacore can be integrated with other software packages such as Genespring, Resolver, Expressionist etc., Pipeline Pilot, EndNote, Cytoscape. Ingenuity can be integrated with Genespring, Partek genomics, SAS-Jump Genomics, Spotfire.

Why Pathway Analysis?

Pathway Creation Algorithms in MetaCore (1)

Direct Interactions Algorithm Draws direct interactions between selected objects. No additional objects are added to the network

Self regulatory Networks Finds the shortest directed paths containing transcription factors between your genes in the gene list. (better used for small number of targets)

Expand by one (not suitable for large collections of targets)

Auto expand Draws sub-networks around the selected objects, stopping the expansion when the sub- networks intersect

Pathway Creation Algorithms in MetaCore (2) Analyze Network: Creates a list of possible networks, ranked according to how many objects in the network correspond to the user's list of genes, how many nodes are in the network, how many nodes are in each smaller network. Analyze Transcription Network similar to above, sub- networks created are centered on TFs. Analyze Networks (Transcription Factors) focusses on presence of TFs at end notes. Analyze Networks (Receptors) focusses on presence on Receptors at end point of a network.

Analyze Network Algorithm P<1e-18 A proteomics experiment. Effect of drug infusion on plasma proteins Generates sub- networks highly saturated with selected objects. Sub-networks are ranked by a P- value and G-Score and interpreted in terms of Gene Ontology

Analyze Networks (Transcription Factors) Algorithm - an example - Favors netwok construction where the end-nodes of transcriptionally regulated pathways are present in the original gene list. P=7.2e-46 Example from an mRNA expression analysis data set comparing healthy and lesion skin.

Analyze Network (Receptors) Algorithm - an example - Favors network construction where the end-point of a pathway leads to a receptor (through “receptor binding”) and the starting point of a pathway (a transcription factor, or ligands, etc…) is present in the original gene list, regardless of the presence of the end-point receptor in the list.

Transcription Regulation Algorithm 13 targets/14 nodes P=7.3e-31 Generates sub-networks centered on transcription factors. Sub-networks are ranked by a P-value and interpreted in terms of Gene Ontology

Immune response: Histamine H1 receptor signaling in immune response (p=1e-4)

GeneGo process networks

WNT signaling (p=1e-5)

Disease biomarker enrichment

Network-disease associations 1) Carcinoma (72% coverage, p=3.3e-10) 2) Neoplasms, connective and soft tissue. (42% coverage, p=8e-10)

Use of Pathway Analysis in Candidate Gene Identification 1061 genes are located to mapped region for disease FGF2, WNT5A, Tenascin-C, EGF, ILI1RN, BDNF, TGF-beta2, FGF2, OSF-2, CSPG4(NG2), IL- 8, ENA-78, GCP2, SLIT2, SLIT3, Activin beta A, Annexin I 360 genes up- or down- regulated by >2x 17 receptor ligand genes are important “input” nodes to pathways formed by genes with changed expression. Other up- or down- regulated genes

Pathway analysis narrows down number of candidate genes for disease ErbB2 PECAM1 DDX5 BCAS3 microRNA1 RARalpha MUL VHR WIP ErbB2 NIK Plakoglobin HEXIM1 Prohibitin STAT5A STAT3 Clathrin PSME3 PSMC5 ErbB2 FGF2, ILI1RN, ErbB2 360 genes up- or down- regulated by >2x Other up- or down- regulated genes These genes, from mapped region of interest, are able to form interaction pathways going through these receptor ligands identified by first analysis.

A caveat Not every gene belongs to a pathway in the database…

Why Pathway Analysis Software? A learning tool – Study a group of gene products. A data analysis tool. – Which pathways are particularly affected? – What disease has similar biomarkers? A hypothesis generation tool – Can provide insight into mechanism of regulation of your genes. Which is the likely causative agent for the observed changes? What is likely to happen as a result of these changes? – Suggest effects of gene knock-in or knock-outs. – Suggest side-effects of drugs. – Can highlight new phenomena that needs further investigation. What does the program not explain?

Thank you.