Elucidating effects of nerve injury on gene expression using

Slides:



Advertisements
Similar presentations
Using Gene Knockout and Transgenic Approaches to Evaluate in vivo Functions of CNS Regeneration Inhibitors: How Important is Nogo ? David Mann Tammy Hibler.
Advertisements

Statistical methods and tools for integrative analysis of perturbation signatures Mario Medvedovic Laboratory for Statistical Genomics and Systems Biology.
Knowledge Graph: Connecting Big Data Semantics
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
TRANSFAC Project Roadmap Discussion.  Structure DNA-binding domain (DBD)  The portion (domain) of the transcription factor that binds DNA Trans-activating.
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
Spinal Cord Injury/Repair
Gene Discovery & Genome Browsing
Stem Cells and Regenerative Medicine
Urbana, IL| MAY 22, 2009 Anatomical Localization BeeSpace 5 th Annual Workshop Institute for Genomic Biology University of Illinois at Urbana-Champaign.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
AP Biology Control of Eukaryotic Genes.
New data and tools at TAIR (The Arabidopsis Information Resource)
Resource Curation and Automated Resource Discovery.
Finish up array applications Move on to proteomics Protein microarrays.
Leveraging Ontologies for Human Immunology Research Barry Smith, Alexander Diehl, Anna- Maria Masci Presented at Leveraging Standards and Ontologies to.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
GUI GoMiner and High-Throughput GoMiner Analysis of Alternative Splice Variants Barry Zeeberg, Ari Kahn, Michael Ryan, David Kane, Curtis Jamison, Hongfang.
Copyright OpenHelix. No use or reproduction without express written consent1.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
12/7/2015Page 1 Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy.
Knowledge Engineering Start with the question: “What is an ‘atom’ of scientific knowledge?”
Master headline RDFizing the EBI Gene Expression Atlas James Malone, Electra Tapanari
PRO and the NIF / ImmPort Antibody Registries Alexander Diehl Protein Ontology Workshop 6/18/14.
Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.
The Neuroscience information framework A User’s Guide.
Semantic Web Portal: A Platform for Better Browsing and Visualizing Semantic Data Ying Ding et al. Jin Guang Zheng, Tetherless World Constellation.
Paloma Marín Arraiza 17 th International Conference on Grey Literature 1 st and 2 nd December 2015, Amsterdam (Netherlands) SCIENTIFIC AUDIOVISUAL MATERIALS.
Introduction to PubChem BioAssay
Using BLAST to Identify Species from Proteins
Cloud based linked data platform for Structural Engineering Experiment
Intersecting different databases to define the inner and outer limits of the data-supported druggable proteome
Making “Open Data” Work: Challenges for Data Integration in Genomics Research
Figure Legend: From: Noncoding RNAs:New Players in Chronic Pain
Mental Functioning and the Gene Ontology
JMC CGEMS SUMMER GENOMICS TRAINING WORKSHOPS
miRPathDB: A Specialized Professional Database with Upkeep Concerns
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Sponsored by the University of Southampton
Functional Genomics in Evolutionary Research
Ontology Evolution: A Methodological Overview
Department of Genetics • Stanford University School of Medicine
Using BLAST to Identify Species from Proteins
Functional Annotation of the Horse Genome
Annotation: linking literature to gene products
SmaRT Visualization of Legal Rules for Compliance
David Mann Tammy Hibler Isaac Holmes Arun George Paul
About Me Matt Danzi Research Mentors – Vance Lemmon and John Bixby
Fig. 6. Treatment with a DLK inhibitor is neuroprotective and reverses stress-induced gene expression changes. Treatment with a DLK inhibitor is neuroprotective.
SIFGD: Setaria italica Functional Genomics Database
Basic Local Alignment Search Tool (BLAST)
Artefacts and Biases in Gene Set Analysis
ChIP-seq Robert J. Trumbly
Single Cell Regulatory Variation
Volume 88, Issue 5, Pages (December 2015)
Service-enabling Biomedical Research Enterprise
Distal Alternative Last Exons Localize mRNAs to Neural Projections
CottonGen: Enabling Cotton Research through Big-Data Analysis and Integration Jing Yu, Sook Jung, Chun-Huai Cheng, Taein Lee, Katheryn Buble, Ping Zheng,
Volume 10, Issue 2, Pages (August 2011)
Using BLAST to Identify Species from Proteins
(A) Western blot probing nuclear extract from wild-type (wt) and the newly generated ACF1 mutant (AcfC) embryos (0–16 h). (A) Western blot probing nuclear.
Trans-acting factors show enrichment of putative targets among differentially translated genes under DR. Trans-acting factors show enrichment of putative.
Presentation transcript:

Elucidating effects of nerve injury on gene expression using . Bio-Ontologies SIG – July 8 2016 Alison Callahan, Matthew C. Danzi, Giulia Zunino, Daniel J. Cooper, Nigam H. Shah, Ubbo Visser, John L. Bixby, and Vance P. Lemmon

Spinal cord injury is a significant burden on individuals and the U. S Spinal cord injury is a significant burden on individuals and the U.S. healthcare system ~12,500 new SCI cases each year in the U.S. ~276,000 total individuals affected in 2014 average yearly expenses in the first year after injury range from $350K to >$1M depending on severity

There are no effective therapies for spinal cord injury – is this because experiments are not replicable? Steward et al. 2012. Replication and reproducibility in spinal cord injury research. Experimental Neurology. Prinz et al. 2011. Believe it or not: how much can we rely on published data on potential drug targets? Nature Drug Discovery. Mechanisms of injury image: Ruff and Fehlings. 2010. Neural stem cells in regenerative medicine: bridging the gap. Panminerva medica 52(2):125-147.

Aggregating and linking data across studies and experiment types is managed by individual scientists Sejnowski et al. 2014. Putting big data to good use in neuroscience. Nature Neuroscience.

We have lots of data at our disposal

Goal: Structure and integrate SCI research relevant data

The RegenBase ontology defines classes and properties specific to the SCI research domain 435 classes 18 object properties 8 data properties mappings to FMA and MPO based on lexical match of class labels

Getting information from the literature into RegenBase SCI + regeneration related publications Expert curation + RDF conversion pipeline + entity identifier mapping

MIASCI Online A tool for SCI researchers to curate publications 11 major sections: investigator, organism, surgery, perturbagen, cell transplantation, biomaterials, histology, immunohistochemistry, imaging, behavior, and data analysis and statistics.

Literature-sourced data model

Literature-sourced data model

Getting assay data into RegenBase Raw assay data Kinase activity assays Neurite outgrowth assays Data processing + RDF conversion + entity identifier mapping

Assay data model

Getting gene expression data into RegenBase Raw RNA-seq or microarray data Data processing + RDF conversion + entity identifier mapping

Gene expression data model

RegenBase content literature-sourced data: ~20,000 statements from 42 publications kinase activity data: effect of ~52,000 compounds on 476 kinases neurite outgrowth data: effect of ~1600 compounds on neurite outgrowth gene expression data: changes in gene expression after injury in rats and mice for > 40,000 genes and gene probes Callahan et al. 2016. RegenBase: a knowledge base of spinal cord injury biology for translational research. Database (Oxford) 16: baw040.

Gene expression data model is motivated by 3 use cases Image credits: - Protein - Thomas Splettstoesser (www.scistyle.com) https://en.wikipedia.org/wiki/Protein_domain#/media/File:Pyruvate_kinase_protein_domains.png - Sequence homology - Thomas Shafee https://upload.wikimedia.org/wikipedia/commons/b/b5/Histone_Alignment.png - Gene expression - http://bmccellbiol.biomedcentral.com/articles/10.1186/1471-2121-11-7

Use case #1: What genes significantly differentially expressed in DRGs after a peripheral nerve injury have a protein product with an RNA-recognition motif? Symbol Fold change P-value Time (hours) A1cf -0.744789 0.00178355 1 -0.544549 0.0147546 3 -0.449241 0.0453885 24 -0.532934 0.0173952 28 Acin1 -0.881353 0.0245572 72 Cirbp 1.23386 0.0115497 1.13581 0.0184748 8 1.40815 0.0202129 12 Cpsf6 1.03969 0.00164697 Cpsf7 -0.840796 0.00846955

Use case #2: Does the mouse gene CALM2 have any rat gene orthologues that are significantly differentially expressed in DRG neurons after a peripheral nerve injury?

Use case #3: What genes are differentially regulated at the early time points after injury, but then move toward to their homeostatic levels (or even go the opposite direction) later? Gene Symbol 1st Fold Change 1st P-value 2nd Fold Change 2nd P-value Pdpk1 5.59 4.56E-10 4.39 7.57E-08 Cdh22 5.26 2.85E-06 -0.056 0.999 Tcf4 4.52 9.24E-05 -0.088 Flrt3 4.51 1.07E-13 2.86 3.17E-08 Il6 4.42 1.42E-13 3.08 4.25E-09 Cacna2d1 4.35 1.73E-08 3.80 2.14E-07 Kcna1 4.33 0.000425 0.00793 Ap3s1 4.32 2.02E-09 4.13 3.70E-09 Gnb1 4.05 0.000169 3.56 0.000608 Gda 3.98 0.000372 3.85 0.000451

http://regenbase.org/example-sparql-queries

RegenBase and the Linked Data Web enable faster, easier SCI data search and analysis We have extended RegenBase with an important new data source, and the code we developed to do this is re-usable Each of the 3 research use cases require many researcher hours if executed “by hand”, each time a gene of interest is identified RegenBase reduces this time to minutes for query formulation and seconds for query response Using URI patterns, identifier mapping services, and Bio2RDF gives us data integration for free

What next? A RegenBase search tool, new methods and data sources for adding content to RegenBase, and working with the broader neuroscience community to extend to new domains

Acknowledgements Literature curators RegenBase team John Bixby Vance Lemmon Ubbo Visser Shah Lab @ Stanford funding: NLM R01s HD057632 and NS080145

Thank you! questions? more information available online http://regenbase.org - project description, simple paper browser, example queries, data download http://bioportal.bioontology.org/ontologies/RB - RegenBase ontology in BioPortal http://regenbase.stanford.edu:8890/sparql - SPARQL endpoint