You can request PRO terms by using the SourceForge PRO tracker (Fig 3A) or by directly contributing to PRO by providing the information in the RACE-PRO.

Slides:



Advertisements
Similar presentations
Bioinformatics Platform Three-tier Architecture Object-based Relational Database implemented using Oracle Middleware implemented using Entity-Class Operations,
Advertisements

Annotation of Gene Function …and how thats useful to you.
Using DAML format for representation and integration of complex gene networks: implications in novel drug discovery K. Baclawski Northeastern University.
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
Ontology annotation: mapping genomic regions biological function Paul D Thomas, Huaiyu Mi and Suzanna Lewis.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Developing a protein-interactions ontology Esther Ratsch European Media Laboratory.
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
COG and GO tutorial.
Abstract The Cell Ontology (CL) is a candidate OBO Foundry 1 ontology for the representation of in vivo cell types. As part of our work in redeveloping.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
Comprehensive Annotation System for Infectious Disease Data Alexander Diehl University at Buffalo/The Jackson Laboratory IDO Workshop /9/2010.
Protein Ontology: Addressing the need for precision in representing protein networks Darren A. Natale, Ph.D. Protein Science Team Lead, PIR Research Assistant.
Claire O’Donovan EMBL-EBI. In UniProtKB, we aim to provide… o A high quality protein sequence database A non redundant protein database, with maximal.
1 iProLINK: An integrated protein resource for literature mining and literature-based curation 1. Bibliography mapping - UniProt mapped citations 2. Annotation.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Integration of PRO and UniProtKB Amherst, NY May 16, 2013 Cathy H. Wu, Ph.D. PRO-PO-GO Meeting.
Linking Diseases and Genes through Informatics Knowledge Bases and Ontologies Joyce A. Mitchell, Ph.D. National Library of Medicine University of Missouri.
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
 Binding sites for several key transcription factors, including nuclear factor (NF)-kB and various interferon regulatory factor (IRF) proteins, are present.
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
IProLINK – A Literature Mining Resource at PIR (integrated Protein Literature INformation and Knowledge ) Hu ZZ 1, Liu H 2, Vijay-Shanker K 3, Mani I 4,
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
UCSC Genome Browser 1. The Progress 2 Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools.
DONNA MAGLOTT, PH.D. PRO AND MEDICAL GENETICS RESOURCES AT NCBI.
1 Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of Biochemistry and Molecular.
COURSE OF BIOINFORMATICS Exam_31/01/2014 A.
Leveraging Ontologies for Human Immunology Research Barry Smith, Alexander Diehl, Anna- Maria Masci Presented at Leveraging Standards and Ontologies to.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Gene Ontology Project
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Protein Ontology (PRO) Amherst, NY May 15, 2013 Cathy H. Wu, Ph.D. Director, Protein Information Resource (PIR) Edward G. Jefferson Chair and Director.
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Variant Prioritization in Disease Studies. 1. Remove common SNPs Credit: goldenhelix.com.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
Protein Information Resource Protein Information Resource, 3300 Whitehaven St., Georgetown University, Washington, DC Contact
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
Introduction to IntAct Pablo Porras Millán, IntAct
Mining Biological Data. Protein Enzymatic ProteinsTransport ProteinsRegulatory Proteins Storage ProteinsHormonal ProteinsReceptor Proteins.
Functional Annotation and Functional Enrichment. Annotation Structural Annotation – defining the boundaries of features of interest (coding regions, regulatory.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Copyright OpenHelix. No use or reproduction without express written consent1.
Central dogma: the story of life RNA DNA Protein.
Copyright OpenHelix. No use or reproduction without express written consent1.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Introduction to biological molecular networks
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Protein databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen and from CSC bio-opas
Maintaining Ontologies as They Scale Across Multiple Species Darren A. Natale Protein Information Resource.
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
User Community Interactions - Impact on PRO - Darren A. Natale, Ph.D. Protein Science Associate Team Lead, PIR Research Assistant Professor, GUMC Protein.
The Transcriptional Landscape of the Mammalian Genome
Department of Genetics • Stanford University School of Medicine
PIR: Protein Information Resource
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Ensembl Genome Repository.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Browsing the GO at MGI Harold Drabkin, Ph.D. Senior Scientific Curator
Hepatitis C virus–cell interactions and their role in pathogenesis
G. Eric Schaller, Shin-Han Shiu, Judith P. Armitage  Current Biology 
Presentation transcript:

You can request PRO terms by using the SourceForge PRO tracker (Fig 3A) or by directly contributing to PRO by providing the information in the RACE-PRO annotation tool (Fig 3B), a user-oriented interface to enter the experimental information about protein forms, therefore enabling and fostering contribution by domain experts. Submission is subject to editorial review and the data is the input to a program that generates PRO terms (Fig 3 C). Protein Ontology to provide specificity to protein and complex annotations Cecilia Arighi 1, Harold Drabkin 2, and PRO consortium* 1 Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, 2 The Jackson Laboratory, Bar Harbor, ME Natale D A et al. Nucl. Acids Res. 2014;42:D415-D421 Fig.1- Illustration of PRO categories and relation to external resources. Categories are listed along the top, with example terms for IRF5 (interferon regulatory factor 5) shown directly below. Not all terms and relationships are shown. IRF, interferon regulatory factor; Phos, phosphorylated; m, mouse; h, human; iso, isoform; BMv, bone marrow variant. Protein Complex: Use GO terms Relations: Protein gene level -> UniProtKB Key Is_ahas_component Background The Protein Ontology (PRO; defines protein and protein complex entities representing their major forms and relations among them. Protein entities represented in PRO denote single amino acid chains and are categorized by level of specificity into evolutionary family, products of a single gene in one species, products of a single transcript and post-translationally modified forms, among others. PRO specializes in organism-specific protein complexes, their components and their modified forms. PRO’s scope includes the 12 GO reference genomes. PRO works with and complements established sequence oriented databases such as UniProtKB and it is interoperable with other biomedical and biological ontologies such as the Gene Ontology (GO), where the PRO organism-specific complexes are subclasses of the organism-agnostic protein complex terms in the GO Cellular Component Ontology (Fig.1). Fig 3. External Contributions via PRO Website A-PRO homepage with links to the annotation resources: SourceForge tracker and RACE-PRO. B- RACE-PRO annotation interface. Example of annotation of the BH3-interacting domain death agonist p15 cleaved form. C- Integrated view of information (ontology, annotation, and mapping) in the PRO entry report B A-Enter accession or paste sequence B-Define protein region and/or PTMs D- Data source C-Enter protein form name E- Annotation C PRO enables annotation to multiple levels of granularity How do I request PRO terms? Annotation tool Term request via Sourceforge How PRO is being used? B-Individual processed proteins from polyproteins Figure source: d=derives from P: GO: ,suppression by virus of host STAT1 activity (EXP, PMID: ) A-Isoforms D-Complex with detail subunit composition P: GO: , MyD88- independent toll-like receptor signaling pathway (EXP, PMID: ) P: GO: , MyD88- independent toll-like receptor signaling pathway (EXP, Reactome:REACT_6809 ) C-Protein variants agent in DOID:9246, cerebral amyloid angiopathy agent in DOID:10652, Alzheimer’s disease E-Family-type terms has part PF:00001, 7 transmembrane receptor has part PF:03827, Orexin receptor type 2 GO Species-agnostic Complex Species-specific Fig.2- Examples of PRO levels of granularity with accompanying annotation. A) Different isoforms of PIP5K1C, B) individual dengue proteins, C) Sequence variants of APP protein, D) TLR4 complex in human and mouse with subunit composition including PTMs, and E) Family-type of terms depicting the Orexin Receptor A Sumolyated form of isoform 1 Unsumolyated form of isoform 1 The gene product of Irf8 appears to be involved in transcription regulation … or NOT PRO is being utilized in multiple ways, some examples: Entity tagging and semantic integration Definition of terms in other ontologies Description of protein/complex networks Gene Ontology Annotation GO curators at MGI and Pombase are actively requesting PRO terms for isoforms and modified forms. The annotations can be viewed in new interface Amigo2. PRO entry report A-Ontology information B-Count of Lck human-related terms at different levels C-Visualization G-List of complexes LcK human is component of Report for LCK human D-Sequence LcK human H-Annotations to terms related to LcK human F-List of all Protein forms related to LcK human E-Multiple Alignment of protein forms related to LcK human with modification sites highlighted Fig.4- The PRO entry report for the species-specific protein (gene level) contains the summary of the protein forms, complexes, annotations and sequences for such protein. *PRO Consortium Conclusion: PRO versatility of proteins and complex representation enables its use at multiple levels of granularity