1 EMBL Outstation — The European Bioinformatics Institute Mus musculus - a model organism in SWISS-PROT.

Slides:



Advertisements
Similar presentations
EMBL-EBI Integration of Sequence and 3D structure Databases.
Advertisements

UniProt Eric Jain Swiss Institute of Bioinformatics, Geneva W3C Workshop on Semantic Web for Life Sciences, October 2004.
Bioinformatics and Chips Bioinformatics is a very integral part of each step in a chip project. Bioinformatics is a very integral part of each step in.
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Swiss-Prot Protein Database Daniel Amoruso December 2, 2004 BI 420.
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
Archives and Information Retrieval
Biological databases.
Protein Databases EBI – European Bioinformatics Institute
The Cell, Central Dogma and Human Genome Project.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
Protein databases Henrik Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
Proteins and Protein Function Charles Yan Spring 2006.
Class European Resources Protein Focused. Protein Databases EBI – European Bioinformatics Institute
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
UniProt - The Universal Protein Resource
Bioinformatics Lecture 3 BCH 550 Arjumand Warsy. Retrieving Protein Sequences.
Claire O’Donovan EMBL-EBI. In UniProtKB, we aim to provide… o A high quality protein sequence database A non redundant protein database, with maximal.
An Introduction to Bioinformatics Molecular Biology Databases.
Wellcome Trust Workshop Working with Pathogen Genomes Module 1 Artemis.
The PIR-PSD current release 78.03, November 24, 2003, contains entries. 65 proteins The PIR was established in 1984 by the National Biomedical.
Pattern databasesPattern databasesPattern databasesPattern databases Gopalan Vivek.
Bioinformatics.
Development of Bioinformatics and its application on Biotechnology
Integration of PRO and UniProtKB Amherst, NY May 16, 2013 Cathy H. Wu, Ph.D. PRO-PO-GO Meeting.
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Bioinformatics for biomedicine
Introduction to databases Tuomas Hätinen. Topics File Formats Databases -Primary structure: UniProt -Tertiary structure: PDB Database integration system.
Protein function Where to find it. How to predict it. How to classify it. Stuart Rison Department of Biochemistry, UCL
© Wiley Publishing All Rights Reserved. Protein and Specialized Sequence Databases.
Bsubt.embl complete entry in EMBL format (DNA and Features) bsubt.embl.Z bsubt.fasta complete DNA sequence in Fasta format bsubt.fasta.Z bsubt.con construct.
Essential Bioinformatics and Biocomputing Module (Tutorial) Biological Databases Lecturer: Chen Yuzong Jan 2003 TAs: Cao Zhiwei Lee Teckkwong, Bernett.
Secondary Databases Ansuman sahoo Roll: Y Bioinformatics Class Presentation 30 Jan 2013.
NCBI’s Bioinformatics Resources Michele R. Tennant, Ph.D., M.L.I.S. Health Science Center Libraries U.F. Genetics Institute January 2015.
Biological databases Nicky Mulder:
Biological Databases By : Lim Yun Ping E mail :
Fortaleza 31.VII.2006 UniProtKB: Questions and answers UniProtKB/Swiss-Prot: Questions, Answers and a few Tips.
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
1 EMBL Outstation — The European Bioinformatics Institute Added-Value Proteome Databases: SWISS-PROT, TrEMBL, InterPro.
Protein Database David Shiuan Department of Life Science Institute of Biotechnology Interdisciplinary Program of Bioinformatics National Dong Hwa University.
1 EMBL Outstation — The European Bioinformatics Institute Automatic and Reliable Functional Annotation of Proteins.
Sequence Search and Analysis SPE 1653 (703)
1 EMBL Outstation — The European Bioinformatics Institute EDITtoTrEMBL Automated High-Quality Sequence Annotation Steffen Möller, Ulf Leser, Wolfgang Fleischmann,
Function preserves sequences
Biological databases an introduction By Dr. Erik Bongcam-Rudloff LCB-UU/SLU ILRI 2007 By Dr. Erik Bongcam-Rudloff LCB-UU/SLU ILRI 2007.
PROTEIN DATABASES. The ideal sequence database for computational analyses and data-mining: I t must be complete with minimal redundancy It must contain.
Biological databases Exercises. Discovery of distinct sequence databases using ensembl.
School B&I TCD Bioinformatics Proteins: structure,function,databases,formats.
Sequencing the World of Possibilities for Energy & Environment MGM workshop. 19 Oct 2010 Information Sources for Genomics Konstantinos Mavrommatis Genome.
Rice Proteins Data acquisition Curation Resources Development and integration of controlled vocabulary Gene Ontology Trait Ontology Plant Ontology
1 EMBL Outstation — The European Bioinformatics Institute Removing redundancy in SWISS-PROT and TrEMBL.
Copyright OpenHelix. No use or reproduction without express written consent1.
EMBL – EBI European Bioinformatics Institute UniProt - The Universal Protein Resource Claire O’Donovan.
Bioinformatics and Computational Biology
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
©CMBI 2008 Databases Data must be in a certain format for software to recognize Every database can have its own format but some data elements are essential.
Central hub for biological data UniProtKB/Swiss-Prot is a central hub for biological data: over 120 databases are cross-referenced (EMBL/DDBJ/GenBank,
Bioinformatics Computing
1 of 28 Evaluating Genes and Transcripts (“Genebuild”)
Welcome to the Protein Database Tutorial. This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
1 EMBL Outstation — The European Bioinformatics Institute Large-Scale Characterization of Protein Sequence Data.
Protein databases Henrik Nielsen
Bio/Chem-informatics
Protein Families, Motifs & Domains.
Molecular biology databases
Archives and Information Retrieval
Swiss-Prot Database --- Xie, H
UniProt: Universal Protein Resource
Welcome to the Protein Database Tutorial
Introduction to Databases
Presentation transcript:

1 EMBL Outstation — The European Bioinformatics Institute Mus musculus - a model organism in SWISS-PROT

2 EMBL Outstation — The European Bioinformatics Institute SWISS-PROT F Curated protein sequence data bank established in 1986 by Amos Bairoch in Geneva and maintained collaboratively with EMBL since 1987 F Contains currently > protein sequence entries (Release 36)

3 EMBL Outstation — The European Bioinformatics Institute Distinguishing features F High level of annotation F Minimal redundancy F High level of integration with other databases

4 EMBL Outstation — The European Bioinformatics Institute TrEMBL F Computer-annotated supplement to SWISS-PROT F Consists of entries in SWISS-PROT format F Contains translations of all coding sequences in EMBL Nucleotide Sequence Database which are not yet in SWISS-PROT F Consists of SP-TrEMBL and REM-TrEMBL

5 EMBL Outstation — The European Bioinformatics Institute Model organisms in SWISS-PROT F Target of genome sequencing and/or mapping F Priority annotation u incorporate new sequences/updates as quickly as possible u high level of annotation u cross-references to specialised databases u additional information in form of specific documents

6 EMBL Outstation — The European Bioinformatics Institute Current status of mouse sequences F Total of 7006 entries in SWISS-PROT and TrEMBL F 3264 entries in SWISS-PROT F 3742 in SP-TrEMBL

7 EMBL Outstation — The European Bioinformatics Institute Annotation F Citation information F Taxonomic data F Sequence data F Function(s) of the protein F Post-translational modification(s) F Domains and sites F Secondary and quaternary structure F Similarities to other proteins F Disease(s) associated with deficiencie(s) in the protein F Sequence variants, conflicts

8 EMBL Outstation — The European Bioinformatics Institute Annotation sources F Publications reporting sequence data F Review articles F External experts

9 EMBL Outstation — The European Bioinformatics Institute Integration with other databases F Sequence u EMBL Nucleotide Sequence Database F Structure u Protein Data Bank (PDB)

10 EMBL Outstation — The European Bioinformatics Institute Integration F Specialised data collections u e.g. ENZYME, PROSITE u Mouse Genome Database (MGD) u Index of MGD entries referenced in SWISS-PROT:

11 EMBL Outstation — The European Bioinformatics Institute ID GCDH_MOUSE STANDARD; PRT; 438 AA. AC Q60759; DT 01-NOV-1997 (REL. 35, CREATED) DT 01-NOV-1997 (REL. 35, LAST SEQUENCE UPDATE) DT 01-NOV-1997 (REL. 35, LAST ANNOTATION UPDATE) DE GLUTARYL-COA DEHYDROGENASE PRECURSOR (EC ). GN GCDH. OS MUS MUSCULUS (MOUSE). OC EUKARYOTA; METAZOA; CHORDATA; VERTEBRATA; TETRAPODA; MAMMALIA; EUTHERIA; RODENTIA. RN [1] RP SEQUENCE FROM N.A. RC STRAIN=129/SV; TISSUE=LIVER; RX MEDLINE; RA KOELLER D.M., DIGIULIO K.A., ANGELONI S.V., DOWLER L.L., FRERMAN F.E., WHITE R.A., GOODMAN S.I.; RL GENOMICS 28: (1995). CC -!- CATALYTIC ACTIVITY: GLUTARYL-COA + ACCEPTOR = CROTONOYL-COA + CO(2) + REDUCED ACCEPTOR. CC -!- COFACTOR: FAD FLAVOPROTEIN. CC -!- PATHWAY: DEGRADATIVE PATHWAY OF L-LYSINE, L-HYDROXYLYSINE, AND L-TRYPTOPHAN METABOLISM. CC -!- SUBUNIT: HOMOTETRAMER. CC -!- SUBCELLULAR LOCATION: MITOCHONDRIAL MATRIX. CC -!- SIMILARITY: BELONGS TO THE ACYL-COA DEHYDROGENASES FAMILY. DR EMBL; U18992; G ; -. DR MGD; MGI:104541; GCDH. DR PROSITE; PS00072; ACYL_COA_DH_1; FALSE_NEG. DR PROSITE; PS00073; ACYL_COA_DH_2; 1. KW OXIDOREDUCTASE; FLAVOPROTEIN; FAD; MITOCHONDRION; TRANSIT PEPTIDE. FT TRANSIT 1 44 MITOCHONDRION (POTENTIAL). FT CHAIN GLUTARYL-COA DEHYDROGENASE. SQ SEQUENCE 438 AA; MW; 8C9149C3 CRC32;

12 EMBL Outstation — The European Bioinformatics Institute Summary F Complete with minimal redundancy F As much up-to-date information as possible on each sequence F Priority annotation of mouse sequences F MGD cross-refererences

13 EMBL Outstation — The European Bioinformatics Institute SWISS-PROT at EBI F Rolf Apweiler F Sergio Contrino F Wolfgang Fleischmann F Henning Hermjakob F Vivien Junker F Stephanie Kappus F Fiona Lang F Michele Magrane F Maria Jesus Martin F Nicoletta Mitaritonna F Steffen Moeller F Claire O’Donovan

14 EMBL Outstation — The European Bioinformatics Institute Some ways to access SWISS-PROT + TREMBL F ftp.ebi.ac.uk/pub/databases F F F F