EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Deposition, Validation, Search and Analysis Services.

Slides:



Advertisements
Similar presentations
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
Advertisements

5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
Archives and Information Retrieval
IST Computational Biology1 Information Retrieval Biological Databases 2 Pedro Fernandes Instituto Gulbenkian de Ciência, Oeiras PT.
The Protein Data Bank (PDB)
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
Comparing protein structure and sequence similarities Sumi Singh Sp 2015.
Using 3D-SURFER. Before you start 3D-Surfer can be accessed at For visualization.
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
Protein Interfaces, Surfaces and Assemblies
Number of released entries Year. Growth of Molecular Complexity Number of Chains Year Number of Structures Containing that Number of Chains.
Development of Bioinformatics and its application on Biotechnology
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Bringing Structure to Biology: Small Molecules and the PDBe
EMBL-EBI MSD-mine. EMBL-EBI MSD-mine overview  Web application for online data analysis and mining For the advanced MSDSD researcher Interactive ad-hoc.
CCP-EM community meeting 7 February 2013 EMDB and beyond Ardan Patwardhan and Gerard Kleywegt Protein Data Bank in Europe EMBL-EBI.
Gene Expression Omnibus (GEO)
Copyright OpenHelix. No use or reproduction without express written consent1.
Protein 3D-structure analysis Exercises. Practicals Find update frequency for RCSB PDB: weekly. When was the last update? How many protein structures.
PDBe-fold (SSM) A web-based service for protein structure comparison and structure searches Gaurav Sahni, Ph.D.
Increasing the Value of Crystallographic Databases Derived knowledge bases Knowledge-based applications programs Data mining tools for protein-ligand complexes.
EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT under the RTD programme.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Databank in Europe (PDBe)‏ An Introduction.
SMART Teams: Students Modeling A Research Topic Jmol Training 101!
Doug Raiford Lesson 3.  More and more sequence data is being generated every day  Useless if not made available to other researchers.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
Discovering Computers Fundamentals Fifth Edition Chapter 9 Database Management.
X-ray Validation Package Present status Swanand Gore PDBe D&A meeting : 21-Oct-2010.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
EMBL-EBI MSD Search tools. EMBL-EBI MSDlite EMBL-EBI MSDlite.
Copyright OpenHelix. No use or reproduction without express written consent1.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
PIRSF Classification System PIRSF: Evolutionary relationships of proteins from super- to sub-families Homeomorphic Family: Homologous proteins sharing.
Data Integration and Management A PDB Perspective.
EBI is an Outstation of the European Molecular Biology Laboratory. MSDchem and the chemistry of the wwPDB EMBO 22nd-26th September 2008 EMBL-EBI Hinxton.
Protein Data Bank: An Introduction Learning to Use the RCSB PDB Portal.
Gene Expression Omnibus (GEO)
EBI is an Outstation of the European Molecular Biology Laboratory. Quaternary Structure.
EMBL-EBI MSD Search and Visualization tools Jawahar Swaminathan.
EBI is an Outstation of the European Molecular Biology Laboratory. Sanchayita Sen, Ph.D. PDB Depositions Validation & Structure Quality.
Macromolecular Structure Database Project EMSD Infra-structure Services for Europe To develop an autonomous structural database capability in Europe
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.
Real World Experiences in Operating a Collaboratory: The Protein Data Bank Helen M. Berman Board of Governors Professor of Chemistry.
Worldwide Protein Data Bank wwPDB Common D&A Project November 24, 2009 November 24, 2009 Steering Committee Project Update.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
Worldwide Protein Data Bank Common D&A Project Sequence Processing Modular Demo May 6, 2010 Project Deliverable.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
AutoDep 4.0 A data deposition and archival system Sameer Velankar.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
CPSC 203 Introduction to Computers T97 By Jie (Jeff) Gao.
Protein sequence databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen This also includes old material from my thesis
InterPro Sandra Orchard.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Search Services (PDBelite, PDBePro and BIObar) Sanchayita Sen, Ph.D. PDB Depositions.
1 Integration of data sources Patrick Lambrix Department of Computer and Information Science Linköpings universitet.
Protein Tertiary Structure Prediction Structural Bioinformatics.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Databank in Europe (PDBe)‏ An Introduction.
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe-fold (SSM) A web-based service for protein structure comparison and structure searches.
The Web Web Design. 3.2 The Web Focus on Reading Main Ideas A URL is an address that identifies a specific Web page. Web browsers have varying capabilities.
Cheminformatics and Metabolism Team The EBI Enzyme Portal.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
PDBe Protein Interfaces, Surfaces and Assemblies
PDBemotif A web based integrated search service to understand ligand binding and secondary structure properties in macromolecular structures.
Getting the Most out of the PDBe
Introduction to Databases
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Deposition, Validation, Search and Analysis Services

worldwide Protein Data Bank (wwPDB) Consists of four sites RCSB (USA), PDB-j (Japan) BMRB (USA) and PDBe. Single repository of macromolecular structures. Started in 1971 and now ~65,000 entries, adding ~200 new entries/week. Deposited by experimentalists and contents is freely available. The format of the archive is flat-files with fixed line format, although an improved flat-file format (mmCIF) and XML are also available.

Protein Databank in Europe (PDBe) group Is one of the four sites around the world that where 3D structures may be deposited. Provides stable and clean repository of macromolecular structure data. Has services that allow users to access, search and retrieve structural data from a single web access point.

EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Tasks Deposition and Validation Database design and implementation Retrieve data Analysis tools & Services

Deposition via AutoDep4 ( Closely collaborate with the other wwPDB members for a single unified archive.. Depositions via EMDEP ( Depositions started June 2002 Depositions and Curation

Validation of Structures Authentication of source That the protein is from human and not rabbit, for example ! Authentication of structure Comparison of structure against raw data. Geometry and Stereochemistry. Provide results back to depositor. Validation of correct methodology used Whether X-Ray, NMR or EM. Conformity to standards Follows PDB format specifications Error checks Consistency checks - to identify simple typos Homo sapiens and not Homo sapien (single human?). Outlier detection - to identify suspect records

EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Tasks Deposition site Database design and implementation Retrieve data Analysis tools & Services

Disadvantages of Flat files… Macromolecular structures are very complex. Existing PDB format is incapable of fully describing few existing structures also. Format is not readily extensible, to cope, for example, with structural genomics data. Historical archive is non-uniform and poorly populated. Search and retrieval of flat files is difficult and/or inaccurate.

Uniform Data Improved Query Functionality Time Effort Usefulness Usage CrystallographersBiologists ProgrammersBioinformaticians PDBe Relational Database

EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Tasks Deposition site Database design and implementation Retrieve data Analysis tools & Services

Some Implementation Issues  The PDBe database is large and complex:  ~61,000 PDB entries  Cross-referenced against SwissProt, PubMed etc.  Making data accessible without adding additional complexity.  Tools for different categories of end-user  Simple – biobar  Intermediate - PDBelite  Advanced – PDBepro  New - PDBeView

biobar A toolbar search application for Mozilla/Netscape or firefox browsers Simple and quick retrieval of data from PDBe and 45 other Databases

PDBelite A simple form-based query system to search the PDBe Databases

PDBelite Search Results

Features of Search Interface Strengths: simple, easy to use form allows multiple search fields to be combined relatively fast, despite performing quite complex SQL queries Weaknesses: not exposing the power of a relational database limited logical operators between search fields: "name" AND "title" AND "keyword“  "name" OR "title" OR "keyword“  ( "name" OR "title" ) AND NOT "keyword" the search form is defined by the authors of the search system, not the author of a query

PDBeView

Search result: The Atlas page

EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Tasks Deposition site Database design and implementation Retrieve data Analysis tools & Services

AstexViewer™: View structures as wireframe, backbone or ribbons Built-in sequence viewer Calculate and display surfaces Various display options: Ramachandran plots Distance matrix B-factors Based on the AstexViewer™ from Astex Technology Limited and modified under licence by the PDBe group

PDBeChem Ligand Database

What is the environment around alpha-D-mannose and beta-D-mannose? PDBeMotif

What binds ASP ASP HIS LYS ? PDBeMotif

How does ATP generally interact with LYS in all structures ? PDBeMotif

Assess Quality of a Structure Ramachandran Plot Bond Distances Bond Angles PDBeAnalysis

PDBePisa What assembly can my structure have ?

PDBeFold Discover unknown relationships… Are there any structures in the PDB that are similar to mine? What SCOP and/or CATH family could my structure belong to ? Can I get some idea about the possible function of my protein based on similarity with others based on structural similarity ? Mutiple alignment of many of my structures ?

ChemSearch Sub-structure based search of a million chemicals

PDBeAnalysis/PDBeValidate Online PDB validation

PDBeStatus PDB Deposition status search

PDBe provides… Clean biological data Integrated data A single web access point Query interfaces for different users (Beginner, Occasional or expert). Interconnected views of the data relating structure, sequence, text & experimental details.

PISA biological assemblies PDBechem ligand data Electron Density Visualisation AstexViewer PDBePro, PDBelite Fold matching Surface Matching Active sites Linking to Domain data, eFamily Sequence Mapping, SIFTS