EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT- 2001-00015 under the RTD programme.

Slides:



Advertisements
Similar presentations
Scientific & technical presentation Structure Visualization with MarvinSpace Oct 2006.
Advertisements

1 Miklós Vargyas, Judit Papp May, 2005 MarvinSpace – live demo.
Protein Structure.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
Protein Structure and Physics. What I will talk about today… -Outline protein synthesis and explain the basic steps involved. -Go over the Chemistry of.
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Web Resources for Bioinformatics Vadim Alexandrov and Mark Gerstein.
3D Molecular Structures C371 Fall Morgan Algorithm (Leach & Gillet, p. 8)
EMBL-EBI MSD motif PDB: 1gci Structure Annotation and Function Assignment with MSDmotif.
Nucleic Acid Database By Pooja Awatramani. Database Utilities Provides structural references in the form of base pair annotation for DNA, RNA, and some.
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
Protein Threading Optimization Using Consensus Homology Modeling Maliha Sarwat ( ), Tasmin Tamanna Haque ( ) Department of Computer Science.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Protein structure prediction May 30, 2002 Quiz#4 on June 4 Learning objectives-Understand difference between primary secondary and tertiary structure.
High Throughput Processing of the Structural Information of the Protein Data Bank Zoltán Szabadka, Vince Grolmusz Department of Computer Science Eötvös.
Using 3D-SURFER. Before you start 3D-Surfer can be accessed at For visualization.
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Protein Interfaces, Surfaces and Assemblies
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Bringing Structure to Biology: Small Molecules and the PDBe
EMBL-EBI MSD-mine. EMBL-EBI MSD-mine overview  Web application for online data analysis and mining For the advanced MSDSD researcher Interactive ad-hoc.
Protein 3D-structure analysis Exercises. Practicals Find update frequency for RCSB PDB: weekly. When was the last update? How many protein structures.
Increasing the Value of Crystallographic Databases Derived knowledge bases Knowledge-based applications programs Data mining tools for protein-ligand complexes.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
ProteinShop: A Tool for Protein Structure Prediction and Modeling Silvia Crivelli Computational Research Division Lawrence Berkeley National Laboratory.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.
3D structure -Swiss Pdb Viewer
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
MSDmotif 1 Adel Golovin Protein Site and Motif search Biosapiense network of excellence.
Crystallographic Databases I590 Spring 2005 Based in part on slides from John C. Huffman.
Ch. 2: “Chemistry of Life”
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Copyright OpenHelix. No use or reproduction without express written consent1.
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
EBI is an Outstation of the European Molecular Biology Laboratory. MSDchem and the chemistry of the wwPDB EMBO 22nd-26th September 2008 EMBL-EBI Hinxton.
Protein Data Bank: An Introduction Learning to Use the RCSB PDB Portal.
What our bodies are made of Chemistry of Cells. Nature of Matter All matter is made of atoms. Atoms consist of electrons, protons and neutrons. Molecules.
EMBL-EBI MSD Search and Visualization tools Jawahar Swaminathan.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Deposition, Validation, Search and Analysis Services.
Macromolecular Structure Database Project EMSD Infra-structure Services for Europe To develop an autonomous structural database capability in Europe
PROTEIN PATTERN DATABASES. PROTEIN SEQUENCES SUPERFAMILY FAMILY DOMAIN MOTIF SITE RESIDUE.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.
Bioinformatics Project BB201 Metabolism A.Nasser
EMBL-EBI Chemistry & the PDB MSDchem Primary Developer: Dimitris Dimitropoulos.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
EMBL-EBI Representative sets and Clustering.. EMBL-EBI Representative sets A subset of data that provides a statistically valid sample set for the complete.
Copyright OpenHelix. No use or reproduction without express written consent1.
Polish Infrastructure for Supporting Computational Science in the European Research Space EUROPEAN UNION Examining Protein Folding Process Simulation and.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Copyright OpenHelix. No use or reproduction without express written consent1.
EMBL-EBI Dimitris Dimitropoulos MSD-mine. EMBL-EBI MSD-mine overview  Web application for online data analysis and mining  For the advanced MSDSD researcher.
2.3 notes Carbon Compounds. Organic chemistry- study of compounds that contain bonds between C atoms Carbon: -can bond with 4 e- to another atom -can.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Search Services (PDBelite, PDBePro and BIObar) Sanchayita Sen, Ph.D. PDB Depositions.
EMBL-EBI Eugene Krissinel SSM - MSDfold. EMBL-EBI MSDfold (SSM)
An Efficient Index-based Protein Structure Database Searching Method 陳冠宇.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
Structure Visualization
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
PDBe Protein Interfaces, Surfaces and Assemblies
Bio/Chem-informatics
PDBemotif A web based integrated search service to understand ligand binding and secondary structure properties in macromolecular structures.
Getting the Most out of the PDBe
Dimitris Dimitropoulos
Protein Structure Prediction and Protein Homology modeling
Predicting Active Site Residue Annotations in the Pfam Database
Computational Analysis
Protein structure prediction.
Chemistry of life review
Presentation transcript:

EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT under the RTD programme "Quality of Life and Management of Living Resources"

EMBL-EBI MSDsite

EMBL-EBI  Many protein (and some nucleic acids) structures have evolved as a framework for an active site – where chemistry takes place.  The active site is critical for the understanding the action of proteins and their role within the cell mechanism.  The function, of these macromolecules can best be studied by detailed 3-dimensional knowledge of these active sites.  Because the active site is a local 3D environment. Introduction

EMBL-EBI  The 3D / 1D position that defines the protein function  disparate amino acids on the 1D sequence  A localized volume in 3D  Where chemistry/binding happens. Ligand environment

EMBL-EBI  The MSD site database contains local structural information around all the bound ligands within the PDB  All proteins deposited to the PDB are checked for ligand(s). These are HETATM & non-protein/non-nucleic- acid/non-water residues.  Interactions are stored within the DB  The bound ligand coordinates & associated information  The ligand environment (i.e. any residue that interacts)  The interaction types between the ligand and environment.  All sequences are checked for the registered ProSite patterns and matches are stored in the DB  Fast algorithm for a pattern search is embedded into the DB MSD site database

EMBL-EBI  Sites are defined as a ligand environment when a ligand is presented.  Sequence motifs matching is applied for site definition independently from ligands.  ProSite sites  Catalytic sites atlas  Merops active sites Site definitions

EMBL-EBI  Covalent Bonds  Coordinate bonds  Hydrogen bonds  Planes  Non-bonding  Electrostatics  Di-Sulphide bonds  Geometry (distance, angles..) PHE O N S ASPASP VALVAL Ligand environment analysis Site DB Structure DB Environment

EMBL-EBI  Define search by ligand  Define search by sequence motif (pattern)  Define search by metal site geometry  Define search by environment  has same environment  has similar environment Possible site searches

EMBL-EBI Structure Protein Ligand Define target: equal/like Filter by PDB header: equal/like Environment exact/sub set Search criteria Sequence motif Metal site geometry Search options

EMBL-EBI Statistics search Interactive Charts Ligand binding Environment binding Atomic bonds Sequence pattern binding

EMBL-EBI Search method Search front form Statistics search Result list Detail views

EMBL-EBI MSDsite interface Filters Query editor

EMBL-EBI HIS|SER:S/H>C2.0 HIS.NE2:S/S>C2.0 HIS.[N]/T>C2.0 Drawing Complex queries secondary structure Distance cut-offs Bond type Side/main chain Query generation

EMBL-EBI Search service aims  Form a site search criteria  Provide a hit list Visualize the found sites  XML technology (xpath,..) to search an XML corresponding to a PDB entry from the hit list.

EMBL-EBI Example Pattern is defined as : HIS-any-HIS-GLU

EMBL-EBI Hit List Help

EMBL-EBI AstexViewer : Default PDB viewer: chemical/x-pdb. RasMol script viewer: application/x-rasmol. Hit List Visualization tools

EMBL-EBI Searched site visualization AstexViewer[tm] Groups menu

EMBL-EBI Goto Site details Hit List Link to Details Bound molecules form Sequence from Atomic bonds

EMBL-EBI Detail forms header Ligand binding statistics Links to the other views of the same PDB entry Switch on/off undefined bonds

EMBL-EBI Show site Sequence form search for PDB entries with the same hetero ID and similar environment

EMBL-EBI Bound molecules form Ligands that match a search creteria Environment statistics

EMBL-EBI Environment binding statistics Filters HIS HIS HIS Trnasformed to HIS>C HIS>C HIS>I

EMBL-EBI Bound molecules form Bonds details on atomic level

EMBL-EBI Atomic bonds view Atomic bonds statistics

EMBL-EBI Atomic bonds statistics

EMBL-EBI Detail forms header Ligand binding statistics

EMBL-EBI Ligand binding statistics Distribution type Filters

EMBL-EBI Ligand binding statistics – distribution by whole environment

EMBL-EBI Pattern binding statistics Pattern: H-x-H-E

EMBL-EBI 3D aligned views  Superimposed by ligand  Superimposed by environment  Superimposed by pattern  Superimposed by active site

EMBL-EBI 3D aligned views

EMBL-EBI 3D aligned views – EBI- AstexViewer[tm]

EMBL-EBI

 DB of active sites has been generated  Application server has been developed  The DB can be searched  Based on properties of the ligand environment  Based on properties of the ligand  Based on properties of structure and sequence of the parent protein Summary

EMBL-EBI Data generated by data mining of common packing of chemical groups is being added. (Proteins S.F.G, 49: , T.J.Oldfield). This information contains about 2000 sites, 1/3 with ligands. Future sites