EMBL-EBI MSD motif www.ebi.ac.uk/msd-srv/msdmotif PDB: 1gci Structure Annotation and Function Assignment with MSDmotif.

Slides:



Advertisements
Similar presentations
Understanding biology through structures Course work 2006 Protein-Nucleic Acid Interactions: General Principles.
Advertisements

EMBL-EBI Integration of Sequence and 3D structure Databases.
Web Resources for Bioinformatics Vadim Alexandrov and Mark Gerstein.
Gapped Blast and PSI BLAST Basic Local Alignment Search Tool ~Sean Boyle Basic Local Alignment Search Tool ~Sean Boyle.
© Wiley Publishing All Rights Reserved. Analyzing Protein Sequences.
Secondary structure prediction. Amino acid sequence -> Secondary structure Alpha helix Beta strand Disordered/coil 70% accuracy 1991, 81% accuracy in.
Intro to Bioinformatics Summary. What did we learn Pairwise alignment – Local and Global Alignments When? How ? Tools : for local blast2seq, for global.
Protein secondary structure prediction methods TDVEAAVNSLVNLYLQASYLS “From sequence to structure”
Protein secondary structure prediction methods TDVEAAVNSLVNLYLQASYLS “From sequence to structure”
1 Levels of Protein Structure Primary to Quaternary Structure.
An Introduction to Bioinformatics Protein Structure Prediction.
Today’s menu: -UniProt - SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.
Computational Biology, Part 10 Protein Structure Prediction and Display Robert F. Murphy Copyright  1996, 1999, All rights reserved.
Today’s menu: -UniProt - SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.
Today’s menu: -SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.
Fa 05CSE182 CSE182-L6 Protein structure basics Protein sequencing.
Making Sense of DNA and protein sequence analysis tools (course #2) Dave Baumler Genome Center of Wisconsin,
Situations where generic scoring matrix is not suitable Short exact match Specific patterns.
Gapped BLAST and PSI-BLAST : a new generation of protein database search programs Team2 邱冠儒 黃尹柔 田耕豪 蕭逸嫻 謝朝茂 莊閔傑 2014/05/12 1.
Multiple Sequence Alignment School of B&I TCD May 2010.
EMBL-EBI MSD-mine. EMBL-EBI MSD-mine overview  Web application for online data analysis and mining For the advanced MSDSD researcher Interactive ad-hoc.
Protein 3D-structure analysis Exercises. Practicals Find update frequency for RCSB PDB: weekly. When was the last update? How many protein structures.
EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT under the RTD programme.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Sequence analysis: Macromolecular motif recognition Sylvia Nagl.
Multiple Alignments Motifs/Profiles What is multiple alignment? HOW does one do this? WHY does one do this? What do we mean by a motif or profile? BIO520.
Sequence Based Analysis Tutorial NIH Proteomics Workshop Lai-Su Yeh, Ph.D. Protein Information Resource at Georgetown University Medical Center.
MSDmotif 1 Adel Golovin Protein Site and Motif search Biosapiense network of excellence.
Motif discovery Tutorial 5. Motif discovery MEME Creates motif PSSM de-novo (unknown motif) MAST Searches for a PSSM in a DB TOMTOM Searches for a PSSM.
Modelling Genome Structure and Function Ram Samudrala University of Washington.
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia,
Web Servers for Predicting Protein Secondary Structure (Regular and Irregular) Dr. G.P.S. Raghava, F.N.A. Sc. Bioinformatics Centre Institute of Microbial.
EMBL-EBI Integration of Sequence and 3D structure Databases “The key to Bioinformatics is integration, integration, integration” Bioinformatics: Bringing.
EBI is an Outstation of the European Molecular Biology Laboratory. MSDchem and the chemistry of the wwPDB EMBO 22nd-26th September 2008 EMBL-EBI Hinxton.
Generic substitution matrix based sequence comparison Q: M A T W L I. A: M A - W T V. Scr: 45 -?11 3 Scr: Q: M A T W L I. A: M A W T V A. Total:
Protein Structure Prediction ● Why ? ● Type of protein structure predictions – Sec Str. Pred – Homology Modelling – Fold Recognition – Ab Initio ● Secondary.
EMBL-EBI MSD Search and Visualization tools Jawahar Swaminathan.
PROTEIN PATTERN DATABASES. PROTEIN SEQUENCES SUPERFAMILY FAMILY DOMAIN MOTIF SITE RESIDUE.
Basic Overview of Bioinformatics Tools and Biocomputing Applications II Dr Tan Tin Wee Director Bioinformatics Centre.
Sequence Based Analysis Tutorial March 26, 2004 NIH Proteomics Workshop Lai-Su L. Yeh, Ph.D. Protein Science Team Lead Protein Information Resource at.
Exercises Pairwise alignment Homology search (BLAST) Multiple alignment (CLUSTAL W) Iterative Profile Search: Profile Search –Pfam –Prosite –PSI-BLAST.
Hyperthermophile subtilases
EMBL-EBI Representative sets and Clustering.. EMBL-EBI Representative sets A subset of data that provides a statistically valid sample set for the complete.
Query sequence MTYKLILNGKTKGETTTEAVDAATAEKVFQYANDN GVDGEWTYTE Structure-Sequence alignment “Structure is better preserved than sequence” Me! Non-redundant.
EMBL-EBI Dimitris Dimitropoulos MSD-mine. EMBL-EBI MSD-mine overview  Web application for online data analysis and mining  For the advanced MSDSD researcher.
V diagonal lines give equivalent residues ILS TRIVHVNSILPSTN V I L S T R I V I L P E F S T Sequence A Sequence B Dot Plots, Path Matrices, Score Matrices.
InterPro Sandra Orchard.
EMBL-EBI Eugene Krissinel SSM - MSDfold. EMBL-EBI MSDfold (SSM)
Marlou Snelleman 2012 Protein structure. Overview Sequence to structure Hydrogen bonds Helices Sheets Turns Hydrophobicity Helices Sheets Structure and.
Protein motif /domain Structural unit Functional unit Signature of protein family How are they defined?
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
Structure and Function
EMBL-EBI MSD motif PDB: 1gci Structure Annotation and Function Assignment with MSDmotif.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
PDBemotif A web based integrated search service to understand ligand binding and secondary structure properties in macromolecular structures.
Secondary structure prediction
חיזוי ואפיון אתרי קישור של חלבון לדנ"א מתוך הרצף
Genome Center of Wisconsin, UW-Madison
Predicting Active Site Residue Annotations in the Pfam Database
Sequence Based Analysis Tutorial
Ping Wang, Katelyn A. Doxtader, Yunsun Nam  Molecular Cell 
Sequence Based Analysis Tutorial
Homology Modeling.
Levels of Protein Structure
Protein structure prediction.
Volume 101, Issue 4, Pages (May 2000)
Structural Basis for FGF Receptor Dimerization and Activation
Volume 12, Issue 11, Pages (November 2004)
Protein structure prediction
Homology modeling in short…
Presentation transcript:

EMBL-EBI MSD motif PDB: 1gci Structure Annotation and Function Assignment with MSDmotif

EMBL-EBI MSDmotif front options

EMBL-EBI Small motifs Alpha-Beta MotifNestST staple 11 motifs in total Prof James Milner-White

EMBL-EBI Small motif stats Occurrence of amino-acidsCorrelation of side chain charge

EMBL-EBI Small motif stats

EMBL-EBI Small motifs – hit list from stats

EMBL-EBI Small motifs – 3D alignment ST-staple

EMBL-EBI Small motifs viewing Ligands Catalytic sites ProSite Small motifs “Group” menu item contains a list of presented in a PDB entry sites and motifs

EMBL-EBI ,  search Ideal for short loops search PDB:1gci

EMBL-EBI ,  search - continue

EMBL-EBI ,  search - continue PDB:1gci Subtilases family PDB:1f5p Globins family

EMBL-EBI ,  search vs sequence search Sequences found while search by ,  : Subtilases family NNSIGVL DNTTGVL DNSIGVL Globins family IVDTGSV Blast search for NNSIGVL Diphtheria toxin family SDSIGVL

EMBL-EBI 3D motif research PDB 1e4m has the small motif - Asx turn of residues (ASP-HIS-GLY), which is found in a loop between two helixes Use phi/psi parameters of these three residues for a search Asx turn

EMBL-EBI 3D motif research - continue The sequences from the hit list: LKY QGF DRF AGV RGV DHG MGK DNL HGV ANN TGA QCY LGA NSY Most of these hits were found in loops between helixes

EMBL-EBI Pattern search ZN binding pattern: CXXCXXXFXXXXXLXXHXXXH

EMBL-EBI Pattern hits 3D alignment

EMBL-EBI Sequence search Pseudo multiple sequence alignment (blast output based) Ligand binding residues are marked Normalization of hit list to 50% sequence identity

EMBL-EBI Helper application for multiple sequences alignment Blixem as helper browser application for multiple sequences alignment based on blast output Mime-type: application/x-blast Blixem

EMBL-EBI Helper application for multiple sequences alignment Jalview as helper browser application for sequences alignment Mime-type: application/x-align Alignment of Zinc binding PROSITE patter hits

EMBL-EBI Sequence hits 3D alignment 2 hits with less than 25% sequence identity Fragments alignmentChains alignment

EMBL-EBI Secondary structure patterns Where N binds sugar: Man or Nag Strand – turn – Strand Glycosylation pattern N{P}[ST]{P} 2-3 residues gap

EMBL-EBI Secondary structure patterns - continue

EMBL-EBI Secondary structure patterns - continue

EMBL-EBI MSD motif Small 3D motifs from J.Milner-White search/view Secondary structure patterns (HTH) search/view , ,  based search/view Ligands and their environment search/view Catalytic sites search/view Blast sequence search/view PROSITE format compliant patterns search/view 3D and sequence multiple alignment Hit list and statistics normalization by SCOP,CATH

EMBL-EBI MSDmotif as web service XML query MSDmotif server XML respond as eFamily XML Hydrogen bonds , ,  angles Secondary structures + small motifs Secondary structure patterns Sequence patterns Blast sequences Prosite, Catalytic sites, Merops Ligands, fragments, SMILEs Ligand interactions Arithmetic operations XML query

EMBL-EBI XML query example Task: Find PDB entries where a ligand is capping a helix and at the same time binds its N-termini. - x l H h l.[O] h.first.N=H l.[O] h.last.N

EMBL-EBI Extending eFamily XML Standard ways of extending an XML schema: - extension - restriction - substitution group segment Helix Strand Beta-turn Gamma-turn Alpha-beta-motif Asx-turn ST-turn ST-staple … entry pdb-entry entity Chain Bound-molecule Water-group

EMBL-EBI MSD motif PDBs, 25 G disk space on Oracle DB, linear dependency ~ 0.8 M per PDB Web application server with J2EE servlet engine NCBI Blast for sequence search

EMBL-EBI MSDmotif future development 3D alignment extension Water interactions Statistical analysis and presentation –Secondary structure patterns –Statistics from MSDmotif hit list –Correlation of different parameters like residues weight, csi/phi/psi angles. Neural network for recognition of complex structure fragments/motifs in a PDB and their function association