Current Status of Homology Modeling Using MCSG Structures 319 MCSG structures in PDB have over 400,000 sequence homologues. These structures represent.

Slides:



Advertisements
Similar presentations
Refinement of a pdb-structure and Convert A. Search for a pdb with the closest sequence to your protein of interest. B. Choose the most suitable entry.
Advertisements

Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Review: Amino Acid Side Chains Aliphatic- Ala, Val, Leu, Ile, Gly Polar- Ser, Thr, Cys, Met, [Tyr, Trp] Acidic (and conjugate amide)- Asp, Asn, Glu, Gln.
Protein Structure Database Introduction Database of Comparative Protein Structure Models ModBase 生資所 g 詹濠先.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Protein Tertiary Structure Prediction
Structural bioinformatics
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
1 Levels of Protein Structure Primary to Quaternary Structure.
Protein structure (Part 2 of 2).
MCSG Site Visit, Argonne, January 30, 2003 Genome Analysis to Select Targets which Probe Fold and Function Space  How many protein superfamilies and families.
Thomas Blicher Center for Biological Sequence Analysis
Workshop on Biological Macromolecular Structure Models RCSB PDB Piscataway, NJ November 19-20, 2005 Topic 3: Structural Genomics and Models Contributors:
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
The Protein Data Bank (PDB)
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
Protein Structure Prediction II
Detecting the Domain Structure of Proteins from Sequence Information Niranjan Nagarajan and Golan Yona Department of Computer Science Cornell University.
Protein Classification A comparison of function inference techniques.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Structure and Function of Proteins Lecturer: Dr. Ora Furman Oct 2009 Winter 2009/10 Teaching Assistants: Miraim Oxsman Sivan Pearl.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
Structural Bioinformatics R. Sowdhamini National Centre for Biological Sciences Tata Institute of Fundamental Research Bangalore, INDIA.
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Modelling binding site with 3DLigandSite Mark Wass
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Secondary structure prediction
Modelling Genome Structure and Function Ram Samudrala University of Washington.
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia,
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Multiple Mapping Method with Multiple Templates (M4T): optimizing sequence-to-structure alignments and combining unique information from multiple templates.
Protein Secondary Structure Prediction G P S Raghava.
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Homology modeling with SWISS-MODEL
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Protein Homologue Clustering and Molecular Modeling L. Wang.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Modelling genome structure and function Ram Samudrala University of Washington.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Detecting Protein Function and Protein-Protein Interactions from Genome Sequences TuyetLinh Nguyen.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
3.3b1 Protein Structure Threading (Fold recognition) Boris Steipe University of Toronto (Slides evolved from original material.
Virginia Commonwealth University
Protein Structure Visualisation
Bio/Chem-informatics
Crystal Structure of Transcription Factor MalT Domain III
Protein Structure Prediction and Protein Homology modeling
Volume 19, Issue 8, Pages (August 2011)
Classification: understanding the diversity and principles of
Homology Modeling.
Protein structure prediction.
Solution and Crystal Structures of a Sugar Binding Site Mutant of Cyanovirin-N: No Evidence of Domain Swapping  Elena Matei, William Furey, Angela M.
Moosa Mohammadi, Joseph Schlessinger, Stevan R Hubbard  Cell 
Crystal Structure of Transcription Factor MalT Domain III
Volume 19, Issue 8, Pages (August 2011)
Crystal Structure of the N-Terminal Domain of Sialoadhesin in Complex with 3′ Sialyllactose at 1.85 Å Resolution  A.P. May, R.C. Robinson, M. Vinson,
A Duplicated Fold Is the Structural Basis for Polynucleotide Phosphorylase Catalytic Activity, Processivity, and Regulation  Martyn F. Symmons, George.
Presentation transcript:

Current Status of Homology Modeling Using MCSG Structures 319 MCSG structures in PDB have over 400,000 sequence homologues. These structures represent ~350 domains. Models are built by MODELLER (Sali) and quality is assessed using PROSA (Sippl). High-quality models can be generated for ~80,000 proteins. Web site has been established that allows automated modeling of sequence homologues and evaluate the quality of the models. Gly140/141 Phe97 Asp96 Tyr95 Gly154/155 Phe103 Trp102 Gly140/141 Phe97 Asp96 Tyr95 Gly154/155 Phe103 Trp102 1t5b domain template Q92LV5 domain model Gly140/141 Phe97 Asp96 Tyr95 Gly154/155 Phe103 Trp102 Gly140/141 Phe97 Asp96 Tyr95 Gly154/155 Phe103 Trp102 1t5b domain template Q92LV5 domain model

Protein Structure Initiative - the Need for Large-Scale Homology Modeling In the next five years PSI can determine approximately 3,000-4,000 protein structures, mainly at course granularity. Reality check: novel structures in PDB will represent very small fraction of sequences in GenBank – reliable homology modeling is critical for obtaining 3D models and extending experimental work. In PSI2 targets for structure determination are selected from large families, therefore determined structures have a large number of sequence homologues at wide range of sequence similarity. Protein often display different function. Homology modeling must provide tools and 3D proteins models that can be used for high-confidence, reliable interpretation of specific structural features in distant (15-25%) sequence homologues, protein function assignment and evolution. Models should provide guide for increasing number of more sophisticated experiments including: (i) aid mutagenesis and biochemical studies, (ii) predicting ligand binding, (iii) predicting oligomerization state, (iv) predicting cellular interactions (protein/protein/DNA/RNA). We need to consider how PSI target selection of protein sequences and subsequent structure determination can improve homology modeling and the quality of the models.

Major Issues with Large-Scale Homology Modeling for Structural Genomics 3D proteins models for distant (15-25%) sequence homologues are often not suitable. Because of sequence divergence for very large families only small fraction of sequences can be reliably modeled (10-20%). Homology modeling must provide input to target selection in fine coverage of protein families. Domain parsing needs improvement. We should be able to model multi-domain proteins from structures of individual domains. We should be able to model neighbouring side chains and important structural and functional features that currently are difficult to assigned and predict correctly. We need methods to predict unusual features and departures from the structure that is used for modelling. Modelling loop and high B factor regions needs improvement.

Structure of P5CR Exemplifies Challenges for Homology Modeling Two structures of P5CR were determined. The proteins share 22% sequence identity and 47% sequence similarity. Structures of monomer are very similar but show individual features. Problems: Protein has two domains and forms oligomers, one domain shows major swapping and protein forms different oligomeric forms in different species

Human Aldose Reductase – SeMet MAD at 0.9 Å Comparison – Experimental vs. Refined Map Refined 0.9 Å, sigmaA (2mF o -DF c ), contour level: 1 sigma Experimental 0.9 Å, F o, contour level: 1 sigma

MAD Map at 3.2 Å, 1.8 Å, 1.6 Å and 1.1 Å

Inhibitor Head Existing in Double Conformation Hard to Interpret at RT (1.45 Å ), Clear at 100 K (0.8 Å ) Tyr 48 His 110