Modeling the Structures of Proteins and Macromolecular Assemblies Depts. Of Biopharmaceutical Sciences and Pharmaceutical Chemistry California Institute.

Slides:



Advertisements
Similar presentations
PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
Advertisements

Common Evolutionary Origin for Coated Vesicles and Nuclear Pore Complexes Devos D, Dokudovskaya S, Alber F, Marti-Renom MA, Chait BT, Sali A, Rout MP UCSF.
Structure and composition of nuclear pore complexes (NPCs)
05/27/2006 Modeling and Determining the Structures of Proteins and Macromolecular Assemblies Depts. of Biopharmaceutical Sciences and Pharmaceutical Chemistry.
Protein Structure Database Introduction Database of Comparative Protein Structure Models ModBase 生資所 g 詹濠先.
Structural bioinformatics
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Protein structure (Part 2 of 2).
Thomas Blicher Center for Biological Sequence Analysis
Workshop on Biological Macromolecular Structure Models RCSB PDB Piscataway, NJ November 19-20, 2005 Topic 3: Structural Genomics and Models Contributors:
The Protein Data Bank (PDB)
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences.
Detecting the Domain Structure of Proteins from Sequence Information Niranjan Nagarajan and Golan Yona Department of Computer Science Cornell University.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structures.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling.
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
Structural Bioinformatics R. Sowdhamini National Centre for Biological Sciences Tata Institute of Fundamental Research Bangalore, INDIA.
Comparative Protein Structure Modeling Lecture 4.1
COMPARATIVE or HOMOLOGY MODELING
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
A computational study of protein folding pathways Reducing the computational complexity of the folding process using the building block folding model.
Lecture 10 – protein structure prediction. A protein sequence.
Comparative modeling with MODELLER Ben Webb, Andrej Sali Lab UC San Francisco Maya Topf, Birkbeck College, London.
Representations of Molecular Structure: Bonds Only.
Modeling the Structures of Macromolecular Assemblies
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia,
NIGMS Protein Structure Initiative: Target Selection Workshop ADDA and remote homologue detection Liisa Holm Institute of Biotechnology University of Helsinki.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Multiple Mapping Method with Multiple Templates (M4T): optimizing sequence-to-structure alignments and combining unique information from multiple templates.
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
JM - 1 Introduction to Bioinformatics: Lecture XI Computational Protein Structure Prediction Jarek Meller Jarek Meller Division.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Protein Homologue Clustering and Molecular Modeling L. Wang.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Modelling genome structure and function Ram Samudrala University of Washington.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Homology 3D modeling Miguel Andrade Mainz, Germany Faculty of Biology,
Computational Structure Prediction
Protein Structure Prediction and Protein Homology modeling
Homology 3D modeling and effect of mutations
Prediction of Protein Structure and Function on a Proteomic Scale
Protein dynamics Folding/unfolding dynamics
Protein Structures.
Modeling the Structures of Proteins and Macromolecular Assemblies
Protein structure prediction.
Frank Alber, Michael F. Kim, Andrej Sali  Structure 
Volume 107, Issue 3, Pages (November 2001)
Volume 22, Issue 8, Pages (August 2014)
The ribosome – a macromolecular machine par excellence
Volume 26, Issue 6, Pages e2 (June 2018)
Gydo C.P. van Zundert, Adrien S.J. Melquiond, Alexandre M.J.J. Bonvin 
Architecture of a Coat for the Nuclear Pore Membrane
Yang Zhang, Jeffrey Skolnick  Biophysical Journal 
Presentation transcript:

Modeling the Structures of Proteins and Macromolecular Assemblies Depts. Of Biopharmaceutical Sciences and Pharmaceutical Chemistry California Institute for Quantitative Biomedical Research University of California at San Francisco Andrej Š ali 3/25/03

1. 1.Yeast and E. coli ribosomes: Electron microscopy, comparative modeling, and structural genomics Yeast Nuclear Pore Complex: Low-resolution modeling of large assemblies; bridging the gaps between structural biology, proteomics, and system biology Comments. 4/6/03

S. cerevisiae ribosome C. Spahn, R. Beckmann, N. Eswar, P. Penczek, A. Sali, G. Blobel, J. Frank. Cell 107, , Fitting of comparative models into 15Å cryo- electron density map. 43 proteins could be modeled on 20-56% seq.id. to a known structure. The modeled fraction of the proteins ranges from 34-99%. 3/25/03

E. coli ribosome H.Gao, J.Sengupta, M.Valle, A. Korostelev, N.Eswar, S.Stagg, P.Van Roey, R.Agrawal, S.Harvey, A.Sali, M.Chapman, and J.Frank. Cell, in press. Upon EF-G binding, the ribosome becomes less compact. In contrast to mRNA, many protein contacts undergo large conformational changes, suggesting ribosomal proteins facilitate the dynamics of translation. 4/6/03

START Get profile for sequence (NR) Scan sequence profile against representative PDB chains Scan PDB chain profiles against sequence PSI-BLAST M OD P IPE : Large-Scale Comparative Protein Structure Modeling Select templates using permissive E-value cutoff 1 Expand match to cover complete domains 1 Build model for target segment by satisfaction of spatial restraints Evaluate model Align matched parts of sequence and structure MODELLER R. Sánchez & A. Šali, Proc. Natl. Acad. Sci. USA 95, 13597, N. Eswar, M. Marti-Renom, M.S. Madhusudhan, B. John, A. Fiser, R. Sánchez, F. Melo, N. Mirkovic, A. Šali. For each target sequence For each template structure 3/25/03 END

Pieper et al., Nucl. Acids Res /25/03

Comparative modeling of the TrEMBL database Unique sequences processed: 733,239 Sequences with fold assignments or models: 415,937 (57%) 4/03/02 ~4 weeks on 500 Pentium III CPUs 70% of models based on <30% sequence identity to template. On average, only a domain per protein is modeled (an “average” protein has 2.5 domains of 175 aa). 3/25/03

Model Accuracy Marti-Renom et al. Annu.Rev.Biophys.Biomol.Struct. 29, , MEDIUM ACCURACY LOW ACCURACYHIGH ACCURACY NM23 Seq id 77% CRABP Seq id 41% EDN Seq id 33% X-RAY/ MODEL Sidechains Core backbone Loops C  equiv 147/148 RMSD 0.41Å Sidechains Core backbone Loops Alignment C  equiv 122/137 RMSD 1.34Å Sidechains Core backbone Loops Alignment Fold assignment C  equiv 90/134 RMSD 1.17Å 4/6/03

Future directions Make sure we have building blocks (structural genomics). Develop methods for simultaneous fitting of proteins into EM density and conformational modeling (induced fit, comparative modeling, ab initio). Need large computing (eg, cluster of hundreds of nodes with >1GB memories). 4/6/03

Rockefeller University, New York *UCSF F. Alber*, T. Suprapto, J. Kipper, W. Zhang, L. Veenhoff, S. Dokudovskaya, M. Rout, B. Chait, A. Šali* Modeling of the yeast nuclear pore complex by satisfaction of spatial restraints (MODELLER) 4/6/03

Modeling macromolecular assemblies by satisfaction of spatial restraints 1)Representation of a system. 2)Scoring function (spatial restraints). 3)Optimization. Sali, Ernst, Glaeser, Baumeister. From words to literature in structural proteomics. Nature 422, , /25/03 There is nothing but points and restraints on them.

Modeling of NPC Stochiometry. Parse proteins into domains. Protein and “sub-complex’ shapes from Stokes radii. Excluded volume of proteins. Symmetry of NPC (EM). Radial and axial localization of proteins (IEM). Protein-protein proximity (immuno-purification). Binary protein-protein contacts from “overlay” experiments. Modeling in the context of the nuclear envelope. Comparative models for some domains. Structural genomics of nucleoporins. 4/6/03

spokehalf-spoke cytosolic side nuclear side Schematic structure of yeast NPC nuclear membrane TOP VIEWSIDE VIEW half-spoke contains ~30 nucleoporin proteins (NUPs). ~480 NUPs in NPC. C.W. Akey, M. Rout 3/25/03

[sum of residues] complex dimension f(res) [A] Protein-protein “contacts” For each protein pair within a half-spoke, the upper bound on the center-center distance is the estimated maximal complex diameter: Complications: - multiple protein copies in half-spoke - inter-spoke interactions. 3/25/03

Successful optimization 11/18/023/25/03

Scan of figure NUP120 NUP84 NUP85 NUP145C SEH1 Cdc31 Experiment Rappsilber J, Siniossoglou S, Hurt EC, Mann M. Anal Chem 2000 Jan 15;72(2): Model NUP84-complex 4/6/03

Structural proteomics aims to characterize structures of most macromolecular complexes, in space and time. On average, a domain may interact with a few other domains. The function of a complex is determined by its structure and dynamics. There are too many complexes to be determined directly by high- resolution experimental structure determination. Thus, just like in structural genomics, an efficient combination of experiment and computation is required. 4/6/03

Structural Proteomics versus Structural Genomics Potential targets not clear clear Target selectionnot clearclear Scopenot clearclear Structure determinationhybridX-ray or NMR Functional Annotationessentialnot a major focus There are additional sciences and technologies in structural proteomics, relative to structural genomics. 4/6/03

Structural Genomics Characterize most protein sequences based on related known structures. There are ~16,000 30% seq id families (90%) (Vitkup et al. Nat. Struct. Biol. 8, 559, 2001) Sali. Nat. Struct. Biol. 5, 1029, Sali et al. Nat. Struct. Biol., 7, 986, Sali. Nat. Struct. Biol. 7, 484, Baker & Sali. Science 294, 93, The number of “families” is much smaller than the number of proteins. Any one of the members of a family is fine. Characterize most protein sequences based on related known structures: 3/25/03

Target selection for structural proteomics (scope) Comprehensive coverage. What targets: Stable assemblies? Transient complexes? Pairs of domains? Can they be organized into groups? How many such groups are there? 4/6/03

Target selection for structural proteomics? There are ~3,000 folds containing ~90% of all sequences. A target: Binary domain-domain interface. There may be a finite number of domain-domain interfaces, largely defined by the domain fold types. Given pairwise interactions, a large assembly may be reconstructed. 4/6/03

Protein and assembly structure by experiment and computation 3/25/3 Sali, Ernst, Glaeser, Baumeister. From words to literature in structural proteomics. Nature 422, , /25/03

Modeling macromolecular assemblies by satisfaction of spatial restraints 1)Representation of a system. 2)Scoring function (spatial restraints). 3)Optimization. Sali, Ernst, Glaeser, Baumeister. From words to literature in structural proteomics. Nature 422, , /25/03 There is nothing but points and restraints on them.

Future directions Development of general/flexible/hierarchical representation of assemblies. Quantifying spatial information from experiments. Optimization of structure. Toy models. Space, time. 4/6/03

Role of NIH Structural proteomics is timely and feasible. Humongous integration of concepts, sciences, methods, tools, people. Thus, need research centers, but also “R01” research. Significant computing. Bridging the gaps between structural biology, proteomics, and system biology. 4/6/03