Protein Structure. Why study protein structure? Studying the structure model allows better understanding of the structure-function relationship, and is.

Slides:



Advertisements
Similar presentations
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Advertisements

The amino acids in their natural habitat. Topics: Hydrogen bonds Secondary Structure Alpha helix Beta strands & beta sheets Turns Loop Tertiary & Quarternary.
A 3-D reference frame can be uniquely defined by the ordered vertices of a non- degenerate triangle p1p1 p2p2 p3p3.
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Structural bioinformatics
1 September, 2004 Chapter 5 Macromolecular Structure.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Structural Bioinformatics Workshop Max Shatsky Workshop home page:
Protein Structure, Databases and Structural Alignment
RasMol workshop Protein Structure and Function A.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
Computational Biology, Part 10 Protein Structure Prediction and Display Robert F. Murphy Copyright  1996, 1999, All rights reserved.
Structural Bioinformatics Workshop Max Shatsky Workshop home page:
Appendix: Automated Methods for Structure Comparison Basic problem: how are any two given structures to be automatically compared in a meaningful way?
The Protein Data Bank (PDB)
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
Structures and Structure Descriptions Chapter 8 Protein Bioinformatics.
Protein structure prediction May 30, 2002 Quiz#4 on June 4 Learning objectives-Understand difference between primary secondary and tertiary structure.
Dali: A Protein Structural Comparison Algorithm Using 2D Distance Matrices.
A unified statistical framework for sequence comparison and structure comparison Michael Levitt Mark Gerstein.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structures.
Chapter 12 Protein Structure Basics. 20 naturally occurring amino acids Free amino group (-NH2) Free carboxyl group (-COOH) Both groups linked to a central.
Multiple Sequence Alignment May 12, 2009 Announcements Quiz #2 return (average 30) Hand in homework #7 Learning objectives-Understand ClustalW Homework#8-Due.
ClusPro: an automated docking and discrimination method for the prediction of protein complexes Stephen R. Comeau, David W.Gatchell, Sandor Vajda, and.
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
Introduction to Protein Structure
Basic Computations with 3D Structures
EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT under the RTD programme.
SMART Teams: Students Modeling A Research Topic Jmol Training 101!
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY Basics of …….. NMR phenomenonNMR phenomenon Chemical shiftChemical shift Spin-spin splittingSpin-spin splitting.
Crystallographic Databases I590 Spring 2005 Based in part on slides from John C. Huffman.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
A data-mining approach for multiple structural alignment of proteins WY Siu, N Mamoulis, SM Yiu, HL Chan The University of Hong Kong Sep 9, 2009.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Pair-wise Structural Comparison using DALILite Software of DALI Rajalekshmy Usha.
X-ray crystallography – an overview (based on Bernie Brown’s talk, Dept. of Chemistry, WFU) Protein is crystallized (sometimes low-gravity atmosphere is.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Jürgen Sühnel Supplementary Material: 3D Structures of Biological Macromolecules Exercise 1:
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Lecture 11 CS5661 Structural Bioinformatics – Structure Comparison Motivation Concepts Structure Comparison.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
Local Flexibility Aids Protein Multiple Structure Alignment Matt Menke Bonnie Berger Lenore Cowen.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Ultra-High Resolution X-ray Diffraction
Protein Structures.
Crystal structure of human mitochondrial NAD(P)+-dependent malic enzyme: a new class of oxidative decarboxylases  Yingwu Xu, Girija Bhargava, Hao Wu,
Yeast RNA Polymerase II at 5 Å Resolution
Volume 13, Issue 9, Pages (December 2015)
Crystal Structure of the MHC Class I Homolog MIC-A, a γδ T Cell Ligand
Structure of Bax  Motoshi Suzuki, Richard J. Youle, Nico Tjandra  Cell 
Structural Analysis of Ligand Stimulation of the Histidine Kinase NarX
Volume 21, Issue 6, Pages (June 2013)
Crystal Structure of the p53 Core Domain Bound to a Full Consensus Site as a Self- Assembled Tetramer  Yongheng Chen, Raja Dey, Lin Chen  Structure  Volume.
Protein Structure Alignment
Structure and Dynamics of Zymogen Human Blood Coagulation Factor X
Volume 26, Issue 1, Pages e3 (January 2018)
Volume 12, Issue 11, Pages (November 2004)
Presentation transcript:

Protein Structure

Why study protein structure? Studying the structure model allows better understanding of the structure-function relationship, and is an important starting point for many kinds of research

Structure determination Crystallography: A solution of protein molecules is assembled into a periodic lattice A solution of protein molecules is assembled into a periodic lattice The crystal is bombarded with X-ray beams The crystal is bombarded with X-ray beams The collision of the beams with the atom electrons creates a diffraction pattern The collision of the beams with the atom electrons creates a diffraction pattern The diffraction pattern is transformed into an electron density map of the protein from which the 3D locations of the atoms can be deduced The diffraction pattern is transformed into an electron density map of the protein from which the 3D locations of the atoms can be deduced F F

Structure determination Nuclear magnetic resonance (NMR): A solution of the protein is placed in a magnetic field A solution of the protein is placed in a magnetic field Spins align parallel or Spins align parallel or anti-parallel to the field RF pulses of electromagnetic energy shift spins from their alignment RF pulses of electromagnetic energy shift spins from their alignment Upon radiation termination spins re- align while emitting the energy they absorbed Upon radiation termination spins re- align while emitting the energy they absorbed The emission spectrum contains information about the identity of the nuclei and their immediate environment The emission spectrum contains information about the identity of the nuclei and their immediate environment The result is an ensemble of models rather than a single one The result is an ensemble of models rather than a single one

PDB: Protein Data Bank

PDB model  Defines the 3D coordinates (x,y,z) of each of the atoms in one or more molecules (i.e., complex)  There are models of proteins, protein complexes, proteins and DNA, protein segments, etc  The models also include the positions of ligand molecules, solvent molecules, metal ions, etc  PDB ID: integer + 3 integers/characters (e.g., 1a14)

The PDB file format

ATOM Records: Usually protein or DNA HETATM Records: Usually Ligand, ion, water chain Residue identity Residue number Atom number Atom identity atom coordinates Temperature factor XYZ Occupancy

Background and motivation  DNA methylation at DNA CpG sites has a central role in imprinting (plants and mammals), but it is not clear how the imprinting machinery recognizes its target genes  The Dnmt3 protein family (a,b,l) are de novo methyltransferases  Dnmt3a and Dnmt3l KO mice show altered sex- specific de novo methylation in germ cells, indicating that these proteins are both required for the methylation of most imprinted loci in germ cells Goal: Conduct a structural and biochemical study of a homogeneous complex of Dnmt3L and Dnmt3a

Methods  Dnmt3a2, the shorter isoform of Dnmt3a that is the predominant form in embryonic stem cells was selected  For crystallographic reasons, a stable complex of the C-terminal domains from both proteins (Dnmt3a2-C and Dnmt3L-C) that retains substantial methyltransferase activity was focused on

Results  The complex is a tetramer: Dnmt3L–Dnmt3a–Dnmt3a– Dnmt3L  Mutagenesis at positions in both interfaces (a-a and a-L) indicate that these interfaces are essential for catalysis  Dnmt3a-Dnmt3a dimerization brings two active sites together  Dimeric Dnmt3a could methylate two CpGs separated by one helical turn in one binding event Dnmt3a Dnmt3l

“We observed a highly significant correlation of methylation status at distances of eight to ten base pairs between two CpG sites” Distribution of CpG sites among 12 known maternally imprinted genes, indicated to be Dnmt3a-Dnmt3l targets

Protein visualization Visualization tools (working on PC): RasMol / RasTop SwissPDBviewer (sPDBv) Protein Explorer (via the web) And many more…

Rastop / Rasmol

RasTop- main menu פתיחת קובץ סגירת קובץ

RasTop - display Wireframe קווים בין אטומים

Sphere VDW מנפח כל אטום לפי רדיוס ה- שלוVDW

Command editor

More on RasTop  RasMol manual RasMol manual RasMol manual  Using RasTop Using RasTop Using RasTop  Commands Commands

Structure alignment Essential for: Protein classification Detection of conserved protein folding cores Detection of similarities between domains Detection of similarities in functional binding sites Evolutionary conservation Construction of nonredundant databases

Pairwise structure alignment Outline: Given two proteins structures, find the transformation that produces the best superimposition of one protein onto the other

Computationally Find the rotations and translations of one of the points set (atoms of protein A) which produce “large” superimpositions on the other points set (atoms of protein B) ? X Y Z X Y Z

RMSD Root Mean Square Deviation Average distance between the matched superimposed atoms usually between backbones Cα atoms

 Matches C-alpha atoms  Rigid pairwise alignement  Sequence order independent  Input: two PDB files or PDB IDs with specific chains  Output: a set of high scoring conformations The superimposed structures may be viewed in a PDB viewer The superimposed structures may be viewed in a PDB viewer

BOBWHITE QUAIL LYSOZYME HEN EGG WHITE LYSOZYME

Results Ranking criteria: 1.Match size 2.RMSD

C-alpha correspondence

Aligned PDB file

 Flexible structural alignment  The first structure is assumed to be rigid, while in the second structure potential flexible regions - hinges, are automatically detected  Input: two PDB IDs (specific chain)  Output: list of alignments ranked according to the number of hinges

Results

Result with 0 hinges:

Result with one hinge:

 Multiple structural alignments of protein structures  Finds the common geometrical cores between the input molecules  Does not require that all the input molecules participate in the alignment  Actually, it efficiently detects high scoring partial multiple alignments for all possible number of molecules from the input  The final structural alignment can either preserve the sequence order (like sequence alignment), or be sequence order independent

Results

DALI - Distance matrix ALIgnment Concept: “Similar 3D structures have similar inter- residue distances”

DALI Algorithm  Generates an inter-residue distance matrix for each protein  The distance matrix contains all pairwise distances (symmetrical)  Dij = distance between C-alpha i and C-alpha j in the same protein  Compares the two distance matrices for a pair of proteins to be aligned

DALI Services DALI sever Used by crystallographers to compare a newly solved structure against structures in the PDB DALI database Contains all-vs.-all PDB 3D structure comparisons and thus enables to find structural neighbors of structures that are already in the PDB Pairwise server Pairwise comparison of two structures Dalilite A standalone version of DALI

DALI Database

Non-redundant chains - no two chains are more than 90 % sequence identical DALI Database Number of structurally aligned residues Number of residues in the protein Sequence identity of aligned positions

Supplementary

אלמנטים של מבנה שניוני, קו המחבר בין C-alpha RasTop - Display

מחזק מבנה שניוני RasTol - Display

Remove rendering Clears the view from previous actions performed

לכל חומצה אמינית צבע משלה תצוגה מסוימת בו לכל אטום צבע מסוים עד כמה מיקום האטום קבוע במבנה: אדום- חופש גדול, כחול-קבוע מאוד מבנה שניוני – צהוב- beta sheet, כחול – loops and turns, אדום – alpha helix, לבן – random coil. RasTop - Color

RasTop - Labels סימון כל המולקולה או קטעים נבחרים בשם והמספר של חומצת האמינו.

The Select Command Select – מגדירה את האזור שעליו נפעל במולקולה - סט האטומים שיעבור מודיפיקציות ע"י שרשרת הפקודות הבאות. הפרמטר של פקודת ה- select הוא “atom expression”. ה - atom expression מגדיר באופן ייחודי קבוצה שרירותית של אטומים בתוך מולקולה. הוא יכול להיות: Primitive expressions, Predefined sets, Within expressions, or logical combination of all above mentioned. In order to display only what we selected, use the command: Edit => restrict

The primitive expressions allow to select by:  Atom number - select atomno=102  Residue – select Val52 (select resno=52 or select 52)  Chain id – select :a  List of residue numbers – select 14,92,46  Range of atom numbers – select atomno=>35  A wildcard can be used to specify a whole field: * Any number of characters Atom or residue type – select *.sg (this will select all Sulphur atoms in Cysteine’s side chain) ? Single character wildcard – select ser.c? – will select all carbons in all serine residues.

The predefined sets are groups of atoms given the definite names: select helix select hoh (water molecules) select protein There is a list with the predefined sets in the Rasmol reference card (google it)

Boolean Expressions And – המשותף לשני תנאים Or – חיבור בין שני תנאים (גם את זה וגם את זה) Not – מה לא לכלול דוגמאות: select tyr and :a → all tyr in ‘a’ chain select tyr or :a → all tyr in the molecule and all ‘a’ chain select not (tyr,:a) → all the molecule beside tyr or ‘a’ chain

Using Select Expression

…or via the command (edit  command)

1.Spacefill 2.Color picker 3.Make sure you’re on atoms!