Homology 3D modeling and effect of mutations. X-ray crystallography (70,714 in PDB) need crystals Nuclear Magnetic Resonance (NMR) (9,312) proteins in.

Slides:



Advertisements
Similar presentations
Functional Site Prediction Selects Correct Protein Models Vijayalakshmi Chelliah Division of Mathematical Biology National Institute.
Advertisements

Protein Structure Prediction using ROSETTA
Chimera. Chimera 1/3 Starting Chimera Open Chimera from desktop (ZDV app) (If there is an update it will take a minute or two) Open a 3D structure by.
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Structural bioinformatics
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Secondary structure prediction. Amino acid sequence -> Secondary structure Alpha helix Beta strand Disordered/coil 70% accuracy 1991, 81% accuracy in.
Sequence alignment & Substitution matrices By Thomas Nordahl & Morten Nielsen.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
Protein structure determination. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography,
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Thomas Blicher Center for Biological Sequence Analysis
Computational Biology, Part 10 Protein Structure Prediction and Display Robert F. Murphy Copyright  1996, 1999, All rights reserved.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Protein structure determination & prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray.
IV. Protein Structure Prediction and Determination Methods of protein structure determination Critical assessment of structure prediction Homology modelling.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
PolyPhen and SIFT: Tools for predicting functional effects of SNPs Epi 244 Spring 2009 Sam S. Oh.
Genetic Threading By J.Yadgari and A.Amir Published: special issue on Bioinformatics in Journal of Constraints, June 2001 Alexandre Tchourbanov University.
1 Computational Biology, Part 11 Retrieving and Displaying Macromolecular Structures Robert F. Murphy Copyright  1996, 1999, All rights reserved.
Detecting the Domain Structure of Proteins from Sequence Information Niranjan Nagarajan and Golan Yona Department of Computer Science Cornell University.
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Part II : Introduction To Protein Structure Kong Lesheng Victor Tong Joo Chuan National University of Singapore.
Protein domains. Protein domains are structural units (average 160 aa) that share: Function Folding Evolution Proteins normally are multidomain (average.
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
COMPARATIVE or HOMOLOGY MODELING
Protein 3D-structure analysis Exercises. Practicals Find update frequency for RCSB PDB: weekly. When was the last update? How many protein structures.
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
SMART Teams: Students Modeling A Research Topic Jmol Training 101!
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
1 PyMOL Evolutionary Trace Viewer 1.1 Lichtarge Lab Sept. 13, 2010.
Representations of Molecular Structure: Bonds Only.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
Multiple Mapping Method with Multiple Templates (M4T): optimizing sequence-to-structure alignments and combining unique information from multiple templates.
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Manually Adjusting Multiple Alignments Chris Wilton.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.
X-ray crystallography – an overview (based on Bernie Brown’s talk, Dept. of Chemistry, WFU) Protein is crystallized (sometimes low-gravity atmosphere is.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Copyright OpenHelix. No use or reproduction without express written consent1.
Homology 3D modeling and effect of mutations. X-ray crystallography (70,714 in PDB) need crystals Nuclear Magnetic Resonance (NMR) (9,312) proteins in.
Challenges and accomplishments in molecular prediction Yanay Ofran.
Homology 3D modeling and effect of mutations Miguel Andrade Faculty of Biology, Johannes Gutenberg University Institute of Molecular Biology Mainz, Germany.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Homology 3D modeling Miguel Andrade Mainz, Germany Faculty of Biology,
Protein 3D representation
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Protein 3D representation
Secondary structure prediction
Protein Structure Prediction and Protein Homology modeling
Protein 3D representation
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Homology 3D modeling and effect of mutations
Prediction of Protein Structure and Function on a Proteomic Scale
Molecular Modeling By Rashmi Shrivastava Lecturer
Protein structure prediction.
Solution and Crystal Structures of a Sugar Binding Site Mutant of Cyanovirin-N: No Evidence of Domain Swapping  Elena Matei, William Furey, Angela M.
Homology 3D modeling Miguel Andrade Mainz, Germany Faculty of Biology,
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Protein 3D representation
Volume 21, Issue 6, Pages (June 2013)
Presentation transcript:

Homology 3D modeling and effect of mutations

X-ray crystallography (70,714 in PDB) need crystals Nuclear Magnetic Resonance (NMR) (9,312) proteins in solution lower size limit (600 aa) Electron microscopy (422) Low resolution (>5A) Determination of protein structure

resolution 2.4 A

Determination of protein structure resolution 2.4 A

Structural genomics Currently: 106K 3D structures from around 35K seqs 90M sequences in UniProt only 0.04%!

Structural genomics 50% sequences covered (25% in 1995) Currently: 106K 3D structures from around 35K seqs 90M sequences in UniProt only 0.04%!

3D structure prediction Applications: target design Query sequence catalytic center known 3D Leu Gly model 3D by homology Gly Lys + similar to LG GK

3D structure prediction Applications: fit to low res 3D Query sequence 1 low resolution 3D (electron microscopy) Query sequence 2

Protein domains are structural units (average 160 aa) that share: Function Folding Evolution Proteins normally are multidomain (average 300 aa) Domains

Protein domains are structural units (average 160 aa) that share: Function Folding Evolution Proteins normally are multidomain (average 300 aa) Domains

Query Sequence Yes 3D Modeling by homology No 2D Prediction 3D Ab initio 3D Threading Similar to PDB sequence? Predict domains Cut

3D structure prediction Ab initio Explore conformational space Limit the number of atoms Break the problem into fragments of sequence Optimize hydrophobic residue burial and pairing of beta-strands Limited success

3D structure prediction Threading I-Tasser: Jeffrey Skolnick & Yang Zhang Lee and Skolnick (2008) Biophysical Journal Roy et al (2010) Nature Methods Yang et al (2015) Nature Methods Fold 66% sequences <200 aa long of low homology to PDB Just submit your sequence and wait… (some days) Output are predicted structures (PDB format)

3D structure prediction I-Tasser Roy et al (2010) Nature Methods

3D structure prediction I-Tasser

3D structure prediction I-Tasser

3D structure prediction QUARK

3D structure prediction GenTHREADER David Joneshttp://bioinf.cs.ucl.ac.uk/psipred/ Input sequence or MSA Typically 30 minutes, up to two hours GenTHREADER Jones (1999) J Mol Biol

Output GenTHREADER 3D structure prediction GenTHREADER

3D structure prediction Phyre Kelley et al (2000) J Mol Biol Kelley and Sternberg (2009) Nature Protocols Processing time can be hours

3D structure prediction Static solutions Datasets of precomputed models / computations Not flexible Variable coverage But you don’t have to wait

3D structure prediction MODbase Andrej Salihttp://modbase.compbio.ucsf.edu/ Pieper et al (2011) Nucleic Acids Research

3D structure prediction MODbase

Protein Model Portal Haas et al. (2013) Database Torsten Schwede 3D structure prediction

Aquaria Sean O’Donoghuehttp://aquaria.ws/ O’Donoghue et al (2015) Nature Methods

Aquaria

Starting aquaria (May require a Java update) Works best in Firefox (in Chrome with reduced functionality) Open Firefox mit JRE (from ZDV) Go to Run an example. If JAVA blocked unblock it at the plugin icon Exercise 1/5

Starting aquaria Note that aquaria.ws requires that two java plug-ins that need to be allowed to run Exercise 1/5

Exercise 2/5 You can load a protein by its UniProt ID Try Myosin X: Zoom in and out using the mouse wheel (or with shift and drag up and down). Rotate by click and drag Click on a residue to select. Shift + Click selects a range. Esc clears the selection. Double click on a residue centers the molecule on it. Right click and drag moves the molecule laterally Compare the different hits with domain annotations using the feature view Comparing different matches in Myosin X

Exercise 3/5 Type NR3C2 in protein name (human mineralocorticoid receptor) Note and compare the multiple hits. Which proteins are those? What do they match in the human mineralocorticoid receptor? (Use the Features view) Comparing different matches in the human MR The further down the less similar are the proteins compared. This is represented by a darker color.

Exercise 4/5 Load the human protein CTNNB1 (Catenin beta-1) Click on the 'Features' tab (bottom of the window) Double click on the feature lane titled 'Amino acid modification’ (post-translational modification). This will highlight the residues in the structure. Then you can click on the residues to see their position and amino acid. Which two amino acid modifications are close in structure, but not in sequence? Which type of modifications are those? Change representation to ball and stick to see the side chains. Do the side chains of the modified residues look like they could interact? Post-translational modifications in CTNNB1

Effect of mutations

Polyphen2

Polyphen2 Training Human protein MSDFGASDFG... Human protein MSDFGASDFG... 6,321 mutations versus mammalian homologs Mouse protein MSDFGATDFG... 8,946 mutations not causing disease Human variant MSDFGAADFG... mildly deleterious 3,155 mutations causing Mendelian disease Disease protein MSDFGARDFG... 13,032 mutations causing disease (UniProt) Disease protein MSDFGARDFG...

Polyphen2 PSIC Score Likelihood of an amino acid to occupy a specific position in the protein sequence given the pattern of amino acid substitutions observed in the multiple sequence alignment EGKLQVQQGTGRFISR DGNLHVNQGMGRFIPR DGNLHVNKGMGRFIPR DGNISVSKGMGRFIPR EGTLHTTEGSGRFISR EGTLHATEGSGRYIPR DGNLHVTEGSGRYIPR DGTLHVTEGSGRYIPR DGNLHVSQGSGRFVPR DGNLFVTEGSGRFVPR DGKMFVTPGAGRFVPR DGNLLVTPGAGRFIPR DGTLSVMEGSGRFIPR DGNLHATSGTGRFIPC Reference Homologs Low scoreHigh score

Polyphen2 Usage

Polyphen2

Exercise 5/5 Let’s see if you can design a damaging and a benign mutation for human myosin X (open in chimera PDB 3PZD to view and select candidate mutations). Go to the Polyphen2 home page: Type the UniProt id of the protein sequence “Q9HD67” in the Protein Identifier window. Type the position of your candidate for a damaging mutation. Select in AA1 the type of amino acid at that position. Now, select an amino acid to mutate to. May be try one with a large side chain, or if the wild type one was hydrophobic, try a hydrophilic one. Be nasty! Then hit Submit Query. What result did you get? Is it close to one? Try your benign mutation in the same way. This time may be choose to mutate to a similar residue to the wild type one. Be gentle! Then hit Submit Query. What result did you get? Is it close to zero? Study the effect of mutants with Polyphen2