Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.

Slides:



Advertisements
Similar presentations
Tutorial Homology Modelling. A Brief Introduction to Homology Modeling.
Advertisements

Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Protein Tertiary Structure Prediction
Structural bioinformatics
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
CISC667, F05, Lec21, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction 3-Dimensional Structure.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Protein Tertiary Structure Prediction. Protein Structure Prediction & Alignment Protein structure Secondary structure Tertiary structure Structure prediction.
Energetics and kinetics of protein folding. Comparison to other self-assembling systems?
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
Detecting the Domain Structure of Proteins from Sequence Information Niranjan Nagarajan and Golan Yona Department of Computer Science Cornell University.
Protein Structures.
Bioinformatics Ayesha M. Khan Spring 2013.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
BIOC3010: Bioinformatics - Revision lecture Dr. Andrew C.R. Martin
Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling.
COMPARATIVE or HOMOLOGY MODELING
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Protein Structure Prediction. Historical Perspective Protein Folding: From the Levinthal Paradox to Structure Prediction, Barry Honig, 1999 A personal.
Introduction to Protein Folding and Molecular Simulation Background of protein folding Molecular Dynamics (MD) Brownian Dynamics (BD) September, 2006 Tokyo.
Representations of Molecular Structure: Bonds Only.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Introduction to Proteomics 1. What is Proteomics? Proteomics - A newly emerging field of life science research that uses High Throughput (HT) technologies.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Protein Structure Prediction ● Why ? ● Type of protein structure predictions – Sec Str. Pred – Homology Modelling – Fold Recognition – Ab Initio ● Secondary.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Homology modeling with SWISS-MODEL
Structure prediction: Ab-initio Lecture 9 Structural Bioinformatics Dr. Avraham Samson Let’s think!
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Protein Structure Prediction Graham Wood Charlotte Deane.
BMC Bioinformatics 2005, 6(Suppl 4):S3 Protein Structure Prediction not a trivial matter Strict relation between protein function and structure Gap between.
Comparative methods Basic logics: The 3D structure of the protein is deduced from: 1.Similarities between the protein and other proteins 2.Statistical.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
What is Protein Folding? Implications of Misfolding Computational Techniques Background image: Staphylococcal protein A, Z Domain (
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Protein Structure Prediction. Protein Sequence Analysis Molecular properties (pH, mol. wt. isoelectric point, hydrophobicity) Secondary Structure Super-secondary.
PROTEIN MODELLING Presented by Sadhana S.
Computational Structure Prediction
University of Washington
Protein Structure Prediction and Protein Homology modeling
Protein dynamics Folding/unfolding dynamics
Introduction to Proteomics
Protein dynamics Folding/unfolding dynamics
Protein Structures.
Molecular Modeling By Rashmi Shrivastava Lecturer
Homology Modeling.
Protein structure prediction.
Homology modeling in short…
Presentation transcript:

Applied Bioinformatics Week 12

Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with another? How to include functional/activity/pathway information in databases? How to extract functional motifs from sequence data? How to predict phenotype from proteotype?

Bioinformatics & Expressional Proteomics How to correlate changes in protein expression with disease? How to distinguish important from unimportant changes in expression? How to compare, archive, retrieve gel data? How to rapidly, accurately identify proteins from MS and 2D gel data? How to include expression info in databases?

Bioinformatics & Structural Proteomics How to predict 3D structure from 1D sequence? How to determine function from structure? How to classify proteins on basis of structure? How to recognize 3D motifs and patterns? How to use bioinformatics databases to help in 3D structure determination? How to predict which proteins will express well or produce stable, folded molecules?

Protein Folding Problem “ Predict a three-dimensional structure of a protein from its amino acid sequence. ” “ How does a protein fold into the structure? ” This question has not been solved for more than half a century.

Proteins Can Fold into 3D Structures Spontaneously The three-dimensional structure of a protein is self-organized in solution. If we can calculate the energy of the system precisely, it is possible to predict the structure of the protein! The structure corresponds to the state with the lowest free energy of the protein-solvent system. (Anfinsen’s dogma)

Levinthal Paradox We assume that there are three conformations for each amino acid (ex. α-helix, β-sheet and random coil). If a protein is made up of 100 amino acid residues, a total number of conformations is = ≒ 5 x If 100 psec ( sec) were required to convert from a conformation to another one, a random search of all conformations would require 5 x x sec ≒ 1.6 x years. However, folding of proteins takes place in msec to sec order. Therefore, proteins fold not via a random search but a more sophisticated search process. We want to watch the folding process of a protein using molecular simulation techniques.

Why is the “ Protein Folding ” so Important? Proteins play important roles in living organisms. Some proteins are deeply related with diseases. And structural information of a protein is necessary to explain and predict its gene function as well as to design molecules that bind to the protein in drug design. Today, whole genome sequences (the complete set of genes) of various organisms have been deciphered and we realize that functions of many genes are unknown and some are related with diseases. Therefore, understanding of protein folding helps us to investigate the functions of these genes and to design useful drugs against the diseases efficiently. In addition to that, the understanding opens the door to designing of proteins having novel functions as new nano machines.

Forces Involved in the Protein Folding Electrostatic interactions van der Waals interactions Hydrogen bonds Hydrophobic interactions

The Energy Function Calculate energies for each particle Since long range interactions important for each pair of particles the pair-wise interactions should be calculated

System for Folding Simulations Without water molecules With water molecules # of atoms: 304 # of atoms: ,377 = 7,681

Much Faster, Much Larger! Special-purpose computer –Calculation of non-bonded interactions is performed using the special chip that is developed only for this purpose. –For example; MDM (Molecular Dynamics Machine) or MD-Grape: RIKEN MD Engine: Taisho Pharmaceutical Co., and Fuji Xerox Co. Parallelization –A single job is divided into several smaller ones and they are calculated on multi CPUs simultaneously. –Today, almost all MD programs for biomolecular simulations (ex. AMBER, CHARMm, GROMOS, NAMD, MARBLE, etc) can run on parallel computers.

Homology Modeling Template Selection and Fold Assignment Target – Template Alignment Model Building –Loop Modeling –Sidechain Modeling Model Evaluation

Fold Assignment and Template Selection Identify all protein structures with sequences related to the target, then select templates 3 main classes of comparison methods –Compare the target sequence with each database sequence independently, pair-wise sequence – sequence comparison, BLAST and FASTA –Multiple sequence comparisons to improve sensitivity, PSI-BLAST –Threading or 3-D template matching methods

Target – Template Alignment Most important step in Homology Modeling A specialized method should be used for alignment –Over 40% identity the alignment is likely to be correct. –Regions of low local sequence similarity become common when overall sequence identity is under 40%. (Saqi et al., Protein Eng. 1999) –The alignment becomes difficult below 30% sequence identity. (Rost, Protein Eng. 1999)

Model Building Construct a 3-D model of the target sequence based on its alignment on template structures Three different model building approaches –Modeling by rigid body assembly –Modeling by segment matching –Modeling by satisfaction of spatial restraints Accuracies of these models are similar Template selection and alignment have larger impact on the model

Swiss-MOD Web Server Screenshots from the Homology Modeling Server Swiss-Model Construct a framework using known protein structures Generate the location of the target amino acids on the framework If loop regions not determined, additional database search or short simulations

Procedure of the MODELLER program After obtaining restraints run a geometry optimization or real- space optimization to satisfy them

Errors in Homology Models a.Errors in side chain packing b.Distortions and shifts in correctly aligned regions c.Errors in regions without a template

d. Errors due to misalignment e. Incorrect templates

Model Building Programs COMPOSERPwww-cryst.bioc.cam.ac.uk CONGENPwww.congenomics.com/congen/congen.html CPH modelsSwww.cbs.dtu.dk/services/CPHmodels/ DRAGONPwww.nimr.mrc.ac.uk/~mathbio/a-aszodi/dragon.html ICMPwww.molsoft.com InsightIIPwww.msi.com MODELLERPguitar.rockefeller.edu/modeller/modeller.html LOOKPwww.mag.com QUANTAPwww.msi.com SYBYLPwww.tripos.com SCWRLPwww.cmpharm.ucsf.edu/~bower/scrwl/scrwl.html SWISS-MODSwww.expasy.ch/swissmod WHAT IFPwww.sander.embl-heidelberg.de/whatif/

Applications

End Theory Mind mapping 10 min break

Practice

3D Structure Prediction? Get a protein sequence Go to: –Use threading Got to: –Find known structure –Ab inito prediction FoldIt ( Crystal structure of a monomeric retroviral protease solved by protein folding game players. Increased Diels-Alderase activity through backbone remodeling guided by Foldit players.