Tutorial Homology Modelling. A Brief Introduction to Homology Modeling.

Slides:



Advertisements
Similar presentations
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
Advertisements

Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Protein Tertiary Structure Prediction
Structural bioinformatics
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Strict Regularities in Structure-Sequence Relationship
CISC667, F05, Lec21, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction 3-Dimensional Structure.
Protein Structure, Databases and Structural Alignment
Protein structure (Part 2 of 2).
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Homology modelling ? X-ray ? NMR ?. Homology Modelling !
Thomas Blicher Center for Biological Sequence Analysis
©CMBI 2002 Homology modelling ? X-ray ? NMR ? Intro Proteins Modelling 8 Steps Detect Threading Alignment Template Side chain Indels Optimize Validate.
The Protein Data Bank (PDB)
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU.
Protein Tertiary Structure Prediction Structural Bioinformatics.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Computing for Bioinformatics Lecture 8: protein folding.
Homology modelling ? X-ray ? NMR ?. Homology Modelling !
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structures.
Bioinformatics Ayesha M. Khan Spring 2013.
Protein Structure Prediction and Analysis
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Chapter 12 Protein Structure Basics. 20 naturally occurring amino acids Free amino group (-NH2) Free carboxyl group (-COOH) Both groups linked to a central.
Computer-Assisted Drug Design (1) i)Random Screening ii)Lead Development and Optimization using Multivariate Statistical Analyses. iii)Lead Generation.
Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling.
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
COMPARATIVE or HOMOLOGY MODELING
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
HOMOLOGY MODELLING Chris Wilton. Homology Modelling   What is it and why do we need it? principles of modelling, applications available   Using Swiss-Model.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia,
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Protein Structure Prediction Graham Wood Charlotte Deane.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
PROTEIN MODELLING Presented by Sadhana S.
Protein Structure Visualisation
Computational Structure Prediction
Protein Structure Prediction and Protein Homology modeling
Protein dynamics Folding/unfolding dynamics
Protein Structures.
Molecular Modeling By Rashmi Shrivastava Lecturer
3-Dimensional Structure
Homology Modeling.
Protein structure prediction.
Protein structure prediction
Homology modeling in short…
Presentation transcript:

Tutorial Homology Modelling

A Brief Introduction to Homology Modeling

Sequence-Structure-Function Relationships ● Proteins of similar sequences fold into similar structures and perform similar biological functions. ● The protein sequence has the intrinsic information to encode the protein structure.

The Noble Prize in Chemistry 1972 Christian B Anfinsen "for his work on ribonuclease, especially concerning the connection between the amino acid sequence and the biologically active conformation"

From Nobel Lecture, December 11, 1972, by Christian Anfinsen The protein sequence is sufficient to specify its 3D structure

Sequence->Structure->Function ● Widespread Automated DNA sequencing => more sequence data than structure data ● Semi-Automated pipeline of structure determination is still not widespread. ● Nevertheless, structure is more conserved than sequence. ● Sequence homologs => structural homologs ● See Chapter 9, Baxevanis and Ouellette 3 rd edn.

Protein Structure Prediction vs Experimental Determination From Chapter 9, Bryan Bergeron, Bioinformatics Computing, 2003 Pearson Education, Inc.

Structure Prediction from sequence 1.Homology (or comparative) modelling 2.Threading 3.Ab initio calculations Homology modelling is most accurate and powerful

What is Homology Modeling? ● Homology modeling also known as comparative modeling uses homologous sequences with known 3D structures for the modelling and prediction of the structure of a target sequence. ● Homology modeling is one of the most best performing prediction methods that gives “accurate” predicted models.

How is Homology Modeling done ● Multistep process involves many steps such as: – Sequence alignment of target/query/unknown protein sequence to homologous sequence with a known structure – structure modification of backbone – side chain replacements – Energy minimisation for refinement of structural model – Validation of model with visual inspection and etc

Why Homology Modeling? ● The number of protein structures solved so far are fewer than the number of genes known. ● Proteins of biological interest with their orthologous proteins solved by X-ray crystallography or NMR can be modeled. ● Homology modeling is an important method used to predict the structures of membrane proteins, ion channels, transporters that are large and difficult to crystallize. ● Examples: GPCR (G Protein-coupled receptor), cytochrome P450 etc.

Overview of the process of Homology Modeling ● A target sequence (the structure to be predicted) ● Identify the homologous sequence with known 3D as template ● Using homology modeling software such as Modeller for structure prediction (from the Sali Lab) ● Model evaluation and refinement

Pre-Modeling Stage: Template Identification ● Target sequence in FASTA format as input ● Blastp against PDB ● Identify proteins with “good” hit ● Pairwise or multiple sequence alignment ● Further editing the alignment results ● Realign and identify the “good” structural template

Pre-Modeling Stage: Preparing the Input Files for Modeller ● PDB files for structural templates is required ● The PIR file from the alignment results ● The script file model.top to execute the Modeller program (latest versions use Python scripts)

In the Heart of Modeller From the Modeller manual

Evaluation of Predicted Model Garbage in-Garbage out ● The predicted model can be superimposed with known structure determined by experiment ● The predicted model is normally evaluated by root mean square deviation (RMSD)

From

Calculating RMSD N = number of atoms, d = the distance in Angstrom between corresponding atoms in the experimental and predicted protein structures. From Chapter 9, Bryan Bergeron, Bioinformatics Computing, 2003 Pearson Education, Inc.

● Some Rule of Thumb for Structural Modelling ● Proteins that share 35 to 50% sequence identity with their templates, will generally deviate by 1.0 to 1.5 Å from their experimental counter parts. ● Crystallographic structures of identical proteins can vary not only because of experimental errors and differences in data collection conditions and refinement, but also because of different crystal lattice contacts and the presence or absence of ligands.

Quality of Model ● The correctness of a model is essentially determined by the quality of the sequence alignment used to identify the template. ● If the sequence alignment is wrong in some regions, then the spatial arrangement of the residues in this portion of the model will be incorrect.

Viewing the Model ● The predicted model is saved in PDB format that can be viewed by molecular visualizing software such as Rasmol, PyMol, MolMol, Sybyl etc. ● Viewing is an essential step to validate the quality of the predicted model. ● In this practical, Rasmol is used to view the predicted structure.

Model Refinement ● Gaps in sequence alignment represent insertion/deletion regions of target. Loop modeling is used to refine these regions (not cover in this practical) ● The predicted model can be further refined by energy minimization to remove unfavourable non-bonded contacts with force fields such as CHARMM, AMBER or GROMOS etc (not covered in this practical)

Web-Based Homology Modeling: The SWISS-MODEL Server ● The aim of the Internet-based SWISS-MODEL server is to provide a comparative protein modelling tool independent from expensive computer hardware and software.

Steps involved in SwissModel Take target sequence of unknown structure 2.Using BLAST to select closest homolog with known structure as structural template 3.Insert target sequence and homologous sequence to Web service 4.Results will be ed back to you. 5.Warning: Structure needs to be analysed and validated

Simple Homology Modelling using Modeller 1.Take target sequence of unknown structure 2.Using BLAST to select closest homolog with known structure. 3.Using Clustalx or Jalview to do pairwise alignment between target sequence and structural homolog and manual adjustment 4.Inspection of missing structural features in structural homolog 5.Preparation of alignment file align.pir 6.Use Modeller7v7 software ( to do the homology modellinghttp://salilab.org/modeller/

Structure Validation ● Visual inspection – Minimise torsion angles in disallowed regions of Ramachandran plots – Maximised hydrogen bonding – Minimised exposed hydrophobic residues – Packing etc. ● Analysis – e.g. run Procheck ( ), VADAR, Verify3D etc