Protein structure prediction May 30, 2002 Quiz#4 on June 4 Learning objectives-Understand difference between primary secondary and tertiary structure.

Slides:



Advertisements
Similar presentations
PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
Advertisements

Protein Structure Prediction
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Protein Tertiary Structure Prediction
Structural bioinformatics
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Strict Regularities in Structure-Sequence Relationship
An Introduction to Bioinformatics Protein Structure Prediction.
Protein Structure, Databases and Structural Alignment
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Computational Biology, Part 10 Protein Structure Prediction and Display Robert F. Murphy Copyright  1996, 1999, All rights reserved.
The Protein Data Bank (PDB)
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
Protein threading Structure is better conserved than sequence
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Sequence/Structure Alignment Resources from NCBI Steve Bryant Protein Data Bank Rutgers University November 19, 2005.
Protein Structure Analysis - I
Motif searching and protein structure prediction May 26, 2005 Hand in written assignments today! Learning objectives-Learn how to read structure information.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structures.
Using structure alignment tools. Structure alignment View a structural alignment of the P53 1T4F protein with Catalytic And Tetramerization Domains From.
Protein Structure Prediction and Analysis
Protein Structural Prediction. Protein Structure is Hierarchical.
IBGP/BMI 705 Lab 4: Protein structure and alignment TA: L. Cooper.
Protein structure prediction
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Chapter 12 Protein Structure Basics. 20 naturally occurring amino acids Free amino group (-NH2) Free carboxyl group (-COOH) Both groups linked to a central.
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
PDBe-fold (SSM) A web-based service for protein structure comparison and structure searches Gaurav Sahni, Ph.D.
Sequence analysis: Macromolecular motif recognition Sylvia Nagl.
Bioinformatics 2 -- Lecture 8 More TOPS diagrams Comparative modeling tutorial and strategies.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
1 Enter the following Micro-RNA sequence into the box Run MFold and look at the results MFold Using MFold to predict RNA secondary structure
Protein Secondary Structure, Bioinformatics Tools, and Multiple Sequence Alignments Finding Similar Sequences Predicting Secondary Structures Predicting.
REMINDERS 2 nd Exam on Nov.17 Coverage: Central Dogma of DNA Replication Transcription Translation Cell structure and function Recombinant DNA technology.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein Strucure Comparison Chapter 6,7 Orengo. Helices α-helix4-turn helix, min. 4 residues helix3-turn helix, min. 3 residues π-helix5-turn helix,
DALI Method Distance mAtrix aLIgnment
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
EMBL-EBI Eugene Krissinel SSM - MSDfold. EMBL-EBI MSDfold (SSM)
Protein Tertiary Structure Prediction Structural Bioinformatics.
Proteins Structure Predictions Structural Bioinformatics.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe-fold (SSM) A web-based service for protein structure comparison and structure searches.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Chapter 14 Protein Structure Classification
PROTEIN MODELLING Presented by Sadhana S.
Protein Structure Prediction and Protein Homology modeling
Protein Structures.
Homology Modeling.
Protein structure prediction.
DALI Method Distance mAtrix aLIgnment
Protein structure prediction
Presentation transcript:

Protein structure prediction May 30, 2002 Quiz#4 on June 4 Learning objectives-Understand difference between primary secondary and tertiary structure. Learn how to display and manipulate protein structures with Deep View. Learn the steps to protein structure prediction with SIMS and VAST. Understand how the MMDB database. Workshop-Manipulation of the hen lysozyme protein structure with Deep View.

Primary, secondary, supersecondary, and tertiary structure Primary Secondary Supersecondary Tertiary ACFTYPL … ACFTYPL sssccss

Protein structure viewers RasMol Deep View Cn3D WebLabViewer

Steps to tertiary structure prediction Compartive protein modeling Extrapolates new structure based on related family members Steps 1. Identification of modeling templates 2. Alignment 3. Model building

Identification of modeling templates One chooses a cutoff value from FastA or BLAST search Up to ten templates can be used but the one with the highest sequence similarity is the reference template C  atoms are selected for superimposition

Alignment Optimization of superimposition of templates “Common core” and conserved loops of target sequence is threaded onto the template structure

Building the model Framework construction Average the position of each atom in target, based on the corresponding atoms in template. Areas that do not match the template are constructed by using a “spare part” algorithm Completing the backbone-a library of PDB entries is consulted Side chains are added Model refinement-minimization of energy

Framework construction

Molecular Modeling DB (MMBD) Relies on PDB for data The MMDB data format is based on the Abstract Syntax Notion 1 (ASN.1) data description language that describes the three- dimensional structure of biological macromolecules A piece of software called PDB file parser is used to translate PDB files into ASN.1 MMDB files. Its major feature is that it detects unambiguities in the PDB data format and, if necessary, automatically modifies the sequence data so that they comply with the 3D coordinates. Cn3D uses the descriptions of atoms and bonds as it is in MMDB records, without needing to validate them, a necessary step for viewers of PDB files. As a result, MMDB data files are consistently interpreted and structures are better displayed.

VAST (Vector Alignment Search Tool) MMDB maintains a pre-computed n x n record of "neighboring structures“ All of the stored protein structures have been compared to each other with VAST, to identify similar 3- dimensional substructures. These neighbors often identify distant homologs. Steps to VAST algorithm: 1. Based on coordinate data (x,y,z) all of the alpha helices and beta sheets of the protein are identified. 2. Vectors are calculated based on the position of these secondary structures. 3. The program creates packets of two vectors within a protein. These are called secondary structure elements (SSE’s). For example a coiled-coil. n n

VAST (Vector Alignment Search Tool) (cont. 1) 4. The program attempts to align SSE’s between two proteins based on type (alpha or beta), relative orientation and connectivity 5. A refinement of the alignment is performed using Monte Carlo methods at each residue to optimize. Scoring The program assigns a score where the superposition of the vectors is the greatest. To obtain a high score one must also determine the likelihood the vector superposition would occur by chance.

VAST (Vector Alignment Search Tool) (cont. 2) Note: a tertiary unit is defined as an SSE. Note: VAST is not the best method for determining structural similarities. Reducing substructures to vectors means that you lose some information. However, this is one of the fastest methods.