Hidden Markov Models That Use Predicted Local Structure for Fold Recognition: Alphabets of Backbone Geometry R Karchin, M Cline, Y Mandel- Gutfreund, K.

Slides:



Advertisements
Similar presentations
1 Profile Hidden Markov Models For Protein Structure Prediction Colin Cherry
Advertisements

Hidden Markov models for detecting remote protein homologies Kevin Karplus, Christian Barrett, Richard Hughey Georgia Hadjicharalambous.
Profiles for Sequences
Hidden Markov Models Theory By Johan Walters (SR 2003)
Local Statistical Dependencies in Protein Structure: Discovery, Evaluation, Prediction and Applications Advancement to Candidacy Computer Science Department.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
© Wiley Publishing All Rights Reserved. Analyzing Protein Sequences.
Structural bioinformatics
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Protein Secondary Structures
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
HIDDEN MARKOV MODELS IN MULTIPLE ALIGNMENT. 2 HMM Architecture Markov Chains What is a Hidden Markov Model(HMM)? Components of HMM Problems of HMMs.
Profile-profile alignment using hidden Markov models Wing Wong.
16 March Identification of RNAi-Related Genes in Archaea David M. Ng BME 230.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU.
HIDDEN MARKOV MODELS IN MULTIPLE ALIGNMENT
Bio 465 Summary. Overview Conserved DNA Conserved DNA Drug Targets, TreeSAAP Drug Targets, TreeSAAP Next Generation Sequencing Next Generation Sequencing.
Lecture 9 Hidden Markov Models BioE 480 Sept 21, 2004.
Protein Secondary Structures Assignment and prediction.
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
Protein Secondary Structures Assignment and prediction Pernille Haste Andersen
Structure Prediction in 1D
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU.
Hidden Markov Models That Use Predicted Local Structure for Fold Recognition: Alphabets of Backbone Geometry R Karchin, M Cline, Y Mandel- Gutfreund, K.
Class 7: Protein Secondary Structure
“Multiple indexes and multiple alignments” Presenting:Siddharth Jonathan Scribing:Susan Tang DFLW:Neda Nategh Upcoming: 10/24:“Evolution of Multidomain.
Protein Structures.
Protein Structure Prediction and Analysis
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
COMPARATIVE or HOMOLOGY MODELING
Proteins Secondary Structure Predictions Structural Bioinformatics.
Matlab Bioinformatics Toolkit Evaluation Kanishka Bhutani.
Protein Secondary Structure Prediction Some of the slides are adapted from Dr. Dong Xu’s lecture notes.
Scoring Matrices Scoring matrices, PSSMs, and HMMs BIO520 BioinformaticsJim Lund Reading: Ch 6.1.
ProteinShop: A Tool for Protein Structure Prediction and Modeling Silvia Crivelli Computational Research Division Lawrence Berkeley National Laboratory.
Protein Secondary Structure Prediction Based on Position-specific Scoring Matrices Yan Liu Sep 29, 2003.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
Eric C. Rouchka, University of Louisville SATCHMO: sequence alignment and tree construction using hidden Markov models Edgar, R.C. and Sjolander, K. Bioinformatics.
Protein Secondary Structure Prediction
2 o structure, TM regions, and solvent accessibility Topic 13 Chapter 29, Du and Bourne “Structural Bioinformatics”
P ROTEIN SEONDARY & SUPER-SECONDARY STRUCTURE PREDICTION WITH HMM By En-Shiun Annie Lee CS 882 Protein Folding Instructed by Professor Ming Li.
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Protein Secondary Structure Prediction G P S Raghava.
Protein Structure Prediction ● Why ? ● Type of protein structure predictions – Sec Str. Pred – Homology Modelling – Fold Recognition – Ab Initio ● Secondary.
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
PROTEIN PATTERN DATABASES. PROTEIN SEQUENCES SUPERFAMILY FAMILY DOMAIN MOTIF SITE RESIDUE.
Sequence Based Analysis Tutorial March 26, 2004 NIH Proteomics Workshop Lai-Su L. Yeh, Ph.D. Protein Science Team Lead Protein Information Resource at.
Prediction of Protein Binding Sites in Protein Structures Using Hidden Markov Support Vector Machine.
Exercises Pairwise alignment Homology search (BLAST) Multiple alignment (CLUSTAL W) Iterative Profile Search: Profile Search –Pfam –Prosite –PSI-BLAST.
Query sequence MTYKLILNGKTKGETTTEAVDAATAEKVFQYANDN GVDGEWTYTE Structure-Sequence alignment “Structure is better preserved than sequence” Me! Non-redundant.
Hidden Markov Model and Its Application in Bioinformatics Liqing Department of Computer Science.
Protein Prediction with Neural Networks! Chris Alvino CS152 Fall ’06 Prof. Keller.
Proteins Structure Predictions Structural Bioinformatics.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Protein Structure Prediction. Protein Sequence Analysis Molecular properties (pH, mol. wt. isoelectric point, hydrophobicity) Secondary Structure Super-secondary.
3.3b1 Protein Structure Threading (Fold recognition) Boris Steipe University of Toronto (Slides evolved from original material.
Predicting Structural Features Chapter 12. Structural Features Phosphorylation sites Transmembrane helices Protein flexibility.
Computational Structure Prediction
Hidden Markov Models (HMM)
Pfam: multiple sequence alignments and HMM-profiles of protein domains
Introduction to Bioinformatics II
Sequence Based Analysis Tutorial
Protein Structures.
Homology Modeling.
Protein structure prediction.
Gene Structure Prediction Using Neural Networks and Hidden Markov Models June 18, 권동섭 신수용 조동연.
Neural Networks for Protein Structure Prediction Dr. B Bhunia.
Presentation transcript:

Hidden Markov Models That Use Predicted Local Structure for Fold Recognition: Alphabets of Backbone Geometry R Karchin, M Cline, Y Mandel- Gutfreund, K Karplus

Problem Parent fold may have low sequence similarity Most often, query is only sequence

Proposed Solution Try to predict local structure of target Use local structure prediction in fold recognition

Simple SAM Overview Profile HMM Database Search Multiple Alignment Query Sequence

Profile HMM Start Stop

Two-Track Profile HMM Start Stop AA Str AA Str AA Str AA Str AA Str AA Str AA Str

SAM with Local Structure Profile HMM Database Search Multiple Alignment Query Sequence Predicted local structure Start Stop AA Str AA Str AA Str AA Str AA Str AA Str AA Str Two-track HMM

Scoring Profile HMM P(residue|state) = θ(AA,state) Two-Track Profile HMM P(residue|state) = θ(AA,state)Φ(local,state) SAM Two-Track Profile HMM P(residue|state) = θ(AA,state)Φ 0.3 (local,state)

Good Structure Alphabets Intuitively, we want –Predictability –Conservation –Better fold recognition –Better alignment

Local Structure Alphabets DSSP Hydrogen Bonding DSSP-EHL STR DSSP with 6 different β-strands STRIDE H-Bond and Φ,φ angles STRIDE-EHL Protein Blocks Designed from SOM of Φ,φ angles ANG Partition of Ramachandran ALPHA Dihedral angle between Calphas TCO Dihedral angle between carbonyls

Predicting Local Structure of the Query Alphabets very dissimilar Use a neural network –Input: window of multiple alignment –Output: structure probabilities for a single residue

Predictability Evaluated on –Correct predictions (QN) –Overlap of structure segments (SOV) –Information gain Winners: –*-EHL for precision –STR and PB for most information

Conservation Used FSSP structural alignments Calculated mutual information between proteins Used only alignments with low sequence similarity Winners: –STR, PB

Fold Recognition Winners: All except PB (STRIDE-EHL leading, STR lagging)

Alignment Compared to DALI and CE alignments Evaluated using a shift score –1.0 perfect match –-0.2 is the worst (adjustable) Winners –STR, and STRIDE gets an odd one. –All alphabets improve alignment

Conclusions Local structure improves: –Fold recognition –Alignment when there’s little sequence similarity Alignment and fold recognition are very different problems