Small protein modules with similar 3D structure but different amino acid sequence Institute of Evolution, University of Haifa, ISRAEL Genome Diversity.

Slides:



Advertisements
Similar presentations
Statistics in Bioinformatics May 2, 2002 Quiz-15 min Learning objectives-Understand equally likely outcomes, Counting techniques (Example, genetic code,
Advertisements

PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
Protein Tertiary Structure Prediction
Strict Regularities in Structure-Sequence Relationship
Protein Structure, Databases and Structural Alignment
Protein structure (Part 2 of 2).
Appendix: Automated Methods for Structure Comparison Basic problem: how are any two given structures to be automatically compared in a meaningful way?
The Protein Data Bank (PDB)
Methods for Improving Protein Disorder Prediction Slobodan Vucetic1, Predrag Radivojac3, Zoran Obradovic3, Celeste J. Brown2, Keith Dunker2 1 School of.
Introduction to Educational Statistics
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
Introduction to Bioinformatics - Tutorial no. 8 Protein Prediction: - PROSITE - Pfam - SCOP - TOPITS - genThreader.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structures.
Protein Structure Prediction and Analysis
 C. C. Hung, H. Ijaz, E. Jung, and B.-C. Kuo # School of Computing and Software Engineering Southern Polytechnic State University, Marietta, Georgia USA.
Protein Tertiary Structure Prediction
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Protein Secondary Structure Prediction with inclusion of Hydrophobicity information Tzu-Cheng Chuang, Okan K. Ersoy and Saul B. Gelfand School of Electrical.
The Origin of Eukaryotic Cells  With lots of perplexities and guesses, researchers did many experiments to bring it to light.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
PROTEIN STRUCTURE CLASSIFICATION SUMI SINGH (sxs5729)
Phylogenetic Prediction Lecture II by Clarke S. Arnold March 19, 2002.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Endogenous Retroviral promoter of the Human gene Kim Tae Hyung Oct 02,2004 MPL.
Calculating branch lengths from distances. ABC A B C----- a b c.
PIRSF Classification System PIRSF: Evolutionary relationships of proteins from super- to sub-families Homeomorphic Family: Homologous proteins sharing.
Protein Strucure Comparison Chapter 6,7 Orengo. Helices α-helix4-turn helix, min. 4 residues helix3-turn helix, min. 3 residues π-helix5-turn helix,
Design of Novel Two-Level Quantizer with Extended Huffman Coding for Laplacian Source Lazar Velimirović, Miomir Stanković, Zoran Perić, Jelena Nikolić,
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Protein Structure Prediction
Similar Figures, Day 2. Warm Up Two figures are similar if one can be obtained from the other using a sequence of transformations in the plane.
Codon Bias and its Relationship to Gene Expression Presented through a virtual grant by the Virtual Student Union.
Pair-wise Structural Comparison using DALILite Software of DALI Rajalekshmy Usha.
Feature Extraction Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and.
Bioinformatics Dipl. Ing. (FH) Patrick Grossmann
PROTEIN STRUCTURE (Donaldson, March 10,2003) What are we trying to learn about genes and their proteins: Predict function for unknown protein by comparison.
Lecture 11 CS5661 Structural Bioinformatics – Structure Comparison Motivation Concepts Structure Comparison.
An Efficient Index-based Protein Structure Database Searching Method 陳冠宇.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Using BLAST To Teach ‘E-value-tionary’ Concepts Cheryl A. Kerfeld 1, 2 and Kathleen M. Scott 3 1.Department of Energy-Joint Genome Institute, Walnut Creek,
General Microbiology (Micr300)
Lecture 14 CS5661 Neighbor Joining Generates unrooted tree, allowing for unequal branches Given: Distance matrix for sequences Steps: Repeat 1-3 till all.
From: The Pfam protein families database
‘Protein sequencing’: Determining protein sequences
Fig. 9. Scatter plots of 500 inferred rates versus their simulated values with a model tree with six sequences and d = 0.1 for (a) ML and (b) EB-EXP. The.
Fig. 2 System flowchart. Each of the four iterations contains two models (SS, and ASA/HSE/CN/ANGLES), for a total of eight LSTM-BRNN based models. The.
Volume 9, Issue 10, Pages (October 2016)
Bayesian Refinement of Protein Functional Site Matching
Topic 5.4: Cladistics.
Prediction of Protein Structure and Function on a Proteomic Scale
N-Terminal Domains in Two-Domain Proteins Are Biased to Be Shorter and Predicted to Fold Faster Than Their C-Terminal Counterparts  Etai Jacob, Ron Unger,
Sequence and comparison of the SDS protein.
Finding Functionally Significant Structural Motifs in Proteins
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Protein Structures.
Molecular Modeling By Rashmi Shrivastava Lecturer
Generalizations of Markov model to characterize biological sequences
Protein structure prediction.
Structural biology of allergens
The C. elegans SYS-1 Protein Is a Bona Fide β-Catenin
A Comparison of Genotype-Phenotype Maps for RNA and Proteins
謝孫源 (Sun-Yuan Hsieh) 成功大學 電機資訊學院 資訊工程系
Volume 84, Issue 2, Pages (February 2003)
by T. M. Jacobs, B. Williams, T. Williams, X. Xu, A. Eletsky, J. F
Volume 3, Issue 6, Pages (November 1998)
Volume 109, Issue 6, Pages (September 2015)
Universal microbial diagnostics using random DNA probes
Presentation transcript:

Small protein modules with similar 3D structure but different amino acid sequence Institute of Evolution, University of Haifa, ISRAEL Genome Diversity Center, Institute of Evolution, University of Haifa, Haifa 31905, ISRAEL * Z.M. Frenkel*, E. N. Trifonov Acknowledgements This work was supported by an ISF grant 710/ and by an EU grant QLG2-CT Z.M.Frenkel is a Post-Doctoral Fellow of the Center for Complexity Science and supported by the Ministry of Absorption. 1qap A (218) LDELDDALKA GADIIMLDNF [94] 1dv1 A (64) PAIISAAEIT GAVAIHPGYG [23] 1tre A (218) SNAAELFAQP DIDGALVGGA 1tre 1l0o C (134) TVTEIADHLG ISPEDVVLAQ [52] 1fse A (28) TTKEIASELF ISEKTVRNHI [48] 1bl0 A (29) TLYDVAEYAG VSYQTVSRVV 1bl0 1ecr A (13) TFRQMEQELA IFAAHLEQHK [33] 1set A (79) EAKRLEEALR EKEARLEALL [30] 1eiy A (64) ALEAREKALE EAALKEALER 1eiy Abstract It is well known, that often protein fragments with almost identical 3D structure have quite different amino acid sequences. Such effect can be observed both for entire protein domain (fold) and for much smaller fragments present in different folds, and makes difficult structure prediction and classification. We proposed a new method that allows to reveal relationship between protein fragments with similar 3D structure but different amino acid sequences by identifying 'intermediate' sequences in proteomic database. The aim of this paper is to define proper parameters of the protein structure/sequence comparison and to illustrate the new approach. All calculations were carried out for fragments of 20 aa. It was shown (for selected protein fragments), that for such length it is practically impossible to find two fragments with identity more or equal 12 aa such that RMSD (root mean square deviation) of their structures is lager 3Å. With these thresholds of sequence and structure similarity, for a fragment from PDB (protein data bank) all similar fragments in database of 113 prokaryotic proteomes were found. The same was done for every such similar fragment, repeatedly. For every new sequence all similar fragments were located in PDB, if found. If the respective structures in PDB were different from the structure of the initial fragment, the sequence was excluded from further searches in proteomes. As a result an 'evolutionary tree' is obtained originating from our initial fragment of PDB. The tree contains elements similar in structure to initial fragment yet with different sequences. These elements are connected to origin in chain like fashion with adjacent elements having similar sequences. Three such evolutionary trees for different protein structures are described. This technique opens new possibilities for protein evolution studies and structure prediction. בס"ד Figure 1 Figure 2Figure 3 Table II Table I Table III Table IV Figure and Table description Fragments of the 'evolutionary trees' are shown in Fig. 1 – 3. The numbers mark the nodes with known structure similar to initial fragment. Description of such structures for Fig. 1 is given in Table I (only more similar structures are described). The thresholds of similarity are 3Å for Fig. 1 – 2 and 1.5Å for Fig. 3. The thin line marks examples of branches described in Tables II – IV. In these tables on the left the sequences from bacterial proteomes are given, that correspond to the nodes of the thin lines. Each sequence differs less (or equal) than 8 amino acids of 20 from its immediate neighbors. Corresponding structures from PDB (if found) are listed in the right. Some of them are shown near the respective tree superimposed with initial element structure. Conclusions Protein fragments with similar 3D structure can be selected, such that the sequences of the group's members can be related to one another in chain-like fashion with incremental changes in the sequences, and no similarity between the ends of the chain.