Classification: understanding the diversity and principles of

Slides:



Advertisements
Similar presentations
LSM2104/CZ2251 Essential Bioinformatics and Biocomputing Essential Bioinformatics and Biocomputing Protein Structure and Visualization (2) Chen Yu Zong.
Advertisements

Web Resources for Bioinformatics Vadim Alexandrov and Mark Gerstein.
C A T H C A T H lass rchitecture opology or Fold Group
Pfam(Protein families )
EBI is an Outstation of the European Molecular Biology Laboratory. Alex Mitchell InterPro team Using InterPro for functional analysis.
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
Strict Regularities in Structure-Sequence Relationship
Protein structure (Part 2 of 2).
Recursive domains in proteins
Sequence order independent structural alignment Joe Dundas, Andrew Binkowski, Bhaskar DasGupta, Jie Liang Department of Bioengineering/Bioinformatics,
Identifying functional residues of proteins from sequence info Using MSA (multiple sequence alignment) - search for remote homologs using HMMs or profiles.
The Protein Data Bank (PDB)
Protein Tertiary Structure Comparison Dong Xu Computer Science Department 271C Life Sciences Center 1201 East Rollins Road University of Missouri-Columbia.
ProteinStructuralDatabases. Proteins are built from amino-acids. Introduction H | NH2-c-CO2H | R.
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
Protein threading Structure is better conserved than sequence
Protein structures in the PDB
Classification and comparison of protein structures Overview Domains as the fundamental unit of classification Major structural classification systems-CATH,
Protein structure Classification Ole Lund, Associate professor, CBS, DTU.
Protein Structure Analysis - I
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
Protein Structure Prediction II
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structure Prediction and Analysis
Current Status of Homology Modeling Using MCSG Structures 319 MCSG structures in PDB have over 400,000 sequence homologues. These structures represent.
Pairwise sequence alignments Dynamic programming (Needleman-Wunsch), finds optimal alignment Heuristics: Blast (Altschul et al) does not guarantee finding.
Protein Tertiary Structure Prediction
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Chapter 12 Protein Structure Basics. 20 naturally occurring amino acids Free amino group (-NH2) Free carboxyl group (-COOH) Both groups linked to a central.
Structural alignment Protein structure Every protein is defined by a unique sequence (primary structure) that folds into a unique.
Protein 3D-structure analysis Exercises. Practicals Find update frequency for RCSB PDB: weekly. When was the last update? How many protein structures.
Structural databases Lecture 5 Structural Bioinformatics Dr. Avraham Samson
PDBe-fold (SSM) A web-based service for protein structure comparison and structure searches Gaurav Sahni, Ph.D.
Exploiting Structural and Comparative Genomics to Reveal Protein Functions  Predicting domain structure families and their domain contexts  Exploring.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
CATH – a hierarchic classification of protein domain structures Rui Kuang.
BMMB597E Protein Evolution Protein classification 1.
PROTEIN STRUCTURE CLASSIFICATION SUMI SINGH (sxs5729)
Tertiary structure combines regular secondary structures and loops (coil) Bovine carboxypeptidase A.
Protein Structure Comparison. Sequence versus Structure The protein sequence is a string of letters: there is an optimal solution (DP) to the problem.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein Strucure Comparison Chapter 6,7 Orengo. Helices α-helix4-turn helix, min. 4 residues helix3-turn helix, min. 3 residues π-helix5-turn helix,
CS177 Lecture 7 Computational Aspects of Protein Structure II Tom Madej
DALI Method Distance mAtrix aLIgnment
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Classification: understanding the diversity and principles of
Comparing and Classifying Domain Structures
Principle of Classification.  Humans primarily emphasize traits that can be seen with their eyes  Biologists also classify organisms into different.
March 28, 2002 NIH Proteomics Workshop Bethesda, MD Lai-Su Yeh, Ph.D. Protein Scientist, National Biomedical Research Foundation Demo: Protein Information.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Lecture 11 CS5661 Structural Bioinformatics – Structure Comparison Motivation Concepts Structure Comparison.
InterPro Sandra Orchard.
Principles of Protein Structure. AMINOACIDS Estereoisomer L Side-chain (-CH 3 ) }carboxyl-COOH amino amino -NH 2.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe-fold (SSM) A web-based service for protein structure comparison and structure searches.
Chapter 14 Protein Structure Classification
Protein Structure September 7,
Genome Annotation Continued
Accuracy of structure-based sequence alignment of automatic (structure-alignment) methods Changhoon Kim and BK Lee Laboratory of Molecular Biology CCR/NCI/NIH.
There are four levels of structure in proteins
Marrying structure and genomics
Classification and binomial naming
Protein structure prediction.
DALI Method Distance mAtrix aLIgnment
Protein Structural Classification
In-Geol Choi, Jaimyoung Kwon, and Sung-Hou Kim, UC Berkeley
Presentation transcript:

Classification: understanding the diversity and principles of protein structure and function MCSG 2001 structures

Protein structure classification Main reference: Robert B. Russell (2002) Classification of Protein Folds. Molecular Biotechnology 20:17-28. Importance: central to studies of protein structure, function, and evolution Philosophy: phyletic vs. phenetic Method: structure comparison + human knowledge

Approaches Hierarchical Based on the types and arrangements of secondary structures Unit (level): domain Domain assignment - structural vs. functional (fold or function in isolation) - automated assignment methods (structure vs. sequence)

Assignment of Class All a or All b (could be subjective) a / b (bab unit) or a + b Other classes

Subjective class assignment

Assignment of Fold Defined by the number, type, and arrangement of SSEs Connectivity (e.g. circular permutation, scrambled proteins)

Circular Permutation (CP) B N A C B A C N C D C D ..A..B..C..D.. ..C..D..A..B..

Circular permutation example 1nls (Concanavalin) 1led (Lectin) C N N C

Scrambled protein pairs 1au1a_ (Interferon-b) 2occc_ (Cytochrome C oxidase)

Assignment of Superfamily Homologous even in the absence of significant sequence similarity - certain level of structural similarity - unusual structural features - low but significant sequence similarity from structural alignment - key active site residues - sequence similarity bridges Divergence vs. convergence

Assignment of Family significant sequence similarity

superfolds, superfamilies, supersites TIM barrel, Rossmann-like, ferredoxin-like, b-propellers, 4-helix bundle, Ig-like, b-jelly rolls, Oligonucleotide/oligosaccharride binding (OB) fold, SH3-like. Structure -> function (only 50% correct)

Structure implicates function?

Classification databases SCOP - careful assignment of evolutionary relationships; homologous vs. analogous CATH - A:architecture FSSP - a list of structural neighbors VAST - NCBI’s Entrez

CATH

Common Folds

Unique Folds

Structure Comparison Evolutionary relationship Growth factor Cytokine Evolutionary relationship Structure classification

9% sequence identity Shapiro & Harris, 2000

Structure Comparison Tools DALI ( http://www2.ebi.ac.uk/dali/ ) CE ( http://cl.sdsc.edu/ ) VAST (http://www.ncbi.nlm.nih.gov/Structure/VAST/vast.html ) Prosup ( http://www.came.sbg.ac.at ) FLASH (http://thr.ibms.sinica.edu.tw/flash/)