Recursive domains in proteins

Slides:



Advertisements
Similar presentations
PROTEINS Proteins are the most complex and most diverse group of biological compounds. If you weigh about 70 kg: About 50 of your 70 kg is water. Many.
Advertisements

Protein Structure C483 Spring 2013.
Protein Structure 2 Higher Order Protein Structures.
Protein Structure – Part-2 Pauling Rules The bond lengths and bond angles should be distorted as little as possible. No two atoms should approach one another.
©CMBI 2001 The amino acids in their natural habitat.
The amino acids in their natural habitat. Topics: Hydrogen bonds Secondary Structure Alpha helix Beta strands & beta sheets Turns Loop Tertiary & Quarternary.
1 September, 2004 Chapter 5 Macromolecular Structure.
Strict Regularities in Structure-Sequence Relationship
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
Proteins Dr Una Fairbrother. Dipeptides u Two amino acids are combined as in the diagram, to form a dipeptide. u Water is the other product.
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Protein Secondary Structure : Kendrew Solves the Structure of Myoglobin “Perhaps the most remarkable features of the molecule are its complexity.
The Protein Data Bank (PDB)
Structures and Structure Descriptions Chapter 8 Protein Bioinformatics.
微生物應用工業 Ch5. 微生物催化劑 阮雪芬 Nov 19, 2002 NTUT
Computing for Bioinformatics Lecture 8: protein folding.
Protein structures in the PDB
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
Protein Structures.
(Foundation Block) Dr. Ahmed Mujamammi Dr. Sumbul Fatma
A PEPTIDE BOND PEPTIDE BOND Polypeptides are polymers of amino acid residues linked by peptide group Peptide group is planar in nature which limits.
Lecture 3. α domain structures Coiled-coil, knobs and hole packing Four-helix bundle Donut ring large structure Globin fold Ridges and grooves model CS882,
Lecture 19 Representation and description II
7.5: PROTEINS Proteins Function Structure. Function 7.5.4: State four functions of proteins, giving a named example of each. [Obj. 1] Proteins are the.
Supersecondary structures. Supersecondary structures motifs motifs or folds, are particularly stable arrangements of several elements of the secondary.
Motifs of Protein Structure. Adapted from “Introduction to Protein Structure” by Branden & Tooze.
Lecture 10: Protein structure
Introduction to Protein Structure
Directions in Protein Contact Map Mining Mohammed J. Zaki Computer Science Dept. joint work with Jingjing Hu & Xiaolan Shen, CS Dept. Yu Shao & Prof. Chris.
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
STRUCTURAL ORGANIZATION
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Study of Loop Length & Residue Composition of β-Hairpin Motif
CATH – a hierarchic classification of protein domain structures Rui Kuang.
Operone lac Principles of protein structure and function Function is derived from structure Structure is derived from amino acid sequence Different.
Protein Secondary Structure, Bioinformatics Tools, and Multiple Sequence Alignments Finding Similar Sequences Predicting Secondary Structures Predicting.
Web Servers for Predicting Protein Secondary Structure (Regular and Irregular) Dr. G.P.S. Raghava, F.N.A. Sc. Bioinformatics Centre Institute of Microbial.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Structure 1 Primary and Secondary Structure.
Protein Structure (Foundation Block) What are proteins? Four levels of structure (primary, secondary, tertiary, quaternary) Protein folding and stability.
Protein Strucure Comparison Chapter 6,7 Orengo. Helices α-helix4-turn helix, min. 4 residues helix3-turn helix, min. 3 residues π-helix5-turn helix,
Last Tuesday and Beyond Common 2° structural elements: influenced by 1° structure –alpha helices –beta strands –beta turns Structure vs. function –Fibrous.
Using Spanners to Describe Protein Structure Leonidas Guibas, Daniel Russel Stanford University.
Chapter 3. Protein structure and function. Proteins are the most versatile macromolecules in living systems. serve crucial functions in essentially all.
3-D Structure of Proteins
Structure of proteins by X-ray crystallography
Events in protein folding. Introduction Many proteins take at least a few seconds to fold, but almost all proteins undergo major structural transitions.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Protein Structure  The structure of proteins can be described at 4 levels – primary, secondary, tertiary and quaternary.  Primary structure  The sequence.
Protein backbone Biochemical view:
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
PROTEINS Characteristics of Proteins Contain carbon, hydrogen, oxygen, nitrogen, and sulfur Serve as structural components of animals Serve as control.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
DEVELOPMENT OF E-COURSES FOR B.F.Sc COURSES TOPIC TOPIC: Introduction COURSE ID: Proteins are an important class of biological macromolecules present in.
Structural organization of proteins
Prepared by: Pritesh radadiya ( ) Hardik amipara ( ) Kaushik padsala ( ) Nishant dobariya( ) Guided by: Dr.
Mir Ishruna Muniyat. Primary structure (Amino acid sequence) ↓ Secondary structure ( α -helix, β -sheet ) ↓ Tertiary structure ( Three-dimensional.
Protein Structure BL
Chapter 14 Protein Structure Classification
Protein Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form in a biologically functional.
Proteins What do we need proteins for?
Protein structure (Foundation Block) Dr. Sumbul Fatma
The heroic times of crystallography
Protein Structure September 7,
Beta sheets come in two flavors: parallel (shown on this slide) and anti parallel. The geometry of the individual beta strandis are almost identical in.
Primary, secondary, tertiary and quaternary structures.
Protein Structures.
Protein structure (Foundation Block).
Four Levels of Protein Structure
Protein structure (Foundation Block).
Presentation transcript:

Recursive domains in proteins Teresa Przytycka NCBI, NIH Joint work with G.Rose & Raj Srinivasan; JHU

Domain: “Polypeptide chain (or a part of it) that can independently fold into stable tertiary structure” (Baranden & Tooze; Introduction to Protein Structure) Two-domain protein.

The 3D structure of a protein domain can be described as a compact arrangement of secondary structures Alpha helix Beta strand

These arrangements are far from random:

There are not so many of them : PDB contains about 17000 structures and less than 1000 different folds. Proportion of "new folds" (light blue) and "old folds" (orange) for a given year. (fold = fold domain)

Possible sources of restricted number of folds: Evolutionary history. Given enough time would domains look “more random”? Existence of general restrictions/rules which render some (compact) arrangements of secondary structures non-feasible. Can real protein domains be seen as sentences in a language, which can be generated by an underlying grammar?

Can protein domains be described using a set of folding rules? We restrict our attention to all beta domains: they admit variety of topologies they are difficult to predict from sequence

Understanding b-folds Patterns in b-sheets Richardson 1977 folding rules for b-sheets Zhang and Kim 2000 Hydrogen bonding pattern Polypeptide chain seems to avoid “complications” Properties of b-sandwiches Woolfson D. N., Evans P. A., Hutchinson E. G., and Thornton J. M. 1993 Parallel anti-parallel mixed “forbidden” crossed conformation

Expectations for good folding rules We need to look at fold properties that occur in non-homologous proteins. Preferably: The provide a model for the folding process.

Super-secondary structures as precursors of folding rules Super-secondary structure – frequently occurring arrangements of a small number of secondary structures The occurrences of super-secondary structures in unrelated families supports possibility of their independent formation.

Example 1: Hairpin

b-b-b-unit

Example 2: Greek key and suggested folding pathway for it for Greek key proposed by Ptitsyn. Pattern from a Greek vase

Two level of folding rules: Primitive folding rules – based on super secondary structures Closure operation – allows for hierarchical application of the primitive rules

supersecondary structures -primitive folding rules hairpin Hairpin rule Bridge Greek key

Direct wind Indirect wind

Closure-composite rules Super-secondary structures are composed of secondary structures that are neighboring in the chain sequence However from the presence of a super-secondary structure, like a hairpin, in a protein structure follows that residues that are non consecutive become neighboring in space. Closure - “short cut” in the sequence due to a folding rule

Example 1 applying folding rules to jelly roll

Recursive domains Recursive domain is a part of a protein fold that can be generated using folding rules supported with the closure operation. A protein that can be fully generated using folding rules has one recursive domain.

Examples Example 1 Example 2 Example 3 Example 4

Recursive domains Recursive domain is a part of a protein fold that can be generated using folding rules supported with the closure operation. A protein that can be fully generated using folding rules has one recursive domain.

Graph theoretical tools and recursive domains Fold graph: Vertices – strands Edges – two types: Neighbor edges: directed edges between strands that are neighbors in chain or vie the closure operation. Domain edge: edges between stands used in the same folding rule Recursive domains = connected component of the fold graph without neighbor edges.

Can the rules generate all known folds? Comparison with the partition for computer generated set of all possible 8-strand sandwiches Partition into recursive components for small (<=10 strands) proteins Control Protein data One recursive fold

Offenders Hedhehog intein domain

Given a fold, is there a unique sequence of folding steps leading to it? Usually no. Usually there alternative sequences of folding steps leading to a construction of the same domain. Do such alternative folding sequences correspond to alternative folding pathways?

Are the rules complete? Probably not. e.g.: For propeller, each blade is in one recursive domain but we do not have a rule that will put the blades together.

It is so nice outside. It would be nice to take the dog for a walk! Conclusions: We are getting some idea how things work... It is so nice outside. It would be nice to take the dog for a walk! Nice… dog… walk

Conclusions Protein folds can be described by simple folding rules. The folding rules capture at least some aspects of fold simplicity and regularity. The sequence of folding steps leading to a given fold is usually not unique. The folding rules generate protein-like structures.

Future directions Can folding rules guide fold prediction? Would hierarchical description of a fold provided by folding rules be useful for fold classification / comparison ? Adding statistical evaluation of a recursive domain.

Acknowledgments George Rose Raj Srinivasen Rohit Pappu Venk Murthy NIH, K01 grant