Protein Structure Prediction

Slides:



Advertisements
Similar presentations
Secondary structure prediction from amino acid sequence.
Advertisements

Pp 50 – 51 & Pp 15 & Proteins Proteins are polymers of amino acids Each has a unique 3D shape Amino acid sequences vary Proteins are.
Protein Structure C483 Spring 2013.
Protein Structure – Part-2 Pauling Rules The bond lengths and bond angles should be distorted as little as possible. No two atoms should approach one another.
The amino acids in their natural habitat. Topics: Hydrogen bonds Secondary Structure Alpha helix Beta strands & beta sheets Turns Loop Tertiary & Quarternary.
Proteins - Many Structures, Many Functions 1.A polypeptide is a polymer of amino acids connected to a specific sequence 2.A protein’s function depends.
Protein 3-Dimensional Structure and Function
1 September, 2004 Chapter 5 Macromolecular Structure.
Protein structure Friday, 10 February 2006 Introduction to Bioinformatics Brigham Young University DA McClellan
(Foundation Block) Dr. Ahmed Mujamammi Dr. Sumbul Fatma
A PEPTIDE BOND PEPTIDE BOND Polypeptides are polymers of amino acid residues linked by peptide group Peptide group is planar in nature which limits.
Protein Structural Prediction. Protein Structure is Hierarchical.
Proteins Dr. Sumbul Fatma Clinical Chemistry Unit
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Chapter 12 Protein Structure Basics. 20 naturally occurring amino acids Free amino group (-NH2) Free carboxyl group (-COOH) Both groups linked to a central.
CS 177 Proteins part 1: Structure-function relationships Review of protein structures Need for analyses of protein structures Sources of protein structure.
Housekeeping Your performance on the exam has caused me to re-evaluate how homework will be handled I will now be picking up every problem assigned on.
7.5: PROTEINS Proteins Function Structure. Function 7.5.4: State four functions of proteins, giving a named example of each. [Obj. 1] Proteins are the.
Supersecondary structures. Supersecondary structures motifs motifs or folds, are particularly stable arrangements of several elements of the secondary.
Lecture 10: Protein structure
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
STRUCTURAL ORGANIZATION
Chapter 4 The Three-Dimensional Structure of Proteins.
CS 177 Proteins, part 2 (Computational modeling) Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory.
Protein Folding & Biospectroscopy F14PFB David Robinson Mark Searle Jon McMaster
Protein Structure Prediction and Structural Genomics Computer Science Department North Dakota State University Fargo, ND.
Monday, October 25, 5:56:17 PM What are gene families?  A gene family is a group of genes that share important characteristics.  In many cases, genes.
Operone lac Principles of protein structure and function Function is derived from structure Structure is derived from amino acid sequence Different.
Protein 3-Dimensional Structure and Function. Terminology Conformation – spatial arrangement of atoms in a protein Native conformation – conformation.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
The α-helix forms within a continuous strech of the polypeptide chain 5.4 Å rise, 3.6 aa/turn  1.5 Å/aa N-term C-term prototypical  = -57  ψ = -47 
Protein Structure (Foundation Block) What are proteins? Four levels of structure (primary, secondary, tertiary, quaternary) Protein folding and stability.
Protein structure and function Part - I
Proteins are instrumental in about everything that an organism does. These functions include structural support, storage, transport of other substances,
THE STRUCTURE AND FUNCTION OF MACROMOLECULES Proteins - Many Structures, Many Functions 1.A polypeptide is a polymer of amino acids connected to a specific.
1 Protein Structure Prediction (Lecture for CS397-CXZ Algorithms in Bioinformatics) April 23, 2004 ChengXiang Zhai Department of Computer Science University.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Proteins Dr. Sumbul Fatma Clinical Chemistry Unit Department of Pathology Tel
CS 177 Proteins I (Structure-function relationships) Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory.
Objective 7: TSWBAT recognize and give examples of four levels of protein conformation and relate them to denaturation.
5.4: Proteins Introduction
Protein- Secondary, Tertiary, and Quaternary Structure.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
PROTEINS Characteristics of Proteins Contain carbon, hydrogen, oxygen, nitrogen, and sulfur Serve as structural components of animals Serve as control.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
CHAPTER 5 THE STRUCTURE AND FUNCTION OF MACROMOLECULES Copyright © 2002 Pearson Education, Inc., publishing as Benjamin Cummings Section D: Proteins -
Protein Structure Prediction. Protein Sequence Analysis Molecular properties (pH, mol. wt. isoelectric point, hydrophobicity) Secondary Structure Super-secondary.
Polypeptide Chains Can Change Direction by Making Reverse Turns and Loops.
Structure and Function
Structural organization of proteins
CHM 708: MEDICINAL CHEMISTRY
Protein Structure BL
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Chemical agents PROTEINS: The Molecular Tools of the Cell
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
Lecture 5 Protein Structure.
Conformationally changed Stability
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
Protein Structure Prediction
Protein Structures.
CS 177 Proteins, part 2 (Computational modeling)
Conformationally changed Stability
Homology Modeling.
Protein structure prediction.
The Three-Dimensional Structure of Proteins
Presentation transcript:

Protein Structure Prediction

Why do we want to know protein structure? Classification Functional Prediction

What is protein structure? Primary - chains of amino acids Secondary - interaction between groups of amino acids Tertiary - the organization in three dimensions of all the atoms in a polypeptide Quaternary - the conformation assumed by a multimeric protein

Primary Structure Proteins are chains of amino acids joined by peptide bonds Polypeptide chain The structure of two amid acids DNA/RNA overview The N-C-C sequence is repeated throughout the protein, forming the backbone The bonds on each side of the C atom are free to rotate within spatial constrains, the angles of these bonds determine the conformation of the protein backbone The R side chains also play an important structural role

Secondary Structure Interactions that occur between the C=O and N-H groups on amino acids Much of the protein core comprises  helices and  sheets, folded into a three-dimensional configuration: - regular patterns of H bonds are formed between neighboring amino acids - the amino acids have similar angles - the formation of these structures neutralizes the polar groups on each amino acid - the secondary structures are tightly packed in a hydrophobic environment - Each R side group has a limited volume to occupy and a limited number of interactions with other R side groups DNA/RNA overview  helix  sheet

Secondary Structure  helix  sheet DNA/RNA overview

Secondary Structure Other Secondary structure elements (no standardized classification) - random coil - loop - others (e.g. 310 helix, -hairpin, paperclip) DNA/RNA overview Super-secondary structure - In addition to secondary structure elements that apply to all proteins (e.g. helix, sheet) there are some simple structural motifs in some proteins - These super-secondary structures (e.g. transmembrane domains, coiled coils, helix-turn-helix, signal peptides) can give important hints about protein function

Classification Structural classification of proteins (SCOP) Class 1: mainly alpha Class 2: mainly beta DNA/RNA overview Class 3: alpha/beta Class 4: few secondary structures

More Classification Alternative SCOP Class  : only  helices Class  : antiparallel  sheets Class / : mainly  sheets with intervening  helices DNA/RNA overview Class + : mainly segregated  helices with antiparallel  sheets Membrane structure: hydrophobic  helices with membrane bilayers Multidomain: contain more than one class

Protein Structure Review Q: If we have all the Psi and Phi angles in a protein, do we then have enough information to describe the 3-D structure? Tertiary structure A: No, because the detailed packing of the amino acid side chains is not revealed from this information. However, the Psi and Phi angles do determine the entire secondary structure of a protein DNA/RNA overview

Secondary-Structure Prediction Programs * PSI-pred * JPRED Consensus prediction (includes many of the methods given below) * DSC * PREDATOR * PHD * ZPRED * nnPredict * BMERC PSA * SSP

Tertiary Structure The tertiary structure describes the organization in three dimensions of all the atoms in the polypeptide The tertiary structure is determined by a combination of different types of bonding (covalent bonds, ionic bonds, h-bonding, hydrophobic interactions, Van der Waal’s forces) between the side chains Many of these bonds are very week and easy to break, but hundreds or thousands working together give the protein structure great stability If a protein consists of only one polypeptide chain, this level then describes the complete structure DNA/RNA overview

Tertiary Structure Proteins can be divided into two general classes based on their tertiary structure: - Fibrous proteins have elongated structure with the polypeptide chains arranged in long strands. This class of proteins serves as major structural component of cells Examples: silk, keratin, collagen DNA/RNA overview - Globular proteins have more compact, often irregular structures. This class of proteins includes most enzymes and most proteins involved in gene expression and regulation

Quaternary Structures The quaternary structure defines the conformation assumed by a multimeric protein. The individual polypeptide chains that make up a multimeric protein are often referred to as protein subunits. Subunits are joined by ionic, H and hydrophobic interactions Example: Haemoglobin (4 subunits) DNA/RNA overview

Structure Displays Common displays are (among others) cartoon, spacefill, and backbone DNA/RNA overview cartoon spacefill backbone

Software RasMol Cn3D Jmol (Chime)

Classic Approach to Determining Structure? Infer function, mechanism of action Experimentally determine 3D structure Determine biochemical and cellular role of protein Purify protein Clone cDNA encoding protein Obtain protein By expression

Structural Genomics Approach? Experimentally determine 3D structure Obtain protein by expression predict protein- coding genes genomic DNA sequences Determine biochemical and cellular role of protein Obtain protein In silico Predict 3D structure homology searches (PSI-BLAST)

Sources of Protein Structure Information? 3-D macromolecular structures stored in databases The most important database: the Protein Data Bank (PDB) The PDB is maintained by the Research Collaboratory for Structural Bioinformatics (RCSB) and can be accessed at three different sites (plus a number of mirror sites outside the USA): - http://rcsb.rutgers.edu/pdb (Rutgers University) - http://www.rcsb.org/pdb/ (San Diego Supercomputer Center) - http://tcsb.nist.gov/pdb/ (National Institute for Standards and Technology) It is the very first “bioinformatics” database ever build DNA/RNA overview

Structural Prediction Computational Modeling Researches have been working for decades to develop procedures for predicting protein structure that are not so time consuming and not hindered by size and solubility constrains. As protein sequences are encoded in DNA, in principle, it should therefore be possible to translate a gene sequence into an amino acid sequence, and to predict the three-dimensional structure of the resulting chain from this amino acid sequence DNA/RNA overview

Computational Modeling How to predict the protein structure? Ab initio prediction of protein structure from sequence: not yet. Problem: the information contained in protein structures lies essentially in the conformational torsion angles. Even if we only assume that every amino-acid residue has three such torsion angles, and that each of these three can only assume one of three "ideal" values (e.g., 60, 180 and -60 degrees), this still leaves us with 27 possible conformations per residue. For a typical 200-amino acid protein, this would give 27200 (roughly 1.87 x 10286) possible conformations! DNA/RNA overview Q: Can’t we just generate all these conformations, calculate their energy and see which conformation has the lowest energy? If we were able to evaluate 109 conformations per second, this would still keep us busy 4 x 10259 times the current age of the universe There are optimized ab initio prediction algorithms available as well as fold recognition algorithms that use threading (compares protein folds with know fold structures from databases), but the results are still very poor

Homology Modeling Homology (comparative) modeling attempts to predict structure on the strength of a protein’s sequence similarity to another protein of known structure DNA/RNA overview Basic idea: a significant alignment of the query sequence with a target sequence from PDB is evidence that the query sequence has a similar 3-D structure (current threshold ~ 40% sequence identity). Then multiple sequence alignment and pattern analysis can be used to predict the structure of the protein

How do we do this? Computational modeling: summary DNA/RNA overview Partial or full sequences predicted through gene finding Similarity search against proteins in PDB Find structures that have a significant level of structural similarity (but not necessarily significant sequence similarity) Alignment can be used to position the amino acids of the query sequence in the same approximate 3-D structure If member of a family with a predicted structural fold, multiple alignment can be used for structural modeling DNA/RNA overview How do we do this? Structural analyses in the lab (X-ray crystallography, NMR) Infer structural information (e.g. presence of small amino acid motifs; spacing and arrangement of amino acids; certain typical amino acid combinations associated with certain types of secondary structure) can provide clues as to the presence of active sites and regions of secondary structure

3D Comparative Modeling Profile Methods - match sequences to folds by describing each fold in terms of the environment of each residue in the structure Threading Methods - match sequences to structure by considering pairwise interactions for each residue, rather than averaging them into an environmental class HMM Methods - the equivalent state corresponds to one structurally aligned position in a structural fold, including gaps

Structural HMM