Proteins Proteins are large biological molecules, or macromolecules, consisting of one or more chains of amino acid residues. Proteins perform a vast.

Slides:



Advertisements
Similar presentations
Chemistry 2100 Lecture 10.
Advertisements

Biomacromolecules Pt IV: Proteins.
Amino Acids and Proteins B.2. there are about 20 amino acids that occur naturally they are the basic “building blocks” of life/proteins.
Structural hierarchy in proteins. Color conventions.
Pp 50 – 51 & Pp 15 & Proteins Proteins are polymers of amino acids Each has a unique 3D shape Amino acid sequences vary Proteins are.
Review of Basic Principles of Chemistry, Amino Acids and Proteins Brian Kuhlman: The material presented here is available on the.
Proteins - Many Structures, Many Functions 1.A polypeptide is a polymer of amino acids connected to a specific sequence 2.A protein’s function depends.
Ch.5 Proteins: Primary structure Polypeptide diversity Protein purification and analysis Protein sequencing Protein evolution.
Copyright © 2005 Pearson Education, Inc. publishing as Benjamin Cummings Concept 5.4: Proteins have many structures, resulting in a wide range of functions.
Proteins include a diversity of structures, resulting in a wide range of functions Protein functions include structural support, storage, transport, enzymes,
Proteins account for more than 50% of the dry mass of most cells
Protein Evolution and Analysis February Protein Assays An assay is a method of detection Specific Sensitive Convenient to use.
Proteins account for more than 50% of the dry mass of most cells
Protein Structure. Protein Structure I Primary Structure.
Amino Acids and Proteins B.2. Properties of 2-amino acids (B.2.2) Zwitterion (dipolar) – amino acids contain both acidic and basic groups in the same.
BIOCHEMISTRY REVIEW Overview of Biomolecules Chapter 4 Protein Sequence.
Chapter 19 “extra stuff”. Amino Acids as Acids and Bases… Revisited Think of an acid dissociation as a reaction that has an associated equilibrium: HAH.
BIOL 200 (Section 921) Lecture # 2, June 20, 2006 Reading for lecture 2: Essential Cell Biology (ECB) 2nd edition. Chap 2 pp 55-56, 58-64, 74-75; Chap.
The Organic Chemistry of Amino Acids,
Amino Acids & Side Groups Polar Charged ◦ ACIDIC negatively charged amino acids  ASP & GLU R group with a 2nd COOH that ionizes* above pH 7.02nd COOH.
Proteins: Evolution, and Analysis Lecture 7 9/15/2009.
AP Biology Discuss the following with your group and be prepared to discuss with the class 1. Why is the shape of a molecule important? 2. How is a covalent.
THE STRUCTURE AND FUNCTION OF MACROMOLECULES Proteins - Many Structures, Many Functions 1.A polypeptide is a polymer of amino acids connected to a specific.
The Structure and Function of Macromolecules Proteins & Nucleic Acids.
Introduction to Proteins
PROTEINS. Proteins Proteins do the nitty-gritty jobs of every living cell. Proteins are made of long strings of individual building blocks known as amino.
5.4: Proteins Introduction
Proteins.
Proteins Polypeptide chains in specific conformations Protein Graphic Design video.
Proteins: Primary Structure Lecture 6 Chapters 4 & 5 9/10/09.
PROTEINS L3 BIOLOGY. FACTS ABOUT PROTEINS: Contain the elements Carbon, Hydrogen, Oxygen, and NITROGEN Polymer is formed using 20 different amino acids.
CHAPTER 5 THE STRUCTURE AND FUNCTION OF MACROMOLECULES Copyright © 2002 Pearson Education, Inc., publishing as Benjamin Cummings Section D: Proteins -
Peptides. Structure and functions of proteins Department of General Chemistry Poznań University of Medical Sciences MD 2015/16.
Proteins Fibrous 1o structure amino acid sequence 2o structure
Chapter 22 The Organic Chemistry of Amino Acids, Peptides, and Proteins Paula Yurkanis Bruice University of California, Santa Barbara.
Notes: Chapter 2.3 Carbon Compounds
Protein Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form in a biologically functional.
Lecture 13 February 16, 2016 Biotech 3.
Amino Acid & Basic Classification
Dr. Jagdish Kaur, P.G.G.C.,Sector 11 Chandigarh
CHAPTER 5 THE STRUCTURE AND FUNCTION OF MACROMOLECULES
Proteins account for more than 50% of the dry mass of most cells
Chemical agents PROTEINS: The Molecular Tools of the Cell
Protein Structure.
Chpt. 5 The Structure and Function of Macromolecules
Proteins.
Proteins.
The Structure and Function of Large Biological Molecules
Transport proteins Transport protein Cell membrane
Standards Distinguish among proteins, carbohydrates, lipids, and nucleic acids. Identify the major structural components and functions of the four major.
Biomacromolecules Part IV: Proteins.
Proteins account for more than 50% of the dry mass of most cells
Chapter 3 Proteins.
Amino acids R-groups non-polar polar acidic basic proteins
Fig. 5-UN1  carbon Amino group Carboxyl group.
Packet #9 Supplement.
The Structure and Function of Macromolecules
Packet #9 Supplement.
Amino acids R-groups non-polar polar acidic basic proteins
Proteins are involved in
Amino acids R-groups non-polar polar acidic basic proteins
Proteins account for more than 50% of the dry mass of most cells
Proteins Genetic information in DNA codes specifically for the production of proteins Cells have thousands of different proteins, each with a specific.
List a Carbohydrate Monomer
Chapter Three Amino Acids and Peptides
Levels of Protein Structure
Proteins.
Proteins Fibrous 1o structure amino acid sequence 2o structure
Proteins Proteins have many structures, resulting in a wide range of functions Proteins do most of the work in cells and act as enzymes 2. Proteins are.
CHAPTER 5 THE STRUCTURE AND FUNCTION OF MACROMOLECULES
Presentation transcript:

Proteins Proteins are large biological molecules, or macromolecules, consisting of one or more chains of amino acid residues. Proteins perform a vast array of functions within living organisms, including catalyzing metabolic reactions, replicating DNA, responding to stimuli, and transporting molecules from one location to another. Proteins differ from one another primarily in their sequence of amino acids, which is dictated by the nucleotide sequence of their genes, and which usually results in folding of the protein into a specific three-dimensional structure that determines its act

Proteins A polypeptide is a single linear polymer chain derived from the condensation of amino acids. The individual amino acid residues are bonded together by peptide bonds and adjacent amino acid residues. The sequence of amino acid residues in a protein is defined by the sequence of a gene, which is encoded in the genetic code. In general, the genetic code specifies 20 standard amino acids; however, in certain organisms the genetic code can include selenocysteine and—in certain archaea—pyrrolysine. Shortly after or even during synthesis, the residues in a protein are often chemically modified by posttranslational modification, which alters the physical and chemical properties, folding, stability, activity, and ultimately, the function of the proteins. Sometimes proteins have non-peptide groups attached, which can be called prosthetic groups or cofactors. Proteins can also work together to achieve a particular function, and they often associate to form stable protein complexes.

Structure of proteins

Color conventions

CORN LAW amino acid with L configuration Protein Geometry CORN LAW amino acid with L configuration

Greek alphabet

The Polypeptide Chain

Covalent structures of proteins Chapter 5 Covalent structures of proteins Proteins function as: 1. Enzymes:biological catalysts 2. Regulators of catalysis-hormones 3. Transport and store i.e. O2, metal ions sugars, lipids, etc. 4. Contractile assemblies Muscle fibers Separation of chromosomes etc. 5. Sensory Rhodopsin nerve proteins

6. Cellular defense immuoglobulins Antibodies Killer T cell Receptors 7. Structural Collagen Silk, etc. Function is dictated by protein structure!!

There are four levels of protein structure 1. Primary structure 1 = Amino acid sequence, the linear order of AA’s. Remember from the N-terminus to the C-terminus Above all else this dictates the structure and function of the protein.

There are four levels of protein structure 2. Secondary structure 2 = Local spatial alignment of amino acids without regard to side chains. Usually repeated structures Examples: a helix, b sheets, random coil, or b turns

3. Tertiary Structure 3 = the 3 dimensional structure of an entire peptide. Great in detail but vague to generalize. Can reveal the detailed chemical mechanisms of an enzyme.

Chapter 5.3 is how to determine a protein’s primary structure. 4. Quaternary Structure 4 two or more peptide chains associated with a protein. Spatial arrangements of subunits. Chapter 5.3 is how to determine a protein’s primary structure. “Protein Chemistry”

Example of each level of protein structure

Amino Acids

The amino-acid side chains have different tendencies to participate in interactions Hydrophobic residues: van der Waals interactions – tendency to avoid contact with water and pack against each other  hydrophobic effect - Ala & Leu are strong helix-favoring residues, Proline are not because its backbone nitrogen isn’t available for H-bond - Aromatic side chain of Phe participates in weakly polar interactions Hydrophilic residues : Hydrogen bonds to one another, to peptide backbone, to polar organic molecules, and to water. - pKa shift: Asp & Glu (57 in hydrophobic interior or nearby (-) charge), Lys (106 in ?) - His: most versatile, most often found in enzyme active sites, pKa is 6, neutral, proton donator and acceptor

Arg: completely protonated at neural, compared to Lys? - Cys: common in enzyme active site, most powerful nucleophile. Compared to Ser? Amphipatic residues : both polar and nonpolar character - Lys: hydrophic charged region, long hydrophobic region (methylene) involved in van der Waals interactions with hydrophobic side chains - Tyr: pKa is 9, in some enzyme active site, hydroxyl group can be donor and acceptor of H-bond. Aromatic ring can form weakly polar interactions - Trp: similar to Tyr, indole amide hydrogen don’t ionize. - Met: least polar among amphipatic residues, thioether sulfur is excellent ligand for metal ions.

Genes and Proteins The genetic code is degenerate The genetic code

Insulin was the first protein to be sequenced F. Sanger won the Nobel prize for protein sequencing. It took 10 years, many people, and it took 100 g of protein! Today it takes one person several days to sequence the same insulin. 1021 AA b- glactosidase 1978

Steps towards protein sequencing Above all else, purify it first!! Chapter 5.3 then 5.1 and 5.2 1. Prepare protein for sequencing a. Determine number of chemically different polypeptides. b. Cleave the protein’s disulfide bonds. c. Separate and purify each subunit. d. Determine amino acid composition for each peptide.

Bovine insulin: note the intra- and inter- chain disulfide linkages

2. Sequencing the peptide chains: a. Fragment subunits into smaller peptides  50 AA’s in length. b. Separate and purify the fragments c. Determine the sequence of each fragment. d. Repeat step 2 with different fragmentation system.

3. Organize the completed structure. a. Span cleavage points between sets of peptides determined by each peptide sequence. b. Elucidate disulfide bonds and modified amino acids. At best, the automated instruments can sequence about 50 amino acids in one run! Proteins must be cleaved into smaller pieces to obtain a complete sequence.

End Group Analysis How many peptides in protein? Bovine insulin should give 2 N-terminii and 2 C-terminii N-terminus 1-Dimethylamino - naphthalene-5-sulfonyl chloride Dansyl chloride Reacts with amines: N-terminus + Lys (K) side chains

Edman degradation with Phenyl isothiocyanate, PITC Disadvantage with the Dansyl-chloride method is that you must use 6M HCl to cleave off the derivatized amino acid, this also cleaves all other amide bonds (residues) as well. Edman degradation with Phenyl isothiocyanate, PITC

Edman degradation has been automated as a method to sequence proteins Edman degradation has been automated as a method to sequence proteins. The PTH-amino acid is soluble in solvents that the protein is not. This fact is used to separate the tagged amino acid from the remaining protein, allowing the cycle of labeling, degradation, and separation to continue. Even with the best chemistry, the reaction is about 98% efficient. After sufficient cycles more than one amino acid is identified, making the sequence determination error-prone at longer reads.

Demonstration of Edman degradation Use your CD disk- install it and run chapter 5 Edman degradation.

Carboxypeptidase cleavage at the C-terminus Carboxypeptidase A Rn R, K, P Rn-1 P If the Tyr-Ser bond is more resistant to cleavage than the Leu-Tyr, the Ser and the Tyr will appear simultaneously and the C-terminus would still be in doubt.

Cleavage of disulfide bonds Permits separation of polypeptide chains Prevents refolding back to native structure Performic acid oxidation Changes cystine or cysteine to Cystic acid Methionine to Methionine sulfone 2-Mercaptoethanol, dithiothreitol, or dithioerythritol Keeps the equilibrium towards the reduced form -S-S- 2SH

Amino acid composition The amino acid composition of a peptide chain is determined by its complete hydrolysis followed by the quantitative analysis of the liberated amino acids. Acid hydrolysis (6 N HCl) at 120 oC for 10 to 100 h destroys Trp and partially destroys Ser, Thr, and Tyr. Also Gln and Asn yield Glu and Asp Base hydrolysis 2 to 4 N NaOH at 100 oC for 4 - 8 h. Is problematic, destroys Cys Ser, Thr, Arg but does not harm Trp.

Amino acid analyzer In order to quantitate the amino acid residues after hydrolysis, each must be derivatized at about 100% efficiency to a compound that is colored. Pre or post column derivatization can be done. These can be separated using HPLC in an automated setup

Amino acid compositions are indicative of protein structures Leu, Ala,Gly, Ser, Val, Glu, and Ile are the most common amino acids His, Met, Cys, and Trp are the least common. Ratios of polar to non-polar amino acids are indicative of globular or membrane proteins. Certain structural proteins are made of repeating peptide structures i.e. collagen.

Long peptides have to be broken to shorter ones to be sequenced Endopeptidases cleave proteins at specific sites within the chain. Trypsin Rn-1 = positively charged residues R, K; Rn  P Chymotrypsin Rn-1 = bulky hydrophobic residues F, W, T; Rn  P Thermolysin Rn = I, M, F, W, T, V; Rn-1  P Endopeptidase V8 Rn-1 = E

Specific chemical cleavage reagents Cyanogen Bromide Rn-1 = M Cleave the large protein using i.e trypsin, separate fragments and sequence all of them. (We do not know the order of the fragments!!) Cleave with a different reagent i.e. Cyanogen Bromide, separate the fragments and sequence all of them. Align the fragments with overlapping sequence to get the overall sequence.

How to assemble a protein sequence 1. Write a blank line for each amino acid in the sequence starting with the N-terminus. 2. Follow logically each clue and fill in the blanks. 3. Identify overlapping fragments and place in sequence blanks accordingly. 4. Make sure logically all your amino acids fit into the logical design of the experiment. 5. Double check your work.

H3N-_-_-_-_-_-_-_-_-_-_-_-_-_-_-COO 1 2 3 4 5 6 7 8 9 10 11 12 13 14 H3N-_-_-_-_-_-_-_-_-_-_-_-_-_-_-COO K F - A - M - K K - F - A - M Q - M - K D - I - K - Q - M G - M - D - I - K Y - R - G - M Y - R Trypsin cleaves after K or R (positively charged amino acids) Q - M - K G - M - D - I - K F - A - M - K Y - R Cyanogen Bromide (CN Br) Cleaves after Met i.e M - X D - I - K - Q - M K K - F - A - M Y - R - G - M

There are a variety of ways to purify peptides All are based on the physical or chemical properties of the protein. Size Charge Solubility Chemical specificity Hydrophobicity/ Hydrophylicity Reverse Phase High Pressure Liquid Chromatography is used to separate peptide fragments.

Peptide mapping: digest protein with an appropriate agent, then separate using two dimensional paper chromatography Digested Peptide from normal (HbA) and Sickle cell anemia (Hbs) hemoglobins HbA V - H - L - T - P - E - E - K HbS V - H - L - T - P - V - E - K  1 2 3 4 5 6 7 8 Beta chain position 6 contains altered amino acid

Red blood cells : (a) normal (b) sickle cell Electrophoretic separation of hemoglobins

Deoxyhemoglobin aggregates and deforms cell Deoxyhemoglobin aggregates and deforms cell. Primary structure changes dictate quaternary structure. Why did the problem not die out? Homozygotic Heterzyatic Homozygotic normal sickle cell trait sickle cell gets malaria resistant gets sickle cell to malaria dies dies

Species variation in homologous proteins The primary structures of a given protein from related species closely resemble one another. If one assumes, according to evolutionary theory, that related species have evolved from a common ancestor, it follows that each of their proteins must have likewise evolved from the corresponding ancestor. A protein that is well adapted to its function, that is, one that is not subject to significant physiological improvement, nevertheless continues to evolve. Neutral drift: changes not effecting function

(evolutionarily related proteins) Homologous proteins (evolutionarily related proteins) Compare protein sequences: Conserved residues, i.e invariant residues reflect chemical necessities. Conserved substitutions, substitutions with similar chemical properties Asp for Glu, Lys for Arg, Ile for Val Variable regions, no requirement for chemical reactions etc.

Amino acid difference matrix for 26 species of cytochrome c Man,chimp 0 Rh. monkey 1 0 Average differences Horse 12 11 0 Donkey 11 10 1 0 10.0 cow,sheep 10 9 3 2 0 dog 11 10 6 5 3 0 gray whale 10 9 5 4 2 3 0 5.1 rabbit 9 8 6 5 4 5 2 0 kangaroo 10 11 7 8 6 7 6 6 0 Chicken 13 12 11 10 9 10 9 8 12 0 penguin 13 12 12 11 10 10 9 8 10 2 0 9.9 Duck 11 10 10 9 8 8 7 6 10 3 3 0 14.3 Rattlesnake 14 15 22 21 20 21 19 18 21 19 20 17 0 12.6 turtle 15 14 11 10 9 9 8 9 11 8 8 7 22 0 Bullfrog 18 17 14 13 11 12 11 11 13 11 12 11 24 10 0 Tuna fish 21 21 19 18 17 18 17 17 18 17 18 17 26 18 15 0 18.5 worm fly 27 26 22 22 22 21 22 21 24 23 24 22 29 24 22 24 0 silk moth 31 30 29 28 27 25 27 26 28 28 27 27 31 28 29 32 14 0 25.9 Wheat 43 43 46 45 45 44 44 44 47 46 46 46 46 46 48 49 45 45 0 Bread mold 48 47 46 46 46 46 46 46 49 47 48 46 47 49 49 48 41 47 54 0 47.0 Yeast 45 45 46 45 45 45 45 45 46 46 45 46 47 49 47 47 45 47 47 41 0 Candida k. 51 51 51 50 50 49 50 50 51 51 50 51 51 53 51 48 47 47 50 42 27 0 Rattlesnake Horse gray whale Bread mold Man,chimp cow,sheep Chicken, Tuna fish silkworm kangaroo Bullfrog worm fly penguin Candida monkey Donkey rabbit turtle Yeast Wheat Duck dog

Phylogenetic tree Indicates the ancestral relationships among the organisms that produced the protein. Each branch point indicates a common ancestor. Relative evolutionary distances between neighboring branch points are expressed as the number of amino acid differences per 100 residues of the protein. PAM units or Percentage of Accepted Mutations

PAM values differ for different proteins. Although DNA mutates at an assumed constant rate. Some proteins cannot accept mutations because the mutations kill the function of the protein and thus are not viable.

Mutation rates appear constant in time Although insects have shorter generation times than mammals and many more rounds of replication, the number of mutations appear to be independent of the number of generations but dependent upon time Cytochrome c amino acid differences between mammals, insects and plants note the similar distances

Evolution through gene duplication Many proteins within an organism have sequence similarities with other proteins. These are called gene or protein families. The relatedness among members of a family can vary greatly. These families arise by gene duplication. Once duplicated, individual genes can mutate into separate genes. Duplicated genes may vary in their chemical properties due to mutations. These duplicate genes evolve with different properties. Example the globin family.

Hemoglobin: Myoglobin: is an oxygen transport protein it must bind and release oxygen as the cells require oxygen Myoglobin: is an oxygen storage protein it binds oxygen tightly and releases it when oxygen concentrations are very low

The globin family history 1. Primordial globin gene acted as an Oxygen-storage protein. 2. Duplication occurred 1.1 billion years ago. lower oxygen-binding affinity, monomeric protein. 3. Developed a tetrameric structure two a and two b chains increased oxygen transport capabilities. 4. Mammals have fetal hemoglobin with a variant b chain i.e. g (a2g2). 5. Human embryos contain another hemoglobin 2e2. 6. Primates also have a d chain with no known unique function.

Protein Evolution is not organismal evolution Chimpanzee  human are about 99% the same amino acid sequences in proteins! However: Rapid divergence with few mutational changes suggest altered control of gene expression. Controlling the amount, where, and when a protein is made.