Protein Folding Bioinformatics Ch 7 (with a little of Ch 8)

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

1 Applied Physics And Chemistry Covalent bonding.
Simplifications of Context-Free Grammars
Variations of the Turing Machine
Angstrom Care 培苗社 Quadratic Equation II
Chapter 7 Chemical Quantities
AP STUDY SESSION 2.
1
& dding ubtracting ractions.
1) a) H = 2.06%, S = 32.69%, O = 65.25% b) Ca = 54.09%, O = 43.18%, H = 2.73% 2) 24 x x x.112 = ) a) g/mol, b) g/mol.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Multiplication X 1 1 x 1 = 1 2 x 1 = 2 3 x 1 = 3 4 x 1 = 4 5 x 1 = 5 6 x 1 = 6 7 x 1 = 7 8 x 1 = 8 9 x 1 = 9 10 x 1 = x 1 = x 1 = 12 X 2 1.
Division ÷ 1 1 ÷ 1 = 1 2 ÷ 1 = 2 3 ÷ 1 = 3 4 ÷ 1 = 4 5 ÷ 1 = 5 6 ÷ 1 = 6 7 ÷ 1 = 7 8 ÷ 1 = 8 9 ÷ 1 = 9 10 ÷ 1 = ÷ 1 = ÷ 1 = 12 ÷ 2 2 ÷ 2 =
David Burdett May 11, 2004 Package Binding for WS CDL.
1 Covalent bonds l Nonmetals hold onto their valence electrons. l They cant give away electrons to bond. l Still want noble gas configuration. l Get it.
We need a common denominator to add these fractions.
Process a Customer Chapter 2. Process a Customer 2-2 Objectives Understand what defines a Customer Learn how to check for an existing Customer Learn how.
CALENDAR.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt BlendsDigraphsShort.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
The 5S numbers game..
Break Time Remaining 10:00.
The basics for simulations
Factoring Quadratics — ax² + bx + c Topic
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
PP Test Review Sections 6-1 to 6-6
Bright Futures Guidelines Priorities and Screening Tables
EIS Bridge Tool and Staging Tables September 1, 2009 Instructor: Way Poteat Slide: 1.
Bellwork Do the following problem on a ½ sheet of paper and turn in.
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
Chapter 1: Expressions, Equations, & Inequalities
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
The Chemicals of Living Cells ©The Wellcome Trust.
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
LO: Count up to 100 objects by grouping them and counting in 5s 10s and 2s. Mrs Criddle: Westfield Middle School.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Artificial Intelligence
Before Between After.
Subtraction: Adding UP
: 3 00.
5 minutes.
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
Essential Cell Biology
Converting a Fraction to %
ANSC644 Bioinformatics-Database Mining 1 ANSC644 Bioinformatics §Carl J. Schmidt §051 Townsend Hall §
Clock will move after 1 minute
Physics for Scientists & Engineers, 3rd Edition
Energy Generation in Mitochondria and Chlorplasts
Select a time to count down from the clock above
9. Two Functions of Two Random Variables
1 Dr. Scott Schaefer Least Squares Curves, Rational Representations, Splines and Continuity.
1 Decidability continued…. 2 Theorem: For a recursively enumerable language it is undecidable to determine whether is finite Proof: We will reduce the.
From Sequences to Structure
Disulfide Bonds Two cyteines in close proximity will form a covalent bond Disulfide bond, disulfide bridge, or dicysteine bond. Significantly stabilizes.
Intro to Bioinformatics Computational Approaches to Receptor Structure Prediction Uğur Sezerman Biological Sciences and Bioengineering Program Sabancı.
Roadmap The topics: basic concepts of molecular biology more on Perl
CAP5510 – Bioinformatics Protein Structures
Operone lac Principles of protein structure and function Function is derived from structure Structure is derived from amino acid sequence Different.
CS790 – BioinformaticsProtein Structure and Function1 Disulfide Bonds  Two cyteines in close proximity will form a covalent bond  Disulfide bond, disulfide.
PROTEIN FOLDING: H-P Lattice Model 1. Outline: Introduction: What is Protein? Protein Folding Native State Mechanism of Folding Energy Landscape Kinetic.
Objective 7: TSWBAT recognize and give examples of four levels of protein conformation and relate them to denaturation.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein Folding & Biospectroscopy Lecture 4 F14PFB David Robinson.
PROTEINS Characteristics of Proteins Contain carbon, hydrogen, oxygen, nitrogen, and sulfur Serve as structural components of animals Serve as control.
Protein Structure and Function. Primary Structure  Made of amino acids covalently bonded together.
Chapter 5 The Structure and Function of Macromolecules
Presentation transcript:

Protein Folding Bioinformatics Ch 7 (with a little of Ch 8)

The Protein Folding Problem Given a particular sequence of amino acid residues (primary structure), what will the tertiary/quaternary structure of the resulting protein be?Central question of molecular biology:Given a particular sequence of amino acid residues (primary structure), what will the tertiary/quaternary structure of the resulting protein be? Input: AAVIKYGCAL… Output: 1 1, 2 2 … = backbone conformation: (no side chains yet)

Disulfide Bonds Two cyteines in close proximity will form a covalent bond Disulfide bond, disulfide bridge, or dicysteine bond. Significantly stabilizes tertiary structure.

Protein Folding – Biological perspective Central dogma: Sequence specifies structureCentral dogma: Sequence specifies structure Denature – to unfold a protein back to random coil configuration – -mercaptoethanol – breaks disulfide bonds –Urea or guanidine hydrochloride – denaturant –Also heat or pH Anfinsens experiments –Denatured ribonuclease –Spontaneously regained enzymatic activity –Evidence that it re-folded to native conformation

Folding intermediates Levinthals paradox – Consider a 100 residue protein. If each residue can take only 3 positions, there are = possible conformations. –If it takes s to convert from 1 structure to another, exhaustive search would take years! Folding must proceed by progressive stabilization of intermediates –Molten globules – most secondary structure formed, but much less compact than native conformation.

Forces driving protein folding It is believed that hydrophobic collapse is a key driving force for protein folding –Hydrophobic core –Polar surface interacting with solvent Minimum volume (no cavities) Disulfide bond formation stabilizes Hydrogen bonds Polar and electrostatic interactions

Folding help Proteins are, in fact, only marginally stable –Native state is typically only 5 to 10 kcal/mole more stable than the unfolded form Many proteins help in folding –Protein disulfide isomerase – catalyzes shuffling of disulfide bonds –Chaperones – break up aggregates and (in theory) unfold misfolded proteins

The Hydrophobic Core Hemoglobin A is the protein in red blood cells (erythrocytes) responsible for binding oxygen. The mutation E6 V in the chain places a hydrophobic Val on the surface of hemoglobin The resulting sticky patch causes hemoglobin S to agglutinate (stick together) and form fibers which deform the red blood cell and do not carry oxygen efficiently Sickle cell anemia was the first identified molecular disease

Sickle Cell Anemia Sequestering hydrophobic residues in the protein core protects proteins from hydrophobic agglutination.

Computational Problems in Protein Folding Two key questions: –Evaluation – how can we tell a correctly-folded protein from an incorrectly folded protein? H-bonds, electrostatics, hydrophobic effect, etc. Derive a function, see how well it does on real proteins –Optimization – once we get an evaluation function, can we optimize it? Simulated annealing/monte carlo EC Heuristics

Fold Optimization Simple lattice models (HP-models) –Two types of residues: hydrophobic and polar –2-D or 3-D lattice –The only force is hydrophobic collapse –Score = number of H H contacts

H/P model scoring: count noncovalent hydrophobic interactions. Sometimes: –Penalize for buried polar or surface hydrophobic residues Scoring Lattice Models

What can we do with lattice models? For smaller polypeptides, exhaustive search can be used –Looking at the best fold, even in such a simple model, can teach us interesting things about the protein folding process For larger chains, other optimization and search methods must be used –Greedy, branch and bound –Evolutionary computing, simulated annealing –Graph theoretical methods

The hydrophobic zipper effect: Learning from Lattice Models Ken Dill ~ 1997

Absolute directions –UURRDLDRRU Relative directions –LFRFRRLLFL –Advantage, we cant have UD or RL in absolute –Only three directions: LRF What about bumps? LFRRR –Bad score –Use a better representation Representing a lattice model

Preference-order representation Each position has two preferences –If it cant have either of the two, it will take the least favorite path if possible Example: {LR},{FL},{RL}, {FR},{RL},{RL},{FR},{RF} Can still cause bumps: {LF},{FR},{RL},{FL}, {RL},{FL},{RF},{RL}, {FL}

Decoding the representation The optimizer works on the representation, but to score, we have to decode into a structure that lets us check for bumps and score. Example: How many bumps in: URDDLLDRURU? We can do it on graph paper –Start at 0,0 –Fill in the graph

More realistic models Higher resolution lattices (45° lattice, etc.) Off-lattice models –Local moves –Optimization/search methods and / representations Greedy search Branch and bound EC, Monte Carlo, simulated annealing, etc.

Threading: Fold recognition Given: –Sequence: IVACIVSTEYDVMKAAR… –A database of molecular coordinates Map the sequence onto each fold Evaluate –Objective 1: improve scoring function –Objective 2: folding

X-Ray Crystallography ~0.5mm The crystal is a mosaic of millions of copies of the protein. As much as 70% is solvent (water)! May take months (and a green thumb) to grow.

X-Ray diffraction Image is averaged over: –Space (many copies) –Time (of the diffraction experiment)

The Protein Data Bank ATOM 1 N ALA E APR 213 ATOM 2 CA ALA E APR 214 ATOM 3 C ALA E APR 215 ATOM 4 O ALA E APR 216 ATOM 5 CB ALA E APR 217 ATOM 6 N GLY E APR 218 ATOM 7 CA GLY E APR 219 ATOM 8 C GLY E APR 220 ATOM 9 O GLY E APR 221 ATOM 10 N VAL E APR 222 ATOM 11 CA VAL E APR 223 ATOM 12 C VAL E APR 224 ATOM 13 O VAL E APR 225 ATOM 14 CB VAL E APR 226 ATOM 15 CG1 VAL E APR 227 ATOM 16 CG2 VAL E APR 228