CS273 Algorithms for Structure and Motion in Biology Instructors: Serafim Batzoglou and Jean-Claude Latombe Teaching Assistant: Sam Gross | serafim | latombe.

Slides:



Advertisements
Similar presentations
Protein Structure and Physics. What I will talk about today… -Outline protein synthesis and explain the basic steps involved. -Go over the Chemistry of.
Advertisements

Introduction to Bioinformatics. What is Bioinformatics Easy Answer Using computers to solve molecular biology problems; Intersection of molecular biology.
Copyright © 2005 Pearson Education, Inc. publishing as Benjamin Cummings Concept 5.4: Proteins have many structures, resulting in a wide range of functions.
Proteins Function and Structure.
The Chemical Building Blocks of Life Chapter 3. 2 Biological Molecules Biological molecules consist primarily of -carbon bonded to carbon, or -carbon.
© 2012 Pearson Education, Inc. Lecture by Edward J. Zalisko PowerPoint Lectures for Campbell Biology: Concepts & Connections, Seventh Edition Reece, Taylor,
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Biology 107 Macromolecules II September 9, Macromolecules II Student Objectives:As a result of this lecture and the assigned reading, you should.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Biology 107 Macromolecules II September 5, Macromolecules II Student Objectives:As a result of this lecture and the assigned reading, you should.
Biology 107 Macromolecules II September 8, 2003.
Stochastic roadmap simulation for the study of ligand-protein interactions Mehmet Serkan Apaydin, Carlos E. Guestrin, Chris Varma, Douglas L. Brutlag and.
The Central Dogma of Molecular Biology (Things are not really this simple) Genetic information is stored in our DNA (~ 3 billion bp) The DNA of a.
Structural Bioinformatics Dr. Avraham Samson Course no.: Credit points: 1.5 Final grade is based on 10 assignments Course homepage:
You Must Know How the sequence and subcomponents of proteins determine their properties. The cellular functions of proteins. (Brief – we will come back.
Macromolecules: proteins & nucleic acids Building Blocks of Life
Proteins account for more than 50% of the dry mass of most cells
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Large Biomolecules. All Organisms Contain the Same Four Classes of Large Biomolecules lipids - hydrophobic =>macromolecules - chains of subunits polysaccharides.
Proteins account for more than 50% of the dry mass of most cells
Diverse Macromolecules. V. proteins are macromolecules that are polymers formed from amino acids monomers A. proteins have great structural diversity.
Protein structure. BIOMEDICAL IMPORTANCE Protein function – Catalyze metabolic reactions – Power cellular motion – Provide structural integrity Defect.
Chapter 3 Protein Structure and Function. Key Concepts Most cell functions depend on proteins. Amino acids are the building blocks of proteins. Amino.
3.2 Proteins Mini Lecture Radjewski. Major functions of proteins: Enzymes—catalytic proteins Defensive proteins (e.g., antibodies) Hormonal and regulatory.
ANIMAL NUTRITION. MECHANISMS TO INGEST FOOD Suspension Feeders: sift small food particles Substrate Feeders: live on or in their food source Fluid Feeders:
PROTEINS. A protein is: A. Polymer B. macromolecule C. Biomolecule D. Organic molecule E. All of the above.
Biology The Molecules of Cells. Carbon and Functional Groups I.Why is Carbon Important? A. What is Organic Chemistry? The study of carbon compounds is.
 Phospholipid bilayer  “Mosaic” of proteins The fluid-mosaic model.
Molecules of Life II CHAPTER 3 Proteins Amino Acid Monomers Polypeptide (protein) Polymers Levels of Protein Structure Importance of Structure to Function.
NOTES: Ch 5, part 2 - Proteins & Nucleic Acids Proteins have many structures, resulting in a wide range of functions ● Proteins account for more.
AP Biology Proteins Multipurpose molecules.
Chapter 5 Section 4 Proteins Mrs. Kerstetter Biology.
AP Biology Chemical Building Blocks  3.4 Proteins.
AP Biology Proteins AP Biology Proteins Multipurpose molecules.
AP Biology Proteins. AP Biology Proteins  Most structurally & functionally diverse group of biomolecules  Function:  involved in.
Proteins & Nucleic Acids Proteins make up around 50% of the bodies dry mass and serve many functions in the body including: – Enzymes – Biological catalysts.
Protein Evolution: Introduction to Protein Structure and Function protEvolEllsEmblSept2009 Please open the.
10/3/2003 Molecular and Cellular Modeling 10/3/2003 Introduction Objective: to construct a comprehensive simulation software system for the computational.
1 Proteins Protein functions include: 1. enzyme catalysts 2. defense 3. transport 4. support 5. motion 6. regulation 7. storage Chapter 3- part 2.
AP Biology Discuss the following with your group and be prepared to discuss with the class 1. Why is the shape of a molecule important? 2. How is a covalent.
THE STRUCTURE AND FUNCTION OF MACROMOLECULES Proteins - Many Structures, Many Functions 1.A polypeptide is a polymer of amino acids connected to a specific.
Chapter 3 The Molecules of Cells By Dr. Par Mohammadian Overview: -Carbon atom -Functional Groups -Major Biomolecules.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
The Chemical Building Blocks of Life Chapter 3. 2 Biological Molecules Biological molecules consist primarily of -carbon bonded to carbon, or -carbon.
Regents Biology Proteins Regents Biology Proteins: Multipurpose molecules.
5.4: Proteins Introduction
BIOLOGICALLY IMPORTANT MACROMOLECULES PROTEINS. A very diverse group of macromolecules characterized by their functions: - Catalysts - Structural Support.
Chapter 3 Proteins.
Proteins: multipurpose molecules
PROTEINS L3 BIOLOGY. FACTS ABOUT PROTEINS: Contain the elements Carbon, Hydrogen, Oxygen, and NITROGEN Polymer is formed using 20 different amino acids.
AP Biology Proteins AP Biology Proteins Multipurpose molecules.
3.8 Fats are lipids that are mostly energy-storage molecules  Some fatty acids contain double bonds –This causes kinks or bends in the carbon chain because.
CHAPTER 5 THE STRUCTURE AND FUNCTION OF MACROMOLECULES Copyright © 2002 Pearson Education, Inc., publishing as Benjamin Cummings Section D: Proteins -
CARBON AND MOLECULAR DIVERSITY The structure and function of macromolecules: Proteins and Nucleic Acids Chapter 5.
PROTEINS Proteins Composed mainly of –Carbon –Hydrogen –Nitrogen.
The Structure and Function of Macromolecules. II. Classes of Organic Molecules: What are the four classes of organic molecules?
Enzymes: A Molecular Perspective
CHM 708: MEDICINAL CHEMISTRY
Chapter 5 Proteins.
The Chemical Building Blocks of Life
Chemical agents PROTEINS: The Molecular Tools of the Cell
AIM: How are Proteins important to our Body?
Proteins Section 3.4.
Proteins.
Protein Structure and Function
Multipurpose molecules
Diverse Macromolecules
Proteins.
Ligand Docking to MHC Class I Molecules
Proteins.
Presentation transcript:

CS273 Algorithms for Structure and Motion in Biology Instructors: Serafim Batzoglou and Jean-Claude Latombe Teaching Assistant: Sam Gross | serafim | latombe | ssgross cs.stanford.edu Spring 2006 –

Need a Scribe!!

Range of Bio-CS Interaction Gene Molecules Tissue/Organs Body system Robotic surgery Molecular structures, similarities and motions Soft-tissue simulation and surgical training Cells Simulation of cell interaction CS273 Sequence alignment Enormous range over space and time

Focus on Proteins  Proteins are the workhorses of all living organisms  They perform many vital functions, e.g: Catalysis of reactions Transport of molecules Building blocks of muscles Storage of energy Transmission of signals Defense against intruders

Proteins are also of great interest from a computational viewpoint  They are large molecules (few 100s to several 1000s of atoms)  They are made of building blocks (amino acids) drawn from a small “library” of 20 amino-acids  They have an unusual kinematic structure: long serial linkage (backbone) with short side-chains

Proteins are associated with many challenging problems  Predict folded structures and motion pathways  Understand why some proteins misfold or partially fold, causing such diseases as: cystic fibrosis, Parkinson, Creutzfeldt-Jakob (mad cow)  Find structural similarities among proteins and classify proteins  Find functional structural motifs in proteins  Predict how proteins bind against other proteins and smaller molecules  Design new drugs  Engineer and design proteins and protein-like structures (polymers)

Central Dogma of Molecular Biology

transcription translation

Protein Sequence O N N N N OO O  Long sequence of amino-acids (dozens to thousands), also called residues  Dictionary of 20 amino-acids (several billion years old) (residue i-1)

O N N N N OO O Protein Sequence Peptide bond (partial double bond character) T

Central Dogma of Molecular Biology Physiological conditions: aqueous solution, 37°C, pH 7, atmospheric pressure

Levels of Protein Structures hemoglobin (4 polypeptide chains) Quaternary

Mostly  -helices Mostly  -sheets Mixed

Intermediate states Folding Unfolded (denatured) state Folded (native) state Many pathways

How (we think) a protein folds...  G =  H - T  S

How (we think) a protein folds...  G =  H - T  S

How (we think) a protein folds...  G =  H - T  S

How (we think) a protein folds...  G =  H - T  S

How (we think) a protein folds...  G =  H - T  S

Motion of Proteins in Folded State HIV-1 protease

Structural variability of the overall ensemble of native ubiquitin structures [Shehu, Kavraki, Clementi, 2005]

Amylosucrase Flexible Loop Loop 7

Central Dogma of Molecular Biology

Binding Inhibitor binding to HIV protease Protein-protein binding Ligand-protein binding

Binding of Pyruvate to LDH (reduction of pyruvate to lactase ) ASP-195 HIS- 193 ASP-166 ARG THR-245 C C O O O CH 3 NADH GLN-101 ARG-106 Loop Lactate dehydrogenase environment Pyruvate Nicotinamide adenine dinucleotide (coenzyme)

What is CS273 about?  Algorithms and computational schemes for molecular biology problems  Molecular biology seen by computer scientists

 y = f(x)  Biologists like experiments, specifics and classifications They like it better to know many (x i,y i ) – i.e., facts – and classify them, than to know f  Computer scientists like simulation, abstractions, and general algorithms They want to know f – the explanation of the facts – and efficient ways to compute it, but rarely care for any (x i,y i )  One challenge of Computational Biology is to fuse these two cultures The Shock of Two Cultures

 Two Views of a BioComputation Class  Where are IT resources for biology available and how to use them  How to design efficient data structures and algorithms for biology

Main Ideas Behind CS273 1.The information is in the sequence  Sequence  Structure (shape)  Function  Sequence similarity  Structural/functional similarity  Sequences are related by evolution

Main Ideas Behind CS273 1.The information is in the sequence  Sequence  Structure (shape)  Function  Sequence similarity  Structural/functional similarity  Sequences are related by evolution 2.Biomolecules move and bind to achieve their functions  Deformation  folded structures of proteins  Motion + deformation  multi-molecule complexes  One cannot just “jump” from sequence to function Protein folding Ligand protein binding

SequenceStructureFunction sequence similarity structure similarity

Main Ideas Behind CS273 1.The information is in the sequence  Sequence  Structure (shape)  Function  Sequence similarity  Structural/functional similarity  Sequences are related by evolution 2.Biomolecules move and bind to achieve their functions  Deformation  folded structures of proteins  Motion + deformation  multi-molecule complexes  One cannot just “jump” from sequence to function  CS273 is about algorithms for sequence, structure and motion - Finding sequence and shape similarities - Relating structure to function - Extracting structure from experimental data - Computing and analyzing motion pathways

Vision Underlying CS273  Goal of computational biology: Low-cost high-bandwidth in-silico biology  Requirements: Reliable models  Efficient algorithms  Algorithmic efficiency by exploiting properties of molecules and processes: Proteins are long kinematic chains Atoms cannot bunch up together Forces have relatively short ranges  Computational Biology is more than using computers to biological problems or mimicking nature (e.g., performing MD simulation)

Tentative Schedule 1April 5Introduction 2April 10Protein geometric and kinematic models 3April 12Conformational space 4April 17Inverse kinematics and applications 5April 19Sequence similarity 6April 24Sequence similarity 7April 26Sequence similarity 8May 1Structure comparison 9May 3Structure comparison 10May 8Protein phylogeny, clustering, and classification 11May 10Protein phylogeny, clustering, and classification 12May 15Energy maintenance 13May 17Energy maintenance 14May 22Structure prediction 15May 24Roadmap methods 16May 31Structure prediction 17June 5Structure prediction 18June 7TBA 19June 12Project presentations (2 hours)

Instructors and TAs  Instructors: –Serafim Batzoglou –Jean-Claude Latombe  TA: –Sam Gross  s: | serafim | latombe | ssgross cs.stanford.edu  Class website:

Expected Work  Regular attendance to lectures and active participation  Class scribing (assignments will depend on # of students)  Exciting programming project: Structure prediction - Clustering and distance metrics - Protein design - Something else

Questions?