Motivational Lecture: UNIX and computer-aided design of new medicines. Alexey Onufriev.

Slides:



Advertisements
Similar presentations
Chemistry for Life Science. Matter Anything that: Anything that: Has mass. Has mass. Takes up space. Takes up space.
Advertisements

Genetics and Genetic Engineering
Interest Grabber Order! Order! Genes are made of DNA, a large, complex molecule. DNA is composed of individual units called nucleotides. Three of these.
Learning Objectives: To understand what is meant by the term, ‘relative molecular mass’
Recombinant DNA technology
Frontiers of Genetics.
1 PharmID: A New Algorithm for Pharmacophore Identification Stan Young Jun Feng and Ashish Sanil NISSMPDM 3 June 2005.
The branch that breaks Is called rotten, but Wasn’t there snow on it? Bartolt Brecht Haiti after a hurricane.
Lecture 3/4 Rob Phillips California Institute of Technology (Block et al.) (Wuite et al.)
DNA -> RNA -> Proteins Proteins are central to life Cellular structure, communications, etc. Medical, drug development Failure -> disease (ex: missing,
Using Metacomputing Tools to Facilitate Large Scale Analyses of Biological Databases Vinay D. Shet CMSC 838 Presentation Authors: Allison Waugh, Glenn.
Introduction to Structural Bioinformatics Dong Xu Computer Science Department 271C Life Sciences Center 1201 East Rollins Road University of Missouri-Columbia.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
1 Computational Biology, Part 13 Retrieving and Displaying Macromolecular Structures Robert F. Murphy Copyright  1996, 1999, All rights reserved.
1 Computational Biology, Part 11 Retrieving and Displaying Macromolecular Structures Robert F. Murphy Copyright  1996, 1999, All rights reserved.
TEAM COMPETITION A Review for the TEST over Unit 3a: The Structures and Functions of DNA and RNA.
LSM2104/CZ2251 Essential Bioinformatics and Biocomputing Essential Bioinformatics and Biocomputing Protein Structure and Visualization (3) Chen Yu Zong.
Structure and Function of Proteins Lecturer: Dr. Ora Furman Oct 2009 Winter 2009/10 Teaching Assistants: Miraim Oxsman Sivan Pearl.
Genetics and Genetic Engineering terms clones b organisms or cells of nearly identical genetic makeup derived from a single source.
Number of released entries Year. Growth of Molecular Complexity Number of Chains Year Number of Structures Containing that Number of Chains.
Part II : Introduction To Protein Structure Kong Lesheng Victor Tong Joo Chuan National University of Singapore.
Protein Synthesis: Transcription
The.pdb file format, and other resources for structural information Topic 5 Chapter 10 & 11, Du and Bourne “Structural Bioinformatics”
Ch. 13 Genetic Engineering
Restriction Enzymes Enzymes that CUT
CS 790 – Bioinformatics Introduction and overview.
Doug Raiford Lesson 3.  More and more sequence data is being generated every day  Useless if not made available to other researchers.
Chemistry 2100 Chapter 26. The Central Dogma The central dogma of molecular biology: –Information contained in DNA molecules is expressed in the structure.
4 Molecular biology in medicine Figure 4.1 The influenza virus has two surface proteins: neuraminidase and haemagglutinin. These proteins change continually,
Macromolecular Visualization or… Where to go when ChemDraw just isn’t enough Martin Case Chem
Biology 2.3 Carbon Compounds
DNA Technology. 1.Isolation – of the DNA containing the required gene 2.Insertion – of the DNA into a vector 3.Transformation – Transfer of DNA into a.
The Strategy of Atomic Resolution Structural Biology Break down complexity so that the system can be understood at a fundamental level Build up a picture.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
The Molecular Basis of Inheritance
Polymer Molecule made of many monomers bonded together
Inorganic and Organic Compounds. Chemistry of Life UEQ: How do the properties and structures of materials determine their uses? What determines the type.
The digestive system, thermodynamics, enzymes, and transport across membranes May 12, 2003 Learning objectives- Be capable of manipulating protein structures.
What is it and why is it useful? EXAMPLES: a. Biomolecular surface story. b. Improving enzyme’s function. c. Folding proteins. Research in Structural.
STRUCTURAL BIOLOGY Martina Mijušković ETH Zürich, Switzerland.
PHARMACOBIOTECHNOLOGY.  Recombinant DNA (rDNA) is constructed outside the living cell using enzymes called “restriction enzymes” to cut DNA at specific.
BIOTECHNOLOGY.
© 2012 Pearson Education, Inc. Lecture by Edward J. Zalisko PowerPoint Lectures for Campbell Biology: Concepts & Connections, Seventh Edition Reece, Taylor,
Biotechnology -- Chap. 16. The use of biological systems for the production of materials (most work is in the field of Genetic Engineering)
GENETIC RECOMBINATION By Dr. Nessrin Ghazi AL-Abdallat Lecturer of Microbiology.
Rational Drug Design Solving the influenza problem.
X-ray crystallography – an overview (based on Bernie Brown’s talk, Dept. of Chemistry, WFU) Protein is crystallized (sometimes low-gravity atmosphere is.
X-ray detection xray/facilities.html.
Plasmids and Vectors Aims:
Topic 1 Roland Dunbrack. Modeling of Biological Units Model data files of single proteins may require –sequence alignment(s) to templates (entry and chain)
 What is different between these 2 sequences? GGAATTCCTAGCAAT CCTTAAGGATCGTTA CTACGTGAGGAATTC GATGCACTCCTTAAG.
Translation- taking the message of DNA and converting it into an amino acid sequence.
No New Virus Produced No New Virus Produced New Virus Produced New Virus Produced Ligand Inserted Into Exosite Ligand Inserted Into Exosite HIV Protease.
Useful shell commands head/tail, cut, sort, uniq Virginie Orgogozo March 2011.
Biotechnology Made up of 3 technologies: Bioprocess technology: when microorganisms are provided with nutrients and advantageous conditions, they perform.
Useful shell commands head/tail, cut, sort, uniq Virginie Orgogozo March 2011.
4/26/2010 BIOTECHNOLOGY.
CHM 708: MEDICINAL CHEMISTRY
Protein Synthesis.
Number of released entries
Answer in your packets!!! 
Week 2 Vocab Definitions
Characteristics and Classification of Life
1. Molecules – Made up of atoms
The Importance of Metal Aqueous Speciation
WHY IS EVERYONE CRAZY FOR CRISPR?
CHAPTER 10 Molecular Biology of the Gene
Macromolecules Biology ch. 2-3.
Gene transfer © 2016 Paul Billiet ODWS.
A Drug-Drug Interaction Crystallizes a New Entry Point into the UPR
Presentation transcript:

Motivational Lecture: UNIX and computer-aided design of new medicines. Alexey Onufriev

Why bother?Example: rational drug design. Drug agent e.g:viral endonuclease (cuts DNA, RNA) If you block the enzyme's function – you kill the virus.

Molecular surface of acetyl choline esterase molecule (structure by Sussman et al.) color coded by electrostatic potential. Challenge: Find a small, non-toxic molecule that will selectively fit right into the cavity. And only this cavity, no other one!

Example of successful computer-aided (rational) drug design: One of the drugs that stopped AIDS epidemic in the US (part of anti-retro viral cocktail). The drug blocks the function of a key viral protein. To design the drug, one needs a precise 3D structure of that protein.

Biological function = f( 3D molecular structure ) Workhorses of biological functions = Proteins. Proteins = Protein Data Bank

Protein Data Bank ~ 30,000 structures

HEADER OXYGEN TRANSPORT 13-DEC M TITLE SPERM WHALE MYOGLOBIN F46V N-BUTYL ISOCYANIDE AT PH 9.0 COMPND MOL_ID: 1; COMPND 2 MOLECULE: MYOGLOBIN; COMPND 3 CHAIN: NULL; COMPND 4 ENGINEERED: SYNTHETIC GENE; COMPND 5 MUTATION: INS(M0), F46V, D122N SOURCE MOL_ID: 1; SOURCE 2 ORGANISM_SCIENTIFIC: PHYSETER CATODON; SOURCE 3 ORGANISM_COMMON: SPERM WHALE; SOURCE 4 TISSUE: SKELETAL MUSCLE; SOURCE 5 CELLULAR_LOCATION: CYTOPLASM; SOURCE 6 EXPRESSION_SYSTEM: ESCHERICHIA COLI; SOURCE 7 EXPRESSION_SYSTEM_STRAIN: PHAGE RESISTANT SOURCE 8 EXPRESSION_SYSTEM_CELLULAR_LOCATION: SOURCE 9 EXPRESSION_SYSTEM_VECTOR_TYPE: PLASMID; SOURCE 10 EXPRESSION_SYSTEM_PLASMID: PEMBL 19+ KEYWDS LIGAND BINDING, OXYGEN STORAGE, OXYGEN BINDING, HEME, KEYWDS 2 OXYGEN TRANSPORT EXPDTA X-RAY DIFFRACTION AUTHOR R.D.SMITH,J.S.OLSON,G.N.PHILLIPS JUNIOR A typical PDB entry (header) myoglobin

PDB Key Part: atomic coordiantes (x,y,z) ATOM 1 N MET N ATOM 2 CA MET C ATOM 3 C MET C ATOM 4 O MET O ATOM 5 CB MET C ATOM 6 CG MET C ATOM 7 SD MET S ATOM 8 CE MET C ATOM 9 N VAL N ATOM 10 CA VAL C ATOM 11 C VAL C ATOM 12 O VAL O ATOM 13 CB VAL C ATOM 14 CG1 VAL C ATOM 15 CG2 VAL C ATOM 16 N LEU N ATOM 17 CA LEU C ATOM 18 C LEU C ATOM 19 O LEU O ATOM 20 CB LEU C ATOM 21 CG LEU C ATOM 22 CD1 LEU C Problem: No Hydrogens! What's missing?

So, why are PDB files missing hydrogens? Experiment: H-atoms are so small, X-ray does not see them.

Approach: use a combination of physics, chemistry and computer science to create a web-server that will add hydrogens to 3D protein structure, based on a set of algorithms. Challenge: A lot of heterogeneous steps, performed by many different pieces of scientific software that have to talk to each other. Solution: use UNIX to orchestrate the process. Why UNIX? Because UNIX applications talk to each other See

Placing hydrogens flow chart: Pdb entry without H Initial (dumb) placing of hydrogens (easy) Pdb pre-processing. Clean-up, Building missing parts, etc. Use AWK, PERL, BASH scripts Smart placing of H. Involves trying all different positions ( and minimizing the total energy. Use various scientific cods developed for UNIX platforms) “Completed” PDB entry + pKs

Conclusions: UNIX tools have been used to build a useful web-based tools for drug design and molecular modeling. Virtually impossible to duplicate under a WINDOWS environment due to incompatibility of various proprietary codes that one would have to use if one tried (well, no one did...)