Structural Bioinformatics Workshop Max Shatsky Workshop home page:

Slides:



Advertisements
Similar presentations
Transcription and Translation
Advertisements

Protein Synthesis Let’s make some protein!. Protein Synthesis: An Overview Genetic information is contained within the nucleus of a cell DNA in the nucleus.
A 3-D reference frame can be uniquely defined by the ordered vertices of a non- degenerate triangle p1p1 p2p2 p3p3.
Cell Division, Genetics, Molecular Biology
Structural bioinformatics
RNA and Protein Synthesis
Cell Division, Genetics, Molecular Biology
Review: The flow of genetic information in the cell is DNA  RNA  protein  The sequence of codons in DNA spells out the primary structure of a polypeptide.
Protein Structure Alignment Human Myoglobin pdb:2mm1 Human Hemoglobin alpha-chain pdb:1jebA Sequence id: 27% Structural id: 90% Another example: G-Proteins:
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Protein Structure, Databases and Structural Alignment
Agenda A brief introduction The MASS algorithm The pairwise case Extension to the multiple case Experimental results.
Structural Bioinformatics Workshop Max Shatsky Workshop home page:
13.2 Ribosomes and Protein Synthesis
Object Recognition. Geometric Task : find those rotations and translations of one of the point sets which produce “large” superimpositions of corresponding.
Structural Bioinformatics Seminar Dina Schneidman
Model Database. Scene Recognition Lamdan, Schwartz, Wolfson, “Geometric Hashing”,1988.
Protein Structure Alignment
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 3 Cell Structures and Their Functions Dividing Cells.
2.7 DNA Replication, transcription and translation
 Assemble the DNA  Follow base pair rules  Blue—Guanine  Red—Cytosine  Purple—Thymine  Green--Adenine.
Protein Synthesis Mrs. Harlin.
From gene to protein. DNA:nucleotides are the monomers Proteins: amino acids are the monomers DNA:in the nucleus Proteins:synthesized in cytoplasm.
How Genes Work. Transcription The information contained in DNA is stored in blocks called genes  the genes code for proteins  the proteins determine.
DNA and Protein Synthesis. DNA Does 2 Important Things in a Cell: 1)DNA is capable of replicating itself. Every time a cell divides, each DNA strand makes.
Chapter 13.2 (Pgs ): Ribosomes and Protein Synthesis
Transcription Transcription is the synthesis of mRNA from a section of DNA. Transcription of a gene starts from a region of DNA known as the promoter.
Transcription and Translation
13.2 Ribosomes and Protein Synthesis
Chapter 13: RNA and Protein Synthesis
What is the job of p53? What does a cell need to build p53? Or any other protein?
The Genetic Code.
CFE Higher Biology DNA and the Genome Translation.
Protein Synthesis: DNA CONTAINS THE GENETIC INFORMATION TO PRODUCE PROTEINS BUT MUST FIRST BE CONVERTED TO RND TO DO SO.
SC.912.L.16.5 Protein Synthesis: Transcription and Translation.
Notes: Protein Synthesis
GENE EXPRESSION TRANSCRIPTION, TRANSLATION AND MUTATIONS.
Protein Synthesis Transcription. DNA vs. RNA Single stranded Ribose sugar Uracil Anywhere Double stranded Deoxyribose sugar Thymine Nucleus.
12-3 RNA and Protein Synthesis
 DNA is the blueprint for life – it contains your genetic information  The order of the bases in a segment of DNA (GENE) codes for a particular protein;
Protein Synthesis The process of putting together amino acids to form proteins in the cell. The process of putting together amino acids to form proteins.
12.3 DNA, RNA, and Protein Objective: 6(C) Explain the purpose and process of transcription and translation using models of DNA and RNA.
Core Transcription and Translation
Introduction to Bioinformatics Algorithms Algorithms for Molecular Biology CSCI Elizabeth White
Leaving Cert Biology Genetics – section 2.5 Genetics ( RNA), 2.5.5,
Protein Synthesis: Transcription & Translation.
YouTube - "The Gene Scene". The Structure of RNA There are three main differences between RNA and DNA. 1. The sugar in RNA is ribose instead of deoxyribose.
Functions of RNA mRNA (messenger)- instructions protein
Microbial Genetics.  DNA replication is semi- conservative:  What does it mean? During cell division, each daughter cell inherits 2 DNA strands, One.
Protein Synthesis Transcription. DNA vs. RNA Single stranded Ribose sugar Uracil Anywhere Double stranded Deoxyribose sugar Thymine Nucleus.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Protein Synthesis. Review…  DNA:  Found in the nucleus  Double stranded  Contains the instructions for controlling the cell (including instructions.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
I.Structure and Function of RNA A) Why is RNA needed? 1) proteins are made by ribosomes outside the nucleus (on the rough Endoplasmic Reticulum)
CHAPTER 10 DNA REPLICATION & PROTEIN SYNTHESIS. DNA and RNA are polymers of nucleotides – The monomer unit of DNA and RNA is the nucleotide, containing.
Gene Expression DNA, RNA, and Protein Synthesis. Gene Expression Genes contain messages that determine traits. The process of expressing those genes includes.
Lesson 4- Gene Expression PART 2 - TRANSLATION. Warm-Up Name 10 differences between DNA replication and transcription.
Find the optimal alignment ? +. Optimal Alignment Find the highest number of atoms aligned with the lowest RMSD (Root Mean Squared Deviation) Find a balance.
12-3 RNA and Protein Synthesis Page 300. A. Introduction 1. Chromosomes are a threadlike structure of nucleic acids and protein found in the nucleus of.
Protein Synthesis The Making of Proteins Using the Genetic Information Stored in DNA.
DNA Transcription and Translation Review. There are 3 types of RNA: Messenger RNA (mRNA) Ribosomal RNA (rRNA) Transfer RNA (tRNA)
(3) Gene Expression Gene Expression (A) What is Gene Expression?
RNA, Protein Synthesis, Mutations, & Gene Expression
Protein Synthesis – The Key Steps
Central Dogma Central Dogma categorized by: DNA Replication Transcription Translation From that, we find the flow of.
RNA carries DNA’s instructions.
Steps of Translation.
Protein Structure Alignment
Genes and Protein Synthesis Review
A C G C C T T G A T C T G T C G C A T T T A G C
Presentation transcript:

Structural Bioinformatics Workshop Max Shatsky Workshop home page:

Schedule Introduction to protein structure. Introduction to pattern matching. Protein structure alignment (comparison). Protein Docking GAMB++ library.

 Presentation and Design Review  Final Project –Software Engineering –Efficiency of Solution –Working Examples and Test Cases –Documentation –Knowledge of all project aspects Grade Ingredients

Bioinformatics - Computational Genomics DNA mapping. Protein or DNA sequence comparisons. Exploration of huge textual databases. In essence one- dimensional methods and intuition.

Structural Bioinformatics - Structural Genomics Elucidation of the 3D structures of biomolecules. Analysis and comparison of biomolecular structures. Prediction of biomolecular recognition. Handles three-dimensional (3-D) structures. Geometric Computing. (a methodology shared by Computational Geometry, Computer Vision, Computer Graphics, Pattern Recognition etc.)

Protein Structural Comparison ApoAmicyanin - 1aaj Pseudoazurin - 1pmy

Algorithmic Solution About 1 sec. Fischer, Nussinov, Wolfson ~ 1990.

Introduction to Protein Structure

The central dogma DNA ---> mRNA ---> Protein {A,C,G,T} {A,C,G,U} {A,D,..Y} Guanine-Cytosine T->U Thymine-Adenine 4 letter alphabets 20 letter alphabet Sequence of nucleic acids seq of amino acids

When genes are expressed, the genetic information (base sequence) on DNA is first transcribed (copied) to a molecule of messenger RNA in a process similar to DNA replication. The mRNA molecules then leave the cell nucleus and enter the cytoplasm, where triplets of bases ((codons) forming the genetic code specify the particular amino acids that make up an individual protein. This process, called translation, is accomplished by ribosomes (cellular components composed of proteins and another class of RNA) that read the genetic code from the mRNA, and transfer RNAs (tRNAs) that transport amino acids to the ribosomes for attachment to the growing protein. (From )

Amino acids and the peptide bond C  – first side chain carbon (except for glycine ). Cα atoms

Wire-frame or ribbons display

Geometric Representation 3-D Curve {v i }, i=1…n

Secondary structure

Hydrogen bonds.  strands and sheets

The Holy Grail - Protein Folding From Sequence to Structure. Relatively primitive computational folding models have proved to be NP hard even in the 2-D case.

Determination of protein structures X-ray Crystallography NMR (Nuclear Magnetic Resonance) EM (Electron microscopy)

An NMR result is an ensemble of models Cystatin (1a67)

The Protein Data Bank (PDB) International repository of 3D molecular data. Contains x-y-z coordinates of all atoms of the molecule and additional data.

Why bother with structures when we have sequences ? In evolutionary related proteins structure is much better preserved than sequence. Structural motifs may predict similar biological function. Getting insight into protein folding. Recovering the limited (?) number of protein folds.

Applications Classification of protein databases by structure. Search of partial and disconnected structural patterns in large databases. Extracting Structure information is difficult, we want to extract “new” folds.

Applications (continued) Speed up of drug discovery. Detection of structural pharmacophores in an ensemble of drugs (similar substructures in drugs acting on a given receptor – pharmacophore). Comparison and detection of drug receptor active sites (structurally similar receptor cavities could bind similar drugs).

Object Recognition

Model Database

Scene

Recognition Lamdan, Schwartz, Wolfson, “Geometric Hashing”,1988.

Protein Alignment = Geometric Pattern Discovery

Protein Alignment The superimposition pattern is not known a- priori – pattern discovery. The matching recovered can be inexact. We are looking not necessarily for the largest superimposition, since other matchings may have biological meaning.

Geometric Task : find those rotations and translations of one of the point sets which produce “large” superimpositions of corresponding 3-D points. Given two configurations of points in the three dimensional space, T

Geometric Task (continued) Aspects: Object representation (points, vectors, segments) Object resemblance (distance function) Transformation (translations, rotations, scaling) -> Optimization technique

Transformations Translation Translation and Rotation Rigid Motion (Euclidian Trans.) Translation, Rotation + Scaling

Inexact Alignment. Simple case – two closely related proteins with the same number of amino acids. T Question: how to measure alignment error?

Superposition - best least squares (RMSD – Root Mean Square Deviation) Given two sets of 3-D points : P={p i }, Q={q i }, i=1,…,n; rmsd(P,Q) = √  i |p i - q i | 2 /n Find a 3-D rigid transformation T * such that: rmsd( T * (P), Q ) = min T √  i |T * p i - q i | 2 /n A closed form solution exists for this task. It can be computed in O(n) time.

Problem statement with RMSD metric. find the largest alignment, a set of matched elements and transformation, with RMSD less than ε. (belong to NP, is it in NPC?) Given two configurations of points in the three dimensional space, and ε threshold T

Distance Functions Two point sets: A={a i } i=1…n B={b j } j=1…m Pairwise Correspondence: (a k 1,b t 1 ) (a k 2,b t 2 )… (a k N,b t N ) (1) Exact Matching: ||a k i – b t i ||=0 (2) RMSD (Root Mean Square Distance) Sqrt( Σ||a k i – b t i || 2 /N) < ε (3) Bottleneck max ||a k i – b t i || Hausdorff distance: h(A,B)=max aєA min bєB ||a– b|| H(A,B)=max( h(A,B), h(B,A))