Cyclopeptide Sequencing

Slides:



Advertisements
Similar presentations
From Genome to Proteome Juang RH (2004) BCbasics Systems Biology, Integrated Biology.
Advertisements

Pressure in Gases (Ideal Gases) Objectives (a) state Boyle’s law.
T h e G a s L a w s. T H E G A S L A W S z B o y l e ‘ s L a w z D a l t o n ‘ s L a w z C h a r l e s ‘ L a w z T h e C o m b i n e d G a s L a w z B.
1 Codes, Ciphers, and Cryptography-Ch 2.3 Michael A. Karls Ball State University.
Collision Detection CSE 191A: Seminar on Video Game Programming Lecture 3: Collision Detection UCSD, Spring, 2003 Instructor: Steve Rotenberg.
Hidden Markov Models: an Introduction by Rachel Karchin.
Chapter 2 Simple Thermodynamics Systems
Dynamic Programming Code
11-1 Matrix-chain Multiplication Suppose we have a sequence or chain A 1, A 2, …, A n of n matrices to be multiplied –That is, we want to compute the product.
P Fig. 6-1, p. 193 Fig. 6-2, p. 193 Fig. 6-3, p. 195.
Lecture 10 – Introduction to Probability Topics Events, sample space, random variables Examples Probability distribution function Conditional probabilities.
Others can edit Wiki—no central gate keeper Google: not always ordered by quality but by popularity and sometimes by payment.
6.3 Conditional Probability.  Calculate Conditional Probabilities  Determine if events are independent.
20 B Week II Chapters 9 -10) Macroscopic Pressure Microscopic pressure( the kinetic theory of gases: no potential energy) Real Gases: van der Waals Equation.
Others can edit Wiki—no central gate keeper Google: not always ordered by quality but by popularity and sometimes by payment.
A Comprehensive Comparison of the de novo Sequencing Accuracies of PEAKS, BioAnalyst and PLGS Bin Ma 1 ; Amanda Doherty-Kirby 1 ; Aaron Booy 2 ; Bob Olafson.
2G1516 Formal Methods2005 Mads Dam IMIT, KTH 1 CCS: Operational Semantics And Process Algebra Mads Dam Reading: Peled 8.3, 8.4, 8.6 – rest of ch. 8.
Lesson Addition Rule and Complements. Objectives Use the Addition Rule for disjoint events Use the General Addition Rule Compute the probability.
Sequence Search and Analysis SPE 1653 (703)
GENOME: an organism’s complete set of genetic material Humans ~3 billion base pairs CHROMOSOME: Part of the genome; structure that holds tightly wound.
A Different Type of Monte Carlo Simulation Jake Blanchard Spring 2010 Uncertainty Analysis for Engineers1.
PEAKS: De Novo Sequencing using Tandem Mass Spectrometry Bin Ma Dept. of Computer Science University of Western Ontario.
1 MINISTRY OF SCIENCE AND TECHNOLOGY DEPARTMENT OF TECHNICAL AND VOCATIONAL EDUCATION ME A.G.T.I (First Year) MECHANICAL ENGINEERING FUNDAMENTAL.
Faster, more sensitive peptide identification from tandem mass spectra by sequence database compression Nathan J. Edwards Center for Bioinformatics & Computational.
Types and Programming Languages Lecture 3 Simon Gay Department of Computing Science University of Glasgow 2006/07.
Addition Rules for Probability Mutually Exclusive Events When do you add/ when do you subtract probabilities?
1 Chapter 8 Recursion. 2 Recursive Function Call a recursion function is a function that either directly or indirectly makes a call to itself. but we.
Charmless Two-Body B Decays Involving a Tensor Meson Kwei-Chou Yang Chung-Yuan Christian University, Taiwan The 36th International Conference on High Energy.
Gases & Kinetic Molecular Theory Kinetic molecular theory Gases Physical properties Temperature Pressure Boyles Law Charles Law Gay Lussacs Law Combined.
Peptide de novo sequencing Peptide de novo sequencing is the analytical process that derives a peptide’s amino acid sequence from its tandem mass spectrum.
Vector 2.
WRKY transcription factors in potato genome factors in potato genome
Group.
How to order – Metal bowl
Prüfer code algorithm Algorithm (Prüfer code)
6.2 - Dot Product of Vectors
Good afternoon! August 9, 2017.
13.2 Complements and Unions of Events
10.8 The Power Theorems After studying this section, you will be able to apply the power theorems.
Applied Algorithmics - week7
What is it? Where is it found? What does it do?
Approximate Inference
Chapter 5 Induction and Recursion
Lecture 10 – Introduction to Probability
RNA and Protein Synthesis
پروفیل طولی یا نیمرخ طولی (Longitudinal Profile)
Accurate gene phylogeny across multiple complete genomes
Planning José Luis Ambite.
Discrete Mathematics and its Applications
WRKY transcription factors in potato genome factors in potato genome
Cyclopeptide Sequencing Problem
8.5 Key Concept: Translation converts an mRNA message into a polypeptide, or protein.
Assembly of BAC ends on P250I21
Proteins are made of amino acids
CMPS 3130/6130: Computational Geometry Spring 2017
Sequencing Cyclic Peptides
RNA and Protein Synthesis
Inference III: Approximate Inference
Addition and Multiplication Rules of Probability
Thermodynamics tutorhour 1
CMPS 3120: Computational Geometry Spring 2013
Factorization & Independence
Factorization & Independence
Amino acids are coded by mRNA base sequences.
Assignment 5 Example of multivariate regression
Note 10:Conditional Probability
Addition and Multiplication Rules of Probability
1. Cyclomatic complexity
Lipid mass spectrometry analysis of WT, tgl3Δ and vps4Δ mutant cells.
Mass spectrometry identification of biomarker candidates in the prediagnostic plasmas in the Women's Health Initiative cohort. Mass spectrometry identification.
Presentation transcript:

Cyclopeptide Sequencing Obyanamide, a non-ribosomally produced antitumor. From Norine database. Tayla Isensee

Problem: Antibiotics are not in the genome Solution: Mass Spectrometry Coding Problem: Brute force takes too long Solution: Branch and bound Candidates: ['PG', 'PA', 'PS', 'PP', 'PV', 'PT', 'PC', 'PI', 'PL', 'PN', 'PD', 'PK', 'PQ', 'PE', 'PM', 'PH', 'PF', 'PR', 'PY', 'PW', 'PG', 'PA', 'PS', 'PP', 'PV', 'PT', 'PC', 'PI', 'PL', 'PN', 'PD', 'PK', 'PQ', 'PE', 'PM', 'PH', 'PF', 'PR', 'PY', 'PW', 'VG', 'VA', 'VS', 'VP', 'VV', 'VT‘...] Peptides Kept: ['PV', 'PT', 'PC', 'VP', 'VV', 'VT', 'VC', 'TP', 'TV', 'TT', 'CP', 'CV']

Data Spectrum = [0, 97, 97, 99, 101, 103, 196, 198, 198, 200, 202, 295, 297, 299, 299, 301, 394, 396, 398, 400, 400, 497]

High Level Steps CyclopeptideSequencing(Spectrum)         Peptides ← a list         while Final Peptides is empty:             Peptides ← Expand(Peptides)             for each peptide Peptide in Peptides                 if Mass(Peptide) is in Spectrum KeptPeps = KeptPeps + peptide                     if Cyclospectrum(Peptide) = Spectrum Final Peptides + Peptide                         output Final Peptide Peptides = KeptPeps                     clear KeptPeps CyclopeptideSequencing(Spectrum)         Peptides ← a set containing only the empty peptide         while Peptides is nonempty             Peptides ← Expand(Peptides)             for each peptide Peptide in Peptides                 if Mass(Peptide) = ParentMass(Spectrum)                     if Cyclospectrum(Peptide) = Spectrum                         output Peptide                     remove Peptide from Peptides                 else if Peptide is not consistent with Spectrum                     remove Peptide from Peptides

Results Program can sequence cyclic peptides! But…runtime limits the length of peptide (no leaderboard) And…LOTS of assumptions about the fidelity of the data Spectrum = [0, 97, 97, 99, 101, 103, 196, 198, 198, 200, 202, 295, 297, 299, 299, 301, 394, 396, 398, 400, 400, 497] Possible Amino Acid Sequence = PVCPT Mass Sequence = 97-99-103-97-101

Conclusion Program works on highly improbable, perfect datasets Improving runtime, either with more expansive bounding or implementing a leaderboard But… NRPs can contain more than 20 aa’s (100s), It isn’t science unless you find a cool database =