1 InCoB 2009, Singapore Ren é Hussong et al. Highly accelerated feature detection in mass spectrometry data using modern graphics processing units Bioinformatics.

Slides:



Advertisements
Similar presentations
Protein Quantitation II: Multiple Reaction Monitoring
Advertisements

Improvements in Mass Spectrometry for Life Science Research – Does Agilent Have the Answer? Ashley Sage PhD.
(t,x) domain, pattern-based ground roll removal Morgan P. Brown* and Robert G. Clapp Stanford Exploration Project Stanford University.
Advanced Higher Unit 3 Mass Spectrometry. Mass spectrometry can be used to determine the accurate molecular mass and structural features of an organic.
Proposal for a Standard Representation of the Results of GC-MS Analysis: A Module for ArMet Helen Fuell 1, Manfred Beckmann 2, John Draper 2, Oliver Fiehn.
3. Introduction to Digital Image Analysis
Peptide Identification by Tandem Mass Spectrometry Behshad Behzadi April 2005.
Smart Templates for Chemical Identification in GCxGC-MS QingPing Tao 1, Stephen E. Reichenbach 2, Mingtian Ni 3, Arvind Visvanathan 2, Michael Kok 2, Luke.
Image Sampling Moire patterns
CUDA Programming Lei Zhou, Yafeng Yin, Yanzhi Ren, Hong Man, Yingying Chen.
Theodore Alexandrov, Michael Becker, Sören Deininger, Günther Ernst, Liane Wehder, Markus Grasmair, Ferdinand von Eggeling, Herbert Thiele, and Peter Maass.
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics.
Previous Lecture: Regression and Correlation
Mass Spectrometry. What are mass spectrometers? They are analytical tools used to measure the molecular weight of a sample. Accuracy – 0.01 % of the total.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
My contact details and information about submitting samples for MS
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Introduction to high-throughput analysis of proteins and metabolites by Mass Spectrometry The basic principle Brief introduction of techniques Computational.
Fa 05CSE182 CSE182-L9 Mass Spectrometry Quantitation and other applications.
Proteomics Informatics – Overview of Mass spectrometry (Week 2)
This work is licensed under a Creative Commons Attribution 4.0 International License. Oliver Kohlbacher, Sven Nahnsen, Knut Reinert COMPUTATIONAL PROTEOMICS.
Chapter 9 Mass Spectrometry (MS) -Microbial Functional Genomics 조광평 CBBL.
CSE182 L14 Mass Spec Quantitation MS applications Microarray analysis.
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
By Arun Bhandari Course: HPC Date: 01/28/12. GPU (Graphics Processing Unit) High performance many core processors Only used to accelerate certain parts.
INF380 - Proteomics-61 INF380 – Proteomics Chapter 6 – Mass Spectrometry – MALDI TOF The MALDI-TOF instruments are the simplest MS instruments suitable.
1 Chemical Analysis by Mass Spectrometry. 2 All chemical substances are combinations of atoms. Atoms of different elements have different masses (H =
Programming Concepts in GPU Computing Dušan Gajić, University of Niš Programming Concepts in GPU Computing Dušan B. Gajić CIITLab, Dept. of Computer Science.
Accelerating a Software Radio Astronomy Correlator By Andrew Woods Supervisor: Prof. Inggs & Dr Langman.
GPU Architecture and Programming
Mass spectroscopy – learning objectives Outline the early developments in mass spectrometry. Outline the use of mass spectrometry in the determination.
Quantification of Membrane and Membrane- Bound Proteins in Normal and Malignant Breast Cancer Cells Isolated from the Same Patient with Primary Breast.
Temple University MASS SPECTROMETRY FURTHER INVESTIGATIONS Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
Experiences Accelerating MATLAB Systems Biology Applications Heart Wall Tracking Lukasz Szafaryn, Kevin Skadron University of Virginia.
Temple University MASS SPECTROMETRY INTRODUCTION Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
JPEG-GPU: A GPGPU IMPLEMENTATION OF JPEG CORE CODING SYSTEMS Ang Li University of Wisconsin-Madison.
Peak Detection with Chemical Noise Removal Using Short-Time FFT for a Kind of MALDI Data Xiaobo Zhou HCNR-CBI, Harvard Medical School and Brigham & Women’s.
High throughput Protein Measurement Techniques Harin Kanani.
自動化蛋白質定量系統 Automatic Protein Quantitation System 生物資訊實驗室 計畫主持人 許聞廉 特聘研究員 宋定懿 研究員 Relative quantitative proteomics Labeling Label-free MS MS/MS 14 N / 15.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
Innovative Paths to Better Medicines Design Considerations in Molecular Biomarker Discovery Studies Doris Damian and Robert McBurney June 6, 2007.
CSE182 CSE182-L11 Protein sequencing and Mass Spectrometry.
1 CH908 Structural Analysis by Mass Spectrometry revision lecture. Prof. Peter O’Connor.
Isotope Labeled Internal Standards in Skyline
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
© David Kirk/NVIDIA and Wen-mei W. Hwu, ECE408/CS483, University of Illinois, Urbana-Champaign 1 Graphic Processing Processors (GPUs) Parallel.
Metabolomics MS and Data Analysis PCB 5530 Tom Niehaus Fall 2015.
Introduction to high-throughput analysis of proteins and metabolites by Mass Spectrometry The basic principle Brief introduction of techniques Computational.
3/12/2013Computer Engg, IIT(BHU)1 CUDA-3. GPGPU ● General Purpose computation using GPU in applications other than 3D graphics – GPU accelerates critical.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
김지형. Introduction precursor peptides are dynamically selected for fragmentation with exclusion to prevent repetitive acquisition of MS/MS spectra.
RANIA MOHAMED EL-SHARKAWY Lecturer of clinical chemistry Medical Research Institute, Alexandria University MEDICAL RESEARCH INSTITUTE– ALEXANDRIA UNIVERSITY.
Yonsei Proteome Research Center Peptide Mass Finger-Printing Part II. MALDI-TOF 2013 생화학 실험 (1) 6 주차 자료 임종선 조교 내선 6625.
VGOS GPU Based Software Correlator Design Igor Surkis, Voytsekh Ken, Vladimir Mishin, Nadezhda Mishina, Yana Kurdubova, Violet Shantyr, Vladimir Zimovsky.
Metabolomics Part 2 Mass Spectrometry
Bottom-Up Proteomics Data collection
MATLAB Distributed, and Other Toolboxes
Fig. 1. proFIA approach for peak detection and quantification
Parallel Objects: Virtualization & In-Process Components
Metabolomics Part 2 Mass Spectrometry
Proteomics Informatics David Fenyő
Filtering Part 2: Image Sampling
Softberry Mass Spectra (SMS) processing tools
Schematic of MS1 filtering.
Skyline MS1 filtering graphical user interface.
Feature extraction and alignment for LC/MS data
Example of MS/MS spectrum of peptide FPTLTGFNR (hypothetical protein with signal peptide EAK88888; N77) from a protein digestion mixture prepared by labeling.
Processing of fragment ion information in DTA files to remove isotope ions and noise. Processing of fragment ion information in DTA files to remove isotope.
The Coming Age of Complete, Accurate, and Ubiquitous Proteomes
Proteomics Informatics David Fenyő
Presentation transcript:

1 InCoB 2009, Singapore Ren é Hussong et al. Highly accelerated feature detection in mass spectrometry data using modern graphics processing units Bioinformatics 25 (2009). Junior Research Group for Protein-Protein-Interactions and Computational Proteomics Saarland University, Saarbruecken, Germany

2 Outline ∙ Introduction & Motivation - The Differential Proteomics Pipeline ∙ Computational Proteomics - Signal Processing and Feature Detection - The Isotope Wavelet Transform ∙ Parallelization via GPUs ∙ Results & Discussion

3 The Differential Proteomics Pipeline Two probes: e.g. sick vs. healthy Mass Spectrometer List of differentially expressed proteins Applications range from basic pharmaceutical research over medical diagnostics and therapy to biotechnology and engineering.

4 Principle of Biological Mass Spectrometry digest intensity mass Fingerprint ProteinsPeptides Peptides are ionized and accelerated

5 Principle of Biological Mass Spectrometry digest intensity mass Fingerprint mass of a single neutron

6 Principle of Biological Mass Spectrometry digest intensity mass Fingerprint mass of a single neutron

7 (Simple) Feature Finding Typically done by simple thresholding : Needs additional preprocessing steps, like e.g.: - Baseline elimination (e.g. by morphological filters) - Noise reduction and/or smoothing  (Mostly) needs resampling Needs additional postprocessing steps, like e.g.: - Peak clustering (so-called “deconvolution”) - Model fitting, charge prediction

8 The Isotope Wavelet Transform Convolution with a kernel function - by construction robust against noise and baseline artifacts - also acts as a filter for chemical noise - predicts simultaneously the charge state - needs no explicit resampling - only a single parameter (threshold)

9 Results – Myoglobin PMF

10 Parallelization via CUDA

11 Parallelization via CUDA

12 Parallelization via CUDA b-th data point

13 Parallelization via CUDA b-th data point

14 Parallelization via CUDA b-th data point

15 Parallelization via CUDA b-th data point

16 Parallelization via CUDA T0 b-th data point Tn

17 Parallelization via CUDA and TBB 2x NVIDIA Tesla C870 via Intel Threading Building Blocks 1x NVIDIA Tesla C870 1x CPU 2.3 GHz >200x speedup

18 Open Issues – Future Work ∙ Solutions for machine-specific ‘ artifacts ’, e.g. - Tailing effects in TOF-Analyzers - Severe mass discretization in high resolution data ∙Separating overlapping patterns ∙Tests for MS n spectra - Refined averagine model GPU solutions

19 Availability: OpenMS ∙An open source C++ library for mass spectrometry ∙Designed for “users” as well as for “developers” ∙ TOPP - “The OpenMS proteomics pipeline” - suite of independent software tools - include file handling / conversion - peak picking and feature detection - includes visualizer TOPPView …

20 References Hussong, R, Gregorius, B, Tholey, A, and Hildebrandt, A (2009). Highly accelerated feature detection in proteomics data sets using modern graphics processing units. Bioinformatics 25. Schulz-Trieglaff, O, Hussong, R, Gr ö pl, C, Leinenbach, A, Hildebrandt, A, Huber, C, and Reinert, K (2008). Computational Quantification of Peptides from LC-MS Data. Journal of Computational Biology 15 (7). Sturm, M, Bertsch, A, Gr ö pl, C, Hildebrandt, A, Hussong, R, Lange, E, Pfeifer, N, Schulz- Trieglaff, O, Zerck, A, Reinert, K, and Kohlbacher, O (2008). OpenMS - An open-source software framework for mass spectrometry, BMC Bioinformatics 9 (163). Hussong, R, Tholey, A, and Hildebrandt, A (2007). Efficient Analysis of Mass Spectrometry Data Using the Isotope Wavelet In: COMPLIFE 2007: The Third International Symposium on Computational Life Science. American Institute of Physics (AIP) 940. Schulz-Trieglaff, O, Hussong, R, Gr ö pl, C, Hildebrandt, A, and Reinert, K (2007). A Fast and Accurate Algorithm for the Quantification of Peptides from Mass Spectrometry Data, In: Proceedings of the Eleventh Annual International Conference on Research in Computational Molecular Biology (RECOMB). Lecture Notes in Bioinformatics (LNBI) 4453.

21 The Isotope Wavelet Transform Kernel function charge state 1, mass 1000D Kernel function charge state 1, mass 2000D - by construction robust against noise and baseline artifacts - also acts as a filter for chemical noise - predicts simultaneously the charge state - needs no explicit resampling - only a single parameter (threshold) Convolution with a kernel function

22 The Isotope Wavelet Transform MS spectrum (charge state 3) charge-1-transform charge-2-transform charge-3-transform

23 The Sweep Line Idea m/z [Th] RT [s] 2 additional parameters: RT_cutoff RT_interleave 2 additional parameters: RT_cutoff RT_interleave

24 digest intensity mass/charge Fingerprint charge state 1 Open Issues – Future Work Fragment Fingerprint

25 Open Issues – Future Work ∙Separating overlapping patterns

26 The Retention Time

27 Results – 2D noisy data

28 The Adaptive Isotope Wavelet Kernel -  denotes the Heaviside step function - λ (m) is a linear function fit to the averagine model