PeakErazor Pre-processing of mass lists prior to peptide mass searching Karin Hjernø, Peter Roepstorff and Peter Højrup Department of Biochemistry and.

Slides:



Advertisements
Similar presentations
Protein Quantitation II: Multiple Reaction Monitoring
Advertisements

Mass spectroscopy. In a typical MS procedure:  1- a sample is loaded onto the MS instrument, and undergoes vaporization.  2- the components of the sample.
1336 SW Bertha Blvd, Portland OR 97219
Proposal for a Standard Representation of the Results of GC-MS Analysis: A Module for ArMet Helen Fuell 1, Manfred Beckmann 2, John Draper 2, Oliver Fiehn.
PROTEIN IDENTIFICATION BY MASS SPECTROMETRY. OBJECTIVES To become familiar with matrix assisted laser desorption ionization-time of flight mass spectrometry.
Data Handling l Classification of Errors v Systematic v Random.
The Calibration Process
MiniSIMS Secondary Ion Mass Spectrometer Dr Clive Jones Millbrook Instruments Limited Blackburn Technology Centre, England
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Previous Lecture: Regression and Correlation
Scaffold Download free viewer:
Proteomics Informatics Workshop Part III: Protein Quantitation
CALIBRATION METHODS.
Calibration & Curve Fitting
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Objectives of Multiple Regression
Quality Assessment 2 Quality Control.
Comparison of chicken light and dark meat using LC MALDI-TOF mass spectrometry as a model system for biomarker discovery WP 651 Jie Du; Stephen J. Hattan.
Introduction Recent research has proposed rapid and robust identification of intact microorganisms using matrix assisted laser desorption/ ionization time-of-flight.
Organic Mass Spectrometry
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
Simpson Rule For Integration.
W  eν The W->eν analysis is a phi uniformity calibration, and only yields relative calibration constants. This means that all of the α’s in a given eta.
Fundamentals of Data Analysis Lecture 10 Management of data sets and improving the precision of measurement pt. 2.
Mass Spectrometry I Basic Data Processing. Mass spectrometry A mass spectrometer measures molecular masses. The mass unit is called dalton, which is 1/12.
INF380 - Proteomics-61 INF380 – Proteomics Chapter 6 – Mass Spectrometry – MALDI TOF The MALDI-TOF instruments are the simplest MS instruments suitable.
1 Chemical Analysis by Mass Spectrometry. 2 All chemical substances are combinations of atoms. Atoms of different elements have different masses (H =
1 Atomic Absorption Spectroscopy Lecture Emission in Flames There can be significant amounts of emission produced in flames due to presence of flame.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Organic Mass Spectrometry
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
1 A method of successive elimination of spurious arguments for effective solution the search- based modelling tasks Oleksandr Samoilenko, Volodymyr Stepashko.
Laxman Yetukuri T : Modeling of Proteomics Data
Uncertainties Using & Calculating Uncertainties for Electrical Measurement.
MASS SPECTROMETRY. CONTENTS Prior knowledge Background information The basic parts of a mass spectrometer The four stages of obtaining a spectrum How.
Novel Algorithms for the Quantification Confidence in Quantitative Proteomics with Stable Isotope Labeling* Novel Algorithms for the Quantification Confidence.
INF380 - Proteomics-101 INF380 – Proteomics Chapter 10 – Spectral Comparison Spectral comparison means that an experimental spectrum is compared to theoretical.
PeptideProphet Explained Brian C. Searle Proteome Software Inc SW Bertha Blvd, Portland OR (503) An explanation.
MS Calibration for Protein Profiles We need calibration for –Accurate mass value Mass error: (Measured Mass – Theoretical Mass) X 10 6 ppm Theoretical.
Quality Assurance How do you know your results are correct? How confident are you?
Chemistry Topic: Atomic theory Subtopic : Mass Spectrometer.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
CALIBRATION METHODS. For many analytical techniques, we need to evaluate the response of the unknown sample against the responses of a set of standards.
Data Analysis: Quantitative Statements about Instrument and Method Performance.
Advanced Analytical Chemistry – CHM 6157® Y. CAIFlorida International University Updated on 9/26/2006Chapter 3ICPMS Interference equations Isobaric.
F LORIDA I NTERNATIONAL U NIVERSITY Advanced Mass Spectrometry Piero R. Gardinali/Yong Cai/ Bruce McCord Revised on August 23, 2009.
INF380 - Proteomics-71 INF380 – Proteomics Chap 7 –Protein Identification and Characterization by MS Protein identification in our context means that we.
Innovative Paths to Better Medicines Design Considerations in Molecular Biomarker Discovery Studies Doris Damian and Robert McBurney June 6, 2007.
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
10-1 MGMG 522 : Session #10 Simultaneous Equations (Ch. 14 & the Appendix 14.6)
Lecture 6 Comparative analysis Oct 2011 SDMBT.
Deducing protein composition from complex protein preparations by MALDI without peptide separation.. TP #419 Kenneth C. Parker SimulTof Corporation, Sudbury,
Analysis of IMS Data By Ed Sittler CAPS Team Meeting #28 Marseille, France July 26-28, 2004.
2014 생화학 실험 (1) 6주차 실험조교 : 류 지 연 Yonsei Proteome Research Center 산학협동관 421호
Quantitation using Pseudo-Isobaric Tags (QuPIT) and Quantitation using Pseudo-isobaric Amino acids in Cell culture (QuPAC) Parimal Samir Andrew J. Link.
Yonsei Proteome Research Center Peptide Mass Finger-Printing Part II. MALDI-TOF 2013 생화학 실험 (1) 6 주차 자료 임종선 조교 내선 6625.
이 장 우. 1. Introduction  HPLC-MS/MS methodology achieved its preferred status -Highly selective and effectively eliminated interference -Without.
Post translational modification n- acetylation Peptide Mass Fingerprinting (PMF) is an analytical technique for identifying unknown protein. Proteins to.
The Calibration Process
Strategies for Eliminating Interferences in Optical Emission Spectroscopy Best practices to optimize your method and correct for interferences to produce.
Time-of-Flight Mass Analyzers
Bioinformatics Solutions Inc.
Chapter 5 Quality Assurance and Calibration Methods
NanoBPM Status and Multibunch Mark Slater, Cambridge University
Metabolomics: Preanalytical Variables
Softberry Mass Spectra (SMS) processing tools
Quality Assessment The goal of laboratory analysis is to provide the accurate, reliable and timeliness result Quality assurance The overall program that.
A handbook on validation methodology. Metrics.
Operation manual of AI SIDA
Presentation transcript:

PeakErazor Pre-processing of mass lists prior to peptide mass searching Karin Hjernø, Peter Roepstorff and Peter Højrup Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M, Denmark. Problem Very advanced peptide mass search programs are available today, making identification of proteins by peptide mass search (PMS) using tryptic MALDI-TOF MS data more and more trustworthy. However, the procedure still faces a number of practical problems: Usually two-point calibration (either external or internal) of the mass spectra are used and in most cases the obtained accuracy is sufficient for identification of the protein in question. to be excluded from the peptide mass list before using the list as input in a protein identification In addition, identification of the project specific contaminants and determination of the exact mass of unknown contaminants can be tricky. Filtering and Precision check. Elimination of known contaminants, e.g tryptic autolytic peptides and keratin derived peptides, to get a higher precision in the following analysis. Using the built-in graph and ppm-list, you can check the presicion of your data and use this in combination with your final results to verify an identification or to determine the actual precision of your data. Calibration. Multi-point calibration on all recognized peaks (trypsin, keratin and other contaminants). This can in favorable cases increase your precision by a factor of two. Contamination detection and evaluation. By using the built-in database you can detect system specific contaminant. These can then easily be added to the contamination list and the exact mass values of unknown contaminants can be fine-tuned. Background subtraction. From a limited number of spectra you can extract background peaks (e.g. experiment specific contaminants) for more precise analysis. FUNCTIONS PERFORMED BY PEAK-ERAZOR Solution To address these problems we have created a small, user-friendly Windows-based program, PeakErazor, using a simple concept: By comparing all mass values from a tryptic digest against a list of known contaminants, contaminating peaks can easily be removed. At the same time you can perform a multipoint calibration leading to greater precision and the program helps you detect new contaminants as you go on. These features leaves you with a much larger precision in the actual PMS search. Four of the features of the program are illustrated here. Below is a small discussion of the practical problems of calibration and the future improvements of the program. PeakErazor is freely available at CONTAMINATION DETECTION AND EVALUATION m/z Intensity FILTERING AND PRECISION CHECK One of the best ways to beat your enemies is to know them! The Peak-Erazor program contains an Erazor-list containing already known contaminants (= the enemies!). Therefore it can immediately identify peaks that are non-specific for the protein of interest. FILTERING AND PRECISION CHECK When contaminating peaks are removed from your mass spectrum, you will have larger score in the actual PMS search, as the contaminations will not give rise to spurious ‘hits’, and the statistics on the correct ‘hit’ will be better reducing the risk of false positive hits.. However, in many spectra the peaks used for calibration can be missing, of poor quality or the isotope envelope may overlap with other peaks. Another problem is peptide contamination. Most people working with proteins separated by 2D-gel come across peaks that appear in many spectra and does not seem to be specific for the protein of interest. Including these peaks in the peptide mass search will obviously make the search less precise and consequently they have Contamination peak automatically erazed from the mass list Graph illustrating the mass precision of the contaminants identified BLUE: Accepted peaks in all analysed spectra from a specific project RED: Rejected peaks in all analysed spectra from the same project

The list of contaminants can be evaluated using the information saved by the program itself. Above is an example showing the number of accepted (blue) and rejected (red) masses in a total of 843 spectra. Here otherwise accepted peaks can be recognized as contaminants and hence included in the erazor-list (blue). Another feature of the evaluation part of the program is to make the precision of the unknown contamination masses more and more accurate as the project is going on. This is done by integrating a small area around the mass of interest and then read out the most abundant value (red). Calibration In a linear time-of-flight (TOF) instrument the mass-to-charge (m/z) value is proportional to the square of the ion flight time (t 2 ). However, linear TOF instruments are not sufficient to get a high enough mass accuracy when working with peptides. Therefore, time-delayed ion extraction and reflectrons are built into the instrument to give a very high mass resolution, a low signal-to-noise ratio and high accuracy on the mass determination. Unfortunately, these improvements disturb the simple correlation between m/z and t (1). Instead the correlation is depending on different parameters like initial ion velocity, specific conditions and instruments used. In order to take these effects into consideration, one could do multipoint calibration with calibration points distributed uniformly over the whole mass range. This can be done using polymers as an external calibration (2). However, a slight difference in the ion flight time is found depending on the location on the target, which means that external calibration is less accurate than internal calibration. Mixing the sample with the polymer is not the way to solve this problem due to ions suppression effects, which will result in polymer suppressing signals from the peptides and vica versa. Instead internal correction of the sample position-dependent errors can be used (2). Other groups have developed different more or less complicated calibration algorithms (e.g. using polynomial fitting procedure or quadratic correction methods (all resulting in more accurate mass values (3,4). At the moment, a simple linear fit is used for multipoint calibration in PeakErazor. In most cases this enhance the overall precision of the mass values by a factor of 1.5-2x. In the future the goal is to incorporate a higher-order calibration function based on the known peaks already found in the spectrum (trypsin, keratin etc.) or by feeding the program with knowledge from previous protein identifications and in this way build up a useful higher-order function which can be used for a more accurate calibration than the two-point calibration used in most laboratories today. Where to find PeakErazor! The program will be avaliable for free at : Please contact for further information References: 1. Hack, C.A and Benner, W.H., Rapid Commun. Mass Spectrom. 2002;16; Gobom, J., Mueller, M., Egelhofer, V., Theiss, D., Lehrach, H. and Nordhoff, E., Anal. Chem. 2002;74; Christian, N.P., Arnold, R.J. and Reilly, J.P., Anal. Chem. 2000;72; Whittal, R.M., Schriemer, D.C. And Li, L., Anal. Chem. 1997;69;2734 CALIBRATION All calibrations, whether two-point or multipoint, can be done IN the program using any contaminant peak. The calibration used in the program is a simple linear regression algorithm.. In spectra analysis programs recalibration are possible (e.g. useful before running a second path search) but in some programs any manual peak peaking performed are erased when recalibrating the spectra. This is avoided using Peak-Erazor. Before multipoint calibration After multipoint calibration BACKGROUND SUBSTRACTION When running a large amount of spectra one often stumble upon peaks that are common for many of the spectra and do not seem to be specific for the proteins of interest. These peaks can easily be identified and subtracted in the project using the “Background” part of the Peak- Erazor Program. These masses are added to the erazor- list and given the name “Blank”. 15 masses found in at least four out of 6 mass lists within a precision of 100 ppm In this example the background is subtracted as ”blank”