Mass Spectrometry in Life Science: Technology and Data-Evaluation H. Thiele Bruker Daltonik, Germany.

Slides:



Advertisements
Similar presentations
Improvements in Mass Spectrometry for Life Science Research – Does Agilent Have the Answer? Ashley Sage PhD.
Advertisements

Introduction to MALDI-TOF MS
MALDI-TOF Mass Spectrometry and Introduction to Proteomics Dr. Steve Hartson Oklahoma State University Dept. Biochemistry and Molecular Biology Recombinant.
Lecture Three: "MALDI-TOF MS, Up Close, and Personal Review of principles Delayed Extraction Reflector / reflectron Post-source Decay (PSD)
Primary Ionization Techniques for Molecules
How to identify peptides October 2013 Gustavo de Souza IMM, OUS.
Peptide Mass Fingerprinting
Mass Analyzers Double Focusing Magnetic Sector Quadrupole Mass Filter Quadrupole Ion Trap Linear Time-of-Flight (TOF) Reflectron TOF Fourier Transform.
Proteomics The proteome is larger than the genome due to alternative splicing and protein modification. As we have said before we need to know All protein-protein.
PROTEIN IDENTIFICATION BY MASS SPECTROMETRY. OBJECTIVES To become familiar with matrix assisted laser desorption ionization-time of flight mass spectrometry.
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Announcements: Proposal resubmissions are due 4/23. It is recommended that students set up a meeting to discuss modifications for the final step of the.
Proteomics Informatics Workshop Part I: Protein Identification
Previous Lecture: Regression and Correlation
Scaffold Download free viewer:
PeakErazor Pre-processing of mass lists prior to peptide mass searching Karin Hjernø, Peter Roepstorff and Peter Højrup Department of Biochemistry and.
My contact details and information about submitting samples for MS
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
MALDI-TOF-TOF with High Resolution Precursor Selection and Multiplexed MS-MS Poster Number ThP 618 Kevin Hayden, Stephen C. Gabeler, Mark Dahl and Marvin.
Evaluated Reference MS/MS Spectra Libraries Current and Future NIST Programs.
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Comparison of chicken light and dark meat using LC MALDI-TOF mass spectrometry as a model system for biomarker discovery WP 651 Jie Du; Stephen J. Hattan.
The ProteomeWorks System is the global alliance of Micromass and Bio-Rad Laboratories dedicated to furthering proteomics research.
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
Molecular mass spectrometry Chapter 20 The study of “molecular ions” M + e -  M e -
Mass Spectrometry I Basic Data Processing. Mass spectrometry A mass spectrometer measures molecular masses. The mass unit is called dalton, which is 1/12.
INF380 - Proteomics-61 INF380 – Proteomics Chapter 6 – Mass Spectrometry – MALDI TOF The MALDI-TOF instruments are the simplest MS instruments suitable.
Common parameters At the beginning one need to set up the parameters.
1 Chemical Analysis by Mass Spectrometry. 2 All chemical substances are combinations of atoms. Atoms of different elements have different masses (H =
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
Laxman Yetukuri T : Modeling of Proteomics Data
Marvin Vestal and Kevin Hayden SimulTOF Systems
INF380 - Proteomics-101 INF380 – Proteomics Chapter 10 – Spectral Comparison Spectral comparison means that an experimental spectrum is compared to theoretical.
MS Calibration for Protein Profiles We need calibration for –Accurate mass value Mass error: (Measured Mass – Theoretical Mass) X 10 6 ppm Theoretical.
MS Intro. MS requires gas-phase ions, why? MS uses magnetic and electric fields to control the path of a compound based on mass to charge ratio (m/z)
Temple University MASS SPECTROMETRY FURTHER INVESTIGATIONS Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
INF380 - Proteomics-51 INF380 – Proteomics Chapter 5 – Fundamentals of Mass Spectrometry Mass spectrometry (MS) is used for measuring the mass-to-charge.
Chemistry 367L/392N Macromolecular Chemistry Lecture 8.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
Proteomics What is it? How is it done? Are there different kinds? Why would you want to do it (what can it tell you)?
INF380 - Proteomics-71 INF380 – Proteomics Chap 7 –Protein Identification and Characterization by MS Protein identification in our context means that we.
Separates charged atoms or molecules according to their mass-to-charge ratio Mass Spectrometry Frequently.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Deducing protein composition from complex protein preparations by MALDI without peptide separation.. TP #419 Kenneth C. Parker SimulTof Corporation, Sudbury,
Chem. 133 – 4/26 Lecture. Announcements Return graded quiz and additional problem Lab – Lab report deadlines (2:4 – Thursday) Today’s Lecture – Mass Spectrometry.
2014 생화학 실험 (1) 6주차 실험조교 : 류 지 연 Yonsei Proteome Research Center 산학협동관 421호
Constructing high resolution consensus spectra for a peptide library
Chapter 29 Mass Spectrometry. 29 A Principles of mass spectrometry In the mass spectrometer, analyte molecules are converted to ions by applying energy.
김지형. Introduction precursor peptides are dynamically selected for fragmentation with exclusion to prevent repetitive acquisition of MS/MS spectra.
RANIA MOHAMED EL-SHARKAWY Lecturer of clinical chemistry Medical Research Institute, Alexandria University MEDICAL RESEARCH INSTITUTE– ALEXANDRIA UNIVERSITY.
Yonsei Proteome Research Center Peptide Mass Finger-Printing Part II. MALDI-TOF 2013 생화학 실험 (1) 6 주차 자료 임종선 조교 내선 6625.
An introduction to Mass spectrometry. What is mass spectrometry? Analytical tool measuring molecular weight (MW) of sample Only picomolar concentrations.
Post translational modification n- acetylation Peptide Mass Fingerprinting (PMF) is an analytical technique for identifying unknown protein. Proteins to.
Metabolomics Part 2 Mass Spectrometry
Peptide Mass Finger-Printing Part II. MALDI-TOF
Mass Spectrometry 101 (continued) Hackert - CH 370 / 387D
‘Protein sequencing’: Determining protein sequences
Acknowledgements Slides and animations were made by Dr. Jon Karty Mass Spectrometry Facility Indiana University, Bloomington.
LC-MS/MS Identification of Impurities Present in Synthetic Peptide Drugs Dr Anna Meljon*, Dr Alan Thompson, Dr Osama Chahrour, and Dr John Malone Almac.
The Syllabus. The Syllabus Safety First !!! Students will not be allowed into the lab without proper attire. Proper attire is designed for your protection.
Mass Spectrometry 101 Hackert - CH 370 / 387D
Tandem MS.
Starter Complete the quiz and hand in..
Metabolomics Part 2 Mass Spectrometry
Proteomics Informatics David Fenyő
Proteomics Informatics –
Mass Spectrometry THE MAIN USE OF MS IN ORG CHEM IS:
Processing of fragment ion information in DTA files to remove isotope ions and noise. Processing of fragment ion information in DTA files to remove isotope.
Investice do rozvoje vzdělávání
Proteomics Informatics David Fenyő
Presentation transcript:

Mass Spectrometry in Life Science: Technology and Data-Evaluation H. Thiele Bruker Daltonik, Germany

MALDI-TOF Mass Spectrometry Proteomics Functional Genomics Proteome Analysis Investigation of protein diversity Genomics SNP Genotyping Search for genetic variations MALDI-TOF MS Identification No a priori knowledge about analyte Screening Analyte of known MW Bridging Proteomics & Genomics

Mass Spectrometer for Biopolymer Research The Technology

Laser Sample plate Analyte molecules in matrix Acceleration grids Drift region Ion detector Mass spectrum Vacuum system Vacuum lock 20 to 200 spectra have to be added; total duration 2 to 20 seconds with 50 (200) Hertz Laser Principle of MALDI-TOF-MS Linear flight tube Flight time m/z all ions with E kin = 1/2mv 2 space/energy uncertainty

HiRes mass spectrum Ion detector MALDI ion source Ion reflector The reflector focuses ions of same mass but different E kin (velocity) on detector; high resolution is obtained Laser High resolution TOF-MS with Reflector Flight time m/z 0V+ kV

MS/MS by PSD MS/MS = fragment ion or tandem mass spectromentry PSD = Post Source Decay

Electr. potential ion energy Metastable decay of molecular ions, energy is reduced according to mass ratio SourceReflector PSD by Reflectron TOF (Scheme) Adjustment of voltages Segment 1 Segment 2 Segment 3 Segment 4 E = 1/2 mv 2 v=const. eg. if M + = 1000, m = 500 has 4 keV m = 100 has 0.8 keV m =25 has 100 eV

Daughter ion mass spectrum Ion detector Parent ion selector MALDI ion source Laser The daughter ion spectrum can only be measured in segments which have to be pasted together segments are necessary. Strong fieldWeaker field Weak field Adjustment of voltages Ion reflector Manual operation: 20 – 40 minutes; automatic operation: 5 – 10 minutes per daughter ion spectrum (100 acquisitions in each segment) TOF-MS/MS by PSD

In proteomics, many proteins have to be separated and analysed fast to avoid degradation Regarding structure information, MALDI MS/MS appears to be optimal, but PSD is much too slow ! Consequence: Development of a fast MALDI MS/MS instrument !

MALDI TOF/TOF with post-acceleration by potential LIFT

Electr. potential ion energy Decaying ions, energy reduced, low speed SourceReflector LIFT All fragment ions can be analyzed simultaneously, no segmenting necessary 1. TOF 2. TOF Even low mass ions have high energy, good for detection Potential is switched when ions are in LIFT TOF/TOF with LIFT (Scheme)

Daughter ion mass spectrum Ion detector Ion reflector Parent ion selector MALDI ion source Potential LIFT for post acceleration MS/MS spectrum of daughter ions is measured in a single acquisition; no pasting of segments; low sample consumption, high speed, high sensitivity 1 to 200 spectra needed; 1 to 10 seconds only with 20 Hertz laser Laser Parent ion supressor TOF -MS/MS with post-acceleration by LIFT LID Collision Cell (CID)

Identification of Proteins (sequence of amino acids) and Protein modifications Data Evaluation Goal : – – Fragmentation of proteins / peptides resulting in PMF / PFF spectra – – Detection (annotation) of the masses of the fragments – – Identification by database searches Method :

- Detection of peaks with low signal/noise ratio - Identification (mass, area, intensity) of (overlapping) isotopic patterns - Score the results - Detection of multiple charges (TOF spectra z = 1,2) average mass monoisotopic mass Problems to be solved by Bioinformatics Detection of protonated molecular ion [M+H]+ Isotopic resolution nominal mass

12 C 93 1 H N O S + : monoisotopic Isotopic pattern of peptides 12 C 93 1 H H 14 N O S + : 1.4%, m= C C 1 H N O S + : 88.9%, m= C 93 1 H N N 16 O S + : 8.1%, m= C 93 1 H N O S + : 0.7%, m= C 93 1 H N O O 1 32 S + : 0.9%, m=

Deisotoping: Assigning monoisotopic masses SNAP approach: Peak selection - -Damping of chemical noise using FFT filtering - -Baseline correction - -noise calculation - -peak search Iterative search for isotopic patterns – –Analysing the largest peaks first – –Alignment of patterns using peak list heuristic and FFT deconvolution – –Nonlinear fit using asymmetric line shape – –Subtraction of analysed patterns Reevaluation – –Fit of intensities of overlapping patterns, optional addition of ICAT masses – –Calculation of Quality Factor

SNAP : Regularized FFT Deconvolution Uncertainty of mean peptide isotopic distribution

SNAP : Nonlinear Fit -  2 Exponentially modified gaussians for asymmetric line shapes: Local optima for least square fit:

SNAP : Quality Factor Idea: Get a value for the quality of a pattern which can be used in favor of S/N or intensity for selecting the “best” peaks Fuzzy Scoring Basic Scoring 22 Area/Width Mean deviation ,  for all patterns Kind of Spectrum/ Instrument Quality factor

From overlapping peak groups SNAP : Use Case To monoisotopic masses

Wavelet Methods for Denoising Proteomics Spectra Denoising by Hard Thresholding Scale - adaptive Thresholds Preservation of Position, Shape and Amplitude of major Peaks Wavelet Transform Hard Thresholding Inverse Wavelet Transform

Denoising by Hard Thresholding Further Developments   Baseline Correction " " Deconvolution of Isotopic Patterns " " Scale-Energy Parameters for enhanced Clustering

Charge Deconvolution : Without Isotopic Resolution m/z Different m/z peaks of Equine Apomyoglobin Protein MW is calculated from m/z differences between adjacant peaks by deconvolution software (result see inlet). Protein Z = Peptide Z = 1,2,3,4 Small molecules Z = 1 Charge states for ESI Related Ion Deconvolution Peak Picking m/z ; intensity Deconvolution envelope; distances Result Z + MW [M+zH] z+ /z M

Charge Deconvolution: Isotopic Resolution d (m/z) =0.25 u d (m/z) =0.2 u (M+4H) 4+ (M+5H) 5+ For isotopically resolved patterns the charge state and the mass can be determined from a single pattern.

Calibration Get more accurate data Problems to be solved by Bioinformatics

Automatic „Smart“ Calibration Automatic Control based on external and internal data Resulting Accuracy <10 ppm High Precision Correction improves stability & accuracy Contaminants, self digestion Mass distribution of peptides External calibration spots Statistical ReferencesInternal CalibrantsExternal Calibration Automatic “Smart” Calibration Tof(m/z) = c 0 +c 1 (m/z) 1/2 +c 2 (m/z) + fixed high precision correction

Statistical Calibration for Proteomics Peaklist Statistical Reference Masses Assign Masses (dM < dErr) Calibrate dErr := Max(50, 0.5*dErr) dErr>=50 Stop Initial Error dErr<500 ppm Using modified Mann’s clustering Resulting Accuracy <20ppm NoYes

Details of the Calibration Routine: Internal Multipoint Calibration – an Example Matching with contaminants Exclusion limit 800ppm Final calibration calibration, reject in- accurate masses average error: 13.4ppm calibration, reject un- matched masses 1.Calibration round Exclusion limit 150ppm average error: 66.7 ppm 2.Calibration round Exclusion limit 40ppm calibration, reject in- accurate masses average error: 16.3ppm error [ppm] measured mass [Da]

Iterative Generation of internal calibrant list Calibration PMFSearch Generation of an improved calibrant list Start of PMF identification with a default calibrant list usually 2 repeats are sufficient The default calibrant list usually consists of three typical trypsin peptides Improved calibrant lists typically contain of masses – averagely of these can be found in a spectra

MS based Identity Search Search Engines Problems to be solved by Bioinformatics

MS Protein Identification is Probability based How closely is a given protein or peptide sequence matching to the measured masses ? There are several strategies for a matching “ score“ : For example: - -Probability based MOWSE score (Mascot) - -Bayesian probability (ProFound) - -Cross correlation (MS-Fit) Masses determined by MS are not unique Identification is probability based Problem of assigning true probabilities to a given identification

Part 2 Successively changing various search parameters to test their influence. Optimisation of search parameters. Part 1 Comparison of the performance of the search engines using a typical set of search parameters. Dataset: 168 MALDI PMF spectra About 10,000 searches have been performed to establish a statistical basis the data was acquired in the environment of a typical proteome project Evaluation of PMF and Search Engines

% of searches ProFound Z score 5% significance level ProFound % of searches Mascot score 5% significance level Mascot % of searches log (MS-Fit MOWSE Score) MS-Fit Comparision of PMF Search Engines – Score Distribution

random matches % of searches ProFound Z score 5% significance level range of uncertainty correct identifications ProFound - scoring distribution Idea: Integration of search results from different engines could improve significance and confidence! An effective ranking of results can be assessed by individual search score distributions Converting the Scoring Distribution to a MetaScore

- Effective sorting of reported results of several search engines - - More correct Proteins are on rank number one - - Elimination of false positives - - drawback: MetaScore does not reflect true probabilities Ranking of Search Results of different PMF algorithms by MetaScore

Automated validation of Search Results Search Engines Problems to be solved by Bioinformatics

m/z PMF Result judgement m/z MS/MS m/z List of precursor masses Fuzzy Engine MetaScoring MTP-Viewer Result visualization Identified ? No Auto MS/MS definition Search result driven Queries Yes From Automation to High Throughput

Fuzzy Engine for Protein Identifikation from PMF spectra FL Probability Score Score Ratio to unrelated Sequence Sequence Coverage Correlation Coefficient Peak Quality Factor Identified Undefined Uncertain (unique) Uncertain (multiple) Identified (multiple) Bad data

Automated MS/MS Precursor Ion Selection Automation & High Throughput Problems to be solved by Bioinformatics

Strategies for automated MS/MS acquisition

Acknowledgement Jens Decker, Michael Kuhn Bruker Daltonik Martin Blüggel, Daniel Chamrad Peter Maaß Kristian Bredies