Mass Spectrometry I Basic Data Processing
Mass spectrometry A mass spectrometer measures molecular masses. The mass unit is called dalton, which is 1/12 of the mass of a carbon atom, and is about the mass of one hydrogen atom. If there is a mixture of different molecules in a sample, all the masses are measured simultaneously. So you get a spectrum.
Some Pictures MALDI-RQ-Tof Micro FT-ICR LTQ-Orbitrap
Each peak corresponds to a different type of molecule in your sample... … … peak list
Three Components of an MS A typical mass spectrometer contains – Ionizer – Mass analyzer – Detector Ion source charges the to-be-measured molecules. – Charge can be negative but often positive. – Two common types: MALDI and ESI. – John B. Fenn & Koichi Tanaka 2002 Nobel Prize in Chemistry for Electrospray and MALDI Mass analyzer separates ions according to the mass to charge ratio (m/z) of the ions. – Iontrap, TOF, Quadrupole, FTICR. Detector detects the ions.
Matrix Assisted Laser Desorption/Ionization Formation of singly charged ions Sample is co-crystallized with matrix (solid) Koichi Tanaka, Nobel Prize 2002 Ionization (1): MALDI Other ionization method exists.
Mass Analyzer (1) – TOF Time of Flight Detector Time of flight is proportional to sqrt(m/z) Other mass analyzer exists.
Drift region (D) MALDI Time-of-flight Putting Them Together MALDI TOF Average time in TOF: sec : average speed 1-2 x 10 5 km/h
MALDI-TOF Linear Mass range = ,000 Sensitivity and accuracy decrease rapidly with size !
MALDI-TOF Linear vs Reflectron Mode Reflectron gives much better resolution for mass < 6,000 Linear = poor resolution due to velocity variation of ions with the same m/z Reflectron = Contact lens for a near sighted machine!
Protein “identification” with intact mass We measure the intact mass of the protein. Then search in the protein database to find a protein with the same mass. Good idea but there are too many proteins with the same mass. In the rest of the lecture we study more sophisticated methods and why protein ID is important.
Complications isotopes widened peaks profile
Centroiding
Another example with lower resolution
Isotopes
Chemical Composition of Living Matter 27 of 92 natural elements are essential. Elements in biomolecules (organic matter): H, C, N, O, P, S These elements represent approximately 92% of dry weight. Organic Matter Organized in "building blocks" amino acids polypeptides ( proteins) monosaccharidesstarch, glycogen nucleic acidsDNA, RNA Back to Basics…
element nominal exact Percent average mass mass abundance mass C % % H % % O % % N % % S % % % excercise Mass (Weights) of Atoms and Molecules
Ethyl acetate C 4 H 8 O 2 4 C 12 4 x H 1 8 x O 16 2 x Nominal Mass: = 88 Monoisotopic Mass: Average Mass: = Mass or Molecular Weight of molecules
Amino Acids There are 20 amino acids. All have the same basic structure but with different side chains: Examples: side chain group H Glycine, or Gly, or G Arginine, or Arg, or R
All the 20 Structures * Picture copied from Dr. R.J. Huskey’s website: u/~rjh9u/aminacid.html
Peptides and Proteins H Glycine, or Gly, or G Arginine, or Arg, or R GR peptide bonds N-terminal C-terminal
Exact Mass of Amino Acid Residues in Proteins Note: Leu (L) = Ile (I) = Mass of Amino Acids Residues
Amino Acid Table AA CodesMono. IONSOURCE.COMIONSOURCE.COM AA Codes Mono. GlyG AspD AlaA GlnQ SerS LysK ProP GluE ValV MetM ThrT HisH CysC PheF LeuL ArgR IleI CMC AsnN TyrY TrpW
Cysteine Proteins are often treated so that cysteine becomes carboxyamidomethyl cysteine (CamC) or Carboxymethyl (CmC) in order to break the disulphide bonds. CamC =
tripeptide (MW )= More precisely: monoisotopic mass average mass Ala-Ser-Phe (ASF) Mass of Peptides and Proteins
In a mass spectrum Deconvolution adds all the isotopic peaks to the monoisotopic peak. So, the later process does not need to worry about the isotopes. Monoisotope peak isotope peaks
Check the difference
ESI and Multiply Charged Ions
Electrospray
Electrospray Ionization: Formation of Charged Droplets Formation of multiply charged ions Ionization (2) – ESI
Multiply Charged Ions The same molecules may be charged differently, and therefore form a few peaks in the spectrum m/z(M+2)/2(M+3)/3 For protein/peptide with positive charges, the charge is obtained from adding protons (which has mass approx. 1 dalton. As a result, a molecule with mass M will have peaks at (M+Z)/Z (M+1)/1
How to determine charge states? Isotope ions when resolution is enough. Check different charge states when resolution is not enough.
Exercise
Exercise
(A) “Multi-charge envelope” ( B) After “Charge-deconvolution algorithm”
Baseline
Baseline correction
Convex Hull Method convex not convex
Convex Hull A convex hull is such that all the data points are above the lines and their extensions.
How to calculate convex hull? Stack S contains all the data points that form the convex hull so far. Data point D[i] = (D[i].x, D[i].y). Algorithm: 1. S.push( D[0] ); s.push(D[1]) 2. for i from 2 to n 2.1 while D[i], S.top(), S.secondtop() are concave S.pop(); 2.2 S.push(D[i]); 3. return S S.top() S.secondtop() D[i]
Analyze the convex hull algorithm Correctness – The algorithm finishes. – The output is a convex hull. – The proof will be included in an assignment. Time complexity – O(n) time. – Proof: each point is checked only once, and added to (and therefore removed from) the stack at most once.
Summarize of spectrum preprocessing Baseline correction Centroiding Charge recognition and deconvolution Noise removal