Download presentation
Presentation is loading. Please wait.
Published byBarbara Bryan Modified over 8 years ago
1
Identify proteins
2
Proteomic workflow
3
Trypsin A typical sample We add a solution of 50 mM NH 4 HCO 3 (pH 7.8) containing trypsin (0.01-5 µg/µl). Volume depends on the dimension of the sample Incubation o/n at 37°C Surnatants are 0.22 µm filtered Depending on the volume, samples are concentrated / vacuum dried Peptide mixtures are then analysed by mass spectrometry
4
Trypsin
5
525.3144 HIQK 615.3283 LHSMK 689.3828 VNELSK 748.3698 TTMPLW 831.3843 EDVPSER 910.4741 EGIHAQQK 1267.7045 YLGYLEQLLR 1384.7300 FFVAPFPEVFGK 1580.8279 VPQLEIVPNSAEER 1694.0761 LLILTCLVAVALARPK 1759.9450 HQGLPQEVLNENLLR 1767.7589 DIGSESTEDQAMEDIK 2316.1369 EPMIGVNQELAYFYPELFR 2321.0813 QMEAESISSSEEIVPNSVEQK This set of masses consitutes a fingerprint of the protein. An MS analysis can allow identification of this protein. MH + Peptide sequence
6
Identification by search in protein sequences databases
7
http://www.matrixscience.com/search_form_select.html
10
525.14 831.34 1384.68 2321.08 MALDI-TOF spectrum of the trypsin digestion of the pictorial sample containing milk Signals corresponding to peptides of bovine α-S1 casein
11
Database search with MS data only MH + data
12
Database search results Several unassigned signals All the signals in the spectrum are inserted in the search box
13
Effect of multiple Peptide Masses on Protein Identification by Mass Fingerprinting Search m/z Mass Tolerance Da N° of Hits 1529.730.1204 1529.730.1 7 1252.700.1 1529.730.1 1252.700.1 1 1833.880.1 Moreover…… The number of peptides detected that belong to the same protein strongly influences the identification
14
Effect of Mass Accuracy and Mass Tolerance on Peptide Mass Fingerprinting Search Results Search m/z Mass Tolerance Da N° of Hits 15291478 1529.70.1164 1529.730.01 25 1529.7340.001 4 1529.73480.0001 2 Moreover……
15
Identification by MS data, (generally MALDI-TOF), suffers from: The complex mixtures of proteins of organic materials The uneven relative quantities of the different components
16
MS analysis What can give us more informations? The sequence of two or more peptides
17
Despite the quite similar name Casein α–S1 and Casein α–S2 are quite different proteins
18
Peptide sequencing by MS
19
MS/MS of Peptide Mixtures LC MS MS/MS
20
CHIP LC-MSMS Q-Tof
21
Single peptides are selected for fragmentation MS/MS Fragmentation spectra Tandem mass spectra Peptide mixture MS of the single peptides + + + What is MS/MS?
22
Interpretation of an MSMS spectrum to derive structural information is analogous to solving a puzzle Use the fragment ion masses as specific pieces of the puzzle to put the intact molecule back together. + + +
23
-HN--CH--CO--NH--CH--CO--NH- RiRi CH-R’ cici z n-i R” d i+1 v n-i w n-i low energy high energy Cleavages Observed in MS/MS of Peptides aiai x n-i bibi y n-i
24
Simple Fragmentation rules Ions of the “b” serieIons of the “y” serie
25
Fragmentation rules
26
Precursor ion Doubly charged m/z = 634.36 MH+ = 1267.72 C-term Arg
27
525.3144 HIQK 615.3283 LHSMK 689.3828 VNELSK 748.3698 TTMPLW (C-terminus of the protein) 831.3843 EDVPSER 910.4741 EGIHAQQK 1267.7045 YLGYLEQLLR 1384.7300 FFVAPFPEVFGK 1580.8279 VPQLEIVPNSAEER 1694.0761 LLILTCLVAVALARPK 1759.9450 HQGLPQEVLNENLLR 1767.7589 DIGSESTEDQAMEDIK 2316.1369 EPMIGVNQELAYFYPELFR 2321.0813 QMEAESISSSEEIVPNSVEQK Since the proteolytic enzyme is trypsin, all the peptides end either with Arg (R) or Lys (K). Y 1 ion will always be either 147 (K), or 175 (R) MH + Peptide sequence
28
m/z = 634.36 MH+ = 1267.72 C-term Arg 288 – 175 = 113 = Leu (L) RL 401 – 288 = 113 = Leu (L) L 175 288 401 529 – 401 = 128 = GLn (Q) Q 529 E 658 129=E 113=L 163=Y 113=L 57=G L 771 Y 934 G 991 L 1104 MH + - y 9 = 163 = Tyr (Y) Y Precursor ion
29
m/z = 634.36 MH+ = 1267.72 RLL 175 228 401 Q 529 E 658 129=E 113=L 57=G L 771 Y 934 G 991 L 1104 Y Precursor ion 277 334 163=Y 497 610 739 128=Q 867
30
m/z = 634.36 MH+ = 1267.72 RLLQELYGLY y1y1 y8y8 y7y7 y6y6 y5y5 y4y4 y3y3 y2y2 y9y9 b2b2 b3b3 b4b4 b6b6 b5b5 b7b7
31
MSMS Peptide Fragmentation signal b 1 y 1 b 2 y 2 b 3 y 3 b 4 y 4 b 5 y 5 Ala-Gly-His-Leu-….Phe-Glu-Cys-Tyr
32
Should we manually interpeter each fragmentation spectrum?
33
Peptide Sequencing This set of numbers are identificative of the fragmentation spectrum of this peptide
34
Signals in the fragmentation spectra can be predicted b seriey serie ---b1b1 Y y 10 --- 277.1547b2b2 L y9y9 1104.6412 334.1761b3b3 G y8y8 991.5571 497.2395b4b4 Y y7y7 934.5356 610.3235b5b5 L y6y6 771.4723 739.3661b6b6 E y5y5 658.3883 867.4247b7b7 Q y4y4 529.3457 980.5088b8b8 L y3y3 401.2871 1093.5928b9b9 L y2y2 288.2030 ---b 10 R y1y1 175.1190 b seriey serie ---b1b1 Y y 10 --- 277.1547b2b2 L y9y9 1177.6364 334.1761b3b3 G y8y8 1064.5524 497.2395b4b4 Y y7y7 1007.5309 683.3188b5b5 W y6y6 844.4676 812.3614b6b6 E y5y5 658.3883 940.4199b7b7 Q y4y4 529.3457 1053.5040b8b8 L y3y3 401.2871 1166.5881b9b9 L y2y2 288.2030 ---b 10 R y1y1 175.1190 A single change in the sequence changes the profile of expected signals
35
m/z 1410.6 Database IIGHFYDDWCPLK SPAFDSIMAETLK AFDSLPDDIHEK GGILAQSPFLIIK Real spectrum cross- correlated with theoretical spectrum 25050075010001250 20 40 60 80 100 x8 185.3 255.7 360.9 403.0 519.1 662.3 805.5 1007.4 1155.5 1226.8 1324.8 892.6 m/z Database searches compares the experimental MSMS spectra with the virtual spectra of the peptides generated by the in silico digestion of the proteins in the database
36
100 fmol BSA injected on column. BPC of m/z 300-2200, and typical MS/MS spectrum (right inset). 304050 Time [min ] MS trace MS/MS trace MS m/z 600200 1000 y2 b3 y3 y4 y5 b7 y7 b8 y6 b9 y9 y10 b11 b12 MS/MS NanoLC MS/MS
37
Database search
39
Database search with raw data from LC-MSMS MH + data Fragmentation spectra are automatically uploaded
43
Why Trypsin is preferred? Because MS sees only ions….. Upon fragmentation, the presence of a positively charged residue (Lys or Arg), will ensure the presence of a charge on fragments on the C- terminal side, generating the y serie The amino group at the N-terminus should ensure the presence of a charge on fragments on the N- terminal side, generating the b serie
45
For this kind of sample we do not use any fixed modification
46
Variable Modifications take into account modifications induced by sample treatment and/or sample deterioration
47
These informations depend on the instrument
48
A typical output of the results
51
Proteins are ranked as a function of decreasing score Proteins are grouped into families using a novel hierarchical clustering algorithm. If the family contains multiple members, the accessions, scores and descriptions are aligned with a dendrogram, which illustrates the degree of similarity between members. A short description is given of the hit, with the organism of provenance
52
-The Report Builder tab allows you to build a customised table of protein hits, which is particularly useful if you need a minimal list of proteins for a publication. Number of peptide matchesNumber of significant peptide matches (above the significance threshold) Number of indipendent sequences Number of significant indipendent sequences (above the significance threshold)
53
A hit Experimental m/z valueExperimental m/z transformed to a relative molecular mass molecular mass calculated from the matched peptide sequence Difference (error) between the experimental and calculated masses Ions score - If there are duplicate matches to the same peptide, then the lower scoring matches are shown in brackets. Expectation value for the peptide match. (The number of times we would expect to obtain an equal or higher score, purely by chance. The lower this value, the more significant the result). A letter U if the peptide sequence is unique to the protein family sequence Sequence of the peptide in 1-letter code. If the peptide sequence is modified, each affected residue is underlined.
54
Clicking the number These are other possible interpretations of the same fragmentation spectrum Highlighting the number (it is the spectrum number in the query)
55
The matched fragment ions are shown in tabular format below the spectrum. The ion series are those specified by the INSTRUMENT search parameter. If you choose to label the matches used for scoring, bold italic red means the series contributed to the score. Bold red means that the number of matches in the ion series is greater than would be expected by chance, indicating that the ion series is present. Non-bold red means that the number of matches in the ion series is no greater than would be expected by chance, so that the matches themselves may be by chance.
56
Advantages: - Univoque identifications - Multiple identifications - Few peptides are sufficient - Proteins in mixtures can be distinguished - Reduced relevance of protein contamination - Deductive results: no hypothesys is requested - Organisms can be recognized and differentiated
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.