Download presentation
Presentation is loading. Please wait.
1
Computational Methods for Biomarker Discovery in Proteomics and Glycomics Vijetha Vemulapalli School of Informatics Indiana University Capstone Advisor: Dr. Haixu Tang
2
What are Biomarkers? Substances present in increased or decreased amounts in body fluids or tissues that indicate exposure, disease or susceptibility to disease. Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
3
Some Uses of Biomarkers Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Biomarkers are increasingly being used for the following purposes: Prognosis / Diagnosis of disease Monitoring response to medication With high sensitivity and throughput, proteomics and glycomics is capable of identifying many potential biomarkers simultaneously.
4
More on Biomarkers Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References A lot of times biomarkers have not been identified clearly. But based on the signature pattern of glycans and proteins, samples can be classified as healthy and diseased.
5
What is Proteomics? Proteomics: Proteomics is the study of proteins and proteomes using high- throughput technology. Proteome: All the proteins in a cell or bodily fluid at a given point of time under certain conditions. Proteins: A chain of amino acids including hormones, enzymes and antibodies. Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References http://parasol.tamu.edu/groups/amatogroup/foldingserver/images/proteinL.gifhttp://biology.clc.uc.edu/graphics/bio104/cell.jpg
6
What is Glycomics? Glycoproteins: Proteins with attached polysaccharides. Glycans: Polysaccharide chain attached to a protein Glycome: The entire set of glycans that are present in a cell or a bodily fluid at a certain point of time under certain conditions. Glycomics: Study of structure and function of oligosaccharides in a cell or organism. Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References http://www.glyfdis.org/images/bg_image.jpg
7
High Throughput Technologies to Identify Biomarkers Genome Scale Scanning Genome level Micro - arrays Transcriptome level Proteomics Proteome level Glycomics Glycome level Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References http://phy.asu.edu/phy598-bio/D4%20Notes%2006_files/image002.jpg
8
Transcriptome Why the Focus on Proteomics and Glycomics? Information content Genome Transcriptome Proteome Glycome Static Dynamic Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
9
Biomarker Discovery using Proteomics
10
Liquid Chromatography / Mass Spectrometry (LC/MS) Why LC/MS for analysis of proteomes? LC spreads complexity of the sample over time. MS identifies ions based on their mass/charge value. Software exists currently to identify proteins in a sample using data from a LC-MS experiment. Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Liquid Chromatography Mass Spectrometry Data Protein sample
11
Liquid Chromatography (LC) Liquid Chromatography is a technique that separates ions or molecules dissolved in a solvent based on size of the ion/molecule, adsorption, ion-exchange or other similar characteristics. Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References http://wwwlb.aub.edu.lb/~webcrsl/high_p3.jpg
12
What is Mass Spectrometer? Mass Spectrometry (MS) is an instrument that identifies ions based on their mass-to-charge ratio. Source: http://www.chemguide.co.uk/analysis/masspec/howitworks.html & http://www.bmms.uu.se/ltq-ft.htmhttp://www.chemguide.co.uk/analysis/masspec/howitworks.htmlhttp://www.bmms.uu.se/ltq-ft.htm Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
13
Visualization of LC/MS Data : 2D Map Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
14
How Do We Find Biomarkers From LC-MS Data? Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References MS View Quantities of peptides identified from the sample Liquid Chromatography Mass Spectrometry Data Protein sample Identification software Identified Proteins and Peptides
15
How Do We Find Biomarkers From LC-MS Data? Continued… Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Sample 1 Sample 2 Sample 3 Sample N MSView Quantification 1 Quantification 2 Quantification 3 Quantification N Analyze to find Biomarkers
16
MSView Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References MSView Visualization Relative Quantification Components Purpose Visual comparison /Analysis Further analysis for Biomarker Discovery
17
Extracted Ion Chromatogram (XIC) Chromatogram created by plotting the intensity of the signal observed at a chosen m/z value in a series of mass spectra recorded as a function of retention time. Source: http://www.lcpackings.com/applications/Probot/images/dual_fract04B.png Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
18
Visualization: XIC Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
19
Relative Quantification using Peptide Identification Results Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Data from LC-MS experiment Identification of peptides Extracted Ion Chromatogram of peptide Peak selection Area calculation MS View
20
Actual data: Quantification: Peak Selection Algorithm Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References After Smoothing: Minima Maxima Selecting local maxima and minima Selecting peaks: Minima Maxima
21
Quantification: Sample Results Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
22
Biomarker Discovery using Glycomics
23
How does Capillary Electrophoresis (CE) work? Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References http://faculty.washington.edu/dovichi/UBUBTpage/research/Methods/CEintro/ceintro.GIF&imgrefurl=http://faculty.washington.edu/dovichi/UBUBTpage/research/Methods/CEintro/CE_LIF.html&h=531&w=6 84&sz=25&hl=en&start=3&um=1&tbnid=_JDf4X3dJn170M:&tbnh=108&tbnw=139&prev=/images%3Fq%3Dcapillary%2Belectrophoresis%26svnum%3D10%26um%3D1%26hl%3Den
24
What does the data look like? Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Samples from different CE experiments:
25
Biomarker Discovery using Glycomics – Overview Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Data from different samples Analysis of quantification for identifying Biomarkers Quantification of mapped peaks Mapping areas corresponding the same glycan from different samples CE Analyze
26
Direct Comparison: Dynamic Time Warping (DTW) DTW algorithm aligns two time series having similar curves but are skewed differently over time. Time Source: http://db-www.aist-nara.ac.jp/theme/bioinfo_kenji-h_dtw.png Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
27
Direct Comparison: DTW continued… Sakoe-Chuba Band is used to reduce time & space complexity. Parameters used in DTW: - Band width- Peak extention penalty - Difference in peak intensities. - Difference in peak direction Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Stan Aslvador and Philip Chan. FastDTW:Toward Accurate Dynamic Time Warping in Linear Time and Space, KDD Workshop on Mining Temporal and Sequential Data, 2004
28
Method: Dynamic Time Warping Consensus Sample Align to consensus sample Align next sample to consensus sample Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
29
Method continued… Corresponding peaks Aligned sample Unaligned sample Corresponding peaks Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Calculate Area Peak 1
30
Results Corresponding peaks Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
31
Summary Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References Proteomics - MSView Glycomics - CE Analyze LC-MS data Identified Peptides Quantification results for Biomarker Discovery CE Data Quantification results for Biomarker Discovery
32
Acknowledgements Dr. Haixu Tang- My advisor Dr. Randy J.ArnoldDr. Yehia Mechref Dr. Milos NovotnyDr. David E.Clemmer Dr. Sun Kim Dr. Jeong-Hyeon Choi Dr. Stephen J. Valentine Yin Wu Manolo D.Plasencia School of Informatics Funding: NIH/NCRR MetaCyt Initiative @ Indiana University Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
33
[ 1] Higgs, R.E., Knierman, M.D., Gelfanova, V., Butle,r J.P. and Hale, J.E. (2005) Comprehensive label-free method for the relative quantification of proteins from biological samples. J. Proteome Res., 4, 1442-1450. [2] Linsen, L., Locherbach, J., Berth, M., Becher, D. and Bernhardy, J. (2006) Visual Analysis of Gel-Free Proteome Data. IEEE Transactions on Visualization and Computer Graphics,12, 497-508. [3] Prakash, A., Mallick, P., Whiteaker, J., Zhang, H., Paulovich, A., Flory, M., Lee, H., Aebersold, R., and Schwikowski, B. (2006) Signal maps for mass spectrometry-based comparative proteomics. Mol. Cell. Proteomics 5, 423 –432 [4] Leptos, K. C., Sarracino, D. A., Jaffe, J. D., Krastins, B., and Church, G. M. (2006) MapQuant: open-source software for large-scale protein quantification. Proteomics 6, 1770 –1782 [5] Aebersold, R., and Mann, M. (2003) Mass spectrometry-based proteomics. Nature 422, 198 –207 Problem Definition Background LC-MS Method Results CE Method Results Acknowledgements References
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.