Daniel C. Liebler Vanderbilt University School of Medicine Vanderbilt, Tennessee Performance and Optimization of LC-MS/MS Platforms for Unbiased Discovery.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Protein Quantitation II: Multiple Reaction Monitoring
The Proteomics Core at Wayne State University
Immunity & Infection Research Centre Better Biomarkers in Transplantation. A Genome Canada Initiative for Human Health Rob McMaster, Gabriela Cohen-Freue,
Big Data & the CPTAC Data Portal Nathan Edwards, Peter McGarvey Mauricio Oberti, Ratna Thangudu Shuang Cai, Karen Ketchum Georgetown University & ESAC.
LESSON 1: What is Genetic Research? PowerPoint slides to accompany Using Bioinformatics : Genetic Research.
Methods: Metabolomics Workflow Introduction Figure 1a: 1 H NMR spectrum of blood serum sample from a breast cancer patient. Results The emerging area of.
PepArML: A model-free, result-combining peptide identification arbiter via machine learning Xue Wu, Chau-Wen Tseng, Nathan Edwards University of Maryland,
The Promise & Challenge of Health Care IT in Community Clinics: Insights from the California Community Clinics Initiative Prepared for the convening on.
Mass Spectrometry at The University of Louisville HSC
Bioinformatics pipeline for detection of immunogenic cancer mutations by high throughput mRNA sequencing Jorge Duitama 1, Ion Mandoiu 1, and Pramod Srivastava.
Metabolomics DNA RNA Protein Biochemicals (Metabolites) Genomics – 25,000 Genes Transcriptomics – 100,000 Transcripts Metabolomics – 2,800 Compounds Proteomics.
Bindley Bioscience Center Vision: Nurture interactive communication and interdisciplinary discovery with flexible laboratory project spaces and an open.
Previous Lecture: Regression and Correlation
FIGURE 5. Plot of peptide charge state ratios. Quality Control Concept Figure 6 shows a concept for the implementation of quality control as system suitability.
Proteomics Josh Leung Biology 1220 April 13 th, 2010.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
NCI’s Clinical Proteomic Technologies for Cancer: “Restructuring Proteomics to Succeed in Discovering Cancer Biomarkers” Joe.
Proteomics Informatics Workshop Part III: Protein Quantitation
Experimental Design and Biospecimens David F. Ransohoff, MD Division of Gastroenterology and Hepatology; Dept. of Medicine.
1 Harvard Medical School Transcriptional Diagnosis by Bayesian Network Hsun-Hsien Chang and Marco F. Ramoni Children’s Hospital Informatics Program Harvard-MIT.
Isolation of N-linked glycopeptides from plasma Yong Zhou 1, Ruedi Aebersold 2, and Hui Zhang 1,3 * 1 Institute for Systems Biology, Seattle, Washington.
The CCE 5 th Annual Retreat Global Proteomics & Determination of Vitamin D Metabolites Update Jiri Adamec.
Proteomics Informatics – Data Analysis and Visualization (Week 13)
Comparison of chicken light and dark meat using LC MALDI-TOF mass spectrometry as a model system for biomarker discovery WP 651 Jie Du; Stephen J. Hattan.
Production of polypeptides, Da, and middle-down analysis by LC-MSMS Catherine Fenselau 1, Joseph Cannon 1, Nathan Edwards 2, Karen Lohnes 1,
In God We Trust.
2007 GeneSpring MS GeneSpring for Metabolite BioMarker Analysis using Mass Spectrometry data Agilent Q-TOF VIP Visit Jan 16-17, 2007 Santa Clara, CA Thon.
False Biomarker Discovery due to Reactivity of a Commercial ELISA for CUZD1 with Cancer Antigen CA125 I. Prassas, D. Brinc, S. Farkona, F. Leung, A. Dimitromanolakis,
Human Proteome Project? Màster en bioquímica, biologia molecular i biomedicina Mòdul 4: Genòmica i Proteòmica Núria Colomé Calls.
Metrological Experiments in Biomarker Development (Mass Spectrometry—Statistical Issues) Walter Liggett Statistical Engineering Division Peter Barker Biotechnology.
Center for Human Health and the Environment
© 2010 SRI International - Company Confidential and Proprietary Information Quantitative Proteomics: Approaches and Current Capabilities Pathway Tools.
GSAT501 - proteomics Name, home-town Students – previous lab experience –Lab you hope to end up in? Teachers – what is your current project.
Mass spectrometry session. Summary Fiehn (1) Standardization important Reporting important, but has to be feasible Does not matter which MS instrument.
Bringing Metrology to Clinical Proteomic Research David Bunk Chemical Science and Technology Laboratory National Institute of Standards and Technology.
BIOMARKERS Diagnostics and Prognostics. OMICS Molecular Diagnostics: Promises and Possibilities, p. 12 and 26.
MS/MS Libraries of Identified Peptides and Recurring Spectra in Protein Digests Lisa Kilpatrick, Jeri Roth, Paul Rudnick, Xiaoyu Yang, Steve Stein Mass.
The Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial Fifth Annual African-American Prostate Cancer Disparity Summit September 24, 2009 Christine.
Karl Clauser Proteomics and Biomarker Discovery Breast Cancer Proteomics and the use of TCGA Mutational Data - Broad Institute update/issues Karl Clauser.
CS 461b/661b: Bioinformatics Tools and Applications Software Algorithm Mathematical Models Biology Experiments and Data.
Proteomic Characterization of Alternative Splicing and Coding Polymorphism Nathan Edwards Center for Bioinformatics and Computational Biology University.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Using Predictive Classifiers in the Design of Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.
Overview of Mass Spectrometry
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
ProteoRed WG1-WG2 Meeting Salamanca, March,16th 2010 PME5: Quantitative LC-MS differential analysis F. Canals.
Novel Peptide Identification using ESTs and Genomic Sequence Nathan Edwards Center for Bioinformatics and Computational Biology University of Maryland,
Peptide-assisted annotation of the Mlp genome Philippe Tanguay Nicolas Feau David Joly Richard Hamelin.
QIBA DCE-MRI Analysis Algorithm Validation Specification and Testing Daniel Barboriak M.D. Duke University Medical Center
Minimize Database-Dependence in Proteome Informatics Apr. 28, 2009 Kyung-Hoon Kwon Korea Basic Science Institute.
What is proteomics? Richard Mbasu and Ben Richards.
Protein quantitation I: Overview (Week 5). Fractionation Digestion LC-MS Lysis MS Sample i Protein j Peptide k Proteomic Bioinformatics – Quantitation.
SRM working plan. La Cristalera, 5-6 November 2013.
Target Analyses in Parallel Reaction Monitoring Mode (PRM)
Novel blood and tissue Biomarkers for Breast and Prostate Cancers
Shotgun protein identification Creative Proteomics offers iTRAQ protein quantification service suited for unbiased untargeted biomarker discovery. Relative.
Table 1. Quality Parameters Being Considered for Evaluation
Richard Mbasu and Ben Richards
Algorithms and Computation: Bottom-Up Data Analysis Workflows
Bottom-Up Proteomics Data collection
Standards Development for Metabolomics
A perspective on proteomics in cell biology
Diagnostics and Prognostics
Summary of intralaboratory and interlaboratory variation for metrics for three LTQ and three LTQ-Orbitrap instruments in six replicate analyses of a tryptic.
Stability and variation of metrics over a range of sample injection amounts. Stability and variation of metrics over a range of sample injection amounts.
Performance metrics for triplicate analyses of a tryptic digest of the CPTAC yeast reference proteome on four LTQ-Orbitraps at three different sites in.
Illustration of chromatography metric C-2A applied to LC-MS/MS data from three Thermo LTQ systems in analyses of yeast proteome samples in CPTAC Study.
Bio-Rad Overview and Statement of Interests
Presentation transcript:

Daniel C. Liebler Vanderbilt University School of Medicine Vanderbilt, Tennessee Performance and Optimization of LC-MS/MS Platforms for Unbiased Discovery of Biomarker Candidates

Discovery Tissue Proximal fluids Clinical Validation Blood Population Verification Blood Population Bio-Specimens Plasma Tissue Proximal fluids Found in blood? higher in cancer? Biomarkers worth evaluating Biomarkers worth evaluating biomarker candidates A Functioning Pipeline for Cancer Biomarker Development Requires Both Discovery and Directed Assay Components “hypotheses” untargeted proteomics genomics

Overview of shotgun proteomics proteins peptides AVAGCPGR STYMAGGTVVK MSAVVGHLYK NALGHTSSSPGGVPR IDGEEPGLVAR QCCDGGVSWTK ANDMANYMORE digest MS-MS data encode sequences database search liquid chromatography- tandem mass spectrometry (LC-MS/MS) peptide fractionation

Shotgun proteomics for biomarker discovery Challenges > 10 6 range of protein concentrations in tissues, biofluids no technology yet capable of complete sampling of proteomes multiple instrument systems used variability in detection of proteins No systematic studies of variation in shotgun proteomics Impact of variation on performance for unbiased discovery unknown cancernormal shotgun proteome analysis compare inventories, identify differences few (<20) samples or sample pools low throughput Identify ~5,000+ proteins inventory differences ~ proteins

HUPO Study Key questions not addressed:  Is the technology inherently variable?  What are the sources of variation?  How reproducible are analyses of complex biological proteomes? “A HUPO test sample study reveals common problems in mass spectrometry-based proteomics” Bell et al. Nature Methods (2009) 6: protein mixture distributed to 27 labs; no standardized methods or data analysis only 7 labs correctly ID all 20 proteins Coaching  reanalyses  common bioinformatics all 27 labs correctly ID “most” proteins Conclusions: Variable performance within and between labs better databases and search engines needed, as is training in their use

CPTAC Unbiased Discovery Workgroup: Goals Evaluate and standardize performance of proteomic discovery platforms and standardize their use Identify and characterize sources of variation Develop quality control metrics Employ defined protein mixtures and biologically relevant proteomes Evaluate sensitivity for protein detection at defined levels of concentration

Discovery WG participants National Cancer Institute - OTIR - OBBR - TTB - NCICB - DCEG - DCB - DCTD - DCP - CCR - OCTR (SPOREs) Bethesda and Frederick, Md. National Institute of Standards and Technology Gaithersburg, Md. Texas A&M University College Station, Tex. Argonne National Laboratory Argonne, Ill. Memorial Sloan- Kettering Cancer Center, New York, NY Broad Institute of MIT and Harvard, Cambridge, Mass. University of California at San Francisco, San Francisco, Calif. Purdue University, West Lafayette, Ind. Harvard-affiliated Hospitals, Boston, MA Fred Hutchison Cancer Research Center, Seattle, Wash. M.D. Anderson Cancer Center, Houston, Tex. University of British Columbia & University of Victoria, Vancouver, British Columbia, Canada. Plasma Proteome Institute, Washington, D.C. New York University, New York, NY. Buck Institute, Novato, Calif. Lawrence Berkeley National Laboratory, Berkeley, Calif. Vanderbilt University School of Medicine, Nashville, Tenn. National Institute of Statistical Sciences Research Triangle Park, NC Indiana University, Predictive Physiology and Medicine, Inc. Bloomington, Ind. Purdue U., Indiana U. Schools of Medicine, Indianapolis, Ind.

CPTAC Discovery WG studies: from simple to complex proteomes samples instruments various instruments study LTQ, LTQ-Orbitrap SOP nonev. 1.0 v. 2.0v. 2.1v. 2.2 NCI-20 yeast yeast w/ BSA spike yeast w/ Sigma UPS spikes Sigma UPS 48 human protein mix 8 none SOP refinement Sample complexity Equivalent to HUPO study

Repeatability and reproducibility are robust across laboratories Run-to-run repeatability in analyses of yeast across 7 instruments/sites 65-80% of protein IDs repeat between runs on individual instruments Interlaboratory reproducibility 65-70% of protein IDs are reproduced across multiple instruments proteins peptides proteins peptides % peptides/proteins shared between runs Study 6 (yeast) Study 6, Study 8 (yeast) % peptides/proteins shared between runs

Saccharomyces cerevisiae proteome reference material Complex protein matrix (6,000 ORFs) Reproducible preparation possible Quantitative TAP tag studies (Ghaemmaghami, S. et al. (2003) Nature 425, ) provide calibration for expression levels Statistical power for modeling depth of coverage by proteomics platforms Use with human protein spikes enables modeling for biomarker discovery applications Made available to proteomics community (NIST) Ghaemmaghami, S. et al. (2003) Nature 425,

Modeling detection of human protein “biomarker” spikes in yeast matrix 48 human protein (UPS) spikes in yeast proteome background “simple” discovery system distinguishes differences equivalent to 0.5 – 1.3% of proteome

System map of LC-MS performance metrics Paul Rudnick Steve Stein (NIST) Over 40 performance metrics monitored

Diagnosing and correcting system malfunction CPTAC study 5

CPTAC Unbiased Discovery Workgroup: Key Achievements 1.First systematic, SOP-driven study of LC-MS/MS analytical systems across multiple laboratories 2.Quantitative assessment of repeatability and reproducibility in peptide vs. protein detection 3.Yeast reference proteome standard and accompanying datasets 4.Yeast reference proteome with spikes enables quantitative modeling of power to discover biomarker candidates 5.Performance metrics and software (“toolkit”) to monitor and troubleshoot system performance

Next steps for Discovery WG Evaluate performance of platforms to discriminate between cancer-relevant phenotypes Phase II Studies Human breast cancer cell model; responses to TKI Compare commonly employed quantitative methods for survey of differences Phase III studies Human tumor tissue specimens corresponding to defined clinical phenotypes Evaluate phenotype discrimination Implement methods, metrics and approaches developed in Phase I, Phase II studies

Backups

Why care about reproducibility in discovery proteomics? 1.Biomarker candidates come from comparing proteomes from different phenotypes 2.Need to know whether observed differences are due to biology or to variability in the analytical system. biomarker candidates cancernormal shotgun proteome analysis compare inventories, identify differences

Yeast proteome enables calibration and comparison of detection efficiency CN 50 = copy number with 50% detection probability

Metrics identify the greatest sources of variability