Metabolomics Metabolome Reflects the State of the Cell, Organ or Organism Change in the metabolome is a direct consequence of protein activity changes.

Slides:



Advertisements
Similar presentations
Institute on Research and Statistics, Sacramento 04/08/04
Advertisements

Analysis of High-Throughput Screening Data C371 Fall 2004.
Protein Quantitation II: Multiple Reaction Monitoring
Dimension reduction (1)
Regression, Correlation. Research Theoretical empirical Usually combination of the two.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Proposal for a Standard Representation of the Results of GC-MS Analysis: A Module for ArMet Helen Fuell 1, Manfred Beckmann 2, John Draper 2, Oliver Fiehn.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Metabolomics Bob Ward German Lab Food Science and Technology.
CALIBRATION Prof.Dr.Cevdet Demir
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Introduction to BioInformatics GCB/CIS535
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Theodore Alexandrov, Michael Becker, Sören Deininger, Günther Ernst, Liane Wehder, Markus Grasmair, Ferdinand von Eggeling, Herbert Thiele, and Peter Maass.
ESMRMB 2009 InfoRESO Fitting Tool for Arrays of Interrelated Datasets (FiTAID) Daniel G.Q. Chong 1 Johannes Slotboom 2 Christine Bolliger 1 Chris Boesch.
Lecture 24: Thurs., April 8th
Elaine Martin Centre for Process Analytics and Control Technology University of Newcastle, England The Conjunction of Process and.
Chapter 11: Inference for Distributions
Metabolomic Data Processing & Statistical Analysis
Proteomics Informatics Workshop Part III: Protein Quantitation
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Objectives of Multiple Regression
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Proteomics Informatics – Data Analysis and Visualization (Week 13)
Practical Issues in Microarray Data Analysis Mark Reimers National Cancer Institute Bethesda Maryland.
Exploring Metabolomic data with recursive partitioning Metabolomic Workshop NISS July 14-15, 2005.
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Simple Linear Regression Models
2007 GeneSpring MS GeneSpring for Metabolite BioMarker Analysis using Mass Spectrometry data Agilent Q-TOF VIP Visit Jan 16-17, 2007 Santa Clara, CA Thon.
DNA microarray technology allows an individual to rapidly and quantitatively measure the expression levels of thousands of genes in a biological sample.
Dr. Richard Young Optronic Laboratories, Inc..  Uncertainty budgets are a growing requirement of measurements.  Multiple measurements are generally.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
NMR Analysis of Protein Dynamics Despite the Typical Graphical Display of Protein Structures, Proteins are Highly Flexible and Undergo Multiple Modes Of.
Eran Yanowski, Eran Hornstein’s: Monitor drug impact on the transcriptome of mouse beta cells (primary and cell-line) using Transeq/RNA-Seq Report.
Epigenetic Analysis BIOS Statistics for Systems Biology Spring 2008.
Novel Algorithms for the Quantification Confidence in Quantitative Proteomics with Stable Isotope Labeling* Novel Algorithms for the Quantification Confidence.
Quality Assurance How do you know your results are correct? How confident are you?
PATTERN RECOGNITION : CLUSTERING AND CLASSIFICATION Richard Brereton
Introduction to Biostatistics and Bioinformatics Regression and Correlation.
Chimiometrie 2009 Proposed model for Challenge2009 Patrícia Valderrama
Simple linear regression Tron Anders Moger
Innovative Paths to Better Medicines Design Considerations in Molecular Biomarker Discovery Studies Doris Damian and Robert McBurney June 6, 2007.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Analyzing Expression Data: Clustering and Stats Chapter 16.
Statistics: Unlocking the Power of Data Lock 5 Inference for Means STAT 250 Dr. Kari Lock Morgan Sections 6.4, 6.5, 6.6, 6.10, 6.11, 6.12, 6.13 t-distribution.
Nuclear Magnetic Resonance
Spectral Interpretation General Process for Structure Elucidation of an Unknown Nat. Prod. Rep., 1999, 16,
Laser light stimulates vibration, causing some energy to be absorbed and the rest to be scattered as a slightly red-shifted photon. Raman Microspectroscopic.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Applying MetaboAnalyst
1 Statistics & R, TiP, 2011/12 Multivariate Methods  Multivariate data  Data display  Principal component analysis Unsupervised learning technique 
BPS - 5th Ed. Chapter 231 Inference for Regression.
Strategies for Metabolomic Data Analysis Dmitry Grapov, PhD.
Protein quantitation I: Overview (Week 5). Fractionation Digestion LC-MS Lysis MS Sample i Protein j Peptide k Proteomic Bioinformatics – Quantitation.
Coronal Hole recognition by He 1083 nm imaging spectroscopy O. Malanushenko (NSO) and H.P.Jones (NASA's GSFC) Tucson, Arizona, USA Solar Image Recognition.
JMP Discovery Summit 2016 Janet Alvarado
Regression Analysis AGEC 784.
Using Laser Raman Spectroscopy to Reduce False Positives of Autofluorescence Bronchoscopies: A Pilot Study  Michael A. Short, PhD, Stephen Lam, MD, Annette.
Quality Control at a Local Brewery
Nat. Rev. Nephrol. doi: /nrneph
Standards Development for Metabolomics
Basic Practice of Statistics - 3rd Edition Inference for Regression
Copyright © 2016 Elsevier Inc. All rights reserved.
Volume 140, Issue 5, Pages (March 2010)
Volume 18, Issue 1, Pages (January 2017)
Brandon Ho, Anastasia Baryshnikova, Grant W. Brown  Cell Systems 
Maria S. Robles, Sean J. Humphrey, Matthias Mann  Cell Metabolism 
Volume 11, Issue 4, Pages (April 2015)
Presentation transcript:

Metabolomics Metabolome Reflects the State of the Cell, Organ or Organism Change in the metabolome is a direct consequence of protein activity changes Not necessarily true for genomic, proteomic or transcriptomic changes Disease, environmental factors, Drugs, etc., perturbs the state of the metabolome Provides a system-wide view of the organism or cell’s response

NMR Metabolomics Overview Prepare the cells, tissue or biofluids Harvest the metabolome Collect the NMR data Analyze the NMR data Analyze the metabolite changes

NMR Metabolomics Data One-dimensional 1 H NMR spectrum Two-dimensional NMR spectra

2D 1 H- 13 C HSQC Experiment – workhorse of metabolomics Correlates all directly bonded 13 C- 1 H pairs generally requires 13 C-labeling (1.1% natural abundance)

2D 1 H- 1 H TOCSY Experiment – workhorse of metabolomics Correlates all 3-bonded 1 H- 1 H pairs in a molecules

NMR Metabolomics Process

Monitor in vivo protein and drug activity Forgue et al. (2006) J. Proteome Res. 5(8): Halouska & Powers (2006) J. Mag. Res. 178:88-95 Inactive Drug Active & Selective Drug Active & Not Selective Active Against Wrong Protein Differential NMR Metabolomics

NMR and Multivariate Statistics Extreme Sensitivity to Experimental Differences Want PCA Clustering to Result from Metabolome Change NOT Experimental Variability EVERYTHING should be a CONSTANT between samples or the study is invalid NMR experimental parameters temperatureBuffer (pH)shimmingTuning & matching lock90 o pulse acquisition parameters Spectral width Data pointsRecycle time Acquisition time Solvent removal Receiver gain processing parameter Zero fillingBaseline correction Window function Linear prediction Solvent removal phasing

Negative Impact of Noise in NMR PCA Clustering ATP ATP-glucose ATP Remove Noise ATP #2 ATP #9 Single NMR Sample with repeat data collection ATP #2 ATP #9 Higher PC2 dispersion (-10 to 10) and an outlier lower PC2 dispersion (-4 to 2) Differential NMR Metabolomics

The Role of NMR Signal-to-Noise in PCA Clustering Increasing Number of NMR Scans (S/N)

Differential NMR Metabolomics How to Quantify the Statistical Significance of Cluster Separations? Analyze Metabolomic Data Using Tree Diagrams Calculate distances between cluster centers  distance matrix Apply Standard Boot-Strapping Methods Randomize selection of cluster members to determine cluster center Generate 100 different distance matrices  100 different trees  consensus tree Bootstrap number -> how many times the consensus node appears in the set of 100 trees

Differential NMR Metabolomics Bootstrap Number and Statistical Significance of Cluster Separations Larger the Distance Between Clusters More Significant Larger bootstrap or smaller p-value > 50% is significant More Data Points Easier to Distinguish Between Clusters more data points (solid line)

Sample Replicates Affects Class Distinction Increasing number of replicates Significant increase in statistical significance of cluster from a modest increase in number or replicates

Ellipses and Tree Diagrams Define Classes P-value on each node identifies statistical significance (< 0.001) of cluster Ellipses represent 95% confidence limits from a normal distribution

Metabolite Identification Differential NMR Metabolomics  Orthogonal partial least squares discriminant analysis (OPLS-DA) a non-linear variant of PCA that minimizes class (group) variations S-plots and loadings identify which “bins” (NMR chemical shifts – metabolites) are strongly correlated with class separation S-plots loadings

Metabolite Identification Overlay of 2D 1 H- 13 C HSQC spectra for wild- type (red) and aconitase mutant (black) Differential NMR Metabolomics Grow cells in the presence of a 13 C-labeled metabolite Only observe metabolites derived from the 13 C-labeled metabolite provided to the cells

Hu et al. (2011) J. Am. Chem. Soc. 133: Convert Peak Intensities to Concentrations (HSQC0) Our 2D 1 H- 13 C HSQC calibration curve

Convert Peak Intensities to Concentrations (HSQC0) Can now compare changes between metabolites

Convert Concentrations to Heatmap  Provides two-levels of hierarchal clustering Identifies replicates with same overall changes Identifies metabolites with correlated changes between replicates  Provides a simple view of a large amount of data  Calculated with a statistical package, like R

Differential NMR Metabolomics Metabolite Network Mapping (Cytoscape) Metabolites increased (red), decreased (green) or unperturbed/undetected (grey)

Differential NMR Metabolomics Traditional Metabolic Pathway

Some Final thoughts  A number of different analytical methods can be used to analyze the metabolome NMR, GC-MS, LC-MS, CE-MS, FTIR, etc.  A variety of statistical techniques can be used to analyze metabolomics data PCA, PLS, OPLS-DA, HCM, SOM, SVN, etc.  Can combine multiple datasets (NMR and MS) for multivariate statistical analysis  Can incorporate proteomics, genomics and any other data source with metabolomics data to generate system-wide view of the organism or cell response