Metabolomics Research & Consultancy CJ Alexander, CA Hackett & JW McNicol With collaborations below as indicated from: 1The James Hutton Institute, UK.

Slides:



Advertisements
Similar presentations
Multiple Comparisons in Factorial Experiments
Advertisements

Data analysis support – opportunities with AQMeN Applied Quantitative Methods Network Alistair Geddes (Dundee University)
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Statistical association of genotype and phenotype.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Statistical Bioinformatics QTL mapping Analysis of DNA sequence alignments Postgenomic data integration Systems biology.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Module 7: Estimating Genetic Variances – Why estimate genetic variances? – Single factor mating designs PBG 650 Advanced Plant Breeding.
QNT 531 Advanced Problems in Statistics and Research Methods
Analysis of Variance or ANOVA. In ANOVA, we are interested in comparing the means of different populations (usually more than 2 populations). Since this.
Lecture 8 Analysis of Variance and Covariance Effect of Coupons, In-Store Promotion and Affluence of the Clientele on Sales.
ASCA: analysis of multivariate data from an experimental design, Biosystems Data Analysis group Universiteit van Amsterdam.
Dr Paul Lewis Lecturer in Bioinformatics Lecturer in Bioinformatics Cardiff University Cardiff University Biostatistics & Bioinformatics Unit Biostatistics.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
EMBC2001 Using Artificial Neural Networks to Predict Malignancy of Ovarian Tumors C. Lu 1, J. De Brabanter 1, S. Van Huffel 1, I. Vergote 2, D. Timmerman.
Metabolomics Metabolome Reflects the State of the Cell, Organ or Organism Change in the metabolome is a direct consequence of protein activity changes.
Analysis of Variance and Covariance Effect of Coupons, In-Store Promotion and Affluence of the Clientele on Sales.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Innovative Paths to Better Medicines Design Considerations in Molecular Biomarker Discovery Studies Doris Damian and Robert McBurney June 6, 2007.
Genotype x Environment Interactions Analyses of Multiple Location Trials.
Lecture 22: Quantitative Traits II
A simple method to localise pleiotropic QTL using univariate linkage analyses of correlated traits Manuel Ferreira Peter Visscher Nick Martin David Duffy.
Genotype x Environment Interactions Analyses of Multiple Location Trials.
THE INHERITANCE OF PLANT HEIGHT IN HEXAPLOID WHEAT (Triticum aestivum L.) Nataša LJUBIČIĆ 1*, Sofija PETROVIĆ 1, Miodrag DIMITRIJEVIĆ 1, Nikola HRISTOV.
Genetic mapping and QTL analysis - JoinMap and QTLNetwork -
Chapter 12 Introduction to Analysis of Variance
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Chapter 14 Repeated Measures and Two Factor Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh.
Looking for statistical twins
A comparison of PLS-based and other dimension reduction methods for tumour classification using microarray data Cameron Hurst Institute of Health and Biomedical.
Research Problems, Purposes, & Hypotheses
Early prediction of rice tolerance to salinity
Genotypic and Phenotypic Variance in Soybean Oil
Analysis of Variance and Covariance
Identifying QTLs in experimental crosses
NATIONAL NUTRITIONAL PHENOTYPE DATABASE
Tutorial 6 : RNA - Sequencing Analysis and GO enrichment
Inferential Statistics
An Introduction to Two-Way ANOVA
i) Two way ANOVA without replication
Genetics: Analysis and Principles
Genome Wide Association Studies using SNP
Justin D. Hackett, Benjamin J. Marcus, and Allen M. Omoto
Functional Genomics in Evolutionary Research
Genomic Investigation of Lupus in the Skin
Day 2: Session 8: Questions and follow-up…. James C. Fleet, PhD
Machine Learning Week 1.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Chapter Eight: Quantitative Methods
Hierarchical clustering approaches for high-throughput data
Multivariate Statistical Methods
Genomic Investigation of Lupus in the Skin
Proteomic Analysis Of The Potato Tuber Life Cycle
The Impact of Network Medicine in Gastroenterology and Hepatology
Understanding Multi-Environment Trials
RESEARCH METHODOLOGY ON ENVIRONMENTAL HEALTH PRACTICE IN WEST AFRICA
Transcriptional Landscape of Cardiomyocyte Maturation
Fixed, Random and Mixed effects
Volume 2, Issue 5, Pages (May 2016)
An Introduction to Correlational Research
Improving Overlap Farrokh Alemi, Ph.D.
Experimental Design All experiments consist of two basic structures:
Stephen C. Pratt, Mark J. Daly, Leonid Kruglyak 
Clustering The process of grouping samples so that the samples are similar within each group.
Volume 3, Issue 1, Pages (January 2010)
Roles of Defense Hormones in the Regulation of Ozone-Induced Changes in Gene Expression and Cell Death  Enjun Xu, Lauri Vaahtera, Mikael Brosché  Molecular.
WETLAND MANAGEMENT PLANNING Problem Analysis (Situation Analysis)
Copyright Pearson Prentice Hall
Volume 25, Issue 5, Pages e4 (May 2017)
Presentation transcript:

Metabolomics Research & Consultancy CJ Alexander, CA Hackett & JW McNicol With collaborations below as indicated from: 1The James Hutton Institute, UK 2Teagasc, Crops Environment and Land Use Programme, Ireland 3Aarhus University, Denmark Metabolomics is a rapidly developing field and scientists at The James Hutton Institute apply various technologies in plant breeding, quality & nutrition. BioSS provides consultancy support in design & analysis. We also investigate opportunities for further developments with new statistical techniques & data types. Conventional Statistical Analysis The statistical approach BioSS uses for the analysis of metabolomics datasets involves two strands: Multivariate where a PCA is used to explore the larger sources of variation amongst the experimental samples and identify the metabolites responsible for these differences Univariate where ANOVA is performed individually on each metabolite. A False Discovery Rate calculation allows the specific limit for significance to be chosen for those metabolites showing a significant treatment effects. A Hierarchical Cluster Analysis then illustrates the relationships among this subset Developments in Statistical Analysis Techniques such as ANOVA Simultaneous Components Analysis (ASCA) may provide an improvement on our conventional approach Essentially, for each ANOVA model term (main effects & interactions), ASCA performs a PCA on the table of effects. Example: potato tuber life cycle. A single factor experiment with 11 levels through life cycle stage ASCA biplot below shows variation of metabolites (marked ‘o’) and the vectors show the factor level loadings. PC1 distinguished metabolites which are increasing from those which are decreasing over the life cycle Linking Metabolomic & Transcriptomic Datasets LVT Shepherd1, PE Hedley1, JA Morris1, D McRae1 & JA Sungurtas1 HV Davies1 In a potato tuber bruising experiment, both microarray & metabolomic responses were available for the same samples in the same designed experiment. The treatment factors were: Genotypes: 3 Storage Times: 3 durations Which genes affect which metabolites? ANOVA performed separately for each of the genes & metabolites Eight possible groupings of treatment factor significances e.g. one grouping would be those responses which were significant for Genotype & Storage Time but no significant interaction Partition the genes & metabolites into their respective significance groups For each metabolite in a significance group, predict the response using a regression tree with all the genes Prune the regression tree 0.3 PC1 0.2 0.4 1a 1b 1c 1d 2a 2b 3 4a 4b 5 6 0.1 0.2 PC2 (19%) 0.0 0.0 -0.1 -0.2 -0.2 -0.2 -0.1 0.0 0.1 0.2 0.3 Stage PC1 (31%) mQTL Analysis A Foito1, S Byrne2,3, D Stewart1 & S Barth2 BioSS have analysed mQTL experiments with JHI collaborators on several crops (blackcurrant, raspberry, oats & potatoes). Shown here is a ryegrass mapping population experiment consisting of: Parent plants (P_m, P_f); F1 plants; 325 F2 offspring Phase 1: two replicate blocks were grown in the field Phase 2: metabolites measured in laboratory with GCMS Non Polar analysis. Such multi-phase experiments are becoming more common and benefit from BioSS input for the careful design and statistical analysis required For the metabolite Octacosanol, a graph of the LOD score profile identified a large QTL on linkage group 4 Consistent differences can be seen between the recessive QTL genotype b and the dominant genotype d for each batch Linkage Group 4 LOD score profile Means for batch number at different levels of D561623: Generation F2 0.5 25 0.0 20 -0.5 15 Octacosanol -1.0 Future Work Pre-processing of spectra Evaluation of other multivariate methods such as Bayesian Independent Component Analysis Utilising metabolic pathway databases (such as KEGG) LOD Score 10 -1.5 -2.0 5 -2.5 5 10 15 | | | Average sed Batch number D561623 b The potato bruising experiment received funding from the United States Department of Agriculture The ryegrass experiment received funding from Irish Department of Agriculture, Fisheries and Marine D561623 d