Quality Control at a Local Brewery

Slides:



Advertisements
Similar presentations
FMRI Methods Lecture 10 – Using natural stimuli. Reductionism Reducing complex things into simpler components Explaining the whole as a sum of its parts.
Advertisements

Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Multiple Analysis of Variance – MANOVA
PCA for analysis of complex multivariate data. Interpretation of large data tables by PCA In industry, research and finance the amount of data is often.
Mutidimensional Data Analysis Growth of big databases requires important data processing.  Need for having methods allowing to extract this information.
An Introduction to Multivariate Analysis
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
Factor Analysis Continued
CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.
Visualizing and Exploring Data Summary statistics for data (mean, median, mode, quartile, variance, skewnes) Distribution of values for single variables.
Chapter 17 Overview of Multivariate Analysis Methods
Lecture 7: Principal component analysis (PCA)
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Regionalized Variables take on values according to spatial location. Given: Where: A “structural” coarse scale forcing or trend A random” Local spatial.
Dimension reduction : PCA and Clustering Christopher Workman Center for Biological Sequence Analysis DTU.
Exploring Microarray data Javier Cabrera. Outline 1.Exploratory Analysis Steps. 2.Microarray Data as Multivariate Data. 3.Dimension Reduction 4.Correlation.
1 Pertemuan 21 Matakuliah: I0214 / Statistika Multivariat Tahun: 2005 Versi: V1 / R1 Analisis Struktur Peubah Ganda (I): Analisis Komponen Utama.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Tables, Figures, and Equations
Relationships Among Variables
Principal Component Analysis. Philosophy of PCA Introduced by Pearson (1901) and Hotelling (1933) to describe the variation in a set of multivariate data.
Separate multivariate observations
Factor Analysis Psy 524 Ainsworth.
Inferential Statistics: SPSS
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
Summarized by Soo-Jin Kim
Key Features and Results
Chapter 2 Dimensionality Reduction. Linear Methods
Principal Components Analysis BMTRY 726 3/27/14. Uses Goal: Explain the variability of a set of variables using a “small” set of linear combinations of.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Basic concepts in ordination
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
Chapter 16 Data Analysis: Testing for Associations.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
PATTERN RECOGNITION : CLUSTERING AND CLASSIFICATION Richard Brereton
In the name of GOD. Zeinab Mokhtari 1-Mar-2010 In data analysis, many situations arise where plotting and visualization are helpful or an absolute requirement.
Chapter 6: Analyzing and Interpreting Quantitative Data
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
Analyzing Expression Data: Clustering and Stats Chapter 16.
Applied Multivariate Statistics Cluster Analysis Fall 2015 Week 9.
Principal Component Analysis (PCA)
Chapter 8 Relationships Among Variables. Outline What correlational research investigates Understanding the nature of correlation What the coefficient.
Principal Component Analysis
Presented by: Muhammad Wasif Laeeq (BSIT07-1) Muhammad Aatif Aneeq (BSIT07-15) Shah Rukh (BSIT07-22) Mudasir Abbas (BSIT07-34) Ahmad Mushtaq (BSIT07-45)
Principal Components Analysis ( PCA)
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
Out of sample extension of PCA, Kernel PCA, and MDS WILSON A. FLORERO-SALINAS DAN LI MATH 285, FALL
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Dimensionality Reduction
Exploring Microarray data
Chapter 12: Regression Diagnostics
Principal Component Analysis (PCA)
Molecular phenotyping of HCS-2/8 cells as an in vitro model of human chondrocytes  J. Saas, Ph.D., K. Lindauer, Ph.D., B. Bau, M.Sc., M. Takigawa, D.D.S.,
CHAPTER 2: PSYCHOLOGICAL RESEARCH METHODS AND STATISTICS
Understanding Research Results: Description and Correlation
Clustering and Multidimensional Scaling
Descriptive Statistics vs. Factor Analysis
Multivariate Statistical Methods
Introduction to Statistical Methods for Measuring “Omics” and Field Data PCA, PcoA, distance measure, AMOVA.
Principal Component Analysis (PCA)
Multivariate Analysis of a Carbonate Chemistry Time-Series Study
Principal Component Analysis
Cluster Analysis.
Multidimensional Space,
Compositional PCA plot of samples (A) and OTU loadings (B) for the initial data set. Compositional PCA plot of samples (A) and OTU loadings (B) for the.
Unsupervised Learning
Presentation transcript:

Quality Control at a Local Brewery Data analysis of in-house sensory panel to measure batch-to-batch consistency of an American IPA brew

Sensory Evaluation Quantitative Descriptive Analysis® (QDR) A behavioral sensory evaluation approach Widely used in Food & Beverage Sciences Term coined in 1974 by Herbert Stone and Joel Sidel at Stanford Research Institute

Components to QDA Experienced subjects (tasters) Small panel size Considered at 3 Taverns: Experienced subjects (tasters) Small panel size Screened for sensitivity Descriptors or classes are given Subjects rate on a scale Conducted individually without discussion Quantitative results considered only Replication of results Production staff, marketing team 6-8 per tasting Not taken into account 13 different sight, flavor, and aroma profiles given Rated on a 9 point scale Same brew not repeated Brew tasted again after 30 and 60 days Different combination of tasters

Sensory analysis at 3 Taverns 13 descriptors or profiles Aroma: pine, citrus, tropical fruit (measured separately) Carbonation Hops Color Sweetness Foam Resin Clarity Alcohol Mouthfeel Lingering bitterness Each variable scored on a scale of 1-9 5 is ideal and is considered “true to style” <5 means too little is perceived >5 means too much is perceived

Focus and goals of study Analyze the tasting panel results of multiple batches of one brew Combinations of different tasters rated the brews after 0, 30, & 60 days Identify outlier batches Identify outlier profiles Identify outlier tasters Make recommendations for improved tasting panel performance

Problem: Plots of averages of each profile after 0, 30, & 60 days are not meaningful No noticeable change over time Not useful

Solution: Principal Component Analysis Pairwise correlations are made to quantify relationships Multivariate analysis – 13 dimensions for this project (# profiles) Reduces dimensions to 2 or 3 Variances are plotted on 2 or 3 axes Clustering indicates related samples Powerful visual representation of data with multiple variables Outliers can be identified

MATLAB’s pca() function [coeff score latent tsquared] = pca(X) X: an m x n matrix, where m are observations and n are variables Coeff: loadings or principal component (PC) coefficients. Give magnitude and direction for eigenvectors in “p” space Score: values of PC for observations. Represent coordinates of original data in PC space Latent: the eigenvalues – gives variance of each eigenvector. Tsquared: Hotelling t-squared statistic. Gives distance of observation to center of data. Input m x n data matrix, X Generate eigenvector coefficients Generate scores Rollins et al., 2011

Issue: PCA is sensitive to outliers PCA centers data but does not scale PCA is best when: data is scaled variables follow normal distribution To visualize data and gain insight, normal distribution of individual variables is not required For hypothesis testing, normality is required Fix: use zscore() Centers data about zero Scales so that the standard deviation is 1

PCA of Descriptors after 0 days

PCA of Descriptors after 0, 30, and 60 days Some variables group together as expected Becomes more spread-out after 30 days. Higher complexity of beer? Suggests beer flavors and aromas may homogenize with time

PCA of Batches – spread of batches over time Visually, 160022, 160024, & 160039 appear to be possible outliers Hotelling t-square shows similar distance from center of data (problem?) – need to correct

Distance Matrix – pdist() Calculates Euclidean distance (straight line) between pairs of observations Try other distance metrics? Manhattan?

Cluster Analysis of Batches In MATLAB: Use linkage() and dendrogram() for tree Find cluster correlation with cophenet()

Next Steps Improve batch analysis Analyze taster performance Overall: Hotelling T2 values Distance metric for high dimensionality Analyze taster performance Find outlier tasters Overall: Summarize findings and make recommendations, e.g. Identify which descriptors tasters struggle with Add or remove descriptors Replication of results: introduce “blind” batches for replication

Questions