Mixed Model Analysis of Highly Correlated Data: Tales from the Dark Side of Forestry Christina Staudhammer, PhD candidate Valerie LeMay, PhD Thomas Maness,

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Brief introduction on Logistic Regression
September 2000Department of Statistics Kansas State University 1 Statistics and Design of Experiments: Role in Research George A. Milliken, PhD Department.
Statistics : Role in Research. Statistics: A collection of procedures and processes to enable researchers in the unbiased pursuit of Knowledge Statistics.
Kriging.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
HSRP 734: Advanced Statistical Methods July 24, 2008.
Lecture 6 (chapter 5) Revised on 2/22/2008. Parametric Models for Covariance Structure We consider the General Linear Model for correlated data, but assume.
1 Detection and Analysis of Impulse Point Sequences on Correlated Disturbance Phone G. Filaretov, A. Avshalumov Moscow Power Engineering Institute, Moscow.
SPATIAL DATA ANALYSIS Tony E. Smith University of Pennsylvania Point Pattern Analysis Spatial Regression Analysis Continuous Pattern Analysis.
Basic geostatistics Austin Troy.
How Many Samples are Enough? Theoretical Determination of the Critical Sampling Density for a Greek Clay Quarry. by K. Modis and S. Stavrou, Nat. Tech.
Using the Maryland Biological Stream Survey Data to Test Spatial Statistical Models A Collaborative Approach to Analyzing Stream Network Data Andrew A.
Deterministic Solutions Geostatistical Solutions
FOUR METHODS OF ESTIMATING PM 2.5 ANNUAL AVERAGES Yan Liu and Amy Nail Department of Statistics North Carolina State University EPA Office of Air Quality,
Applied Geostatistics
Deterministic Solutions Geostatistical Solutions
Ordinary Kriging Process in ArcGIS
Multiantenna-Assisted Spectrum Sensing for Cognitive Radio
Two and a half problems in homogenization of climate series concluding remarks to Daily Stew Ralf Lindau.
Introduction to Multilevel Modeling Using SPSS
Geo479/579: Geostatistics Ch13. Block Kriging. Block Estimate  Requirements An estimate of the average value of a variable within a prescribed local.
Day 7 Model Evaluation. Elements of Model evaluation l Goodness of fit l Prediction Error l Bias l Outliers and patterns in residuals.
Tracking Pedestrians Using Local Spatio- Temporal Motion Patterns in Extremely Crowded Scenes Louis Kratz and Ko Nishino IEEE TRANSACTIONS ON PATTERN ANALYSIS.
On Model Validation Techniques Alex Karagrigoriou University of Cyprus "Quality - Theory and Practice”, ORT Braude College of Engineering, Karmiel, May.
Spatial Interpolation of monthly precipitation by Kriging method
Covariance structures in longitudinal analysis Which one to choose?
1 Institute of Engineering Mechanics Leopold-Franzens University Innsbruck, Austria, EU H.J. Pradlwarter and G.I. Schuëller Confidence.
Edoardo PIZZOLI, Chiara PICCINI NTTS New Techniques and Technologies for Statistics SPATIAL DATA REPRESENTATION: AN IMPROVEMENT OF STATISTICAL DISSEMINATION.
Chapter 6 BEST Linear Unbiased Estimator (BLUE)
Explorations in Geostatistical Simulation Deven Barnett Spring 2010.
Mixed Linear Models An Introductory Tutorial. What we have covered!!! The Linear model: Mean Structure or Fixed Effects Errors.
Geographic Information Science
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Geo479/579: Geostatistics Ch16. Modeling the Sample Variogram.
Spatial Statistics in Ecology: Continuous Data Lecture Three.
GEOSTATISICAL ANALYSIS Course: Special Topics in Remote Sensing & GIS Mirza Muhammad Waqar Contact: EXT:2257.
The Semivariogram in Remote Sensing: An Introduction P. J. Curran, Remote Sensing of Environment 24: (1988). Presented by Dahl Winters Geog 577,
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Definitions Random Signal Analysis (Review) Discrete Random Signals Random.
Interpreting Principal Components Simon Mason International Research Institute for Climate Prediction The Earth Institute of Columbia University L i n.
Population Pharmacokinetic Characteristics of Levosulpiride and Terbinafine in Healthy Male Korean Volunteers Yong-Bok Lee College of Pharmacy and Institute.
Semivariogram Analysis and Estimation Tanya, Nick Caroline.
Jeff J. Orchard, M. Stella Atkins School of Computing Science, Simon Fraser University Freire et al. (1) pointed out that least squares based registration.
Chapter 6 – Estimating Variance Components of Tack Measurements An important consideration in the manufacture of type of item is consistency; all items.
Vamsi Sundus Shawnalee. “Data collected under different conditions (i.e. treatments)  whether the conditions are different from each other and […] how.
Short course on space-time modeling Instructors: Peter Guttorp Johan Lindström Paul Sampson.
Autoregressive (AR) Spectral Estimation
WCRP Extremes Workshop Sept 2010 Detecting human influence on extreme daily temperature at regional scales Photo: F. Zwiers (Long-tailed Jaeger)
Machine Learning 5. Parametric Methods.
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
Geo479/579: Geostatistics Ch12. Ordinary Kriging (2)
Information criteria What function fits best? The more free parameters a model has the higher will be R 2. The more parsimonious a model is the lesser.
Chapter 13 Understanding research results: statistical inference.
Computacion Inteligente Least-Square Methods for System Identification.
CARMA Models for Stochastic Variability (or how to read a PSD) Jim Barrett University of Birmingham.
Adv DSP Spring-2015 Lecture#11 Spectrum Estimation Parametric Methods.
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
Model Comparison. Assessing alternative models We don’t ask “Is the model right or wrong?” We ask “Do the data support a model more than a competing model?”
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Core Research Competencies:
Multilevel modelling: general ideas and uses
HST 583 fMRI DATA ANALYSIS AND ACQUISITION
Linear Mixed Models in JMP Pro
Maximum Likelihood Estimation
Clustering (3) Center-based algorithms Fuzzy k-means
Interpreting Principal Components
Two Sample t-test vs. Paired t-test
Paul D. Sampson Peter Guttorp
GAUSSIAN PROCESS REGRESSION WITHIN AN ACTIVE LEARNING SCHEME
Presentation transcript:

Mixed Model Analysis of Highly Correlated Data: Tales from the Dark Side of Forestry Christina Staudhammer, PhD candidate Valerie LeMay, PhD Thomas Maness, PhD Robert Kozak, PhD THE UNIVERSITY OF BRITISH COLUMBIA VANCOUVER, BRITISH COLUMBIA, CANADA

Staudhammer, et al.

Introduction - 1 Current Statistical Process Control (SPC) in Sawmills –Data Collection: Periodically, a few boards are pulled from a machine Thickness measured in 6-10 places with digital calipers –Data Analysis Control Charts are constructed to ensure that X, s 2 b, s 2 w are within a target range, e.g., –SPC is slow and labour-intensive, but important and effective

Staudhammer, et al. Introduction - 2 Recent advances in SPC –Laser Range Sensors Real-time measurements available at up to 1000 meas./sec. Each and every board (or cant) is measured –Research describing Rigid Body Motion Removes effect of ‘bouncing boards’ (or cants) enables board profiles to be analyzed, in addition to thickness –On-line machine diagnostics can be monitored to trace quality problems to specific saws

Staudhammer, et al. Interesting Issues A great increase in the amount of information available –the data from these devices is subject to noise External, e.g., wane Internal, e.g., measurement errors –The data are closely spaced and highly autocorrelated Boards are almost censused Observations are easily predicted from their neighbors. The process variance is underestimated, leading to too narrow control limits for SPC and false signals of an out of control process. An adequate statistical model to describe the data has not yet been described in the literature.

Staudhammer, et al. Objectives Research Objective –To establish a system for collecting and processing real-time quality control data for automated lumber manufacturing Presentation Objective –To present methods for estimation of the components of variance so that control charts can be constructed

Staudhammer, et al. Data Collection

Staudhammer, et al. Profile Data Profiles (y 1 – y 4 ) are computed using the laser readings and the known distance to the centre of the board. l4 l4 l2 l2 Laser 3 l1 l1 y 3 y 4 y 1 y 2 Laser 2 Laser 4 Laser 1 l3 l3

Staudhammer, et al. Sample Data - Profile

Staudhammer, et al. Simple Model y ijkm =  +  i +  j + k +  ijkm [1] where: i = 1 to b boards; j = 1 to s sides; k = 1 to r laser positions; m = 1 to n measurements along the board;  i = the ith board effect;  j = the jth side effect; k = the kth laser position effect; and  ijkm = the error associated with the mth measurement.

Staudhammer, et al. ● ● ● ● ● ● ● ● ● ● ● ● Model Details All effects are random, except sides Observations on a single side of a board are highly correlated, and thus the error covariance structure should be added to the model…

Staudhammer, et al. Error Covariance Structures Isotropic spatial covariance structures e.g., Exponential: (Other models include Gaussian, spherical, linear) Autoregressive covariance structures e.g., ARMA(1,1): Anisotropic spatial covariance structures e.g., Power:

Staudhammer, et al. Model Fitting Methods Models fit: [1] Simple Model [2] Model [1] plus isotropic spatial error covariance structure [3] Model [1] plus autoregressive error cov. structure [4] Model [1] plus anisotropic spatial error covariance structure Models were fit with SAS PROC MIXED A reduced dataset was used with 50 meas. per laser/side/board

Staudhammer, et al. Model Evaluation Tests for Maximum Likelihood Estimation (MLE) –e.g., Likelihood Ratio Test, Wald Test, etc. –Are tests appropriate? Fit Statistics for MLE –Information Criteria, e.g., Akaike’s Information Criteria (AIC) –Do not require setting arbitrary significance levels

Staudhammer, et al. Results Simple Model

Staudhammer, et al. Results Model [1] Plus Exponential Error Covariance Structure

Staudhammer, et al. Results Model [1] plus ARMA(1,1) Error Cov. Structure

Staudhammer, et al. Results Model [1] plus Anisotropic Power Error Cov. Structure

Staudhammer, et al. Model selection based on fit statistics –Lowest AIC indicates [4] with Anisotropic Power Structure –What is indicated by directional variograms? Discussion - 1

Staudhammer, et al. Semivariograms vs. Model [4]

Staudhammer, et al. Semivariograms vs. Model [4]

Staudhammer, et al. Model selection based on knowledge of system –Appropriateness of isotropic spatial vs. anisotropic spatial vs. autoregressive models of error covariance structure –Should there be a decrease in between- board variance component? Will a saw travelling at varying speeds yield a consistent ‘saw signature’? Discussion - 2

Staudhammer, et al. Conclusions Application of QC to automated processes is an important step toward more efficient lumber processing Model selection should be based on knowledge of the system as well as fit statistics Further testing should be done on datasets from different days/saws to ensure widespread applicability

Staudhammer, et al. Acknowledgements National Science and Engineering Research Council British Columbia Science Council Izaak Walton Killam Foundation Canadian Forest Products Weyerhauser Company Forintek Canada