Angular regression Louis-Paul Rivest S. Baillargeon, T. Duchesne, D. Fortin & A. Nicosia 1.

Slides:



Advertisements
Similar presentations
Detecting Statistical Interactions with Additive Groves of Trees
Advertisements

Pattern Recognition and Machine Learning
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Analysis of variance and statistical inference.
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Kin 304 Regression Linear Regression Least Sum of Squares
Copula Regression By Rahul A. Parsa Drake University &
Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
Deriving space use patterns from animal interaction mechanisms Jonathan Potts, Postdoctoral Fellow, University of Alberta, May 2013.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
2 – In previous chapters: – We could design an optimal classifier if we knew the prior probabilities P(wi) and the class- conditional probabilities P(x|wi)
Statistics 100 Lecture Set 7. Chapters 13 and 14 in this lecture set Please read these, you are responsible for all material Will be doing chapters
Dimension reduction (1)
The General Linear Model. The Simple Linear Model Linear Regression.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Biostatistics-Lecture 10 Linear Mixed Model Ruibin Xi Peking University School of Mathematical Sciences.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Parametric Inference.
Power Laws Otherwise known as any semi- straight line on a log-log plot.
“Ghost Chasing”: Demystifying Latent Variables and SEM
Some standard univariate probability distributions
CHAPTER 4: Parametric Methods. Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Parametric Estimation X = {
Some standard univariate probability distributions
Discriminant Analysis Testing latent variables as predictors of groups.
Modern Navigation Thomas Herring
Linear Regression/Correlation
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Review of Lecture Two Linear Regression Normal Equation
Regression and Correlation Methods Judy Zhong Ph.D.
Xitao Fan, Ph.D. Chair Professor & Dean Faculty of Education University of Macau Designing Monte Carlo Simulation Studies.
Some standard univariate probability distributions Characteristic function, moment generating function, cumulant generating functions Discrete distribution.
Generalized Minimum Bias Models
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Name: Angelica F. White WEMBA10. Teach students how to make sound decisions and recommendations that are based on reliable quantitative information During.
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Maximum Likelihood Estimation Methods of Economic Investigation Lecture 17.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
New Measures of Data Utility Mi-Ja Woo National Institute of Statistical Sciences.
Chapter 3: Maximum-Likelihood Parameter Estimation l Introduction l Maximum-Likelihood Estimation l Multivariate Case: unknown , known  l Univariate.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Tutorial I: Missing Value Analysis
Generalized Spatial Dirichlet Process Models Jason A. Duan Michele Guindani Alan E. Gelfand March, 2006.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Classification and Regression Trees
Learning Theory Reza Shadmehr Distribution of the ML estimates of model parameters Signal dependent noise models.
MathematicalMarketing Slide 5.1 OLS Chapter 5: Ordinary Least Square Regression We will be discussing  The Linear Regression Model  Estimation of the.
Computacion Inteligente Least-Square Methods for System Identification.
Part 3: Estimation of Parameters. Estimation of Parameters Most of the time, we have random samples but not the densities given. If the parametric form.
Introduction We consider the data of ~1800 phenotype measurements Each mouse has a given probability distribution of descending from one of 8 possible.
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Estimating standard error using bootstrap
Chapter 7. Classification and Prediction
Deep Feedforward Networks
Probability Theory and Parameter Estimation I
Linear Regression.
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
CH 5: Multivariate Methods
EC 331 The Theory of and applications of Maximum Likelihood Method
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
OVERVIEW OF LINEAR MODELS
The Multivariate Normal Distribution, Part 2
Chapter 4, Regression Diagnostics Detection of Model Violation
OVERVIEW OF LINEAR MODELS
Learning Theory Reza Shadmehr
Presentation transcript:

Angular regression Louis-Paul Rivest S. Baillargeon, T. Duchesne, D. Fortin & A. Nicosia 1

Summary 1- Animal movement in ecology 2- A general regression model for circular variable 3- Modeling the errors 4- Data analysis and simulation results 2

Animal movement in ecology 3 Study the interaction between an animal and its environment using 1.GIS data on land cover 2.GPS data on animal motion 3.Special software (ArGIS) is used to merge the data together

4 Dependent variable: y t motion angle at time t. Predicted value:  (y| x 1t, …) a compromise between several targets P t-1 Target 1: meadow PtPt y t-1 Target 2: Canopy gap x 2t x 1t P t+1 ytyt Animal movement in ecology

Animal movement 5 Ecologists are uneasy about combining targets’ directions. McClintock et al. (Ecological Monograph 2012) define a compromise z,t between the angles  t-1 and  z,t as They are not fully satisfied with their definition. They add a word of caution: without providing details. Not right for a non integer  z

A general circular regression model 6 Let (x 1, z 1 ),..., (x p, z p ) be explanatory variables measured on each unit where x is an angle and z is a positive linear variable. The mean direction of y given (x 1, z 1 ),..., (x p, z p ),  (y|x, z), is the direction of

Mixture model: Each target is associated to a state. Given state j the conditional model for y i is x ji plus some errors The unconditional mean direction of y i is  (y|x,z) with z=1 and  j  ρ j p j. Motivating example: latent classes 7 ρ j =E{cos(ε j )} is the mean resultant length of the deviations ε j in state j

A general circular regression model 8 Standardization:  1 = z 1 =1. Examples (z=1): Mean direction model x 1 =0, and x 2 =  /2 Rotation model x 1 =w, and x 2 =w+  /2

A general circular regression model 9 Examples (z=1): Decentred predictor (Rivest, 1997) x 1 =w, and x 2 =w+  /2, x 3 =0, and x 3 =  /2

A general circular regression model 10 Presnell & all (1998) model: z 1 = z 2 =0, z 3 = z 4 =w, x 1 =0, x 2 =  /2, x 3 =0, x 4 =  /2, Jammalamadaka & Sen Gupta (2001) models. The Moebius model of Downs & Mardia (2002) does not belong!

Models for the errors 11 We use the von Mises density for both specifications: with population MRL. This is a modified Bessel function

Modeling the errors 12 Option 1 (homogeneity model): The density of  does not depend on neither x nor z. It is von Mises with concentration parameter . von Mises variable

Modeling the errors 13 Option 2 (Consensus, Presnell et al, 1998): The concentration parameter of  is  ℓ, where ℓ is the length of It is large when all the angles x j point in the same direction. von Mises variable

Modeling the errors 14 The consensus model uses the parameters  to model the mean direction and the concentration of the dependent angle y. Wouldn’t it be better to use two independent sets of parameters, one for the direction and one for the error concentration, (Fisher, 1992 mixed models)?

Models for the errors 15 For the consensus model, the density of y given (x,z) is This is the conditional distribution for a multivariate von Mises model (Mardia, 1975). This density belongs to the exponential family and parameter estimation should be easy.

Parameter estimation 16 Strategy: 1.Maximize the von Mises Likelihood (use several starting values for the homogeneous errors) 2.Use the inverse of the Fisher information matrix to approximate the sampling distributions of the estimates (model based) 3.Calculate robust sandwich variance covariance matrices for the parameter estimates (valid even if the model assumptions are violated) Alternative estimation strategies: use the projected normal (Presnell & al., 1998) or the wrapped Cauchy as an error density.

Parameter estimation 17 Score functions [  i =  (y|x i, z i )] : 1-Homegenous errors 2-Consensur errors (  j =  j, ℓ i  =  ℓ i )

Parameter estimation 18 Data: (y i, x i, z i ), i=1,...,n Maximizing the von Mises likelihood with homogeneous errors leads to a max-cosine estimation criterion for the parameters {  j }: Numerical problems may occur. Example: data simulated from the homogeneous error model: n=50, p=2,  2 =0.5 x 1 =0, x 2  U(- ,  ),  =0.4

Parameter estimation 19 Properties: 1.The max-cosine estimator for  is consistent under the two error specifications, homogeneous and consensus; 2.When the errors are homogeneous, the consensus MLE might not be consistent. A lack of robustness to the specifications of the errors’ distribution is the price to pay for the numerical stability of the likelihood.

Parameter estimation 20 Properties: 1.Bias: In a one parameter model with homogenoeus errors, the consensus MLE underestimates  2 (by up to 20%) 2.MLE: The algorithm that maximizes the homogeneous likelihood must use several starting values (more that 1000!)

The conditional mean direction of y t is a compromise between y t-1 and x 0 : Under consensus errors with von Mises distributions (Mardia et al, 2007) Stationary distribution unknown for homogeneous errors. A time series model 21

This is a “Biased Correlated Random Walk” in Ecology: y t = direction of animal movement at time t x 0 =x 0t = direction of a target to which an animal might be attracted (“Directional Bias”) The estimation of  2 relies on the methods presented earlier. A time series model 22

y = direction of displacement x = distance traveled Presnell et al. (1998): projected normal errors Example 1: Periwinkle data 23 Presnell et al fitConsensus fit Homogeneous fit: Numerical problems

Example 1: Periwinkle data 24

y i = track orientation for pixel i x 1i = track orientation for pixel i-1 x 2i = angle for next meadow z 2i = log(distance to next meadow) x 3i = angle for next canopy gap z 3i = log(distance to next canopy gap) Example 2: Dancose (2011) digitized bison track data 25 K=218 trails for 5600 pixels Model considered

estim. s.e.(R) s.e.(FI) 1.06 beta S 0.07 beta S beta S beta NS Bison track data: homegenous model 26 Model The tracks are “biased” towards target meadows (TM) and canopy gaps. When approaching a meadow, the bisons zoom in. The weight of the TM angle is  log(D)/1.06  D

estim Homo estim Consense(FI) beta beta beta beta Bison track data: consensus model 27 Model The two sets of estimates are similar and lead to the same conclusion.

Discussion 28 Multivariate angular regression applies beyond animal movement: Meteo: ensemble prediction of wind direction Experimental psychology: real and perceived orientation of features Geophysics: direction of earthquake ground movement and direction of steepest descent Thank you!

References 29  Dancose, K., D. Fortin, and X. L. Guo Mechanisms of functional connectivity: the case of free-ranging bison in a forest landscape. Ecological Applications 21:  Downs, T. D. and Mardia, K. V. (2002) Circular regression. Biometrika,89,  Fisher, N.I., Lee A.J. (1992). Regression models for angular responses. Biometrics, 48,  Fortin & al (2005) Wolves influence elk movements: behavior shapes a trophic cascade in Yellowstone National Park, Ecology, 86(5), 2005, pp. 1320–1330  Jammalamadaka, S. R. and SenGupta, A. (2001) Topics in Circular Statistics. World Scientific: Singapour  Mardia, K.V. and Jupp, P.E. (1999) Directional Statistics,John Wiley, New York  Presnell, B., Morrison, S.P., and Littell, R.C. (1998). Projected multivariate linear models for directional data. JASA. 93(443):  Rivest, L.-P. (1997). A decentred predictor for circular-circular regression. Biometrika, 84,