Generalized Spatial Dirichlet Process Models Jason A. Duan Michele Guindani Alan E. Gelfand March, 2006.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Bayes rule, priors and maximum a posteriori
STATISTICS Joint and Conditional Distributions
Sampling Design, Spatial Allocation, and Proposed Analyses Don Stevens Department of Statistics Oregon State University.
Bayesian Spatial and Functional Data Analysis Using Gaussian Processes Alan E. Gelfand Duke University (with contributions from J. Duan, D. Dunson, M.
Dynamic Spatial Mixture Modelling and its Application in Cell Tracking - Work in Progress - Chunlin Ji & Mike West Department of Statistical Sciences,
Dynamic spatial mixture modelling and its application in Bayesian tracking for cell fluorescent microscopic imaging Chunlin Ji & Mike West Department of.
Chapter 2 Multivariate Distributions Math 6203 Fall 2009 Instructor: Ayona Chatterjee.
Gibbs Sampling Methods for Stick-Breaking priors Hemant Ishwaran and Lancelot F. James 2001 Presented by Yuting Qi ECE Dept., Duke Univ. 03/03/06.
Hierarchical Dirichlet Processes
Bayesian dynamic modeling of latent trait distributions Duke University Machine Learning Group Presented by Kai Ni Jan. 25, 2007 Paper by David B. Dunson,
Pattern Recognition and Machine Learning
Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.
Dynamic Bayesian Networks (DBNs)
Sampling Distributions (§ )
Slide 1 EE3J2 Data Mining EE3J2 Data Mining Lecture 10 Statistical Modelling Martin Russell.
Presenting: Assaf Tzabari
Space-time Modelling Using Differential Equations Alan E. Gelfand, ISDS, Duke University (with J. Duan and G. Puggioni)
Lecture II-2: Probability Review
5-1 Two Discrete Random Variables Example Two Discrete Random Variables Figure 5-1 Joint probability distribution of X and Y in Example 5-1.
5-1 Two Discrete Random Variables Example Two Discrete Random Variables Figure 5-1 Joint probability distribution of X and Y in Example 5-1.
Distribution Function properties. Density Function – We define the derivative of the distribution function F X (x) as the probability density function.
Sampling Distributions  A statistic is random in value … it changes from sample to sample.  The probability distribution of a statistic is called a sampling.
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
Jointly Distributed Random Variables
Fast Max–Margin Matrix Factorization with Data Augmentation Minjie Xu, Jun Zhu & Bo Zhang Tsinghua University.
Bayesian Inference Ekaterina Lomakina TNU seminar: Bayesian inference 1 March 2013.
Hierarchical Dirichelet Processes Y. W. Tech, M. I. Jordan, M. J. Beal & D. M. Blei NIPS 2004 Presented by Yuting Qi ECE Dept., Duke Univ. 08/26/05 Sharing.
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
Bayesian Hierarchical Clustering Paper by K. Heller and Z. Ghahramani ICML 2005 Presented by HAO-WEI, YEH.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
Ch 2. Probability Distributions (1/2) Pattern Recognition and Machine Learning, C. M. Bishop, Summarized by Yung-Kyun Noh and Joo-kyung Kim Biointelligence.
The Dirichlet Labeling Process for Functional Data Analysis XuanLong Nguyen & Alan E. Gelfand Duke University Machine Learning Group Presented by Lu Ren.
Stick-Breaking Constructions
CS Statistical Machine learning Lecture 24
1 Clustering in Generalized Linear Mixed Model Using Dirichlet Process Mixtures Ya Xue Xuejun Liao April 1, 2005.
STATISTICS Joint and Conditional Distributions Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Lecture 2: Statistical learning primer for biologists
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
by Ryan P. Adams, Iain Murray, and David J.C. MacKay (ICML 2009)
Bayesian Density Regression Author: David B. Dunson and Natesh Pillai Presenter: Ya Xue April 28, 2006.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
Statistical NLP: Lecture 4 Mathematical Foundations I: Probability Theory (Ch2)
Gaussian Process Networks Nir Friedman and Iftach Nachman UAI-2K.
The Nested Dirichlet Process Duke University Machine Learning Group Presented by Kai Ni Nov. 10, 2006 Paper by Abel Rodriguez, David B. Dunson, and Alan.
Chapter 5 Joint Probability Distributions and Random Samples  Jointly Distributed Random Variables.2 - Expected Values, Covariance, and Correlation.3.
Crash course in probability theory and statistics – part 2 Machine Learning, Wed Apr 16, 2008.
Probability Theory and Parameter Estimation I
Bayesian Generalized Product Partition Model
STAT 311 REVIEW (Quick & Dirty)
Graduate School of Information Sciences, Tohoku University
Non-Parametric Models
The distribution function F(x)
Omiros Papaspiliopoulos and Gareth O. Roberts
Special Topics In Scientific Computing
Dirichlet process tutorial
Kernel Stick-Breaking Process
OVERVIEW OF BAYESIAN INFERENCE: PART 1
Generalized Spatial Dirichlet Process Models
Statistical NLP: Lecture 4
Volume 6, Issue 5, Pages e13 (May 2018)
Pattern Recognition and Machine Learning
Chapter 8: Estimating With Confidence
Sampling Distributions (§ )
Moments of Random Variables
Probabilistic Surrogate Models
Presentation transcript:

Generalized Spatial Dirichlet Process Models Jason A. Duan Michele Guindani Alan E. Gelfand March, 2006

Generalized Dirichlet Process Models – 2 Agenda Overview of Bayesian Spatial Statistics Models. Spatial Dirichlet Processes (SDP) Generalized Spatial Dirichlet Processes (GSDP) –Multivariate Stick-Breaking Random Probability Measure –Spatially Varying Probability Model –Simulation Based Model Fitting –Illustrative Example –Dynamic Model with GSDP Conclusions

Generalized Dirichlet Process Models – 3 Bayesian Spatial Model Suppose we have point-referenced spatial data such that

Generalized Dirichlet Process Models – 4 Bayesian Spatial Model We can build a hierarchical model for Customary choice for priors.

Generalized Dirichlet Process Models – 5 Criticism of Gaussian Model Two problems with this model: Gaussian assumption: –Multi-modality. –Heavy tail behavior. Stationary assumption: –Inappropriate for most environmental issues: dispersion of pollutant, meteorology, etc. Nonparametric models add flexibility.

Generalized Dirichlet Process Models – 6 Spatial Dirichlet Process (SDP) Sethuraman’s construction for the SDP: This is a stick-breaking construction for the random weights.

Generalized Dirichlet Process Models – 7 Motivations for GSDP SDP implies the same surface selection for for all locations. GSDP has a flexible surface selection. –Fewer random surfaces needed. –Better for small data sets. –Applications: brain imaging, species distribution, house prices, etc.

Generalized Dirichlet Process Models – 8 Definition of GSDP How to construct a consistent distribution? Multivariate stick-breaking.

Generalized Dirichlet Process Models – 9 Association Structure of GSDP Moments conditional on G Y(s) has heterogeneous variance and is nonstationary.

Generalized Dirichlet Process Models – 10 Association Structure of GSDP

Generalized Dirichlet Process Models – 11 Consistency and Continuity Requirements  The first requirement is met by the consistency of the multivariate stick-breaking.  The second requirement is established by the continuity property of the GSDP.

Generalized Dirichlet Process Models – 12 Multivariate Stick-Breaking

Generalized Dirichlet Process Models – 13 Multivariate Stick-Breaking Multivariate extension: We exemplify with a bivariate case Similarly, we define two events at one location s

Generalized Dirichlet Process Models – 14 Multivariate Stick-Breaking

Generalized Dirichlet Process Models – 15 Multivariate Stick-Breaking

Generalized Dirichlet Process Models – 16 Spatially Varying Weights We createthrough latent Gaussian processes: Define How to construct?

Generalized Dirichlet Process Models – 17 Spatially Varying Weights Define joint weight This definition of joint weights satisfies the consistency and continuity requirements.

Generalized Dirichlet Process Models – 18 Hierarchical Spatial Model Our model has the following hierarchical structure:

Generalized Dirichlet Process Models – 19 Hierarchical Spatial Model

Generalized Dirichlet Process Models – 20 GSDP Mixture Model Note the first two hierarchies of our model define a GSDP mixture of multivariate normal distributions: where This construction alleviates the undesirable almost discreteness property of the GSDP. The GSDP mixture model resonates the typical “random effect + nugget” spatial model.

Generalized Dirichlet Process Models – 21 Moments of GSDP Mixture where

Generalized Dirichlet Process Models – 22 Model Fitting The marginal model at each location is a DP model. The joint model has NO Polya Urn Scheme. In practice, we have to use a finite approximation. We only need

Generalized Dirichlet Process Models – 23 Model Fitting Evaluation of weightsis difficult. We directly sample the auxiliary variable

Generalized Dirichlet Process Models – 24 Model Fitting The likelihood function is The posterior conditional distribution of is truncated normal.

Generalized Dirichlet Process Models – 25 Full Conditionals

Generalized Dirichlet Process Models – 26 Full Conditionals

Generalized Dirichlet Process Models – 27 A Simulation Example 50 locations, 40 replicates

Generalized Dirichlet Process Models – 28 A Simulation Example has a bimodal normal mixture distribution.

Generalized Dirichlet Process Models – 29 Simulation Results: Marginal Density Thick Curve: GSDP Dashed Curve: SDP Dotted Curve: True Distribution

Generalized Dirichlet Process Models – 30 Simulation Results: Joint Density Very close locations: 26 and 50. Predictive density: GSDPTrue DensityPredictive density: SDP

Generalized Dirichlet Process Models – 31 Simulation Results: Joint Density Slightly Distant locations: 33 and 50. Predictive density: GSDPTrue DensityPredictive density: SDP

Generalized Dirichlet Process Models – 32 Simulation Results: Joint Density Very Distant locations: 49 and 50. Predictive density: GSDPTrue DensityPredictive density: SDP

Generalized Dirichlet Process Models – 33 Results: Common Surface Selection

Generalized Dirichlet Process Models – 34 Dynamic GSDP Model We can model it by embeding GSDP in a dynamic linear model: In practice repeated measurements are made in consecutive time periods: has temporal correlation.

Generalized Dirichlet Process Models – 35 Dynamic GSDP: Simulation 54 locations, 41 periods

Generalized Dirichlet Process Models – 36 Dynamic GSDP: Simulation = 0.7  We fit the Dynamic GSDP to 50 locations and 40 periods, leaving the last period and 4 locations out to validate prediction.

Generalized Dirichlet Process Models – 37 Dynamic GSDP: Prediction

Generalized Dirichlet Process Models – 38 Dynamic GSDP: Prediction Predictive density: T+1True Density: T+1 Very close locations: 23 and 50.

Generalized Dirichlet Process Models – 39 Dynamic GSDP: Prediction Predictive density: T+1True Density: T+1 Very distant locations: 49 and 50.

Generalized Dirichlet Process Models – 40 Conclusion GSDP is a more flexible generalization of SDP Methodology Development –Number of surfaces for finite approximation –Discrete and multivariate data GSDP –Other constructions of random weights Future Applications of GSDP –Species distribution –Brain Imaging –Spatial demand function –House prices