1 Spatial processes and statistical modelling Peter Green University of Bristol, UK BCCS GM&CSS 2008/09 Lecture 8.

Slides:



Advertisements
Similar presentations
Mathematical Preliminaries
Advertisements

Fundamentals of Probability
Feichter_DPG-SYKL03_Bild-01. Feichter_DPG-SYKL03_Bild-02.
© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
STATISTICS Joint and Conditional Distributions
Detection of Hydrological Changes – Nonparametric Approaches
STATISTICS Univariate Distributions
STATISTICS Random Variables and Distribution Functions
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
UNITED NATIONS Shipment Details Report – January 2006.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
1. 2 Unknown Backprojection usually produce a blurred version of the image.
Chapter 7 Sampling and Sampling Distributions
Solve Multi-step Equations
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
Discrete time Markov Chain
Break Time Remaining 10:00.
Discrete Math Recurrence Relations 1.
Detection Chia-Hsin Cheng. Wireless Access Tech. Lab. CCU Wireless Access Tech. Lab. 2 Outlines Detection Theory Simple Binary Hypothesis Tests Bayes.
PP Test Review Sections 6-1 to 6-6
EU market situation for eggs and poultry Management Committee 20 October 2011.
5-1 Chapter 5 Theory & Problems of Probability & Statistics Murray R. Spiegel Sampling Theory.
What I am after from gR2002 Peter Green, University of Bristol, UK.
15. Oktober Oktober Oktober 2012.
Chapter 6 The Mathematics of Diversification
Copyright © 2013, 2009, 2006 Pearson Education, Inc.
Constant, Linear and Non-Linear Constant, Linear and Non-Linear
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
© 2012 National Heart Foundation of Australia. Slide 2.
Discrete Mathematics 3. MATRICES, RELATIONS, AND FUNCTIONS Lecture 5 Dr.-Ing. Erwin Sitompul
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
25 seconds left…...
: 3 00.
Statistical Inferences Based on Two Samples
Analyzing Genes and Genomes
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Essential Cell Biology
12 System of Linear Equations Case Study
Clock will move after 1 minute
Intracellular Compartments and Transport
PSSA Preparation.
IP, IST, José Bioucas, Probability The mathematical language to quantify uncertainty  Observation mechanism:  Priors:  Parameters Role in inverse.
Experimental Design and Analysis of Variance
Essential Cell Biology
Immunobiology: The Immune System in Health & Disease Sixth Edition
Simple Linear Regression Analysis
Multiple Regression and Model Building
Energy Generation in Mitochondria and Chlorplasts
Select a time to count down from the clock above
Murach’s OS/390 and z/OS JCLChapter 16, Slide 1 © 2002, Mike Murach & Associates, Inc.
9. Two Functions of Two Random Variables
1 Undirected Graphical Models Graphical Models – Carlos Guestrin Carnegie Mellon University October 29 th, 2008 Readings: K&F: 4.1, 4.2, 4.3, 4.4,
Commonly Used Distributions
Math Review with Matlab:
Probabilistic Reasoning over Time
1 Spatial processes and statistical modelling Peter Green University of Bristol, UK IMS/ISBA, San Juan, 24 July 2003.
Presentation transcript:

1 Spatial processes and statistical modelling Peter Green University of Bristol, UK BCCS GM&CSS 2008/09 Lecture 8

2 Continuous space Discrete space –lattice –irregular - general graphs –areally aggregated Point processes –other object processes Spatial indexing

3 Space vs. time apparently slight difference profound implications for mathematical formulation and computational tractability

4 Requirements of particular application domains agriculture (design) ecology (sparse point pattern, poor data?) environmetrics (space/time) climatology (huge physical models) epidemiology (multiple indexing) image analysis (huge size)

5 Key themes conditional independence –graphical/hierarchical modelling aggregation –analysing dependence between differently indexed data –opportunities and obstacles literal credibility of models Bayes/non-Bayes distinction blurred

6 Why build spatial dependence into a model? No more reason to suppose independence in spatially-indexed data than in a time- series However, substantive basis for form of spatial dependent sometimes slight - very often space is a surrogate for missing covariates that are correlated with location

7 Discretely indexed data

8 Modelling spatial dependence in discretely-indexed fields Direct Indirect –Hidden Markov models –Hierarchical models

9 Hierarchical models, using DAGs Variables at several levels - allows modelling of complex systems, borrowing strength, etc.

10 Modelling with undirected graphs Directed acyclic graphs are a natural representation of the way we usually specify a statistical model - directionally: disease symptom past future parameters data …… whether or not causality is understood. But sometimes (e.g. spatial models) there is no natural direction

11 Conditional independence In model specification, spatial context often rules out directional dependence (that would have been acceptable in time series context) X0X0 X1X1 X2X2 X3X3 X4X4

12 Conditional independence In model specification, spatial context often rules out directional dependence X 20 X 21 X 22 X 23 X 24 X 00 X 01 X 02 X 03 X 04 X 10 X 11 X 12 X 13 X 14

13 Conditional independence In model specification, spatial context often rules out directional dependence X 20 X 21 X 22 X 23 X 24 X 00 X 01 X 02 X 03 X 04 X 10 X 11 X 12 X 13 X 14

14 Directed acyclic graph in general: for example: ab c d p(a,b,c,d)=p(a)p(b)p(c|a,b)p(d|c) In the RHS, any distributions are legal, and uniquely define joint distribution

15 Undirected (CI) graph X 20 X 21 X 22 X 00 X 01 X 02 X 10 X 11 X 12 Absence of edge denotes conditional independence given all other variables But now there are non-trivial constraints on conditional distributions Regular lattice, irregular graph, areal data...

16 Undirected (CI) graph X 20 X 21 X 22 X 00 X 01 X 02 X 10 X 11 X 12 then and so The Hammersley-Clifford theorem says essentially that the converse is also true - the only sure way to get a valid joint distribution is to use ( ) ( ) clique Suppose we assume

17 Hammersley-Clifford X 20 X 21 X 22 X 00 X 01 X 02 X 10 X 11 X 12 A positive distribution p(X) is a Markov random field if and only if it is a Gibbs distribution - Sum over cliques C (complete subgraphs)

18 Partition function X 20 X 21 X 22 X 00 X 01 X 02 X 10 X 11 X 12 Almost always, the constant of proportionality in is not available in tractable form: an obstacle to likelihood or Bayesian inference about parameters in the potential functions Physicists call the partition function

19 Markov properties for undirected graphs The situation is a bit more complicated than it is for DAGs. There are 4 kinds of Markovness: P – pairwise –Non-adjacent pairs of variables are conditionally independent given the rest L – local –Conditional only on adjacent variables (neighbours), each variable is independent of all others

20 G – global –Any two subsets of variables separated by a third are conditionally independent given the values of the third subset. F – factorisation –the joint distribution factorises as a product of functions of cliques In general these are different, but F G L P always. For a positive distribution, they are all the same.

21 Gaussian Markov random fields: spatial autoregression is a multivariate Gaussian distribution, and If V C (X C ) is - ij (x i -x j ) 2 /2 for C={i,j} and 0 otherwise, then is the univariate Gaussian distribution where

22 Inverse of (co)variance matrix: dependent case ABCD non-zero A B C D ABCDABCD Gaussian random fields

23 Non-Gaussian Markov random fields Pairwise interaction random fields with less smooth realisations obtained by replacing squared differences by a term with smaller tails, e.g.

24 Discrete-valued Markov random fields Besag (1974) introduced various cases of for discrete variables, e.g. auto-logistic (binary variables), auto-Poisson (local conditionals are Poisson), auto-binomial, etc.

25 Auto-logistic model is Bernoulli( p i ) with - a very useful model for dependent binary variables ( NB various parameterisations ) (X i = 0 or 1)

26 Statistical mechanics models The classic Ising model (for ferromagnetism) is the symmetric autologistic model on a square lattice in 2-D or 3-D. The Potts model is the generalisation to more than 2 colours and of course you can usefully un-symmetrise this.

27 Auto-Poisson model is Poisson For integrability, ij must be 0, so this only models negative dependence: very limited use.

28 Hierarchical models and hidden Markov processes

29 Chain graphs If both directed and undirected edges, but no directed loops: can rearrange to form global DAG with undirected edges within blocks

30 Chain graphs If both directed and undirected edges, but no directed loops: can rearrange to form global DAG with undirected edges within blocks Hammersley-Clifford within blocks

31 Hidden Markov random fields We have a lot of freedom modelling spatially-dependent continuously- distributed random fields on regular or irregular graphs But very little freedom with discretely distributed variables use hidden random fields, continuous or discrete compatible with introducing covariates, etc.

32 Hidden Markov models z0z0 z1z1 z2z2 z3z3 z4z4 y1y1 y2y2 y3y3 y4y4 e.g. Hidden Markov chain observed hidden

33 Hidden Markov random fields Unobserved dependent field Observed conditionally- independent discrete field (a chain graph)

34 Spatial epidemiology applications independently, for each region i. Options : CAR, CAR+white noise (BYM, 1989) Direct modelling of,e.g. SAR Mixture/allocation/partition models: Covariates, e.g.: cases expected cases relative risk

35 Spatial epidemiology applications Spatial contiguity is usually somewhat idealised

36 relative risk parameters Richardson & Green ( JASA, 2002) used a hidden Markov random field model for disease mapping observed incidence expected incidence hidden MRF Spatial epidemiology applications

37 Chain graph for disease mapping e Y z k based on Potts model

38 Larynx cancer in females in France SMRs