Rick Paik Schoenberg, UCLA Statistics

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

Spatial point patterns and Geostatistics an introduction
Contributions of Prof. Tokuji Utsu to Statistical Seismology and Recent Developments Ogata, Yosihiko The Institute of Statistical Mathematics , Tokyo and.
Correlation and regression
Andrea Bertozzi University of California Los Angeles Thanks to contributions from Martin Short, George Mohler, Jeff Brantingham, and Erik Lewis.
Yan Y. Kagan Dept. Earth and Space Sciences, UCLA, Los Angeles, CA ,
Yan Y. Kagan Dept. Earth and Space Sciences, UCLA, Los Angeles, CA , EARTHQUAKE.
Nonstationary covariance structures II NRCSE. Drawbacks with deformation approach Oversmoothing of large areas Local deformations not local part of global.
Simulation Modeling and Analysis
Yan Y. Kagan Dept. Earth and Space Sciences, UCLA, Los Angeles, CA , GLOBAL EARTHQUAKE.
1 1.MLE 2.K-function & variants 3.Residual methods 4.Separable estimation 5.Separability tests Estimation & Inference for Point Processes.
Yan Y. Kagan Dept. Earth and Space Sciences, UCLA, Los Angeles, CA , Statistical.
Epidemic Type Earthquake Sequence (ETES) model  Seismicity rate = "background" + "aftershocks":  Magnitude distribution: uniform G.R. law with b=1 (Fig.
Earthquake predictability measurement: information score and error diagram Yan Y. Kagan Department of Earth and Space Sciences, University of California.
1 Some Current Problems in Point Process Research: 1. Prototype point processes 2. Non-simple point processes 3. Voronoi diagrams.
Nonstationary covariance structures II NRCSE. Drawbacks with deformation approach Oversmoothing of large areas Local deformations not local part of global.
Yan Y. Kagan Dept. Earth and Space Sciences, UCLA, Los Angeles, CA , Rules of the Western.
If we build an ETAS model based primarily on information from smaller earthquakes, will it work for forecasting the larger (M≥6.5) potentially damaging.
Statistics of Seismicity and Uncertainties in Earthquake Catalogs Forecasting Based on Data Assimilation Maximilian J. Werner Swiss Seismological Service.
The interevent time fingerprint of triggering for induced seismicity Mark Naylor School of GeoSciences University of Edinburgh.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
Stability and accuracy of the EM methodology In general, the EM methodology yields results which are extremely close to the parameter estimates of a direct.
Forecasting occurrences of wildfires & earthquakes using point processes with directional covariates Frederic Paik Schoenberg, UCLA Statistics Collaborators:
FULL EARTH HIGH-RESOLUTION EARTHQUAKE FORECASTS Yan Y. Kagan and David D. Jackson Department of Earth and Space Sciences, University of California Los.
Analysis of complex seismicity pattern generated by fluid diffusion and aftershock triggering Sebastian Hainzl Toni Kraft System Statsei4.
Yan Y. Kagan Dept. Earth and Space Sciences, UCLA, Los Angeles, CA ,
© by Yu Hen Hu 1 ECE533 Digital Image Processing Image Segmentation.
Goodness of fit testing for point processes with application to ETAS models, spatial clustering, and focal mechanisms (USGS) Frederic Paik Schoenberg,
1. Difficulty of point process model evaluation. 2. RELM and CSEP. 3. Numerical summaries (L-test, N-test, etc.). 4. Functional summaries (error diagrams,
Spatial Statistics in Ecology: Point Pattern Analysis Lecture Two.
Basic Time Series Analyzing variable star data for the amateur astronomer.
Ilya Zaliapin Department of Mathematics and Statistics University of Nevada, Reno IUGG General Assembly * Monday, June 29, 2015 Yehuda Ben-Zion Department.
"Classical" Inference. Two simple inference scenarios Question 1: Are we in world A or world B?
Ilya Zaliapin Department of Mathematics and Statistics University of Nevada, Reno USC ISC * Thursday, July 23, 2015 Yehuda Ben-Zion Department of Earth.
Relative quiescence reported before the occurrence of the largest aftershock (M5.8) with likely scenarios of precursory slips considered for the stress-shadow.
Methods for point patterns. Methods consider first-order effects (e.g., changes in mean values [intensity] over space) or second-order effects (e.g.,
Robust Regression. Regression Methods  We are going to look at three approaches to robust regression:  Regression with robust standard errors  Regression.
1 Producing Omori’s law from stochastic stress transfer and release Mark Bebbington, Massey University (joint work with Kostya Borovkov, University of.
California Earthquake Rupture Model Satisfying Accepted Scaling Laws (SCEC 2010, 1-129) David Jackson, Yan Kagan and Qi Wang Department of Earth and Space.
1 1.Definitions & examples 2.Conditional intensity & Papangelou intensity 3.Models a) Renewal processes b) Poisson processes c) Cluster models d) Inhibition.
Yan Y. Kagan Dept. Earth and Space Sciences, UCLA, Los Angeles, CA ,
GNS Science Testing by hybridization – a practical approach to testing earthquake forecasting models David Rhoades, Annemarie Christophersen & Matt Gerstenberger.
Jiancang Zhuang Inst. Statist. Math. Detecting spatial variations of earthquake clustering parameters via maximum weighted likelihood.
Abstract The space-time epidemic-type aftershock sequence (ETAS) model is a stochastic process in which seismicity is classified into background and clustering.
Why Model? Make predictions or forecasts where we don’t have data.
Manuel Gomez Rodriguez
BAE 5333 Applied Water Resources Statistics
Some tricks for estimating space-time point process models.
Global smoothed seismicity models and test results
Manuel Gomez Rodriguez
Summary of Prev. Lecture
Statistics in MSmcDESPOT
Machine Learning Basics
USGS Getty Images Frederic Paik Schoenberg, UCLA Statistics
Clustering (3) Center-based algorithms Fuzzy k-means
Model evaluation for forecasts of wildfire activity and spread
Spatial Point Pattern Analysis
"Did your model account for earthworms?" Rick Paik Schoenberg, UCLA
Diagnostics and Transformation for SLR
Spatial Point Pattern Analysis
Rick Paik Schoenberg, UCLA Statistics
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
R. Console, M. Murru, F. Catalli
Model Comparison.
Discrete Event Simulation - 4
Lecture 2 – Monte Carlo method in finance
Spatial Point Pattern Analysis
Diagnostics and Transformation for SLR
Human-centered Machine Learning
Outline Texture modeling - continued Markov Random Field models
Presentation transcript:

Rick Paik Schoenberg, UCLA Statistics A survey of goodness-of-fit tests for point process models for earthquake occurrences Rick Paik Schoenberg, UCLA Statistics Some point process models in seismology Pixel-based methods Numerical summaries Error diagrams Residuals: rescaling, thinning, superposition Comparative methods, tessellation residuals Example

N(A) = # of points in the set A. S = [0 , T] x X. Some point process models in seismology. Point process: random (s-finite) collection of points in some space, S. N(A) = # of points in the set A. S = [0 , T] x X. Simple: No two points at the same time (with probability one). Conditional intensity: l(t,x) = limDt,Dx -> 0 E{N([t, t+Dt) x Bx,Dx) | Ht} / [DtDx]. Ht = history of N for all times before t, Bx,Dx = ball around x of size Dx. * A simple point process is uniquely characterized by l(t,x). (Fishman & Snyder 1976) Poisson process: l(t,x) doesn’t depend on Ht. N(A1), N(A2), … , N(Ak) are independent for disjoint Ai, and each Poisson.

Some cluster models of clustering: Neyman-Scott process: clusters of points whose centers are formed from a stationary Poisson process. Typically each cluster consists of a fixed integer k of points which are placed uniformly and independently within a ball of radius r around each cluster’s center. Cox-Matern process: cluster sizes are random: independent and identically distributed Poisson random variables. Thomas process: cluster sizes are Poisson, and the points in each cluster are distributed independently and isotropically according to a Gaussian distribution. Hawkes (self-exciting) process: parents are formed from a stationary Poisson process, and each produces a cluster of offspring points, and each of them produces a cluster of further offspring points, etc. l(t, x) = m + ∑ g(t-ti, ||x-xi||). ti < t

Aftershock activity typically follows the modified Omori law (Utsu 1971): g(t) = K/(t+c)p.

Stationary (homogeneous) Poisson process: l(t,x) = m. Commonly used in seismology: Stationary (homogeneous) Poisson process: l(t,x) = m. Inhomogeneous Poisson process: l(t, x) = f(t, x). (deterministic) ETAS (Epidemic-Type Aftershock Sequence, Ogata 1988, 1998): l(t, x) = m(x) + ∑ g(t - ti, ||x - xi||, mi), ti < t where g(t, x, m) = K exp{am} (t+c)p (x2 + d)q

2. Pixel-based methods. Compare N(Ai) with ∫A l(t, x) dt dx, on pixels Ai. (Baddeley, Turner, Møller, Hazelton, 2005) Problems: * If pixels are large, lose power. * If pixels are small, residuals are mostly ~ 0,1. * Smoothing reveals only gross features.

(Baddeley, Turner, Møller, Hazelton, 2005)

3. Numerical summaries. a) Likelihood statistics (LR, AIC, BIC) 3. Numerical summaries. a) Likelihood statistics (LR, AIC, BIC). Log-likelihood = ∑ logl(ti,xi) - ∫ l(t,x) dt dx. b) Second-order statistics. * K-function, L-function (Ripley, 1977) * Weighted K-function (Baddeley, Møller and Waagepetersen 2002, Veen and Schoenberg 2005) * Other weighted 2nd-order statistics: R/S statistic, correlation integral, fractal dimension (Adelfio and Schoenberg, 2009)

Weighted K-function Usual K-function: K(h) ~ ∑∑i≠j I(|xi - xj| ≤ h), Weight each pair of points according to the estimated intensity at the points: Kw(h)^~ ∑∑i≠j wi wj I(|xi - xj| ≤ h), where wi = l(ti , xi)-1. (asympt. normal, under certain regularity conditions.) Lw(h) = centered version = √[Kw(h)/π] - h, for R2

Model: l(x,y;a) = a m(x,y) + (1- a)n. h (km)

3. Numerical summaries. a) Likelihood statistics (LR, AIC, BIC) 3. Numerical summaries. a) Likelihood statistics (LR, AIC, BIC). Log-likelihood = ∑ logl(ti,xi) - ∫ l(t,x) dt dx. b) Second-order statistics. * K-function, L-function (Ripley, 1977) * Weighted K-function (Baddeley, Møller and Waagepetersen 2002, Veen and Schoenberg 2005) * Other weighted 2nd-order statistics: R/S statistic, correlation integral, fractal dimension (Adelfio and Schoenberg, 2009) c) Other test statistics (mostly vs. stationary Poisson). TTT, Khamaladze (Andersen et al. 1993) Cramèr-von Mises, K-S test (Heinrich 1991) Higher moment and spectral tests (Davies 1977) Problems: -- Overly simplistic. -- Stationary Poisson not a good null hypothesis (Stark 1997)

4. Error Diagrams Plot (normalized) number of alarms vs 4. Error Diagrams Plot (normalized) number of alarms vs. (normalized) number of false negatives (failures to predict). (Molchan 1990; Molchan 1997; Zaliapin & Molchan 2004; Kagan 2009). Similar to ROC curves (Swets 1973). Problems: -- Must focus near axes. [consider relative to given model (Kagan 2009)] -- Does not suggest where model fits poorly.

Suppose N is simple. Rescale one coordinate: move each point 5. Residuals: rescaling, thinning, superposing Rescaling. (Meyer 1971; Merzbach & Nualart 1986; Nair 1990; Schoenberg 1999; Vere-Jones and Schoenberg 2004): Suppose N is simple. Rescale one coordinate: move each point {ti, xi} to {ti , ∫oxi l(ti,x) dx} [or to {∫oti l(t,xi) dt), xi }]. Then the resulting process is stationary Poisson. Problems: * Irregular boundary, plotting. * Points in transformed space hard to interpret. * For highly clustered processes: boundary effects, loss of power.

Thinning. (Westcott 1976): Suppose N is simple, stationary, & ergodic.

Thinning: Suppose inf l(ti ,xi) = b. Keep each point (ti ,xi) with probability b / l(ti ,xi) . Can repeat many times --> many stationary Poisson processes (but not quite ind.!)

Then Mk --> stationary Poisson. Superposition. (Palm 1943): Suppose N is simple & stationary. Then Mk --> stationary Poisson.

Superposition: Suppose sup l(t , x) = c. Superpose N with a simulated Poisson process of rate c - l(t , x) . As with thinning, can repeat many times to generate many (non-independent) stationary Poisson processes. Problems with thinning and superposition: Thinning: Low power. If b = inf l(ti ,xi) is small, will end up with very few points. Superposition: Low power if c = sup l(ti ,xi) is large: most of the residual points will be simulated.

-- Better: consider difference between log-likelihoods, in each pixel. 6. Comparative methods, tessellation. -- Can consider difference (between competing models) between residuals over each pixel. Problem: Hard to interpret. If difference = 3, is this because model A overestimated by 3? Or because model B underestimated by 3? Or because model A overestimated by 1 and model B underestimated by 2? Also, when aggregating over pixels, it is possible that a model will predict the correct number of earthquakes, but at the wrong locations and times. -- Better: consider difference between log-likelihoods, in each pixel. (Wong & Schoenberg 2009). Problem: pixel choice is arbitrary, and unequal # of pts per pixel…..

Problem: pixel choice is arbitrary, and unequal # of pts per pixel….. -- Alternative: use the Voronoi tessellation of the points as cells. Cell i = {All locations closer to point (xi,yi) than to any other point (xj,yj) }. Now 1 point per cell. If l is locally constant, then cell area ~ Gamma (Hinde and Miles 1980)

8. Example: using focal mechanisms in ETAS

In ETAS (Ogata 1998), l(t,x,m) = f(m)[m(x) + ∑i g(t-ti, x-xi, mi)], where f(m) is exponential, m(x) is estimated by kernel smoothing, and i.e. the spatial triggering component, in polar coordinates, has the form: g(r, q) = (r2 + d)q . Looking at inter-event distances in Southern California, as a function of the direction qi of the principal axis of the prior event, suggests: g(r, q; qi) = g1(r) g2(q - qi | r), where g1 is the tapered Pareto distribution, and g2 is the wrapped exponential.

ETAS: no use of focal mechanisms. Summary of principal direction of motion in an earthquake, as well as resulting stress changes and tension/pressure axes.

tapered Pareto / wrapped exp.  biv. normal (Ogata 1998)  Cauchy/ ellipsoidal (Kagan 1996) 

Thinned residuals: Data  tapered Pareto / wrapped exp.  Cauchy/ ellipsoidal (Kagan 1996)  biv. normal (Ogata 1998) 

Tapered pareto / wrapped exp. Cauchy / ellipsoidal