Regression problems for magnitude

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Lesson 10: Linear Regression and Correlation
The Simple Regression Model
Welcome to PHYS 225a Lab Introduction, class rules, error analysis Julia Velkovska.
Lecture 6 Bootstraps Maximum Likelihood Methods. Boostrapping A way to generate empirical probability distributions Very handy for making estimates of.
Chapter 9 Hypothesis Testing Understandable Statistics Ninth Edition
Kriging.
Chapter 27 Inferences for Regression This is just for one sample We want to talk about the relation between waist size and %body fat for the complete population.
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
STA305 week 31 Assessing Model Adequacy A number of assumptions were made about the model, and these need to be verified in order to use the model for.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Objectives (BPS chapter 24)
Variance and covariance M contains the mean Sums of squares General additive models.
The Simple Linear Regression Model: Specification and Estimation
Chapter 10 Simple Regression.
9. SIMPLE LINEAR REGESSION AND CORRELATION
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
4. TESTS 5. CONCLUSIONS 2. METHOD We base the study on the method proposed by Zúñiga and Wyss (1995) which was originally intended for finding suitable.
Introduction to Probability and Statistics Linear Regression and Correlation.
Estimation 8.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Lecture 17 Interaction Plots Simple Linear Regression (Chapter ) Homework 4 due Friday. JMP instructions for question are actually for.
Business Statistics - QBM117 Statistical inference for regression.
Correlation and Regression Analysis
Chapter 7 Probability and Samples: The Distribution of Sample Means
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Chapter 12 Section 1 Inference for Linear Regression.
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
1 Simple Linear Regression 1. review of least squares procedure 2. inference for least squares lines.
Standard error of estimate & Confidence interval.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Inferences for Regression
Chapter 6 & 7 Linear Regression & Correlation
Montecarlo Simulation LAB NOV ECON Montecarlo Simulations Monte Carlo simulation is a method of analysis based on artificially recreating.
R. Kass/W03P416/Lecture 7 1 Lecture 7 Some Advanced Topics using Propagation of Errors and Least Squares Fitting Error on the mean (review from Lecture.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Center for Sustainable Transportation Infrastructure Harmonization of Friction Measuring Devices Using Robust Regression Methods Samer Katicha 09/09/2013.
Managerial Economics Demand Estimation & Forecasting.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
Section 10.1 Confidence Intervals
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 21 The Simple Regression Model.
The Simple Linear Regression Model: Specification and Estimation ECON 4550 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s.
Linear Regression Linear Regression. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Purpose Understand Linear Regression. Use R functions.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
R. Kass/Sp07P416/Lecture 71 More on Least Squares Fit (LSQF) In Lec 5, we discussed how we can fit our data points to a linear function (straight line)
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Chapter 4 Basic Estimation Techniques
Part 5 - Chapter
(5) Notes on the Least Squares Estimate
Basic Estimation Techniques
The Simple Linear Regression Model: Specification and Estimation
Inference for Regression
How to handle missing data values
Basic Estimation Techniques
Linear Regression.
CONCEPTS OF ESTIMATION
The Simple Linear Regression Model: Specification and Estimation
Chapter 7: The Normality Assumption and Inference with OLS
Presentation transcript:

Regression problems for magnitude Silvia Castellaro1, Peter Bormann2, Francesco Mulargia1 and Yan Y. Kagan3 1 Sett. Geofisica, Università Bologna (Italy) 2 GFZ, Potsdam (Germany) 3 UCLA, Los Angeles IUGG (Perugia), 11 July 2007

The need for a unified magnitude A large variety of earthquake size indicator exists (ms, mb, md, mL, M0, Me, Mw ,etc.) Each one with a different meaning

Ignoring the fact that a single indicator of size may be inadequate in seismic hazard estimates, The state of the art is to use on account of its better definition in seismological terms Mw

The magnitude conversion problem In converting magnitude, It is commonly assumed that the relation Mx – My is linear (this is justified as long as none of them shows a much stronger saturation than the other) Least-Squares Linear Regression is so popular that it is mostly applied without checking whether its basic requirements are satisfied

Linear Least-Squares Regression BASIC ASSUMPTIONS The uncertainty in the independent variable is at least one order of magnitude smaller than the one on the dependent variable, Both data and uncertainties are normally distributed, Residuals are constant.

Fail to satisfy the basic assumptions may: Lead to wrong magnitude conversions, Have severe consequences on the b-value of the Gutenberg-Richter magnitude-frequency distribution, which is the basis for probabilistic seismic hazard estimates

Which regression relation? Standard Linear Regression Other Regressions

Here we focus on the performance of Standard Regression Orthogonal Regression = s2y / s2x

s2x  0, s2y > 0 s2x > 0, s2y  0 s2x > 0, s2y > 0 SR Standard least-squares Regression ISR Inverse Standard least-squares Regression GOR General Orthogonal Regression OR Orthogonal Regression. Special case of GOR with s2x ~ s2y h =1 s2x  0, s2y > 0 Y  b X + a s2x > 0, s2y  0 Y  b X + a s2x > 0, s2y > 0 Y = b X + a

It has already been demonstrated that GOR produces better results than SR/ISR (Castellaro et al., GJI, 2006) On normally, log-normally and exponentially distributed variables On normally, log-normally and exponentially dstributed errors On different amount of errors

GOR SR ISR Example: (X, Y) exponentially distributed, Exponentially distributed errors added to X and Y, True slope (b) = 1.

However the point with GOR is That the error ratio between the y and the x variables (h = s2y / s2x) needs to be known. In practice h is mostly ignored since the seismological data centers do not publish standard deviations for their average event magnitudes.

Ratios h between variances Absolute values of errors sx, sy To define the performance of the different procedures we run enough simulations to cover the ranges of Slopes b Ratios h between variances Absolute values of errors sx, sy which may be enocuntered when converting magnitudes

Parameters used in the simulations In order to produce realistic simulations, parameters are inferred from the study of CENC (Chinese Earthquake Network Center), GRSN (German Regional Seismic Network) and Italian official catalogues 0.5 < btrue < 2 0.05 < sx, sy < 0.50 0.25 < h < 0.3

Generation of the datasets 1) 103 couples of magnitudes (Mx, My) with 3.5 < Mx, My < 9.5 2) Sampled from exponential distribution (Utsu, 1999, Kagan, 2002, 2002b, 2005, Zailiapin et al. 2005) 3) From (Mx, My) to (mx, my) by addition of errors sampled from Gaussian distributions with deviations sx and sy Steps 1) to 3) are repeated 103 times and 103 SR, ISR, GOR and OR regressions are performed to obtain the average bSR, bISR, bGOR, bOR and their deviations.

Results GOR is always the best fitting procedure However, if h is unknown…

Attention should be paid to the mb-MS relation which is not linear (due to saturation of the short-period mb for strong earthquakes), has an error ratio of about 2 and usually a rather large absolute scatter in the mb data

If nothing is known about the variable variances, compare your case to the whole set of figures posted in www.terraemoti.net to get some examples for chosing the best regression procedure

A typical Italian dataset Data no. ms Mw 109 mL 121 mb 204 s ms 0.28 mL 0.22 mb 0.37 Mw 0.18 Italian earthquakes in 1981-1996. s computed for each earthquake each time it was recorded by at least 3 stations.

Mw-ms

Mw-mL

Mw-mb

The magnitude conversion problem may appear a solved problem while it is not! For example, some authors in the BSSA (2007) state “this work likely represent the final stage of calculating local magnitude relation ML-Md by regression analysis…” but they forgot to consider the variable errors at all! It follows that…

BSSA, (2007) While the most realistic result should be…

The use of SR without any discussion on the applicability of the model is unfortunately still too common

The problem of the magnitude conversion can of course be approached also through other techniques IN EUROPE Panza et al., 2003; Cavallini and Rebez, 1996; Kaverina et al., 1996; Gutdeutsch et al., 2002; Grünthal and Wahlström, 2003; Stromeyer et al., 2004: Apply the OR for magnitude-intensity relations but h = 1 Gutdeutsch et al., 2002 finds the mL-ms relation through OR (h = 1) for the Kàrnik (1996) catalouge of central-southern Europe Grünthal and Wahlström, 2003 applied the c2 regression to central Europe Gutdeutsch and Keiser are studing the c2 regressions for magnitudes

The software to run SR, ISR, OR and GOR is available on www.terraemoti.net