WAR, POWER-LAWS & MLE Miles D. Townes The George Washington University Draft, links, and STATA do-files available at

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Brief introduction on Logistic Regression
Correlation and regression
Session 8b Decision Models -- Prof. Juran.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Multiple Linear Regression Model
Final Review Session.
Power Laws Otherwise known as any semi- straight line on a log-log plot.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Simple Linear Regression Analysis
Business Statistics - QBM117 Interval estimation for the slope and y-intercept Hypothesis tests for regression.
BCOR 1020 Business Statistics
Today Concepts underlying inferential statistics
Hypothesis Testing Using The One-Sample t-Test
Simple Linear Regression and Correlation
Chapter 12 Section 1 Inference for Linear Regression.
Simple Linear Regression Analysis
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Inference for regression - Simple linear regression
Hypothesis Testing in Linear Regression Analysis
Basic Statistics. Basics Of Measurement Sampling Distribution of the Mean: The set of all possible means of samples of a given size taken from a population.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
ANOVA, Regression and Multiple Regression March
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
P-values and statistical inference Dr. Omar Aljadaan.
Chapter 8. Process and Measurement System Capability Analysis
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
Statistics for Business and Economics 7 th Edition Chapter 7 Estimation: Single Population Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Chapter 15 Inference for Regression. How is this similar to what we have done in the past few chapters?  We have been using statistics to estimate parameters.
MARCH 18, 2014 DATA ANALYSIS. WHAT TO DO WITH DATA Take a look at your data Histogram Descriptive statistics Mean, mode, range, standard deviation/standard.
Howard Community College
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Statistical Inference
Statistical Estimation
Chapter 4 Basic Estimation Techniques
Model validation and prediction
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Basic Estimation Techniques
Chapter 11: Simple Linear Regression
Goodness of Fit x² -Test
Lecture 8 Preview: Interval Estimates and Hypothesis Testing
Slides by JOHN LOUCKS St. Edward’s University.
Basic Estimation Techniques
I271B Quantitative Methods
Correlation and Regression
Inference about the Slope and Intercept
Inference about the Slope and Intercept
Confidence Interval Estimation
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Simple Linear Regression
Chapter 7: The Normality Assumption and Inference with OLS
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Maximum Likelihood We have studied the OLS estimator. It only applies under certain assumptions In particular,  ~ N(0, 2 ) But what if the sampling distribution.
Presentation transcript:

WAR, POWER-LAWS & MLE Miles D. Townes The George Washington University Draft, links, and STATA do-files available at Abstract: Richardson (1948) discovered a power-law relationship between the number of deaths in a “fatal quarrel” and the relative frequency of such quarrels. Of late this study has attracted renewed interest from IR scholars, but often the power-law relationship is estimated by OLS regression. In this study, I demonstrate why OLS is inappropriate, using MLE and simulation techniques to arrive at correct estimates of the power-law parameters. I also extend these techniques to interpretation of the power-law result, towards a better understanding of what Richardson's Law means for International Relations and international conflict.

Power Laws This can also be relationship can also be described as log p(x) = c – α log x So why not use OLS regression? Because it introduces bias: “First, the errors are hard to estimate because they are not well-described by the usual regression formulas, which are based on assumptions that do not apply in this case. Second, a fit to a power-law distribution can account for a large fraction of the variance even when the fitted data do not follow a power law... And third, the fits extracted by regression methods do not satisfy basic requirements on probability distributions, such as normalization, and hence cannot be correct.” (Clauset et al, 2007; 22) Power-laws are described by the distribution:

Adapting techniques suggested by Clauset et al (2007), I generated a simulated dataset of 1000 observations with a know α = 1.5. The differences are two-fold: first, OLS in fact estimates not α, but slope of the distribution function, -(α – 1). Even correcting for the unit difference, the OLS estimate is still biased. Though the difference appears small, nonetheless the 95% confidence interval for OLS ( , ) excludes the true value of α = 1.5, per the simulated data. This is a critical problem for correct interpretation of the estimation results. Simulation MLE and OLS estimates for simulated power-law data with α = 1.5

Richardson's Law

Correlates of war These graphs plot the year and magnitude of observed conflicts in three COW datasets for war, which I also combine into a single dataset: All Wars.

MLE results Comparison of ML estimations for three COW datasets plus combined dataset These results suggest that the datasets reflect three distinct processes, but there is nonetheless good reason to think that these differences reflect coding artifacts from COW and not systematic differences among the types of conflict.

Next Steps -Further verification of STATA syntax. -Calculation of p-value for each COW dataset. -Using power-law data to debunk “long cycles” of war. -Using α to test: Are there differences by continent/region in α ? Are there differences in α by time period? Are great-powers more violent than lesser powers? Are nuclear-power interactions less violent than those among non-nuclear powers? Do international organizations exerting a pacifying effect?

Testing the Power-Law Clauset et al suggest five steps to test whether a distribution fits a power law: 1. Determine the best fit of the power-law to the data, including both alpha and xmin 2. Calculate the KS statistic for goodness-of-fit for step 1 3. Simulate steps 1 and 2 for a large number of synthetic datasets with alpha and xmin the same as step Calculate the p-value as the fraction of KS statistics for the synthetic data whose value exceeds the KS statistic for the real data. 5. If the p-value is sufficiently small, the power-law distribution can be rejected. Using this test for the All Wars dataset, p =.83 I cannot reject the power-law distribution – in fact, it looks like a solid fit.