“Dirty Pool” Revisited: When Less is More Robert S. Erikson, Pablo M. Pinto, Kelly T. Rader Department of Political Science Download paper at:

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Economics 105: Statistics GH 24 due Wednesday. Hypothesis Tests on Several Regression Coefficients Consider the model (expanding on GH 22) Is “race” as.
CHAPTER 8 MULTIPLE REGRESSION ANALYSIS: THE PROBLEM OF INFERENCE
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Lecture 8 Relationships between Scale variables: Regression Analysis
Empirical Analysis Doing and interpreting empirical work.
Sociology 601 Class 10: October 1, : Small sample comparisons for two independent groups. o Difference between two small sample means o Difference.
Chapter 10 Simple Regression.
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
The Role of Financial System in Economic Growth Presented By: Saumil Nihalani.
Sociology 601 Class12: October 8, 2009 The Chi-Squared Test (8.2) – expected frequencies – calculating Chi-square – finding p When (not) to use Chi-squared.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Inferences About Process Quality
Stat 217 – Day 15 Statistical Inference (Topics 17 and 18)
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Chapter 12 Inferring from the Data. Inferring from Data Estimation and Significance testing.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
On Comparing Classifiers: Pitfalls to Avoid and Recommended Approach Published by Steven L. Salzberg Presented by Prakash Tilwani MACS 598 April 25 th.
Hypothesis Testing: Two Sample Test for Means and Proportions
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Simple Linear Regression Analysis
The Two Sample t Review significance testing Review t distribution
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Introduction to Multilevel Modeling Using SPSS
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Chapter 13: Inference in Regression
Chapter 8 Introduction to Hypothesis Testing
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Inferential Statistics 2 Maarten Buis January 11, 2006.
STA302/ week 911 Multiple Regression A multiple regression model is a model that has more than one explanatory variable in it. Some of the reasons.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Instructor Resource Chapter 5 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
Andrew Thomson on Generalised Estimating Equations (and simulation studies)
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Why Model? Make predictions or forecasts where we don’t have data.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Chapter 20 Testing Hypothesis about proportions
Discussion of time series and panel models
Three Frameworks for Statistical Analysis. Sample Design Forest, N=6 Field, N=4 Count ant nests per quadrat.
Section 3.3: The Story of Statistical Inference Section 4.1: Testing Where a Proportion Is.
to accompany Introduction to Business Statistics
[title of your research project] [author] INST 381 Fall 2015.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
11 Chapter 5 The Research Process – Hypothesis Development – (Stage 4 in Research Process) © 2009 John Wiley & Sons Ltd.
Regression. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other words, there is a distribution.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Chapter 22 Comparing Two Proportions.  Comparisons between two percentages are much more common than questions about isolated percentages.  We often.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Logistic Regression and Odds Ratios Psych DeShon.
1/25 Introduction to Econometrics. 2/25 Econometrics Econometrics – „economic measurement“ „May be defined as the quantitative analysis of actual economic.
Chapter 9 Introduction to the t Statistic
Test of a Population Median. The Population Median (  ) The population median ( , P 50 ) is defined for population T as the value for which the following.
Test of a Population Median. The Population Median (  ) The population median ( , P 50 ) is defined for population T as the value for which the following.
AP Statistics Chapter 14 Section 1.
Virtual COMSATS Inferential Statistics Lecture-26
CHAPTER 26: Inference for Regression
Regression Models - Introduction
The Regression Model Suppose we wish to estimate the parameters of the following relationship: A common method is to choose parameters to minimise the.
Fixed, Random and Mixed effects
Seminar in Economics Econ. 470
Presentation transcript:

“Dirty Pool” Revisited: When Less is More Robert S. Erikson, Pablo M. Pinto, Kelly T. Rader Department of Political Science Download paper at:

Do Democracies Trade more with other Democracies? How the literature approaches this question: Convention: –Measure units as dyad-year e.g.: Sierra Leone-Ecuador 1997 –Measure democracy as democracy score for the least democratic dyad member –Relate democracy to trade levels while controlling for other stuff Main issues: –Fixed effects? –Lagged dependent variable on RHS? –Dyads as unit not an issue.

Do Democracies Trade more with other Democracies? Methodological issues discussed in 2001 symposium in IO 90,000+ dyad-years, but only up to 115 countries over 30 years We present a cautionary tale about using dyads as units of analysis Large data set with a massive N can create overconfidence in hypothesis testing

Symposium on Research Design and Methods in IR: –Editors' Introduction (pp )Editors' Introduction Peter Gourevitch, David A. Lake –Dirty Pool (pp )Dirty Pool Donald P. Green, Soo Yeon Kim, David H. Yoon –Clear and Clean: The Fixed Effects of the Liberal Peace (pp )Clear and Clean: The Fixed Effects of the Liberal Peace John R. Oneal, Bruce Russett –Throwing out the Baby with the Bath Water: A Comment on Green, Kim, and Yoon (pp )Throwing out the Baby with the Bath Water: A Comment on Green, Kim, and Yoon Nathaniel Beck, Jonathan N. Katz –Proper Nouns and Methodological Propriety: Pooling Dyads in International Relations Data (pp )Proper Nouns and Methodological Propriety: Pooling Dyads in International Relations Data Gary King

Our contribution Randomization tests on dyads to infer the correct p-values Switch to nation-years as units of analysis: proportion of trade with democracies as a function of regime Difference-in-difference analysis of change in trading partners after a regime shock Cross-sectional analysis of trade by regime type

GKY (current dollars)

GKY sample (constant dollars)

Expanded Trade Data

Randomization Tests! Simulate the state of the world if the null hypothesis were true Scramble the key independent variable to break the relationship between it and the outcome Estimate coefficients and significance tests using preferred model, knowing the “answer” is zero Repeat at least 1000 times See if the actual estimated effect is rare compared to distribution of spurious effects Sample estimate of Fisher’s exact test (1935)

Randomization Tests! Drop the country labels on the floor and scramble them. Pick them up randomly and reinsert. Albania might become “Bolivia” for example, in all dyads involving true Albania. Rescramble 1000 times. With random country labels and the equation rerun, the data present a distribution around the null outcome (presumably zero, but with complications). T Where in the distribution of observations does the observed value from the actual equation fall? That position in the cumulative distribution yields the p- value with the randomization test.

Advantages of Randomization Tests Unlike parametric tests… –Don’t rely on distributional assumptions about  ij –Don’t rely on distributional assumptions about the test statistic –Standard errors and significance tests determined empirically –Especially useful when theoretical standard errors are hard to derive Like the democratic trade models! But, do assume exchangeability of errors

Figure 1: Randomization tests results

Figure 1 (cont): Randomization tests results

Country-year Models

Figure 2: Dem. trade and regime (residuals) Residual proportion of trade with other democracies by residual regime Data: nation-year; controls: nation and year dummies. Y = X + ε std. err =.044 p =.000 N= 6459 Adj. r 2 =0.011

Figure 3: Democratic shocks Democratic trade at t n relative to democratic shock at t 0

Figure 4: Democratic shocks Residualized democratic trade at t n relative to democratic shock at t 0

Figure 5: Anti-democratic shocks Democratic trade at t n relative to anti-democratic shock at t 0

Figure 6: Anti-democratic shock Residualized democratic trade at t n relative to anti-democratic shock at t 0

Figure 7: Effect of regime shocks Change in democratic trade as a function of democratic shocks 2.6 = mean change in democratic trade per year x 6 years

Figure 8: Country analysis Residual democratic trade over mean regime score (year adjusted) Data: countries Y = 1.10 X + ε st. err =.185 p =.000 N= 154 adj. r 2 =0.183

Conclusions Dyads are inappropriate units of analysis for democratic trade hypothesis: –Randomization tests show hypothesis testing in traditional regression results are too optimistic (p-values up to 41 trillion times larger!) Randomization tests are useful for testing claims, especially when the appropriate parametric test is difficult to figure out Move to nation-year as unit of analysis –We find no regime effect Less is more : –Event and difference in differences analyses of shocks seem to confirm democratic trade hypothesis