Xavier Sala-i-Martin Columbia University June 2008.

Slides:

Advertisements

Similar presentations

World Inequality and Globalization by Bob Sutcliffe Presented by Meg Spearman April 13, 2007 PUAF 699I Professor Milanovic Presented by Meg Spearman April.

Advertisements

Sampling: Final and Initial Sample Size Determination

Confidence Intervals This chapter presents the beginning of inferential statistics. We introduce methods for estimating values of these important population.

Statistics : Statistical Inference Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University 1.

Poverty, Inequality, and the World Distribution of Income By Xavier Sala-i-Martin.

Poverty, Inequality, and the World Distribution of Income By Xavier Sala-i-Martin.

Objectives (BPS chapter 24)

World Distribution of Household Wealth James Davies, Susanna Sandström, Anthony Shorrocks and Edward Wolff World Institute for Development Economics Research.

Xavier Sala-i-Martin Columbia University June 2009.

Poverty, Inequality, and Development

Sample size computations Petter Mostad

The Simple Regression Model

Topic 2: Statistical Concepts and Market Returns

Evaluating Hypotheses

The World Income Distribution of Income: Falling Poverty and…Convergence, Period Sala-i-Martin (2006)

The Basics of Regression continued

2008 Chingchun 1 Bootstrap Chingchun Huang ( 黃敬群 ) Vision Lab, NCTU.

International Workshop on Industrial Statistics Dalian, China June 2010 Shyam Upadhyaya UNIDO Benchmarking of monthly/quarterly.

Scot Exec Course Nov/Dec 04 Ambitious title? Confidence intervals, design effects and significance tests for surveys. How to calculate sample numbers when.

Measuring Inequality A practical workshop On theory and technique San Jose, Costa Rica August 4 -5, 2004.

Correlation & Regression

Review of normal distribution. Exercise Solution.

Inference for regression - Simple linear regression

Poverty, Inequality, and the World Distribution of Income By Xavier Sala-i-Martin.

Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.

Figure 14.1 Income levels, growth rates and population, 1980–2010 Data source: World Development Indicators online; GDP per capita in constant 2000 US.

Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.

Montecarlo Simulation LAB NOV ECON Montecarlo Simulations Monte Carlo simulation is a method of analysis based on artificially recreating.

PARAMETRIC STATISTICAL INFERENCE

1 G Lect 10a G Lecture 10a Revisited Example: Okazaki’s inferences from a survey Inferences on correlation Correlation: Power and effect.

ECON Poverty and Inequality. Measuring poverty To measure poverty, we first need to decide on a poverty line, such that those below it are considered.

Confidence Interval Proportions.

Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.

Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.

The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.

Inequality The “Haves” and the “Have Nots”. Course Themes Inequality – Crime Corporate Crime – Health Issues – War and Conflict – Race / Ethnicity – Gender.

Brian Macpherson Ph.D, Professor of Statistics, University of Manitoba Tom Bingham Statistician, The Boeing Company.

Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.

Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.

Statistics PSY302 Quiz One Spring A _____ places an individual into one of several groups or categories. (p. 4) a. normal curve b. spread c.

1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.

Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.

CATCH UP AND EMERGING DIVERGENCES: Can it Reduce Inequality? Deepak Nayyar Institute of Social Studies The Hague 8th October 2015.

Poverty, Inequality, and the World Distribution of Income By Xavier Sala-i-Martin.

Chapter 8. Process and Measurement System Capability Analysis

From the population to the sample The sampling distribution FETP India.

Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.

Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.

ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.

1 Measuring Poverty: Inequality Measures Charting Inequality Share of Expenditure of Poor Dispersion Ratios Lorenz Curve Gini Coefficient Theil Index Comparisons.

Statistical Inference: Poverty Indices and Poverty Decompositions Michael Lokshin DECRG-PO The World Bank.

ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.

The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.

Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.

Estimating standard error using bootstrap

Economic growth, debt and inequality

The simple linear regression model and parameter estimation

Xavier Sala-i-Martin Columbia University June 2009

Global summary of the AIDS epidemic, 2008

Global summary of the AIDS epidemic, 2008

CONCEPTS OF ESTIMATION

World Distribution of Household Wealth

Soil-transmitted helminth infections: updating the global picture

Western & Central Europe

Statistics PSY302 Review Quiz One Spring 2017

Poverty Maps for Sri Lanka

Children (<15 years) estimated to be living with HIV as of end 2005

Regional HIV and AIDS statistics and features, end of 2004

How Confident Are You?.

Statistical inference for the slope and intercept in SLR

Presentation transcript:

Xavier Sala-i-Martin Columbia University June 2008

Goal Estimate WDI Estimate Poverty Rates and Counts Estimate Income Inequality across the world’s citizens

Data GDP Per capita (PPP-Adjusted). We usually use these data as the “mean” of each country/year distribution of income (for example, when we estimate growth regressions) Note: I decompose China and India into Rural and Urban Use local surveys to get relative incomes of rural and urban Apply the ratio to PWT GDP and estimate per capita income in Rural and Urban and treat them as separate data points (as if they were different “countries”) Using GDP Per Capita we know…

GDP Per Capita Since 1970

Annual Growth Rate of World Per Capita GDP

β-Non-Convergence

σ -Divergence (191 countries)

Histogram Income Per Capita (countries)

Adding Population Weights

Back

Population-Weighted β-convergence ( )

We can use Survey Data Problem Not available for every year Not available for every country Survey means do not coincide with NA means But NA Numbers do not show Personal Situation: Need Individual Income Distribution

Surveys not available every year Can Interpolate Income Shares (they are slow moving animals) Regression Near-Observation Cubic Interpolation Others

Missing Countries Can approximate using neighboring countries

Method: Step 1: Interpolate Break up our sample of countries into regions(World Bank region definitions). Interpolate the quintile shares for country-years with no data, according to the following scheme, and in the following order: Group I – countries with several years of distribution data We calculate quintile shares of years with no income distribution data that are WITHIN the range of the set of years with data by cubic spline interpolation of the quintile share time series for the country. We calculate quintile shares of years with no data that are OUTSIDE this range by assuming that the share of each quintile rises each year after the data time series ends by beta/2^i, where i is the number of years after the series ends, and beta is the coefficient of the slope of the OLS regression of the data time series on a constant and on the year variable. This extrapolation adjustment ensures that 1) the trend in the evolution of each quintile share is maintained for the first few years after data ends, and 2) the shares eventually attain their all-time average values, which is the best extrapolation that we could make of them for years far outside the range of our sample. Group II – countries with only one year of distribution data. We keep the single year of data, and impute the quintile shares for other years to have the same deviations from this year as does the average quintile share time series taken over all Group I countries in the given region, relative to the year for which we have data for the given country. Thus, we assume that the country’s inequality dynamics are the same as those of its region, but we use the single data point to determine the level of the country’s income distribution. Group III – countries with no distribution data. We impute the average quintile share time series taken over all Group I countries in the given region.

Step 2: Find the σ of the lognormal distribution using least squares

Step 3: Compute the resulting normal distributions, and the poverty and inequality statistics

Step 4 (to generate confidence intervals): Generate a new data set of quintiles Having obtained our point estimates, we obtain our standard errors by reproducing our original set of income distribution data by drawing samples of the sample size given in the country information sheets for the WIDER database from each estimated lognormal distribution corresponding to a country-year with data, calculating the sample quintile shares for each of these samples, discarding the sample

Step 5: Repeat steps 1 through 4 using the original values of σ and μ to generate samples in step 4 Repeat the steps 1 through 4 to generate a new set of poverty and inequality measures for each country-year and the world as a whole over the 34 years. We repeat the procedure N (300) times. Note that we do not use our estimates to generate income shares for country-years with no data, but we obtain the data by the procedure described above in order to keep the data-generating process identical to the one we used to obtain point estimates. Note also that in all iterations, we generate our samples from the lognormals with parameters given by the point estimates we obtain from the true, rather than synthetic data.

Step 6: Find the mean and the standard deviation of poverty and inequality measures Note that we have as many “observations” of the poverty and inequality measures as we have iterations of step 5. For this paper we used N=300. We can now estimate the mean and standard deviation of these “observations. If our assumption about the nature of the sampling in the surveys as roughly i.i.d., our assumption that the country-year distributions are lognormal, and our assumption that the interpolation provides reasonable estimates of quintile shares for country-years with no data are all correct, the standard deviation of the estimates for the N iterations should converge to the population standard deviation of the (complicated) estimator that we use to obtain our point estimates.

Results

Back

Poverty Rates

Rates or Headcounts? Veil of Ignorance: Would you Prefer your children to live in country A or B? (A) people and poor (poverty rate = 50%) (B) people and poor (poverty rate =33%) If you prefer (A), try country (C) (C) people and poor.

Poverty Counts

Gini and Atkinson Index (coef=1)

Sen Index (=Income*(1-gini))

Atkinson Welfare Level

MLD and Theil

MLD Decomposition (t=total, w=within, and b=between country inequality)

Theil Decomposition (t=total, w=within, and b=between country inequality)

Regional Analysis

Sub Saharan Africa

East Asia

South Asia

Latin America

Middle East and North Africa

Eastern Europe

Former Soviet Union

Counts (all regions, $1/day)

Sensitivity of Functional form: Poverty Rates ($1/day) with Kernel, Normal, Gamma, Adjusted Normal, Weibull distributions

Sensitivity of Functional form: Gini ($1/day) with Kernel, Normal, Gamma, Weibull distributions

Sensitivity of GDP Source: Poverty Rates ($1/day) with PWT, WB, and Maddison

Sensitivity of Source of GDP: Gini with PWT, WB, and Maddison

Sensitivity of Interpolation Method: Poverty Rates 1$/day with Nearest, Linear, Cubic and Baseline

Sensitivity of Interpolation Method: Gini with Nearest, Linear, Cubic and Baseline

Preliminary Results on Confidence Intervals with Lognormal: Gini

Preliminary Results on Confidence Intervals with Lognormal: MLD