Social Assistance Pilots Program SA Pilots Seminar Hybrid Means Testing (HMT) Model Development Roman Semko CASE Ukraine March, 2010.

Slides:



Advertisements
Similar presentations
Saksaganskogo Str. 1a 01601, Kyiv Ukraine Тел , NATIONAL AGENCY ON PREPARATION AND ORGANIZATION.
Advertisements

ICES 3° International Conference on Educational Sciences 2014
 Econometrics and Programming approaches › Historically these approaches have been at odds, but recent advances have started to close this gap  Advantages.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 7: Demand Estimation and Forecasting.
Multiple Linear Regression Model
© 2000 Prentice-Hall, Inc. Chap Multiple Regression Models.
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
Lecture 22 Multiple Regression (Sections )
1 Regression Analysis Regression used to estimate relationship between dependent variable (Y) and one or more independent variables (X). Consider the variable.
Topic 3: Regression.
Palestinian Central Bureau of Statistics (PCBS) Palestine Poverty Maps 2009 March
Lorelei Howard and Nick Wright MfD 2008
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Hypothesis Testing II The Two-Sample Case.
Chapter 7: Demand Estimation and Forecasting
Pensioners’ Employment and Their Families Well-being in Russia Oxana Sinyavskaya, Independent Institute for Social Policy Family in the Stream of Changes:
Off-farm labour participation of farmers and spouses Alessandro Corsi University of Turin.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Has Public Health Insurance for Older Children Reduced Disparities in Access to Care and Health Outcomes? Janet Currie, Sandra Decker, and Wanchuan Lin.
Social Assistance Pilots Program SA Pilots Seminar Social assistance to able to work unemployed based on their participation in public works Liudmyla Kotusenko.
Social Assistance Pilots Program SA Pilots Seminar Ways for improving housing subsidies system Liudmyla Kotusenko CASE Ukraine March 2010.
MORE TARGETING, LESS REDISTRIBUTION? AN ENQUIRY INTO THE ROLE OF POLICY DESIGN IVE MARX, LINA SALANAUSKAITE, GERLINDE VERBIST CENTRUM VOOR SOCIAAL BELEID.
Additional analysis of poverty in Scotland 2013/14 Communities Analytical Services July 2015.
Investment Analysis and Portfolio Management First Canadian Edition By Reilly, Brown, Hedges, Chang 6.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
1. 2 Introduction Purpose of the ICP UN System of National Accounts calls for comparisons of GDP across countries be using PPPs The Approach Collection.
Managerial Economics Demand Estimation & Forecasting.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
5-4-1 Unit 4: Sampling approaches After completing this unit you should be able to: Outline the purpose of sampling Understand key theoretical.
Chapter 7: Demand Estimation and Forecasting McGraw-Hill/Irwin Copyright © 2011 by the McGraw-Hill Companies, Inc. All rights reserved.
Modeling and Forecasting Household and Person Level Control Input Data for Advance Travel Demand Modeling Presentation at 14 th TRB Planning Applications.
Chapter 13 Multiple Regression
Discussion of time series and panel models
1 Ministry of Finance and Ministry of Samurdhi Welfare Benefits Board The Targeting Formula: Analysis Using Pilot Data Welfare Workshop Colombo,
Improving of Household Sample Surveys Data Quality on Base of Statistical Matching Approaches Ganna Tereshchenko Institute for Demography and Social Research,
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Using microsimulation model to get things right: a wage equation for Poland Leszek Morawski, University of Warsaw Michał Myck, DIW - Berlin Anna Nicińska,
Randomized Assignment Difference-in-Differences
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
1 Measuring Poverty: Inequality Measures Charting Inequality Share of Expenditure of Poor Dispersion Ratios Lorenz Curve Gini Coefficient Theil Index Comparisons.
Effects of migration and remittances on poverty and inequality A comparison between Burkina Faso, Kenya, Nigeria, Senegal, South Africa, and Uganda Y.
Kwon, Tae Hee (SungKyunKwan University ) The Relation between Work-Family Balance and Quality of Life for Married Women in Korea.
Experience of Legal Aid Reform in Ukraine Andriy VYSHNEVSKY Director of the Centre for Legal Reform and Law Drafting under the Ministry of Justice of Ukraine.
4-1 MGMG 522 : Session #4 Choosing the Independent Variables and a Functional Form (Ch. 6 & 7)
Demand Management and Forecasting Chapter 11 Portions Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Chapter 11 – With Woodruff Modications Demand Management and Forecasting Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
R ETURN TO COMMUTING IN S WEDEN Sergii Troshchenkov PhD student L.A.S.E.R.
Chapter 7 Demand Estimation and Forecasting
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
Chapter 6: Multiple Regression – Additional Topics
Chapter 7. Classification and Prediction
Regression Analysis AGEC 784.
26134 Business Statistics Week 5 Tutorial
Multiple Regression Analysis: Further Issues
Statistics: The Z score and the normal distribution
Multiple Regression Analysis and Model Building
Chapter 6: Multiple Regression – Additional Topics
Chapter 6: Multiple Regression – Additional Topics
Multiple Regression Analysis: Further Issues
Trip Generation II Meeghat Habibian Transportation Demand Analysis
Estimation of Employment for Cities, Towns and Rural Districts
Chapter 6: Multiple Regression – Additional Topics
Multiple Regression Analysis: Further Issues
The Simple Regression Model
Chapter 7: Demand Estimation and Forecasting
Chapter 7: Demand Estimation and Forecasting
Poverty and Inequality Statistics: Development of Methodology in the Russian Federation Geneva, 5-6 May 2015.
Presentation transcript:

Social Assistance Pilots Program SA Pilots Seminar Hybrid Means Testing (HMT) Model Development Roman Semko CASE Ukraine March, 2010

1.Introduction to modeling 2.Data analysis 3.Methods for estimation 4.Simulations 5.Income from assets (agriculture) 6.Double-blind experiment results 7.Model comparisons and conclusions Content Content 2

Concept Concept 3 The World Bank has developed a methodology for income estimation which is based on regression analysis – HYBRID MEANS TESTING (HMT)The World Bank has developed a methodology for income estimation which is based on regression analysis – HYBRID MEANS TESTING (HMT) Under HTM method, eligibility to the SA program is assessed based on the households income modelingUnder HTM method, eligibility to the SA program is assessed based on the households income modeling Total income is divided into two parts: easy to verify (e.g., pension, stipend) and hard to verify (e.g., dividends, shadow wage)Total income is divided into two parts: easy to verify (e.g., pension, stipend) and hard to verify (e.g., dividends, shadow wage) The final goal is to estimate hard to verify share of the income based on a set of variables, which can be accurately measured and reflect the hard to verify incomeThe final goal is to estimate hard to verify share of the income based on a set of variables, which can be accurately measured and reflect the hard to verify income Hard to verify income is divided into income which is not generated by long-term assets (estimated by regression model) and income from assets (estimated by formulas)Hard to verify income is divided into income which is not generated by long-term assets (estimated by regression model) and income from assets (estimated by formulas)

The main goal of the model is to predict most precisely total family income The main goal of the model is to predict most precisely total family income Source: Finance Ministry of Ukraine Data and Knowledge Methods Equation which estimates applicant’s income based on the available information: Y = β 1 *X 1 + β 2 *X 2 + β 3 *X 3 + … YX 1, X 2, X 3, …  total income  hard to verify  family structure  type and sector of employment  education  region  other Model Criteria 1.Theoretical validity 2.Simplicity 3.Goodness of fit 4.Significance of explanatory variables Application and Simulation 4

Pilots datasetHBS 2008 Good model should use all available relevant information for income prediction Good model should use all available relevant information for income prediction 5 10,622 observations of households with total income > 3,000 observations of families with declared income Cannot be used separately for model estimation since total income is not available A lot of information could/should be used to guarantee acceptable level of precision Declared income (DI) is an important indicator in total income (TI) assessment MATCHING CharacteristicsTICharacteristicsTIDICharacteristicsDI

Pilots dataset HBS 2008 Observations are matched in a way to guarantee the highest similarity between them Observations are matched in a way to guarantee the highest similarity between them 6 Procedure 1. Form groups based on the follow-ing variables: type of settlement, type of assistance, household’s size, # of children, working persons, pensioners, sex of the single-heads household 2. Match each observations from HBS to the observation from pilots dataset from the same groups based on the similar characteristics: age of the head, education of the head, etc. using Euclidean distance function 3. Each observation from pilots dataset is used for matching no more than 2 times 4. Aggregate the groups if there are no good candidate for HBS observation from corresponding group from pilots dataset and match again Observation 1 Observation … Observation K 1 Group 1 Observation 1 Observation … Observation K … Group … Observation 1 Observation … Observation K N Group N Observation 1 Observation … Observation L 1 Group 1 Observation 1 Observation … Observation L … Group … Observation 1 Observation … Observation L N Group N

Total vs. declared income comparison (without SA) Data comparison: a main difference between HBS and pilots applicants occurs in their incomes, while most of other characteristics are similar Data comparison: a main difference between HBS and pilots applicants occurs in their incomes, while most of other characteristics are similar 7

For some regions average income in HBS significantly differs from the Personal Disposable Income (PDI) Statistics For some regions average income in HBS significantly differs from the Personal Disposable Income (PDI) Statistics 8 Statistics, PDI HBS Lviv Lutsk Rivne Ternopil Ivano- Frankivsk Uzhgorod Chernivtsi Vinnytsa Zhytomyr Chernigiv Sumy Kharkiv Poltava Cherkasy Kirovograd Odesa Kherson Mykolayiv Zaporizhzhya Dnipropetrovsk Lugansk Donetsk Simferopol Kyiv Khmelnytsky Differences in income without SA per capita compared to Chernivtsi region, UAH – >200 – – <100 Lviv Lutsk Rivne Ternopil Ivano- Frankivsk Uzhgorod Chernivtsi Vinnytsa Zhytomyr Chernigiv Sumy Kharkiv Poltava Cherkasy Kirovograd Odesa Kherson Mykolayiv Zaporizhzhya Dnipropetrovsk Lugansk Donetsk Simferopol Kyiv Khmelnytsky

Bayesian econometrics allows combining data with aggregated publications of regional PDI Bayesian econometrics allows combining data with aggregated publications of regional PDI 9 Standard estimation Bayesian estimation Calibration Researcher artificially determines the model coefficient(s), e.g., if regional macrodata say that income in Kyiv city is 1108 UAH higher than in AR of Crimea, than it is assumed that for Kyiv city applicants income is 1108 UAH higher than for AR of Crimea applicants, other things equal Coefficients are determined based on the collected observations using standard regression tools (classical econometrics) Combines both approaches. Estimated coefficient lies between calibrated and estimated in a standard way Does not lead to significant changes within regions but for regions across Ukraine changes are significant: average predicted income for regions has changes

Linear model is the most simple Linear model is the most simple Linear Description R2R2 Predictions 58 % (large cities – 65%, small cities – 63%, villages – 48%) Linear relation between income and family characteristics Dependent variable is under the logarithm (log-linear) Independent variables (IVs) include easy to verify income Other IVs are: number of children, of working persons, of the elderly, type and sector of employments of household heads, education level – declared income – predicted income Concept: the more income the applicant declares, the lower the additional predicted income is – a sort of a “zero sum game” 10

Nonlinear model is performing well when income differences are high – for the whole HBS sample Nonlinear model is performing well when income differences are high – for the whole HBS sample NonLinear Description R2R2 Predictions R 2 -square is not bounded in [0%,100%] region Nonlinear relation between income and family characteristics. The form of relation: cubic or quadratic – since total income sorted in ascending order increases as a polynomial of 2 nd or 3 rd order Dependent variable is under the logarithm (log-linear) Independent variables are as in the linear model – declared income – predicted income Concept: as for the linear model 11

Two-step model is effective when there is a large number of families with zero and nonzero hard to verify incomes Two-step model is effective when there is a large number of families with zero and nonzero hard to verify incomes Two-stage Description R2R2 Predictions 47 % (no division by cities) At first stage probability that family has shadow income is estimated and then linear relations between income and family characteristics with a hazard of having shadow income is used for estimation Dependent variable is under the logarithm (log-linear) and does not include salary – declared income – predicted income Concept: Stable additional income is added to the declared – “the game with constant markup”. 12

1. DEPENDENT VARIABLE Informal (shadow) salary was incorporated into the dependent variable (hard to verify income) since it is not easy to verify income Each model needs a set of adjustments in order to become fully useful Each model needs a set of adjustments in order to become fully useful 13 Adjustments 3. TIME INCONSISTENCIES In order to compare incomes across different time period, average growth rates of PDI and its elements were used for time adjustment 2. EXPLANATORY VARIABLES (EVs) Some EVs which can be used for predictions are hard to verify, e.g., number of mobile phones cannot be accurately measured 4. FAMILY HEADS The definitions of family heads are standar- dized: male co-head and female co-head are used instead of voluntary definitions Prediction does not change significantly unless dependent variable is redefined. If the dependent variable is redefined, additional predicted income becomes more stable and decreases with the increase of declared income at a lower rate Description

Average predicted income exceeds declared by 26% Average predicted income exceeds declared by 26% 14 Declared vs. Predicted income (by models)

27% families will be excluded from the SA programs 27% families will be excluded from the SA programs 15 Number of beneficiaries (hypothetical scenario)

Average assistance will drop significantly, except for low income and fuel subsidies Average assistance will drop significantly, except for low income and fuel subsidies 16 Average assistance (hypothetical scenario)

Total budget for SA expenditures will decrease by 27% Total budget for SA expenditures will decrease by 27% 17 Total expenditures on SA (hypothetical scenario)

New approachCurrent situation Income from agriculture assets is calculated based on the developed normatives Income from agriculture assets is calculated based on the developed normatives 18 Agriculture income is calculated as income per hectar Normatives are not unified across regions Income calculation per hectar and per each animal Differentiation between cities and villages Normatives are unified since they are based on the same methodology and data Calculation procedure Information, certified by the village/city council Is not applied to families with disable persons or elderly (>70) If applicant lives closer than 10 km to the city – apply city normatives Income from land is a product of land area and normatives Income from payi is calculated sepatately Income from lifestock is the product of number of livestock heads times the normative Average predicted income exceed declared by 28%

New approach Current situation Example of income calculation from agriculture Example of income calculation from agriculture 19 NORMATIVELAND AREANORMATIVEANIMALSAGROINCOME CROPSANIMALS Only farm- stead area of 0.56 hectars, located in village (Donetsk region) per hectar per month Possess one cow and 10 chickens Only related through the hayfields and pasturage Only farm- stead area of 0.56 hectars, located in village (Donetsk region) per hectar per month Possess one cow and 10 chickens (the same) for cow, and 4.48 for one chicken UAH per month UAH per month

Declared Income = 211 UAH Double-blind experiment: case study Double-blind experiment: case study 20 Family description Father: unemployed and not registered in employment center Mother: housewife Age: <18Age: <3 Commission case and home visit DENIAL Model result LESS than Eligibility threshold = 255 UAH Model prediction = 308 UAH immediate decision – risky family, need home visit GRANT? WHILE MORE than Eligibility threshold = 255 UAH DENIAL

Cases of SA denials through commission, based on home inspections Cases of SA denials through commission, based on home inspections 21 Type of assistance Declared income Predicted income Absolute difference Relative difference UAH % Low income Low income Low income Child care Child care Child care Child care Child care Child care Single mothers068 - Single mothers

Predicted income helps to select families for home inspection Predicted income helps to select families for home inspection 22 Families selected for inspection Comments Each family has a chance to be selected for home inspection The probability of selections increases as the predicted income is significantly different from declared in absolute and relative terms

Model comparison Model comparison 23 Characte- ristics MODEL Linear (no matching) NonlinearTwo-step model Linear (matching) Linear (Bayesian + matching or not) Theoretical background Weak since linear relations are rare in nature Stronger since takes into account nonlinearities Strong if selection bias is expected to be Same as linear (no matching) SimplicityVery simpleSimpleSlightly complex Simple in estimation but complex in data matching Very complex in estimation R-squareHigh- - Information as an input Only HBS HBS and pilots dataset Very effective use of information Influence on applicant High MediumHigh

Conclusions Conclusions 24 Income estimates generated by the models significantly differ from the incomes declared by the SA applicants Further empirical tests with the models are needed Initially model results should be used only as an advice rather than a criterion for granting SA benefits The models may be used as an instrument for selecting families for home inspections