ANALYSIS OF POSSIBILITY TO USE TAX AUTHORITY DATA IN STS. RESULTS

Slides:



Advertisements
Similar presentations
Evaluating the Effects of Business Register Updates on Monthly Survey Estimates Daniel Lewis.
Advertisements

Tests of Hypotheses Based on a Single Sample
Unido.org/statistics International workshop on industrial statistics 8 – 10 July, Beijing Non response in industrial surveys Shyam Upadhyaya.
Using Business Taxation Data as Auxiliary Variables and as Substitution Variables in the Australian Bureau of Statistics Frank Yu, Robert Clark and Gabriele.
Nonparametric estimation of non- response distribution in the Israeli Social Survey Yury Gubman Dmitri Romanov JSM 2009 Washington DC 4/8/2009.
Correlation and regression Dr. Ghada Abo-Zaid
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Econ 140 Lecture 121 Prediction and Fit Lecture 12.
Chapter 10 Simple Regression.
Chapter 12 Simple Regression
2.3. Measures of Dispersion (Variation):
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
1 Tax Base and Revenue Forecasting Tuan Minh Le Presented at Workshop on Public Finance Management in Bhutan Washington D.C., August
Editing of mixed source data for turnover statistics Jeffrey Hoogland (SN) Work Session on Statistical Data Editing (Ljubljana, Slovenia, 9-11 May 2011)
Chapter 2 – Simple Linear Regression - How. Here is a perfect scenario of what we want reality to look like for simple linear regression. Our two variables.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Use of administrative data in statistics - challenges and opportunities ICES III End Panel Discussion Montreal, June 2007 Heli Jeskanen-Sundström Statistics.
Chapter 7 Estimation: Single Population
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
Ch4 Describing Relationships Between Variables. Pressure.
The new multiple-source system for Italian Structural Business Statistics based on administrative and survey data Orietta Luzi, Ugo Guarnera, Paolo Righi.
Simple Linear Regression. The term linear regression implies that  Y|x is linearly related to x by the population regression equation  Y|x =  +  x.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Multivariate selective editing via mixture models: first applications to Italian structural business surveys Orietta Luzi, Guarnera U., Silvestri F., Buglielli.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Luxembourg, October Workshop on Backasting SHORT-TERM BUSINESS STATISTICSwww.stat.gov.lt BACKCASTING OF TIME SERIES: PROBLEMS AND RESULTS.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
4-6 September 2013, Vilnius Quality in Statistics: Administrative Data and Official Statistics USING ADMINISTRATIVE DATA SOURCES IN OFFICIAL.
Estimating standard error using bootstrap
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 13 Simple Linear Regression
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Multiple Regression Prof. Andy Field.
Chapter 9: Inferences Involving One Population
Section 11.1 Day 2.
Chapter Six Normal Curves and Sampling Probability Distributions
Chapter 11 Simple Regression
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
CHILE: INDEX OF SALES SELECTED SERVICES
Regression composite estimation for the Finnish LFS from a practical perspective Riku Salonen.
Chapter 9 Hypothesis Testing.
CHAPTER 29: Multiple Regression*
Quality Aspects and Approaches in Business Statistics
An Active Collection using Intermediate Estimates to Manage Follow-Up of Non-Response and Measurement Errors Jeannine Claveau, Serge Godbout and Claude.
CONCEPTS OF ESTIMATION
Chapter 14 – Correlation and Simple Regression
ADMINISTRATIVE DATA IN ANNUAL BUSINESS STATISTICS OF LATVIA
VAT data in Business Register and Business Statistics
The European Statistical Training Programme (ESTP)
Italian situation in the following areas:
Pieter Vlag senior statistical researcher
A New Business Statistics in Finland - Quarterly Investments
Interval Estimation and Hypothesis Testing
Estimation of Employment for Cities, Towns and Rural Districts
ANALYSIS OF POSSIBILITY TO USE TAX AUTHORITY DATA IN STS
Use of monthly tax return data
Product moment correlation
SIMPLE LINEAR REGRESSION
Global Trends: Global Financial Crisis
IMPROVEMENT OF SBS DATA QUALITY
RENATA TUMĖNIENĖ Chief specialist Enterprises statistics division
Parallel Session: BR maintenance Quality in maintenance of a BR:
The Swedish survey on turnover in the service sector
Chapter 9 Hypothesis Testing: Single Population
2.3. Measures of Dispersion (Variation):
Business Statistics For Contemporary Decision Making 9th Edition
Item 4.1: Annual labour market flows
Presentation transcript:

ANALYSIS OF POSSIBILITY TO USE TAX AUTHORITY DATA IN STS. RESULTS ANDRIUS ČIGINAS Chief specialist Sampling methods sub-division, Methodology and Quality division ANDRIUS.CIGINAS@STAT.GOV.LT Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

STS surveys: 1) quarterly survey of service enterprises; 2) quarterly survey of construction enterprises; 3) monthly survey of industrial enterprises; 4) monthly survey of domestic trade enterprises. Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

Basic variable and indicator Variable of interest in each survey population – income from actual for specific survey economical activity. Indicator of interest in each population – sum of income. Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

Auxiliary variable The VAT declaration data can be obtained from the State Tax Inspectorate database. Very important derivative variable is selling, which is calculated using data of taxes paid by enterprises and is close to income of enterprises. Selling exists for considerable part of sample and non-sample enterprises, which are VAT payers. It can include income from all economical actions of enterprise. Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

Availability and use of VAT data The monthly VAT declarations data is available in 45 days since the end of accounting period. For quarterly statistical surveys the VAT data of all three months of the quarter is available at the moment of estimation of sums of income. For monthly statistical surveys only the VAT data of the last month is available. We shall use it, because compared with other auxiliary information it is more “hot” information about potential sizes of income of enterprises. Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

Editing of VAT data I The sampling designs and estimation domains of each survey are connected with economical activity groups and sizes groups of enterprises. Assuming the data of survey have been checked, for every “natural” group of enterprises we examine properties of empirical distributions of variable in order to detect not typical observations of selling of such group of enterprises. We choose the lower and upper bounds for values of separate distribution and use they to edit outlying observations. Main conclusion: primary VAT data can not be used for straightforward imputation. Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

Editing of VAT data II The main purpose is to prepare VAT data for estimators, which use auxiliary information. So, we need to edit pairs (income, selling) which income is not known (ussually it is non-sample enterprises). Assuming that such pairs in separate group of enterprises have the same distribution as the pairs (income, selling) where both components are known, we apply the method of nearest neighbour to edit selling for non-sample enterprises, like we do it in the first step of editing. Result - the sums of VAT data were calibrated. Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

Two ways to proceed To apply combined estimator right now. That is, we can use direct (which do not use auxiliary information) estimator of sum of income for that part of the estimation domain for which selling is not available and use regression or ratio estimator for the remaining part of estimation domain. So, we try it. To stop and complete the set of selling values. That is, to estimate “non-existing” selling. Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

Completion of selling We shall use the model of simple regression to complete the set of values of selling. Using variable “number of employees of the last year respective period” we apply model where we assume that parameters of such model can be different for separate economical activity group of enterprises. More, we do not include into the model of prediction that outlying (influential) pairs of model data, for which we observed considerable change of they “number of employees” during one year. Let the predicted values for all population are denoted sellingX. Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

Editing of VAT data III After completion of selling data we shall return to the situation after second step of VAT data editing and further examine pairs (income, selling), where both components are known, by using simple regresion model which is closely related to application of ratio or regression estimators of sum of income. Model is different for each group of enterprises. For influential or outlying pairs, where selling still considerably differs from income, using the results of selling completion model we shall input if this value differs from income less. Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

Estimation We shall use combination of direct and regression (or ratio) estimators to estimate the sums of income. That is the direct estimate can be always used for strata of sample design or for parts of estimation domains, where we have no selling data or we can not detect relations between income and selling. Were compared: direct estimator HT, which does not use selling; combination of direct and ratio estimators HTS1, when selling is only edited; combination of direct and regression estimators HTR1, when selling is only edited; combination of direct and ratio estimators HTS2, with estimated selling; combination of direct and regression estimators HTR2, with estimated selling. The estimates of sums and the estimates of coefficients of variation for 2005-2008 years data were compared. Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

Improvement of accuracy of estimates SURVEY YEAR PERIOD HT HTS1 HTR1 HTS2 HTR2 service 7 3 1,94 0,22 0,21 0,20 8 2 3,65 0,26 0,28 0,29 0,31 construction 5 4 3,81 0,74 0,77 0,73 0,75 1 2,80 0,72 0,70 0,69 Industry 6 1,12 0,39 0,37 0,40 industry 0,32 0,33 trade 10 2,82 1,01 0,94 0,91 9 0,71 0,42 0,45 0,46 0,44 Here are typical examples of estimates of coefficients of variation (in percents) for actual surveys: Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results

Basic results Estimated possible reduction of surveys samples sizes, if accuracy remains the same as for direct estimate or remains acceptable: 1) survey of service enterprises: 24-29%, 2) survey of construction enterprises: 25-34 %, 3) survey of industrial enterprises: 23-28%, 4) survey of domestic trade enterprises: 13-30%. Luxembourg, 29-30 June 2009, Workshop II Analysis of possibility to use Tax authority data in STS. Results