The treatment of uncertainty in the results

Slides:



Advertisements
Similar presentations
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
Advertisements

Sampling: Final and Initial Sample Size Determination
Estimation of Sample Size
Introduction to statistics in medicine – Part 1 Arier Lee.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Limitations of Analytical Methods l The function of the analyst is to obtain a result as near to the true value as possible by the correct application.
Formalizing the Concepts: Simple Random Sampling.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
RESEARCH A systematic quest for undiscovered truth A way of thinking
Variables, sampling, and sample size. Overview  Variables  Types of variables  Sampling  Types of samples  Why specific sampling methods are used.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Quality Control: Analysis Of Data Pawan Angra MS Division of Laboratory Systems Public Health Practice Program Office Centers for Disease Control and.
Lecture 5.  It is done to ensure the questions asked would generate the data that would answer the research questions n research objectives  The respondents.
COMPLIMENTARY TEACHING MATERIALS Farm Business Management: The Fundamentals of Good Practice Peter L. Nuthall.
Sampling: Samples are studied to make statements or decisions about the population when either the population is not available or it is too large to be.
Yandell - Econ 216 Chap 1-1 Chapter 1 Introduction and Data Collection.
Typical farms and hybrid approaches
GS/PPAL Research Methods and Information Systems
Short Training Course on Agricultural Cost of Production Statistics
SUR-2250 Error Theory.
Confidence Intervals Topics: Essentials Inferential Statistics
Sample design strategies
Sampling.
Overview of probability and statistics
Measurement, Quantification and Analysis
Research Methods and Statistics
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Allocation of costs in complex cropping and mixed farming systems
ESTIMATION.
Making inferences from collected data involve two possible tasks:
Sampling Why use sampling? Terms and definitions
Agricultural cost of production statistics: main concepts
Quality Assurance in the clinical laboratory
Cost of Production: Uses and Users
Modeling approaches for the allocation of costs
Introduction to estimation: 2 cases
SOCIAL NETWORK AS A VENUE OF PARTICIPATION AND SHARING AMONG TEENAGERS
Al-Imam Mohammad Ibn Saud University Large-Sample Estimation Theory
PCB 3043L - General Ecology Data Analysis.
Understanding Results
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Distribution of the Sample Means
Sampling And Sampling Methods.
SAMPLING (Zikmund, Chapter 12.
Sampling Design.
Introduction to Instrumentation Engineering
CONCEPTS OF ESTIMATION
2. Stratified Random Sampling.
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
Confidence Intervals Topics: Essentials Inferential Statistics
The European Statistical Training Programme (ESTP)
Sampling and Sample Size Calculations
Sampling Distributions
Week Three Review.
SAMPLING (Zikmund, Chapter 12).
Sampling and Power Slides by Jishnu Das.
Sampling.
What are their purposes? What kinds?
Dr. Donald K. McConnell Jr.
Sampling Sampling is choosing a sample.
Business Statistics: A First Course (3rd Edition)
Sampling Sampling is choosing a sample.
Sampling: How to Select a Few to Represent the Many
Sampling and estimation
BUSINESS MARKET RESEARCH
STA 291 Spring 2008 Lecture 13 Dustin Lueker.
Sampling Techniques Assist. Prof. Maha A. AL-Nuaimi Ph.D/ Comm. Med
Chapter 13: Item nonresponse
Business Statistics For Contemporary Decision Making 9th Edition
Presentation transcript:

The treatment of uncertainty in the results Short Training Course on Agricultural Cost of Production Statistics

1 – Uncertainty: definition and characteristics (1/2) Uncertainty is generated by the randomness of the data collection and estimation process. Uncertainty impacts the precision of the results but not the accuracy: results may be uncertain (highly dispersed around their mean) but accurate (the estimated mean is close to the “true” mean) and vice-versa. Uncertain and inaccurate Certain and inaccurate Uncertain and accurate Certain and accurate

1 – Uncertainty: definition and characteristics (2/2) Errors generating uncertainty (loss in precision) include: Sampling errors: the loss in precision due to the selection of a random sample of the population Errors due to estimation procedures such as regression or any technique which involves statistical inference. Non-random (or non-sampling) errors are systematic errors made at different stages of the data collection and compilation process: Asking the wrong questions, inappropriate sample coverage, systematic editing mistakes, etc. These errors can be minimized through an appropriate planning, management and follow-up. They are difficult/costly to correct ex-post. These errors affect accuracy, not uncertainty/precision  

2 – Why is it important to take uncertainty into account ? Important policy decisions are based on surveys: uncertainty may lead to adverse policy impacts based on badly designed policies... or no decision at all !   Surveys always lead to uncertain results, even when designed in a statistically sound way: users have to elaborate strategies that take uncertainty into account Varying degrees of uncertainty may be accepted, depending on the nature of the survey: lower uncertainty may be required for surveys touching vital topics such as public health => It is essential to provide users with information on uncertainty, be it of a quantitative or qualitative nature  

3 – Sampling errors: definition and characteristics They originate from the extrapolation of the results obtained for a randomly selected sample to the population of interest as a whole   They provide a quantitative information on the quality of the sample selection process: => high sampling errors lead to uncertain results: they will be difficult to interpret and of little use The magnitude of the sampling error depends on: The type of sample selection method: simple random, stratified, multistage, etc. The size of the sample

4 – Sampling errors: measurement Uncertainty can be appropriately measured by the variance or standard-deviation of the variable of interest: => Sampling errors are measured by the variance of the variable of interest due to the sampling procedure (sampling variance) Sampling variance formulas are known for simple sample designs, i.e. when the inclusion probability of the final sampling unit is known and when the variable of interest to estimate has a “simple” expression (total or mean, for example). => See corresponding Annex of the manual When sampling variance can be measured, point estimates of the variable of interest can be completed with confidence bands For complex sample designs (multistage, etc.) and/or complex functions to estimate (ratio of two variables for example) sampling variance can only be approximated using complex methods

5 – Other random errors The process of collecting and compiling data may include stages where estimations are made using statistical inferences, such as: The correction of non-response using regression techniques, hot or cold-deck imputation, etc. An example was given in this seminar with the use of model-based techniques for cost allocation   These statistical operations generate uncertainty in the results: in most cases, it can be measured by simulation or other techniques. These estimations generally affect part of the sample, rendering its impact on the whole sample difficult to measure.

6 – Global or combined uncertainty As seen, different sources of uncertainty exist, some sampling-related and others not   When the estimated objective variable is affected by several sources of uncertainty , the combined uncertainty can be measured: if the two sources of uncertainty (1 and 2) are uncorrelated In more complicated cases, uncertainty cannot be summed. For example when: The objective variable is a function of other variables, etc. The sources of uncertainty are correlated

7 – How to present information on uncertainty (1/2) When quantifiable, uncertainty measurements should be included in the results along with the point estimates. Most commonly, these include: Sampling standard-deviation and associated confidence bands, using the usual risk probabilities of 5% or 10% Standard-deviation due to estimation methods and associated confidence bands Combined standard-deviation, when there are several uncertainty sources Any other information on the probability distribution of the estimate

8 – How to present information on uncertainty (2/2) Even when uncertainty cannot be measured: it can be taken into account in the presentation of the results. Common options are: To present the results for sub-groups of the population of interest, such as deciles, quintiles, etc. To provide inter-decile and inter-quartile ranges The variables used for the grouping should be: The variable of interest itself: for example, if average farm income is the objective variable, average income by income deciles/quintiles can be provided Variables correlated to the variable of interest: farm income by deciles/quintiles of farm size, etc. The statistician should provide qualitative information on its possible magnitude

9 – Country example: Zambia Total Cost Quintile (ZMK/50 maize kg) Share of total maize production (%) 1 2 3 4 5 farmer mean per 50 kg bag mean 31.4% 27.1% 20.1% 12.8% 8.7% Costs of production (ZMK/50kg) -----------------------------------Mean-------------------------------- - Total cash expenditures 6,411 12,239 14,969 19,449 27,482 16,111 18,630 Expenditures plus household labor and assets (excl. land) 15,567 29,078 42,776 64,341 118,953 54,152 40,739 Land annual rental 3,364 4,835 6,633 9,152 15,102 7,818 4,720 Total Cost (incl. land cost) 18,931 33,914 49,409 73,493 134,055 61,970 45,459 Source: MACO/CSO Crop Forecast Survey 2010. Note: a) Fertilizer and seed costs include both subsidized and commercially acquired inputs. Countries (developed and developing) do not generally disseminate error/uncertainty estimates even though such estimations are done and used internally

10 – References Handbook on agricultural cost of production statistics (Draft), pp. 53 – 55 and Annex 1, Global Strategy Publications, 2012 Bell (1999), A Beginners Guide to Uncertainty Measurement, Measurement Good Practice Guide n°11 (Issue 2) Sampling Methods for Agricultural Surveys (1989), FAO Statistical Development Series 3, FAO, Rome. Accessible online at: http://www.fao.org/fileadmin/templates/ess/ess_test_folder/Publications/SDS/3_sampling_method_for_agricultural_survey.pdf