1 Joint Research Centre (JRC) Using remote sensing for crop and land cover area estimation

Slides:



Advertisements
Similar presentations
ECE 8443 – Pattern Recognition LECTURE 05: MAXIMUM LIKELIHOOD ESTIMATION Objectives: Discrete Features Maximum Likelihood Resources: D.H.S: Chapter 3 (Part.
Advertisements

Sampling Strategy for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Accuracy Assessment of Thematic Maps
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
VALIDATION OF REMOTE SENSING CLASSIFICATIONS: a case of Balans classification Markus Törmä.
Machine Learning CMPT 726 Simon Fraser University
Accuracy Assessment Chapter 14. Significance Accuracy of information is surprisingly difficult to address We may define accuracy, in a working sense,
Lecture 14: Classification Thursday 18 February 2010 Reading: Ch – 7.19 Last lecture: Spectral Mixture Analysis.
7-1 Chapter Seven SAMPLING DESIGN. 7-2 Sampling What is it? –Drawing a conclusion about the entire population from selection of limited elements in a.
A new sampling method: stratified sampling
February 15, 2006 Geog 458: Map Sources and Errors
Formalizing the Concepts: Simple Random Sampling.
Global Land Cover: Approaches to Validation Alan Strahler GLC2000 Meeting JRC Ispra 3/02.
Chapter Outline  Populations and Sampling Frames  Types of Sampling Designs  Multistage Cluster Sampling  Probability Sampling in Review.
Module 2.1 Monitoring activity data for forests using remote sensing REDD+ training materials by GOFC-GOLD, Wageningen University, World Bank FCPF 1 Module.
1 JRC – Ispra Area frames for land cover estimation: Improving the European LUCAS survey Javier Gallego Jacques Delincé.
Accuracy Assessment. 2 Because it is not practical to test every pixel in the classification image, a representative sample of reference points in the.
A Statistically Valid Method for Using FIA Plots to Guide Spectral Class Rejection in Producing Stratification Maps Mike Hoppus & Andrew Lister USDA-Forest.
Chapter 9 Accuracy assessment in remotely sensed categorical information 遥感类别信息精度评估 Jingxiong ZHANG 张景雄 Chapter 9 Accuracy assessment in remotely sensed.
Ten State Mid-Atlantic Cropland Data Layer Project Rick Mueller Program Manager USDA/National Agricultural Statistics Service Remote Sensing Across the.
Transforming a sample design for taking into account new statistical needs, new information or new technological instruments for data collection Elisabetta.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section A 1.
earthobs.nr.no Land cover classification of cloud- and snow-contaminated multi-temporal high-resolution satellite images Arnt-Børre Salberg and.
Sampling: Theory and Methods
Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.
Classification & Vegetation Indices
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
11 july 2008 European Conference on Quality COMPARISON OF VALIDATION PROCEDURES TO DETECT MEASUREMENT ERRORS IN AN AREA FRAME SAMPLE SURVEY Laura Martino,
7-1 Chapter Seven SAMPLING DESIGN. 7-2 Selection of Elements Population Element the individual subject on which the measurement is taken; e.g., the population.
1 Ratio estimation under SRS Assume Absence of nonsampling error SRS of size n from a pop of size N Ratio estimation is alternative to under SRS, uses.
Image Classification 영상분류
Remote Sensing for agricultural statistics Main uses and cost-effectiveness in developing countries Insert own member logo here Pietro Gennari, Food and.
Accuracy Assessment Having produced a map with classification is only 50% of the work, we need to quantify how good the map is. This step is called the.
Benchmarking the efficiency of coarse resolution satellite images for area estimation. J. Gallego, M. El Aydam – MARS AGRI4CAST.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Accuracy of Land Cover Products Why is it important and what does it all mean Note: The figures and tables in this presentation were derived from work.
Digital Image Processing
Area estimation in the MARS project. A summary history J. Gallego,– MARS AGRI4CAST.
Chuvieco and Huete (2009): Fundamentals of Satellite Remote Sensing, Taylor and Francis Emilio Chuvieco and Alfredo Huete Fundamentals of Satellite Remote.
Remote Sensing Classification Accuracy
Nairobi 1-2 October Some Approaches to Agricultural Statistics NOTES 1. PLACE, DATE AND EVENT NAME 1.1. Access the slide-set.
Chapter 3: Maximum-Likelihood Parameter Estimation l Introduction l Maximum-Likelihood Estimation l Multivariate Case: unknown , known  l Univariate.
NTTS 2011 Brussels February 22, Joint Research Centre (JRC) Sampling Very High Resolution Images for Area Estimation
BOT / GEOG / GEOL 4111 / Field data collection Visiting and characterizing representative sites Used for classification (training data), information.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
2011 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) Aihua Li Yanchen Bo
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
JRC Place on dd Month YYYY – Event Name 1 Land cover change Objective: estimate land cover changes, in particular between agriculture and non-agriculture.
LUCAS 2006 J. Gallego, MARS AGRI4CAST. Sampling scheme Adaptation of the Italian AGRIT First phase: Systematic sampling of unclustered points (single.
Housekeeping –5 sets of aerial photo stereo pairs on reserve at SLC under FOR 420/520 –June 1993 photography.
INRA Rabat, October 14, Crop area estimates in the EU. The use of area frame surveys and remote sensing NOTES 1.
Accuracy Assessment of Thematic Maps THEMATIC ACCURACY.
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section B 1.
26. Classification Accuracy Assessment
Chapter 3: Maximum-Likelihood Parameter Estimation
Making inferences from collected data involve two possible tasks:
Accuracy Assessment of Thematic Maps
Graduate School of Business Leadership
Meeting-6 SAMPLING DESIGN
Incorporating Ancillary Data for Classification
The GISCO task force “Remote Sensing for Statistics”
REMOTE SENSING Multispectral Image Classification
REMOTE SENSING Multispectral Image Classification
Housekeeping 5 sets of aerial photo stereo pairs on reserve at SLC under FOR 420/520 June 1993 photography.
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Parametric Methods Berlin Chen, 2005 References:
Institute for Protection and Security of the Citizen
Presentation transcript:

1 Joint Research Centre (JRC) Using remote sensing for crop and land cover area estimation

2 Classified images as main information for area estimation Area is sometimes estimated by counting pixels in a classified image Sources of area estimation error: Mixed pixels (boundary). Error depends on Resolution, geometry (% of mixed pixels) Misclassification of pure pixels. Error depends Radiometric separability of different classes suitable resolution: most pixels should be pure

3 Direct area estimation by photo-interpretation (polygon area measurement) Example: CORINE Land Cover By photo-interpretation of TM images Nearly homogeneous rules in most European Countries Nomenclature of 44 classes Minimum polygon size: 25 ha Some mixed classes such as agro-forestry, complex agricultural patterns, etc. In the early times of CLC (90’s), it was often presented as a source of direct land cover area estimators –But further analysis has shown that this is only acceptable if there is no alternative

4 CORINE Land Cover 2000 Partial view (rasterised 100m)

5 confusion matrix with “pure LUCAS points” (excluding points too close to boundaries)

6

7 Land cover change: Example of straight estimation Consider CORINE Land Cover (CLC90) and CLC2000 Direct overlay gives an “estimate”of ~20% of change in land cover type Remaking the photo-interpretation of both layers gives <5% change in land cover type. Probably closer to reality No sampling error, but Bias due to Photo-interpretation errors, Scale effect.

8 Land cover change the example of “total arable land” Comparing changes derived from “CLC change ” and from agricultural statistics. Very different figures (in kha)

9 Pixel counting as area estimator Errors from misclassification of pure pixels No sampling error if complete image Possible large bias Λ = confusion matrix for the population

10 Pixel counting as area estimator Rule of thumb: do not use pixel counting if your expected commission/omission error is significantly larger than the targeted accuracy. Example: if you want an accuracy of ± 5% (semi- confidence interval?), do not use pixel counting unless you are confident that your classification accuracy is >>90%. Gaussian distribution does not protect against bias or subjectivity

11 Pixel counting as area estimator Example with maximum likelihood supervised classification (discriminant analysis) Region of ~ 100,000 km 2 Area of cereals ~ 2 Mha Accuracy of classification ~ 70% Tuning the parameters (a priori prob.), we can easily get an area of pixels classified as cereals between 1.5 and 2.5 Mha. If we think the area is 2.3 Mha, we will tune the classification to get that figure. It may be right, but we are using RS as a “sexy dress” to make our belief more attractive. There may be a tendency to underestimate changes if we use historical statistical data as a reference

12 Pixel counting as area estimator (2) We can tune the parameters to balance commission and omission errors on a test sample This gives a good protection against bias if the sample is statistically valid (random, systematic, etc…) Random sample ≠ hap-hazard set We are implicitly using a calibration estimator. We better use a calibration estimator explicitely.

13 Correcting bias with a confusion matrix Bias  Commission error – omission error If we have a confusion matrix, we can correct the bias, Cannot we? Ex: Photo-interpretation made for the EU LUCAS survey Raw confusion matrix (simplified nomenclature) without taking into account the weights derived from the sampling plan: Let us look at the class “forest and wood” Commission < Omission  We should increase the estimates by ca. 12% Right?

14 Bias and confusion matrix But in LUCAS the sampling rate of the non-agricultural strata is 5 times lower  the corresponding rows of the confusion matrix should be multiplied by 5  Weighted confusion matrix Commission > Omission  We should reduce the estimates by ca. 13%

15 Bias and confusion matrix The classification bias can be corrected if we have a confusion matrix But the confusion matrix has to be properly weighted Otherwise the bias correction can be completely wrong Weights = inverse of sampling probability  We need to know the sampling probability  Reference (field) data must be collected according to a sampling plan Hap-hazard data collection for bias correction is risky There are better ways than (omission error – commission error) to correct bias Calibration estimator if the field data are collected with a non-clustered sampling plan Regression estimator if the field data are collected with a clustered sampling plan

16 Combining ground survey and satellite images to improve the accuracy of estimates Main approaches: calibration and regression estimators. Common features: combine accurate information on a sample (ground survey) with less accurate information in the whole area, or most of it. Unbiasedness is provided by the ground survey. The more accurate the ground survey, the higher the added value of RS. Variant if ground data are too difficult/expensive (e.g: forest in very large areas): Accurate information from high or medium resolution on a sample of images Less accurate information from coarse resolution (AVHRR, VEGETATION, MODIS, MERIS)

17 RS to improve ground survey estimates Calibration estimators with confusion matrices A : Confusion matrix on a sample of test pixels Λ g : ground truth totals Λ c : pixels classified by class Λ : Confusion matrix on the population Λ g : ground truth totals (unknown to be estimated) Λ c : pixels classified by class Error matrices:

18 Calibration estimators with confusion matrices Straightforward identities: Estimators: Relative efficiency of the same order of regression estimator.

19 Satellite images to improve ground survey estimates Regression estimator Y: Ground data (% of wheat) X: Classified satellite image (% od pixels classified as wheat) Difference estimator if slope b pre-defined: less efficient, but more robust. Ratio estimator if a = 0

20 Regression estimator

21 Regression estimator An efficiency = 2 means that : n segments + regression ~ 2n segments (only ground survey) Criterion to assess cost-efficiency Relative efficiency ( coarse approximation) Relative efficiency of the same order of calibration estimator. Regression is not very suitable for point sampling: only 4 points in the regression plot: (0,0), (0,1), (1,0), (1,1) better approximation:

22 Regression estimator is not always reliable n = 39 but unreliable regression (maximum Belsley’s β = 4.7)  use tools to detect influential observations

23 Regression Estimator n = 24 but reliable regression (maximum Belsley’s β = 0.8)

24 Regression estimator Caution!!!! X must be the same variable in the sample and outside the sample Use all pixels (including mixed pixels) to compute X on the sample Do not use the same sample for training pixels and for regression, or at least use a classification with a similar behaviour for training and test pixels (few parameters to estimate) If this is not respected, regression estimator can degrade the ground survey estimates

25 Practical obstacles for operational use of remote sensing In the 80’s-early 90’s: cost efficiency was insufficient Cost of images Cost/time of image processing. In the late 90’s RS area estimation became nearly cost-efficient with Landsat TM, but…. no continuity of the mission. Timeliness: 1-2 months after ground survey estimates Autonomy of official organisations. Currently new image types need to be better assessed (e.g: DMCII)

26 Remote sensing over-marketing We have the solution. Which is your problem? ?

27 Small area estimators Small area

28 Small area estimation: a simplified example Proportion of wheat Large region L Small area S image X L X S Sample n segments 0 segments mean Y L ? Estimator Well… Actually it is a bit more complex. See e.g. Battese G. E., Harter R.M., Fuller W.A., 1988, An error-components model for prediction of county crop areas using survey and satellite data. Journal of the American Statistical Association, 83, pp

29 Small area estimators Small area estimators use The sample inside the area (possibly n=0) A covariable inside the area (classified satellite image) The link between variable and covariable outside the area. Small area estimators are model- dependent

30 Remote sensing and area estimation Improving an area sampling frame with satellite images Stratification: strata defined by an indicative land cover pattern Two-phase sampling: large random or systematic pre-sample and subsampling with unequal probability. Stratification and two-phase (double) sampling efficiency is generally moderate (often around 1.5) but the operation is not too expensive and is valid for several years.

31 Efficiency of stratification V nostr Variance that we would have got with the same sample size without stratification. But we do not have such a sample…. For stratified random sampling: How much did we gain with the stratification? Do not use:

32 Substituting ground data with remote sensing data When a proper ground survey is not possible Principles remain the same, with –A sample of HR-VHR images instead of the ground data (<10 m?) –A wall-to-wall (complete as much as possible) cover of medium resolution images (TM for example) Differences: –The sampling plan (size of PSUs) has to take into account the size of HR/VHR images. –The main non-sampling error (commission/omission errors) needs to be assessed:  Some ground observations, approximately balanced, are better than no ground data at all  If no ground data at all can be collected, assess commission/omission errors in an area with similar landscape