Accuracy Assessment. 2 Because it is not practical to test every pixel in the classification image, a representative sample of reference points in the.

Slides:



Advertisements
Similar presentations
Lecture 7 Forestry 3218 Forest Mensuration II Lecture 7 Forest Inventories Avery and Burkhart Chapter 9.
Advertisements

© 2011 Pearson Education, Inc
Major Operations of Digital Image Processing (DIP) Image Quality Assessment Radiometric Correction Geometric Correction Image Classification Introduction.
Accuracy Assessment of Thematic Maps
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
Evaluation.
VALIDATION OF REMOTE SENSING CLASSIFICATIONS: a case of Balans classification Markus Törmä.
Evaluation.
Accuracy Assessment Chapter 14. Significance Accuracy of information is surprisingly difficult to address We may define accuracy, in a working sense,
Lecture 14: Classification Thursday 18 February 2010 Reading: Ch – 7.19 Last lecture: Spectral Mixture Analysis.
AA_Vis_1. Using the software “MultiSpec,” students follow protocols to prepare a “clustered” image of their GLOBE Study Site.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
Experimental Evaluation
Spatial data quality February 10, 2006 Geog 458: Map Sources and Errors.
February 15, 2006 Geog 458: Map Sources and Errors
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson6-1 Lesson 6: Sampling Methods and the Central Limit Theorem.
Copyright, © Qiming Zhou GEOG1150. Cartography Quality Control and Error Assessment.
Formalizing the Concepts: Simple Random Sampling.
Global Land Cover: Approaches to Validation Alan Strahler GLC2000 Meeting JRC Ispra 3/02.
9. GIS Data Collection.
Data Acquisition Lecture 8. Data Sources  Data Transfer  Getting data from the internet and importing  Data Collection  One of the most expensive.
Sampling Designs Avery and Burkhart, Chapter 3 Source: J. Hollenbeck.
Image Classification: Redux
Chapter 9 Accuracy assessment in remotely sensed categorical information 遥感类别信息精度评估 Jingxiong ZHANG 张景雄 Chapter 9 Accuracy assessment in remotely sensed.
Methods of Validating Maps of Deforestation and Selective Logging Carlos Souza Jr. Instituto do Homem e Meio Ambiente da Amazônia—Imazon.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
Co-authors: Maryam Altaf & Intikhab Ulfat
Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 5 of Data Mining by I. H. Witten, E. Frank and M. A. Hall 報告人:黃子齊
Sampling: Theory and Methods
1 Evaluating Model Performance Lantz Ch 10 Wk 5, Part 2 Right – Graphing is often used to evaluate results from different variations of an algorithm. Depending.
Classification & Vegetation Indices
Introduction to Remote Sensing. Outline What is remote sensing? The electromagnetic spectrum (EMS) The four resolutions Image Classification Incorporation.
Land Cover Classification Defining the pieces that make up the puzzle.
Resolution A sensor's various resolutions are very important characteristics. These resolution categories include: spatial spectral temporal radiometric.
Data Sources Sources, integration, quality, error, uncertainty.
Image Classification Digital Image Processing Techniques Image Restoration Image Enhancement Image Classification Image Classification.
Image Classification 영상분류
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Accuracy Assessment Having produced a map with classification is only 50% of the work, we need to quantify how good the map is. This step is called the.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Accuracy of Land Cover Products Why is it important and what does it all mean Note: The figures and tables in this presentation were derived from work.
Understanding Sampling
Chuvieco and Huete (2009): Fundamentals of Satellite Remote Sensing, Taylor and Francis Emilio Chuvieco and Alfredo Huete Fundamentals of Satellite Remote.
Remote Sensing Classification Accuracy
Error & Uncertainty: II CE / ENVE 424/524. Handling Error Methods for measuring and visualizing error and uncertainty vary for nominal/ordinal and interval/ratio.
AP Statistics Section 10.1 C Determining Necessary Sample Size.
Data Collection & Sampling Dr. Guerette. Gathering Data Three ways a researcher collects data: Three ways a researcher collects data: By asking questions.
BOT / GEOG / GEOL 4111 / Field data collection Visiting and characterizing representative sites Used for classification (training data), information.
Ka-fu Wong © 2003 Chap 6- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
RELIABILITY OF DISEASE CLASSIFICATION Nigel Paneth.
Housekeeping –5 sets of aerial photo stereo pairs on reserve at SLC under FOR 420/520 –June 1993 photography.
Part 4: Contextual Classification in Remote Sensing * There are different ways to incorporate contextual information in the classification process. All.
Accuracy Assessment Accuracy Assessment Error Matrix Sampling Method
Accuracy Assessment of Thematic Maps THEMATIC ACCURACY.
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section B 1.
Risk Identification and Evaluation Chapter 2
26. Classification Accuracy Assessment
Distortions in imagery:
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
26. Classification Accuracy Assessment
Accuracy Assessment of Thematic Maps
Incorporating Ancillary Data for Classification
The GISCO task force “Remote Sensing for Statistics”
REMOTE SENSING Multispectral Image Classification
REMOTE SENSING Multispectral Image Classification
2. Stratified Random Sampling.
Housekeeping 5 sets of aerial photo stereo pairs on reserve at SLC under FOR 420/520 June 1993 photography.
Assessment of data quality
ALI assignment – see amended instructions
Presentation transcript:

Accuracy Assessment

2 Because it is not practical to test every pixel in the classification image, a representative sample of reference points in the image with known class values is used

Sources of Errors

4 Ground Reference Test pixels Locate ground reference test pixels (or polygons if the classification is based on human visual interpretation) in the study area. These sites are not used to train the classification algorithm and therefore represent unbiased reference information. It is possible to collect some ground reference test information prior to the classification, perhaps at the same time as the training data. Most often collected after the classification using a random sample to collect the appropriate number of unbiased observations per class.

5 Sample Size Sample size N to be used to assess the accuracy of a land- use classification map for the binomial probability theory: P - expected percent accuracy, q = 100 – p, E - allowable error, Z = 2 (from the standard normal deviate of 1.96 for the 95% two-sided confidence level).

6 Sample Size For a sample for which the expected accuracy is 85% at an allowable error of 5% (i.e., it is 95% accurate), the number of points necessary for reliable results is:

7 Sample Size With expected map accuracies of 85% and an acceptable error of 10%, the sample size for a map would be 51:

8 Accuracy assessment “best practices”  reference points per class is ideal  Reference points should be derived from imagery or data acquired at or near the same time as the classified image  If no other option is available, use the original image to visually evaluate the reference points (effective for generalized classification schemes)

9 Sample Design There are basically five common sampling designs used to collect ground reference test data for assessing the accuracy of a remote sensing–derived thematic map: 1. random sampling, 2. systematic sampling, 3. stratified random sampling, 4. stratified systematic unaligned sampling, and 5. cluster sampling.

Sample Methods

11 Commonly Used Methods of Generating Reference Points Random: no rules are used; created using a completely random process Stratified random: points are generated proportionate to the distribution of classes in the image Equalized random: each class has an equal number of random points

12 Commonly Used Methods of Generating Reference Points The latter two are the most widely used to make sure each class has some points With a “stratified random” sample, a minimum number of reference points in each class is usually specified (i.e., 30) For example, a 3 class image (80% forest, 10% urban, 10% water) & 30 reference points:  completely random: 30 forest, 0 urban, 1 water  stratified random: 24 forest, 3 urban, 3 water  equalized random: 10 forest, 10 urban, 10 water

13 Surveying  Uses expensive field equipment and crews  Most accurate method for large scale, small areas GPS  Collection of satellites used to fix locations on Earth’s surface Field Spectrometer High resolution imagery Primary data capture

14 Total Station

GPS “Handhelds” geographic coordinates text photos audio video Bluetooth, WiFi

16 Field Spectrometer

17 High Resolution Imagery SensorPanchromaticMultispectralBands QuickBird65 cm2.62 mblue, green, red, NIR, panchromatic WorldView-150 cmnaPanchromatic WorldView-246 cm *1.85 m * coastal, blue, green, yellow, red, red edge, NIR, NIR2, panchromatic IKONOS82 cm3.2 mblue, green, red, NIR, panchromatic OrbView-31 m4 mblue, green, red, NIR, panchromatic GeoEye-141 cm *1.65 mblue, green, red, NIR, panchromatic

High Resolution Imagery - GeoEye

19 High Resolution Imagery - GeoEye

20 High Resolution Imagery - IKONOS

21 High Resolution Imagery – IKONOS

22 High Resolution Imagery – QuickBird

23 High Resolution Imagery – QuickBird

24 High Resolution Imagery – QuickBird

25 High Resolution Imagery – WorldView1

26 High Resolution Imagery – WorldView2

27 Error Matrix A tabular result of the comparison of the pixels in a classified image to known reference information Rows and columns of the matrix contain pixel counts Permits the calculation of the overall accuracy of the classification, as well as the accuracy of each class

28 Error Matrix Example classified image reference data

29 Evaluation of Error Matrices Ground reference test information is compared pixel by pixel (or polygon by polygon when the remote sensor data are visually interpreted) with the information in the remote sensing–derived classification map. Agreement and disagreement are summarized in the cells of the error matrix. Information in the error matrix may be evaluated using simple descriptive statistics or multivariate analytical statistical techniques.

30 Types of Accuracy  Overall accuracy  Producer’s accuracy  User’s accuracy  Kappa coefficient

31 Descriptive Statistics The overall accuracy of the classification map is determined by dividing the total correct pixels (sum of the major diagonal) by the total number of pixels in the error matrix (N ). The total number of correct pixels in a category is divided by the total number of pixels of that category as derived from the reference data (i.e., the column total). This statistic indicates the probability of a reference pixel being correctly classified and is a measure of omission error. This statistic is the producer’s accuracy because the producer (the analyst) of the classification is interested in how well a certain area can be classified.

32 Producer’s Accuracy classified image reference data  The probability a reference pixel is being properly classified  measures the “omission error” (reference pixels improperly classified are being omitted from the proper class)  150/180 = 83.33%

33 Overall Accuracy The total number of correctly classified samples divided by the total number of samples measures the accuracy of the entire image without reference to the individual categories sensitive to differences in sample size biased towards classes with larger samples

34 Overall Accuracy classified image reference data  Based on how much of each image class was correctly classified  total number of correctly classified pixels divided by the total number of pixels in the matrix  ( )/500 = 360/500 = 72%

35 Descriptive Statistics If the total number of correct pixels in a category is divided by the total number of pixels that were actually classified in that category, the result is a measure of commission error. This measure, called the user’s accuracy or reliability, is the probability that a pixel classified on the map actually represents that category on the ground.

36 User’s Accuracy classified image reference data  The probability a pixel on the map represents the correct land cover category  measures the “commission error” (image pixels improperly classified are being committed to another reference class)  150/200 = 75%

37 Kappa Analysis K hat Coefficient of Agreement: Kappa analysis yields a statistic,, which is an estimate of Kappa. It is a measure of agreement or accuracy between the remote sensing–derived classification map and the reference data as indicated by a) the major diagonal, and b) the chance agreement, which is indicated by the row and column totals (referred to as marginals).

38 Kappa Coefficient Expresses the proportionate reduction in error generated by the classification in comparison with a completely random process. A value of 0.82 implies that 82% of the errors of a random classification are being avoided

39 Kappa Coefficient The Kappa coefficient is not as sensitive to differences in sample sizes between classes and is therefore considered a more reliable measure of accuracy; Kappa should always be reported

40 Kappa Coefficient A Kappa of 0.8 or above is considered a good classification; 0.4 or below is considered poor

41 Kappa Coefficient Where: r = number of rows in error matrix n ij = number of observations in row i, column j n i = total number of observations in row i n j = total number of observations in column j M = total number of observations in matrix

42 Kappa Coefficient classified image reference data

Error matrix of the classification map derived from hyperspectral data of the Mixed Waste Management Facility on the Savannah River Site. Error matrix of the classification map derived from hyperspectral data of the Mixed Waste Management Facility on the Savannah River Site. 121/121=