A methodology for translating positional error into measures of attribute error, and combining the two error sources Yohay Carmel 1, Curtis Flather 2 and.

Slides:



Advertisements
Similar presentations
By Venkata Sai Pulluri ( ) Narendra Muppavarapu ( )
Advertisements

© McGraw-Hill Higher Education. All rights reserved. Chapter 3 Reliability and Objectivity.
Ensemble Methods An ensemble method constructs a set of base classifiers from the training data Ensemble or Classifier Combination Predict class label.
Spatial monitoring of late-successional forest habitat over large regions with nearest-neighbor imputation Janet Ohmann 1, Matt Gregory 2, Heather Roberts.
GIS Error and Uncertainty Longley et al., chs. 6 (and 15) Sources: Berry online text, Dawn Wright.
Desheng Liu, Maggi Kelly and Peng Gong Dept. of Environmental Science, Policy & Management University of California, Berkeley May 18, 2005 Classifying.
1 Walking on a Graph with a Magnifying Glass Stratified Sampling via Weighted Random Walks Maciej Kurant Minas Gjoka, Carter T. Butts, Athina Markopoulou.
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
 What is it? Vagueness Measurement errors Boundary identification  Why is it important to quantify? Critical applications of GIS Military, Emergency.
VALIDATION OF REMOTE SENSING CLASSIFICATIONS: a case of Balans classification Markus Törmä.
Supervised classification performance (prediction) assessment Dr. Huiru Zheng Dr. Franscisco Azuaje School of Computing and Mathematics Faculty of Engineering.
Ensemble Learning: An Introduction
Geog 458: Map Sources and Errors Uncertainty January 23, 2006.
Long-term changes in Mediterranean vegetation in Israel and California Yohay Carmel Faculty of Agricultural Engineering The Technion -- Israel Institute.
Topic 3: Regression.
February 15, 2006 Geog 458: Map Sources and Errors
Copyright, © Qiming Zhou GEOG1150. Cartography Quality Control and Error Assessment.
Global Land Cover: Approaches to Validation Alan Strahler GLC2000 Meeting JRC Ispra 3/02.
Handwritten Character Recognition using Hidden Markov Models Quantifying the marginal benefit of exploiting correlations between adjacent characters and.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Accuracy Assessment. 2 Because it is not practical to test every pixel in the classification image, a representative sample of reference points in the.
Area Objects and Spatial Autocorrelation Chapter 7 Geographic Information Analysis O’Sullivan and Unwin.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
earthobs.nr.no Land cover classification of cloud- and snow-contaminated multi-temporal high-resolution satellite images Arnt-Børre Salberg and.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Economic Cooperation Organization Training Course on “Drought and Desertification” Alanya Facilities, Antalya, TURKEY presented by Ertan TURGU from Turkish.
FAPBED Checkpoint Presentation: Feature Identification Danilo Scepanovic Josh Kirshtein Mentor: Ameet Jain.
SIMULATION USING CRYSTAL BALL. WHAT CRYSTAL BALL DOES? Crystal ball extends the forecasting capabilities of spreadsheet model and provide the information.
Indiana GIS Conference, March 7-8, URBAN GROWTH MODELING USING MULTI-TEMPORAL IMAGES AND CELLULAR AUTOMATA – A CASE STUDY OF INDIANAPOLIS SHARAF.
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Multiple Collinearity, Serial Correlation,
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
1 Webcam Mouse Using Face and Eye Tracking in Various Illumination Environments Yuan-Pin Lin et al. Proceedings of the 2005 IEEE Y.S. Lee.
Accuracy Assessment Having produced a map with classification is only 50% of the work, we need to quantify how good the map is. This step is called the.
1 CS 391L: Machine Learning: Experimental Evaluation Raymond J. Mooney University of Texas at Austin.
Vegetation Mapping An Interagency Approach The California Department of Forestry and Fire Protection and the USDA Forest Service Mark Rosenberg: Research.
Calibration Guidelines 1. Start simple, add complexity carefully 2. Use a broad range of information 3. Be well-posed & be comprehensive 4. Include diverse.
Spatially Assessing Model Error Using Geographically Weighted Regression Shawn Laffan Geography Dept ANU.
Extent and Mask Extent of original data Extent of analysis area Mask – areas of interest Remember all rasters are rectangles.
Multiple Discriminant Analysis
Gap-filling and Fault-detection for the life under your feet dataset.
Change Detection The process of identifying differences in the state of an object or phenomenon by observing it at different times. Change detection applications:
Remotely sensed land cover heterogeneity
Remote Sensing Classification Accuracy
H. Lexie Yang1, Dr. Melba M. Crawford2
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Bayesian Travel Time Reliability
NATIONAL CONFERENCE ON STUDENT ASSESSMENT JUNE 22, 2011 ORLANDO, FL.
Mapping Spatial Distribution of Land Cover Classification Errors Maria João Pereira, Amílcar Soares CERENA – Centre for Natural Resources and Environment.
Environmental Modeling Validating GIS Models. 1. A Habitat Model Issues: ► Mapping Florida Scrub Jay habitat in the Kennedy Space Center in the Kennedy.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
A Framework and Methods for Characterizing Uncertainty in Geologic Maps Donald A. Keefer Illinois State Geological Survey.
Two Approaches to Estimation of Classification Accuracy Rate Under Item Response Theory Quinn N. Lathrop and Ying Cheng Assistant Professor Ph.D., University.
U.S. Department of the Interior U.S. Geological Survey Chandra Giri Ying Zhong 7/15/2015 Cropland Extent Mapping in South America.
Puulajeittainen estimointi ja ei-parametriset menetelmät Multi-scale Geospatial Analysis of Forest Ecosystems Tahko Petteri Packalén Faculty.
Potential impacts of map error on land cover change detection Nick Cuba Clark University 2/25/12 1.
US Croplands Richard Massey Dr Teki Sankey. Objectives 1.Classify annual cropland extent, Rainfed-Irrigated, and crop types for the US at 250m resolution.
26. Classification Accuracy Assessment
HydroEurope 2017 Week 2: Flood Map Accuracy & Resilience Estimation
CHARACTERIZING SPATIAL DATA UNCERTAINTY IN GIS
Figure 1. Spatial distribution of pinyon-juniper and ponderosa pine forests is shown for the southwestern United States. Red dots indicate location of.
The GISCO task force “Remote Sensing for Statistics”
Sequential Pattern Discovery under a Markov Assumption
REMOTE SENSING Multispectral Image Classification
Assessment of data quality
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Computer Vision Chapter 4
Somi Jacob and Christian Bach
Volume 54, Issue 6, Pages (June 2007)
Outlines Introduction & Objectives Methodology & Workflow
Presentation transcript:

A methodology for translating positional error into measures of attribute error, and combining the two error sources Yohay Carmel 1, Curtis Flather 2 and Denis Dean 3 1 The Thechnion, Haifa, Israel 2 USDA, Forest Service 3 Colorado State University

Part 1: bridging the gap between positional error and classification error

classification error -- difference in pixel class between the map and a reference

Positional error (misregistration, location error) Is the gap between the true location of an item and its location on the map / image

Positional error may translate to thematic error positional error largely affects overall thematic error (often more than classification error) (Townshend et al 1992, Dai and Khorram 1998)

Positional error: RMSE = 2.51 m classification error: Accuracy Matrix: Goal 1: find a common denominator for both error types Goal 2: combine the two error types to get an overall estimate of error (in the context of temporal change) THE PROBABILITY THAT AN OBSERVED TRANSITION IS CORRECT

positional error affects thematic error

Expressing positional error in terms of thematic error Shift = 1, 0 Shift = 15, 7 Shift = 2, 3

Expressing positional error in terms of thematic error

Error model: step 1 (and step 2) of 5 RMSE Positional Error Classification Error

positional accuracy matrix A LOC classification accuracy matrix A CLASS Combined A BOTH A LOC 1, 1 *A CLASS 2, 1 / n +1 A LOC 2, 1 *A CLASS 2, 2 / n +2 A LOC 3, 1 *A CLASS 2, 3 / n +3 A BOTH 2,1 = + + Classified error model: step 3 (of 5) Combining the two accuracy matrices

Classified Reference Error model: step 3 Combined Error Matrix

Error model: step 4 Calculating the combined PCC, and the Combined user accuracy, p(C) Classified Reference PCC = 0.70 THE PROBABILITY THAT AN OBSERVED STATE IS CORRECT

Error model: step 5 Calculating multi-temporal indices One such index is The probability that an observed transition is correct The context of this model is temporal change. The goal is to provide indices for the reliability of an observed change

Example: vegetation changes in Hastings Nature Reserve, California

Example: vegetation changes in Hastings, California RMSE 1939 = 3.53 m 1995 = 2.51 m User accuracy for: Grass in 1939 = 0.92 Trees in 1995 = 0.91 positional accuracy Classification accuracy C1C1 C4C4

The probability that an observed transition is correct p(C 1 C 2 …C n ) = p(C 1 ) * p(C 2 ) * … * p(C n ) C1C1 C2C2 C3C3 C4C

Error model: step 5 p(C 1 C 2 …C n ) = p(C 1 ) * p(C 2 ) * … * p(C n ) C1C1 C2C2 C3C3 C4C4 The probability that an observed transition is correct: This probability may be calculated as the product of the respective user-accuracy value for the respective year and class

Example: vegetation changes in Hastings, California GT 0.53 p(GRASS 1939 TREE 1995 ) = p(G 1939 G 1956 G 1971 T 1995 ) = GGGT

Transition type Nature of transition Proportion in 1939 Probability of being correct, given : positional error Classif. error Combined error GGGG Grassland does not change CGGGChaparral burnt in 1955 fire All 69 transitions involving forest Averaged across the entire study area Indices of accuracy of multi-temporal datasets

(1)errors in each time step are independent of errors in other time steps (2)positional and classification errors are independent of one another A simulation study was conducted in order to evaluate how robust is the model in general and in particular -- to violations of two assumptions:

Maps for simulations High autocorrelation, equal class proportions Low autocorrelation, unequal class proportions Int. J. Rem. Sens. 2004

Original map Spatial error Classification error Both error types Simulation

Model simulations were conducted under a range of values for: Number of map categories (2-4) Class Proportions in the original map Auto-correlation in the original map Auto-correlation in classification error Classification error rate Positional error rate Correlation in error structure between time steps Correlation between the two error types

Some results of simulation runs

When models assumptions are not met, model fit decreases Maximum correlation found in real datasets* *IEEE GRSL 2004 Maximum correlation found in real datasets* *IEEE GRSL 2004

Transition type Nature of transition Proportion in 1939 Probability of being correct, given : positional error Classif. error Combined error GGGG Grassland does not change CGGGChaparral burnt in 1955 fire All 69 transitions involving forest Averaged across the entire study area Indices of accuracy of multi-temporal datasets

PART 2: Controlling data-uncertainty via aggregation Pixel size = 0.3 m Grid cell size = 15 m = 2500 pixels

Map aggregation = image degradation overlay a grid of cells on the image (cell >>pixel) and define the larger cell as the basic unit Pixel size = 0.3 m Grid cell size = 15 m = 2500 pixels

‘soft’ aggregation ‘hard’ aggregation Map aggregation = image degradation overlay a grid of cells on the image (cell >>pixel) and define the larger cell as the basic unit

Impact of positional error is largely reduced when cells are aggregated a b c

At the pixel level: only 55% of the pixels remained unaffected by a minor shift At the grid cell level: Impact of positional error is largely reduced when cells are aggregated

This trade-off calls for a model that quantifies the process to aid decisions on optimal level of aggregation Aggregation: Gain in accuracyBUTloss of information

A geometric approach to the impact of positional error

Effective positional error at the grid cell level  is the proportion of pixels that transgress into neighboring cells (RMSE units)

Positional error at the GRID CELL level p(loc) is the probability that positional error translates into attribute error p A (loc) is the same probability – in the context of a larger grid cell

The impact of aggregation on thematic accuracy 0.23 A p(loc) cell size error 0.6 m m m0.01

Conclusions positional error has a large impact on thematic accuracy, particularly in the context of change But can be easily mitigated: increase MMU to >10X[positional error] and do not worry about it. Within overall thematic error at the pixel level – classification error component is typically smaller than the positional error component, but is more difficult to get rid of by aggregation.

TODA THANK YOU