Download presentation
Presentation is loading. Please wait.
Published byWillis O’Neal’ Modified over 9 years ago
1
Variance of Similar Neighbors compared to Random Imputation Nearest Neighbor Conference August 28-30, 2006 Kenneth B. Pierce Jr and Janet L. Ohmann Forestry Sciences Lab, PNW Research Station, Corvallis
2
Project Objectives Map fuels and vegetation using Gradient Nearest Neighbor (GNN) imputationMap fuels and vegetation using Gradient Nearest Neighbor (GNN) imputation Produce maps of plot-level tree attributes as complete coveragesProduce maps of plot-level tree attributes as complete coverages Provide a high degree of analytical flexibility for end-usersProvide a high degree of analytical flexibility for end-users Provide robust accuracy assessmentProvide robust accuracy assessment California Sierra (Mediterranean) Coastal Oregon (Maritime) Eastern Washington (Temperate steppe)
3
Presentation Objectives Give an brief overview of Gradient Nearest Neighbor (GNN) imputation as a techniqueGive an brief overview of Gradient Nearest Neighbor (GNN) imputation as a technique Describe the use of imputation for mapping natural variabilityDescribe the use of imputation for mapping natural variability Describe the use of imputation for mapping sampling sufficiencyDescribe the use of imputation for mapping sampling sufficiency Examine the variability among nearest neighbors in gradient space versus a random set of neighborsExamine the variability among nearest neighbors in gradient space versus a random set of neighbors Examine the change in variability when restricting plot selection to those well represented in gradient spaceExamine the change in variability when restricting plot selection to those well represented in gradient space
4
Major Steps in GNN Imputation mapping: 1) Assembling Data1) Assembling Data 2) Statistical Modeling (CCA)2) Statistical Modeling (CCA) 3) Imputation/Map Creation3) Imputation/Map Creation 4) Accuracy Assessment4) Accuracy Assessment 5) Applications and Risk Assessment5) Applications and Risk Assessment
6
Statistical Modeling: Canonical Correspondence Analysis Multivariate statistical methodMultivariate statistical method –results in a weight for each spatial variable as to its relationship with the multiple response variables Modeling Variables-used as model Y’sModeling Variables-used as model Y’s –Structure models (BAC, BAH, STPH, CWD) –Species models Mapping Variables-retained with plot-map linkMapping Variables-retained with plot-map link
7
Neighbors in Gradient Space Direct gradient analysis allows assignment of a multi- dimensional location to each predicted pixelDirect gradient analysis allows assignment of a multi- dimensional location to each predicted pixel
8
A Pixel in Plotland (example 0.5 * elevation + 0.25 * precip)
9
A Pixel in Plotland Sample plot locations in gradient space (example 0.5 * elevation + 0.25 * precip)
10
A Pixel in Plotland Target Location in Gradient Space Sample plot locations in gradient space (example 0.5 * elevation + 0.25 * precip)
11
A Pixel in Plotland Five closest neighbors (example 0.5 * elevation + 0.25 * precip)
12
A Pixel in Plotland Twenty closest neighbors (example 0.5 * elevation + 0.25 * precip)
13
A Pixel in Plotland Interplot Distances (example 0.5 * elevation + 0.25 * precip)
14
How far is far in gradient space?
15
Major Steps in GNN mapping: 1) Data Preparation/Screening1) Data Preparation/Screening 2) Statistical Modeling2) Statistical Modeling 3) Imputation/Map Creation3) Imputation/Map Creation 4) Accuracy Assessment4) Accuracy Assessment 5) Applications and Risk Assessment5) Applications and Risk Assessment
16
Imputing/Assigning plot id’s Nearest neighbor (single neighbor, retains covariance, MSN-like)Nearest neighbor (single neighbor, retains covariance, MSN-like) Summary statistic of multiple neighbors (single value, kNN-like)Summary statistic of multiple neighbors (single value, kNN-like) Etc. (i.e. many other contortions possible)Etc. (i.e. many other contortions possible)
17
Process Uncertainty/Natural VariabilityProcess Uncertainty/Natural Variability –Uncontrollable (often unmeasurable) Natural disturbances Demographic stochasticity Anthropogenic disturbances Sampling UncertaintySampling Uncertainty –Not entirely uncontrollable Limited sampling Spatial averaging Temporal sample variation Sources of Uncertainty For Ecological Detectives Hilborn & Mangel 1997
18
Map integral (Value of Map)Map integral (Value of Map) –Confusion matrices/Kappa (local) –Correlation statistics (local) –Regional histograms (regional) Map explicit (Map of Values)Map explicit (Map of Values) –Confidence maps (Process) –Support (Sampling) Accuracy assessments “obsessive transparency”
19
Overview of maps Vegetation mapVegetation map –the predicted value Neighbor Count mapNeighbor Count map –a measure of sampling sufficiency for a specific ecological location Natural Variability mapNatural Variability map –the variability in response at the most similar locations
20
Natural variability maps Variability maps are created by calculating the variance for the 5 nearest neighbors at each location (a value other than 5 could certainly be used)Variability maps are created by calculating the variance for the 5 nearest neighbors at each location (a value other than 5 could certainly be used)
21
Sampling sufficiency maps Centile thresholds are selected from the histogram of interplot distancesCentile thresholds are selected from the histogram of interplot distances Gradient distance grids are retained for the 20 nearest neighbors during imputationGradient distance grids are retained for the 20 nearest neighbors during imputation The 20 distance grids are compared to the threshold values and a count grid is created where a value of 20 indicates 20 plots were within the threshold valueThe 20 distance grids are compared to the threshold values and a count grid is created where a value of 20 indicates 20 plots were within the threshold value
22
4m Aerial Photo
23
Expected value Basal Area m 2 /ha 0 61
24
10 th Quantile Threshold map 0 20 Neighbors out of 20 within the threshold distance
25
20th Quantile Threshold map 0 20 Neighbors out of 20 within the threshold distance
26
50th Quantile Threshold map 0 20 Neighbors out of 20 within the threshold distance
27
Natural Variability 1 - 6 6.1 - 8 8.1 - 10 10.1 - 12 12.1 - 15 15.1 - 18 18. 1 - 21 21. 1 - 25 25. 1 - 29 Standard deviation of 5 nearest neighbors for BA (m2/ha)
28
“Premise of Imputation” TheoremTheorem –Places similar in X-values should be similar in Y-values. PostulatePostulate –The 5 plots most similar to a location in X-values should have reduced variance in Y-values compared to 5 random plots
29
Methods 1.Create 1000 random spatial locations 2.Sample the plot ids from the 5 nearest neighbors and the 10-th and 20-th centile sufficiency grids 3.Select an attribute and query the plot data with the five nearest neighbor ids 4.Calculate the variance for the five nearest neighbors at each of the 1000 sample points 5.Plot the density of the variance values (Black line)
30
data Random sets of 5 values
31
Methods continued Create 1000 sets of 5 random plots and repeat the variance calculation and density plot (Open circles)Create 1000 sets of 5 random plots and repeat the variance calculation and density plot (Open circles) Subset the random locations and plot data sets into groups based on their sufficiency scores: 0, 5, >15 [# of 20 nearest neighbors w/in the threshold value]Subset the random locations and plot data sets into groups based on their sufficiency scores: 0, 5, >15 [# of 20 nearest neighbors w/in the threshold value] Plot densities by subgroupsPlot densities by subgroups Create and plot random sets from appropriate subgroupsCreate and plot random sets from appropriate subgroups
32
data Random sets of 5 values Bootstrap set All imputed Neighbors >=15 Neighbors >5 Neighbors <15 Neighbors <=5
33
Bootstrap set All imputed Neighbors >=15 Neighbors >5 Neighbors <15 Neighbors <=5
34
Is this a general result? Sorta.Sorta.
39
Conclusions
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.