Modeling & Analyzing Massive Terrain Data Sets (STREAM Project) Pankaj K. Agarwal Workshop on Algorithms for Modern Massive Data Sets.

Slides:



Advertisements
Similar presentations
I/O and Space-Efficient Path Traversal in Planar Graphs Craig Dillabaugh, Carleton University Meng He, University of Waterloo Anil Maheshwari, Carleton.
Advertisements

Yang Yang, Miao Jin, Hongyi Wu Presenter: Buri Ban The Center for Advanced Computer Studies (CACS) University of Louisiana at Lafayette 3D Surface Localization.
Efficient access to TIN Regular square grid TIN Efficient access to TIN Let q := (x, y) be a point. We want to estimate an elevation at a point q: 1. should.
Lars Arge 1/43 Big Terrain Data Analysis Algorithms in the Field Workshop SoCG June 19, 2012 Lars Arge.
Lars Arge 1/13 Efficient Handling of Massive (Terrain) Datasets Lars Arge A A R H U S U N I V E R S I T E T Department of Computer Science.
. Computing Contour Maps & Answering Contour Queries Pankaj K. Agarwal Joint work with Lars Arge ThomasMolhave Thomas Molhave Bardia Sadri.
CSE554ContouringSlide 1 CSE 554 Lecture 4: Contouring Fall 2013.
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
3D and Surface/Terrain Analysis
Lasse Deleuran 1/37 Homotopic Polygonal Line Simplification Lasse Deleuran PhD student.
I/O-Algorithms Lars Arge January 31, Lars Arge I/O-algorithms 2 Random Access Machine Model Standard theoretical model of computation: –Infinite.
I/O-Efficient Batched Union-Find and Its Applications to Terrain Analysis Pankaj K. Agarwal, Lars Arge, Ke Yi Duke University University of Aarhus.
I/O-Efficient Construction of Constrained Delaunay Triangulations Pankaj K. Agarwal, Lars Arge, and Ke Yi Duke University.
Approximating Sensor Network Queries Using In-Network Summaries Alexandra Meliou Carlos Guestrin Joseph Hellerstein.
lecture 4 : Isosurface Extraction
I/O-Algorithms Lars Arge Spring 2009 February 2, 2009.
Lecture 16 Terrain modelling: the basics
I/O-Algorithms Lars Arge Spring 2009 January 27, 2009.
I/O-Algorithms Lars Arge Spring 2007 January 30, 2007.
I/O-Algorithms Lars Arge Aarhus University February 16, 2006.
I/O-Algorithms Lars Arge Aarhus University February 7, 2005.
I/O-Algorithms Lars Arge University of Aarhus February 13, 2005.
I/O-Algorithms Lars Arge Spring 2009 March 3, 2009.
I/O-Algorithms Lars Arge Aarhus University February 6, 2007.
I/O-Algorithms Lars Arge Spring 2006 February 2, 2006.
Lars Arge 1/14 A A R H U S U N I V E R S I T E T Department of Computer Science Efficient Handling of Massive (Terrain) Datasets Professor Lars Arge University.
I/O-Algorithms Lars Arge Aarhus University March 5, 2008.
I/O-Efficient Structures for Orthogonal Range Max and Stabbing Max Queries Second Year Project Presentation Ke Yi Advisor: Lars Arge Committee: Pankaj.
I/O-Algorithms Lars Arge Aarhus University February 9, 2006.
I/O-Algorithms Lars Arge Aarhus University February 14, 2008.
I/O-Efficient Batched Union-Find and Its Applications to Terrain Analysis Pankaj K. Agarwal, Lars Arge, and Ke Yi Duke University University of Aarhus.
I/O-Algorithms Lars Arge University of Aarhus March 7, 2005.
Generating Raster DEM from Mass Points via TIN Streaming Jack Snoeyink UNC Chapel Hill Jonathan Shewchuk UC Berkeley Tim Thirion UNC Chapel Hill Martin.
Week 10. GIS Data structure II
From Elevation Data to Watershed Hierarchies Pankaj K. Agarwal Duke University Supported by ARO W911NF
Abstract Shortest distance query is a fundamental operation in large-scale networks. Many existing methods in the literature take a landmark embedding.
Lars Arge 1/12 Lars Arge. 2/12  Pervasive use of computers and sensors  Increased ability to acquire/store/process data → Massive data collected everywhere.
I/O-Algorithms Lars Arge Spring 2008 January 31, 2008.
Processing Terrain Data in the River Proximity Arc Hydro River Workshop December 1, 2010 Erin Atkinson, PE, CFM, GISP Halff Associates, Inc.
Terrain Mapping and Analysis
TerraStream: From Elevation Data to Watershed Hierarchies Thursday, 08 November 2007 Andrew Danner (Swarthmore), T. Moelhave (Aarhus), K. Yi (HKUST), P.
I/O-Algorithms Lars Arge Fall 2014 August 28, 2014.
Heavily based on slides by Lars Arge I/O-Algorithms Thomas Mølhave Spring 2012 February 9, 2012.
Bin Yao Spring 2014 (Slides were made available by Feifei Li) Advanced Topics in Data Management.
Bin Yao Spring 2014 (Slides were made available by Feifei Li) Advanced Topics in Data Management.
Creating Watersheds and Stream Networks
Terracost: Hazel, Toma, Vahrenhold, Wickremesinghe Terracost: A Versatile and Scalable Approach to Computing Least-Cost-Path Surfaces for Massive Grid-Based.
Spatial Databases: Digital Terrain Model Spring, 2015 Ki-Joune Li.
Lecture 2: External Memory Indexing Structures CS6931 Database Seminar.
I/O-Efficient Batched Union-Find and Its Applications to Terrain Analysis Pankaj K. Agarwal, Lars Arge, Ke Yi Duke University University of Aarhus.
Adding the third dimension In high relief areas variables such as altitude, aspect and slope strongly influence both human and physical environments –a.
CSE554ContouringSlide 1 CSE 554 Lecture 4: Contouring Fall 2015.
SpaDaGIS Efficiency Issues in Multi-resolution Terrain Modeling Leila De FlorianiPaola Magillo Leila De Floriani * Paola Magillo Department of Computer.
1 Overview Importing data from generic raster files Creating surfaces from point samples Mapping contours Calculating summary attributes for polygon features.
References [1] Pankaj Agarwal, Lars Arge and Andrew Danner. From Point Cloud to Grid DEM: A Scalable Approach. In Proc. 12th Intl. Symp. on Spatial Data.
Flow Modeling on Massive Grids Laura Toma, Rajiv Wickremesinghe with Lars Arge, Jeff Chase, Jeff Vitter Pat Halpin, Dean Urban in collaboration with.
Laura TomaSimplified External memory Algorithms for Planar DAGs Simplified External Memory Algorithms for Planar DAGs July 2004 Lars Arge Laura Toma Duke.
Lecture 1: Basic Operators in Large Data CS 6931 Database Seminar.
TerraSTREAM: Terrain Processing Pipeline MADALGO – Center for Massive Data Algorithmics, a Center of the Danish National Research Foundation What TerraSTREAM.
Digital Elevation Models (DEMs)Constructing Grid DEMConstructing Contour Map Constructing TIN DEM Grid DEM Quality Metric A Digital Elevation Model (DEM)
Terrain modelling: the basics
Digital Terrain Analysis for Massive Grids
Modeling & Analyzing Massive Terrain Data Sets
Distributed Flow Routing
Surface Analysis Tools
Advanced Topics in Data Management
Spatial Databases: Digital Terrain Model
Spatial Databases: Digital Terrain Model
Presentation transcript:

Modeling & Analyzing Massive Terrain Data Sets (STREAM Project) Pankaj K. Agarwal Workshop on Algorithms for Modern Massive Data Sets

Diverse elevation point data: density, distribution, accuracy Lidar 0.15m v. accuracy; altitude 700m and 2300m Photogrammetry 0.76m v. accuracy (5ft contours) RTK-GPS 0.10m v. accuracy

Constructing Digital Elevation Models Grid DEM: Elevation stored at uniform grid points TIN: Triangulation; elevation stored at vertices Contours Maps: Iso-contour lines at regular intervals

Natural feature extraction Extracted foredunes using profile curvature and elevation threshold Ma Maps of topo parameters: computed simultaneously with interpolation using partial derivatives of the RST function, terrain features: combined thresholds of topo parameters

Tracking Evolving Features slip faces: slope > 30deg 1995 new slip face in how to automatically track features that change some of their properties over time? dune crests: profile curvature threshold

Terrain Analysis Flow Analysis Watershed hierarchy Visibility Navigation

Challenge: Massive Data Sets LIDAR – NC Coastline: 200 million points – over 7 GB – Neuse River basin (NC): 500 million points – over 17 GB – Applachian Range: 50GB-5T B Output is also large – 10ft grid: 10GB – 5ft grid: 40GB

Approximation Algorithms Exact computation expensive Many practical problems are intractable Multiple & often conflicting optimization criteria Suffices to find a near-optimal solution Tunable Approximation algorithms – Tradeoff between accuracy and efficiency – User specifies tolerance

I/O-Bottleneck Data resides in secondary memory Disk access is 10 6 times slower than main memory access – Maximize useful data transferred with each access – Amortize large access time by transferring large contiguous blocks of data I/O – not CPU – is often the bottleneck for processing massive data sets track magnetic surface read/write arm read/write head

I/O-efficient Algorithms [AV88] Traditional algorithms optimize CPU computation – Not sensitive to penalty of disk access I/O model – Memory is finite – Data is transferred in blocks (B: block size) – Complexity: #disk blocks transferred (#I/Os) Disk RAM CPU M B B ~ 2K-8K External Memory Algorithms [Vitter]

The TerraStream Modules (Classification)(DEM Construction)ConditioningFlow Routing Flow AccumulationWatershed HierarchiesQuality MetricsContour Lines

TIN DEM

Level Sets, Contours

Contour Maps

Usage of contour lines (also called iso-contours, isogons, etc) goes back to at least 17 th century Philosophical Transactions of Royal Society of London, 1779

Find a seed point on each contour and traverse the triangulation to trace each contour Use a simple data structure (e.g. contour trees) to compute seed points Query time: O(log N + T) T: #contour edges Contour map: O(Nlog N +T) T: #contour map edges For massive terrains I/O efficiency is bad: O(N+T) instead of O((N+T)/B) Computing Contour Map: Internal Memory Algorithm

Storing a TIN on a disk so that it can be traversed with as few I/Os possible Is there a good ordering of the triangles?

Our results Computing contour maps: O(Sort(N)+T/B) I/Os Answering contour queries – Preprocessing Time: O(Sort(N)) I/Os – Space: O(N/B) disk blocks – Query: O(log B N+T/B) Individual contours can be retrieved in O(T/B)I/Os from this ordering Ordering Theorem: C-ordering Ordering Theorem: A total ordering, called C-ordering, of triangles can be computed in O(Sort(N)) I/Os s.t. the subsequence of triangles intersecting a contour appears along the contour and contours in a level set are broken in nested order.

Height Graph H*: Dual graph of H

maximum saddle minimum regular Critical Points

Simple Terrains

Positive & Negative Saddles

Cut Trees

Surgery on Terrain

Surgery & Contours

Nesting of Contours [][][][]

Ongoing Work Compute C-ordering for general 2- manifolds Maintain C-ordering for hierarchical representation of terrains Denoising contours

Collaborators Sponsored by – Army Research Office W911NF Lars Arge Helena Mitasova Andy Danner Thomas Molhave Ke Yi Bardia Sadri

I/O-Efficient Algorithms Answering a contour query: Preprocessing O(Nlog B N), Space: O(N/B) blocks Query: O(log B N+T/B)

Front-ends ArcGIS ExtensionGRASS ExtensionCommand Line Tools Front-End GUIMapInfo