Tony Rees and Glenelg Smith Divisional Data Centre + Remote Sensing Facility CSIRO Marine Research, Australia Application of c-squares.

Slides:



Advertisements
Similar presentations
Remote Visualisation System (RVS) By: Anil Chandra.
Advertisements

Tony Rees Divisional Data Centre CSIRO Marine Research, Australia Application of c-squares spatial indexing to an archive of remotely.
C-squares - a new simple, XML friendly, display/ query/ exchange format for representing spatial data extents at the metadata level System concept and.
Multidimensional Data Rtrees Bitmap indexes. R-Trees For “regions” (typically rectangles) but can represent points. Supports NN, “where­am­I” queries.
Evolving concepts in the architecture of OBIS, the Ocean Biogeographic Information System Tony Rees CSIRO Marine Research 29 November 2004 Phoebe Zhang.
From portions of Chapter 8, 9, 10, &11. Real world is complex. GIS is used model reality. The GIS models then enable us to ask questions of the data by.
Rapid spatial indexing and web mapping using the “c-squares” global grid Tony Rees Manager, Divisional Data Centre 23 March 2007 CSIRO.
Spatial Mining.
Multidimensional Data. Many applications of databases are "geographic" = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Bioinformatics GIS Applications Anatoly Petrov.
Spatial Indexing, Search, and Mapping for Species level databases Tony Rees, CSIRO Marine and Atmospheric Research (CMAR), Hobart, Tasmania, Australia.
Multiple-key indexes Index on one attribute provides pointer to an index on the other. If V is a value of the first attribute, then the index we reach.
Fractal Image Compression
Exploring large marine datasets using an interactive website and Google Earth Jon Blower, Dan Bretherton, Keith Haines, Chunlei Liu, Adit Santokhee Reading.
Geographic Information Systems
Tony Rees Divisional Data Centre CSIRO Marine Research, Australia Application of c-squares spatial indexing to an archive of remotely.
So What is GIS??? “A collection of computer hardware, software and procedures that are used to organize, manage, analyze and display.
NPS Introduction to GIS: Lecture 1
Chap8: Trends in DBMS 8.1 Database support for Field Entities 8.2 Content-based retrieval 8.3 Introduction to spatial data warehouses 8.4 Summary.
COMP 451/651 Multiple-key indexes
Attribute databases. GIS Definition Diagram Output Query Results.
Tony Rees – C-squares Oct Nested Grids: the c-squares global grid Tony Rees CSIRO Marine and Atmospheric Research, Hobart for:
GIS Analysis. Questions to answer Position – what is here? Condition – where are …? Trends – what has changed? Pattern – what spatial patterns exist?
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
Distributed storage for structured data
Obtaining MISR Data and Information Jeff Walter Atmospheric Science Data Center April 17, 2009.
Canada’s Natural Resources – Now and for the Future Using Open Source to analyze Canada’s National Forest Inventory. Chris West and Brian Low Pacific Forestry.
Navigating and Browsing 3D Models in 3DLIB Hesham Anan, Kurt Maly, Mohammad Zubair Computer Science Dept. Old Dominion University, Norfolk, VA, (anan,
GIS Concepts ‣ What is a table? What is a table? ‣ Queries on tables Queries on tables ‣ Joining and relating tables Joining and relating tables ‣ Summary.
Chapter 1: Introduction to Spatial Databases 1.1 Overview 1.2 Application domains 1.3 Compare a SDBMS with a GIS 1.4 Categories of Users 1.5 An example.
Alok 1Northwestern University Access Patterns, Metadata, and Performance Alok Choudhary and Wei-Keng Liao Department of ECE,
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
Spatial Data Management Chapter 28. Types of Spatial Data Point Data –Points in a multidimensional space E.g., Raster data such as satellite imagery,
IST 210 Introduction to Spatial Databases. IST 210 Evolution of acronym “GIS” Fig 1.1 Geographic Information Systems (1980s) Geographic Information Science.
Slides are based on Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems.
C-squares - a simple, XML friendly, query/ display/ exchange format for representing spatial data at the metadata level System concept and development.
OBIS Portal Architecture Concepts plus potential for utilization as a basis for Regional OBIS Nodes Tony Rees, CSIRO Marine Research, Hobart (and OBIS.
Global Land Cover Facility The Global Land Cover Facility (GLCF) is a member of the Earth Science Information Partnership (ESIP) Federation providing data,
Approximate Encoding for Direct Access and Query Processing over Compressed Bitmaps Tan Apaydin – The Ohio State University Guadalupe Canahuate – The Ohio.
8. Geographic Data Modeling. Outline Definitions Data models / modeling GIS data models – Topology.
OBIS and species distributions Tony Rees discussion presentation, March 2003 Some fundamental intentions for OBIS... –Choose any species and discover its.
URBDP 422 Urban and Regional Geo-Spatial Analysis Lecture 2: Spatial Data Models and Structures Lab Exercise 2: Topology January 9, 2014.
C-FIND Performance Issues. Multiple Patient IDs PACS increasingly have records transferred between facilities (IHE and otherwise) Transferred patient.
CSIRO Marine Research Data Centre linked databases - CAAB, MarLIN and Divisional Data Warehouse.
Spatial Search of Orbital Swath Data Abstract: The high volume of today's remotely sensed Earth Science data creates.
MySQL spatial indexing for GIS data in a web 2.0 internet application Brian Toone Samford University
Raster data models Rasters can be different types of tesselations SquaresTrianglesHexagons Regular tesselations.
Assessing the Phenological Suitability of Global Landsat Data Sets for Forest Change Analysis The Global Land Cover Facility What does.
Emerging Technologies for Games Deferred Rendering CO3303 Week 22.
Pad++: A Zooming Graphical Interface for Exploring Alternate Interface Physics Presented By: Daniel Loewus-Deitch.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
CAAB and taxon management at CSIRO Marine Research Tony Rees Divisional Data Centre CSIRO Marine Research, Hobart
Maths & Technologies for Games Spatial Partitioning 2
MarLIN: a research data metadatabase for CSIRO Marine Research Tony Rees Divisional Data Centre CSIRO Marine Research, Hobart contact:
Towards Unifying Vector and Raster Data Models for Hybrid Spatial Regions Philip Dougherty.
ERDDAP The Next Generation of Data Servers Bob Simons DOC / NOAA / NMFS / SWFSC / ERD Monterey, CA Disclaimer: The opinions expressed.
System concept and development by: Tony Rees Divisional Data Centre CSIRO Marine Research, Australia c-squares - a new method for representing, querying,
What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial data”
1 Advanced Scene Management. 2 This is a game-type-oriented issue Bounding Volume Hierarchies (BVHs) Binary space partition trees (BSP Trees) “Quake”
System concept and development by: Tony Rees Divisional Data Centre CSIRO Marine Research, Australia c-squares - a new method for representing, querying,
 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.
Spatial Data Management
Catalogs contain hundreds of millions of objects
Backtrack Orbit Search Algorithm
Multidimensional Access Structures
Tony Rees CSIRO Marine Research 29 November 2004
Using Partitions and Fragments
COMP 430 Intro. to Database Systems
Geographic Information Systems
SEO Report to WGISS Brian Killough CEOS Systems Engineering Office (SEO) WGISS-46 Meeting October 22, 2018.
Presentation transcript:

Tony Rees and Glenelg Smith Divisional Data Centre + Remote Sensing Facility CSIRO Marine Research, Australia Application of c-squares spatial indexing to an archive of remotely sensed data

The starting point … Archive of c.60,000 satellite AVHRR scenes, current, accumulating at another 15+ per day Region covered varies for each scene Metadata (satellite, orbit, time parameters) and computed polygon (e.g. 7 points per side) available for each scene Determining which scenes include any given point, on- the-fly (to answer user’s request) is a standard requirement, but not trivial – available methods may be slow, or inaccurate, or both...

example in more detail... Example – may need to query which out of these four scenes includes point “x” (in reality, stop/start points of scenes also vary, also other scenes cross at different angle/s) x

Options for spatial indexing Bounding polygons Require a GIS system (or similar) for spatial queries (polygon overlays); can be slow to test many thousands of scenes Can be problems with footprints which cross the date line, or include a pole Bounding rectangles Computationally cheap, rapid to query, no GIS required Typically a poor fit to actual data footprints (typically these are not square, or even nearly so) – many “false positives” Can be problems with footprints which cross the date line, or include a pole Gridded (tile based) representation Potentially simple to query, no GIS required May be poor fit to actual data footprints if tile size is large (smaller tiles best, however index gets larger – tradeoff required here) Compromise approach – slightly inexact compared with polygon overlays, but should be better performance for real-time queries (most computation has been done in advance) Problems with footprints which cross the date line, or include a pole, can be pre- handled

C-squares basics Based on a tiled (gridded) representation of the earth’s surface, at choice of 10 x 10, 5 x 5, 1 x 1, 0.5 x 0.5 degree tiles, etc. (0.5 x 0.5 degree tiles are used in this example) Every tile has a unique, hierarchical alphanumeric ID (incorporates the ID of all parent, grandparent, etc. in every child tile ID) Dataset (=scene) extents are represented by a list of all the tiles included in, or intersected by the footprint Spatial search comprises looking for one or more tile IDs in the set associated with any dataset (= simple text search). (more details – see )

C-squares nomenclature – ten degree squares (follows WMO 10 x 10º square nomenclature)

Adaptive square size – 10 x 10, 5 x 5, 1 x 1, 0.5 x 0.5 degrees (etc.) – produces quadtree-like efficiencies (“compaction”) (this example shows detail to 1- degree square level) 10 degree square: 3213:*** 5 degree square: 3214:1** 1 degree square: 3315:100 - can reduce storage requirement by up to 90%, cf. constant- resolution encoding

A real-world example NOAA Jun :23 10 x 10 degree squares (28) (base level of hierarchy, cannot compact) 5 x 5 degree squares (99) = 36 after “compaction” 1 x 1 degree squares (1,982) = 515 after comp. 0.5 x 0.5 degree squares (7,691) = 704 after comp. 0.5 x 0.5 degree squares - detail

Metadata table concept – 2 “search” tables only Scene details (1 row per scene) Scene <> c-square pairs ( rows per scene)

Clickable map interface

Example search result

SQL for spatial search (example for 0.5 degree search square) select distinct A.scene_id, B.satellite, B.scene_date_time, B.image_location from satdata_csq_all A, scene_info B where ( (sqrsize = 0.5 and (A.csquare = search_csq -- e.g. 3414:100:1 (0.5 degree square) or A.csquare = substr(search_csq,1,8)|| ': * ' -- 1 level of compression or A.csquare = substr(search_csq,1,6)|| ' ** : * ' -- 2 levels of compression or A.csquare = substr(search_csq,1,4)||': *** : * ') -- 3 levels of compression ) -- (other supported search square size options inserted here) ) and (startdate is null or B.scene_date_time >= startdate) and (enddate is null or B.scene_date_time <= enddate) and (sat = 'any' or B.satellite = sat) and A.scene_id = B.scene_id order by B.scene_date_time, B.satellite;

Summary of steps involved (1) Encoding – using custom polygon-to-c-squares algorithm (first attempt: 3+ mins per scene; second attempt: 30 seconds per scene to encode) (2) Storage – first attempt: all scene/c-square pairs in one large table, second attempt: partitioned by 10 x 10º “bins” (100 smaller tables, for faster searching) (3) Querying – using standard SQL and text matches (4) (optional) Visualising – generating footprints, using web call to multi-purpose c-squares mapper More information: c-squares website - CMR Satellite Data spatial search: