Introduction to Geospatial Information Management and Spatial Databases Lecture 4 1.

Slides:



Advertisements
Similar presentations
Introduction to Spatial Databases
Advertisements

Three-Step Database Design
The Role of Error Map and attribute data errors are the data producer's responsibility, GIS user must understand error. Accuracy and precision of map and.
Spatial Database Systems. Spatial Database Applications GIS applications (maps): Urban planning, route optimization, fire or pollution monitoring, utility.
Center for Modeling & Simulation.  A Map is the most effective shorthand to show locations of objects with attributes, which can be physical or cultural.
GIS for Environmental Science
CS 599 – Spatial and Temporal Databases Realm based Spatial data types: The Rose Algebra Ralf Hartmut Guting Markus Schneider.
1 Enviromatics Spatial database systems Spatial database systems Вонр. проф. д-р Александар Маркоски Технички факултет – Битола 2008 год.
Introduction to Spatial Database System Presented by Xiaozhi Yu.
Raster Based GIS Analysis
Spatial Mining.
Query Processing in Databases Dr. M. Gavrilova.  Introduction  I/O algorithms for large databases  Complex geometric operations in graphical querying.
Spatial Database Systems
Geodatabases by Shawn J. Dorsch Spatial Databases Part 2.
Spatial Information Systems (SIS) COMP Spatial queries and operations.
Introduction to GIS Ghassan Mikati, Ph.D GIS Expert.
Geographical Database
1 Shen FEii CSCI599-Fall2000 Spatial Database: Accomplishments and Research Needs presented by Fei Shen from S.Shekar,S.Chawla, S.Ravada,A.Fetterer, X.Liu,C.T.Lu.
What is Where? Lecture 5 Introduction to GISs Geography 176A Department of Geography, UCSB Summer 06, Session B.
Information Systems and GIS Chapter 2 Slides from James Pick, Geo-Business: GIS in the Digital Organization, John Wiley and Sons, Copyright © 2008.
Geographic Information Systems
Geographic Information Systems : Data Types, Sources and the ArcView Program.
Spatial databases: Introduction Geog 495: GIS database design.
Chapter 14 The Second Component: The Database.
An Introduction to Spatial Databases R. H. Guting VLDB Journal v3, n4, October 1994 Speaker: Giovanni Conforti.
Spatial SQL: A Query and Presentation Language Max J. Egenhofer (Presenter: Yi-Shin Chen)
Chapter 1: Introduction to Spatial Databases 1.1 Overview 1.2 Application domains 1.3 Compare a SDBMS with a GIS 1.4 Categories of Users 1.5 An example.
Spatial Database Systems. Spatial Database Applications GIS applications (maps): Urban planning, route optimization, fire or pollution monitoring, utility.
DBST 652 Emerging Database Technologies Many challenging areas are supported by database technology Eg. Web.
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
1 C. Shahabi CSCI599-Fall2000 Introduction to Spatial Database Systems by Cyrus Shahabi from Ralf Hart Hartmut Guting’s VLDB Journal v3, n4, October 1994.
Spatial Data Models. What is a Data Model? What is a model? (Dictionary meaning) A set of plans (blueprint drawing) for a building A miniature representation.
Preparing Data for Analysis and Analyzing Spatial Data/ Geoprocessing Class 11 GISG 110.
Spatial Databases ENVE/CE 424/524. Definitions Database – an integrated set of data on a particular subject Spatial database - database containing geographic.
Spatial Database Souhad Daraghma.
Chapter 1: Introduction to Spatial Databases 1.1 Overview 1.2 Application domains 1.3 Compare a SDBMS with a GIS 1.4 Categories of Users 1.5 An example.
GIS 1110 Designing Geodatabases. Representation Q. How will we model our real world data? A. Typically: Features Continuous Surfaces and Imagery Map Graphics.
IST 210 Introduction to Spatial Databases. IST 210 Evolution of acronym “GIS” Fig 1.1 Geographic Information Systems (1980s) Geographic Information Science.
Applied Cartography and Introduction to GIS GEOG 2017 EL Lecture-2 Chapters 3 and 4.
Introduction In ArcGIS, there are three basic data structures:
 Trace the incremental algorithm for constructing convex data on the sample point data given below. Intermediate steps should be shown First, two steps.
Spatial Concepts and Data Models Reading: Shekhar & Chawla Chapter 2 November 22, 2005.
URBDP 422 Urban and Regional Geo-Spatial Analysis Lecture 2: Spatial Data Models and Structures Lab Exercise 2: Topology January 9, 2014.
INTRODUCTION TO GEOGRAPHICAL INFORMATION SCIENCE RSG620 Week 1, Lecture 2 April 11, 2012 Department of RS and GISc Institute of Space Technology, Karachi.
1 Spatial Data Models and Structure. 2 Part 1: Basic Geographic Concepts Real world -> Digital Environment –GIS data represent a simplified view of physical.
Spatial Databases - Introduction Spring, 2015 Ki-Joune Li.
Creating and Maintaining Geographic Databases. Outline Definitions Characteristics of DBMS Types of database Relational model SQL Spatial databases.
Spatial Query Processing Spatial DBs do not have a set of operators that are considered to be basic elements in a query evaluation. Spatial DBs handle.
Spatial DBMS Spatial Database Management Systems.
Intro to GIS | Summer 2012 Attribute Tables – Part 1.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
Lecture 10 Creating and Maintaining Geographic Databases Longley et al., Ch. 10, through section 10.4.
Kedar Gawande Position Paper ITK 478 Fall / 04 / 2006.
Which are Spatial Data? Names address NASA satellites imagery Weather and Climate Data SSN Medical Imaging NASA satellites imagery - terabytes of.
1 Overview of Query Evaluation Chapter Outline  Query Optimization Overview  Algorithm for Relational Operations.
Lecture 3: Spatial Data Management Dr. Taysir Hassan Abdel Hamid Associate Professor, Information Systems Dept., Faculty of Computers and Information.
UNIT 3 – MODULE 6: Data Analysis. TERMINOLOGY There are several terms that are important to know when discussing data analysis: – Entity – an individual.
Overview of Mining Spatial Data
Chapter 1: Introduction to Spatial Databases 1. 1 Overview 1
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Spatial Database Systems
Physical Structure of GDB
Introduction to Spatial Databases (1)
Geographic Information Systems
Query Processing in Databases Dr. M. Gavrilova
Spatial Database Systems
Spatial Databases - Introduction
Chapter 1: Introduction to Spatial Databases 1. 1 Overview 1
Value of SDBMS Non-spatial queries: Spatial Queries:
Spatial Databases - Introduction
Presentation transcript:

Introduction to Geospatial Information Management and Spatial Databases Lecture 4 1

Value of SDBMS  Traditional (non-spatial) database management systems provide:  Persistence across failures.  Allows concurrent access to data.  Scalability to search queries on very large datasets which do not fit inside main memories of computers.  Efficient for non-spatial queries, but not for spatial queries.  Non-spatial queries:  List the names of all bookstore with more than ten thousand titles.  List the names of ten customers, in terms of sales, in the year  Spatial Queries:  List the names of all bookstores with 10 km of Chiayi city.  List all customers who live in Taipei city and its adjoining counties. 2

Value of SDBMS – Spatial Data Examples  Examples of non-spatial data  Names, phone numbers, addresses of people  Examples of Spatial data  Census data  NASA satellites imagery - terabytes of data per day  Weather and Climate Data  Rivers, farms, ecological impact 3

Value of SDBMS – Users, Application Domains  Many important application domains have spatial data and queries.  Army Field Commander: Has there been any significant enemy troop movement since last night?  Insurance Risk Manager: Which homes are most likely to be affected in the next great flood on the Changhua county?  Medical Doctor: Based on this patient's MRI, have we treated somebody with a similar condition ?  Molecular Biologist: Is the topology of the amino acid biosynthesis gene in the genome found in any other sequence feature map in the database ?  Astronomer: Find all blue galaxies within 2 arcmin of quasars. 4

Value of SDBMS – Users, Application Domains  Various fields/applications require management of geometric, geographic or spatial data:  A geographic space: surface of the earth  Man-made space: layout of VLSI design  Model of a rat brain 5

What is a SDBMS  A SDBMS is a software module that  can work with an underlying DBMS (e.g., MySQL, GIS and Spatial Extentions)  supports spatial data models, spatial abstract data types (ADTs) and a query language from which these ADTs are callable  supports spatial indexing, efficient algorithms for processing spatial operations, and domain specific rules for query optimization  Example: Oracle Spatial data cartridge, ESRI SDE  can work with Oracle 8i DBMS  Has spatial data types (e.g. polygon), operations (e.g. overlap) callable from SQL3 query language  Has spatial indices, e.g. R-trees 6

What is a SDBMS  Common challenge: dealing with large collections of relatively simple geometric objects. (e.g., rectangle, point, polygon)  Different from image and pictorial database systems:  Containing sets of objects in space rather than images or pictures of a space 7

SDBMS Example  Consider a spatial dataset with:  County boundary (dashed white line)  Census block - name, area, population, boundary (dark line)  Water bodies (dark polygons)  Satellite Imagery (gray scale pixels)  Storage in a SDBMS table:  create table census_blocks ( name string, area float, population number, boundary polygon ); 8

Modeling Spatial Data in Traditional DBMS  A row in the table census_blocks  Question: Is polyline datatype supported in DBMS? 9

Spatial Data Types and Traditional Databases  Traditional relational DBMS  Support simple data types, e.g. number, strings, date  Modeling spatial data types is tedious  Example: next slide shows modeling of polygon using (numbers)  Three new tables: polygon, edge, points. Note: Polygon is a polyline where last point and first point are same  A simple unit square represented as 16 rows across 3 tables  Simple spatial operators, e.g. area(), require joining tables  Tedious and computationally inefficient 10

Mapping “census_table” into a Relational Database 11

Evolution of DBMS technology 12

Spatial Data Types and Post-relational Databases  Post-relational DBMS  Support user defined abstract data types  Spatial data types (e.g. polygon) can be added  Choice of post-relational DBMS  Object oriented (OO) DBMS  Object relational (OR) DBMS  A spatial database is a collection of (spatial data types), (operators), (indices), processing strategies, etc. and can work with many post- relational DBMS as well as programming languages like Java, Visual Basic etc. 13

 GIS is a (software ) to visualize and analyze spatial data using spatial analysis functions such as  Search Thematic search, search by region, (re-)classification  Location analysis Buffer, corridor, overlay  Terrain analysis Slope/aspect, catchment, drainage network  Flow analysis Connectivity, shortest path  Distribution Change detection, proximity, nearest neighbor  Spatial analysis/Statistics Pattern, centrality, autocorrelation, indices of similarity, topology: hole description  Measurements Distance, perimeter, shape, adjacency, direction  GIS uses SDBMS  to store, search, query, share large spatial data sets How is a SDBMS different from a GIS ? 14

 SDBMS focuses on  (Efficient storage), (querying), sharing of large spatial datasets  Provides simpler set based query operations  Example operations: search by region, overlay, nearest neighbor, distance, adjacency, perimeter etc.  Uses (spatial indices) and (query optimization) to speedup queries over large spatial datasets.  SDBMS may be used by applications other than GIS  Astronomy, Genomics, Multimedia information systems,...  Will one use a GIS or a SDBM to answer the following:  How many neighboring countries does USA have?  Which country has highest number of neighbors? How is a SDBMS different from a GIS ? 15

Three meanings of the acronym GIS  Geographic Information Services  Web-sites and service centers for casual users, e.g. travelers  Example: Service (e.g. AAA, mapquest) for route planning  Geographic Information Systems  Software for professional users, e.g. cartographers  Example: ESRI Arc/View software  Geographic Information Science  Concepts, frameworks, theories to formalize use and development of geographic information systems and services  Example: design spatial data types and operations for querying 16

Components of a SDBMS  Recall: a SDBMS is a software module that  can work with an underlying DBMS  supports spatial data models, spatial ADTs and a query language from which these ADTs are callable  supports spatial indexing, algorithms for processing spatial operations, and domain specific rules for query optimization  Components include  spatial data model, query language, query processing, file organization and indices, query optimization, etc. 17

Spatial Taxonomy, Data Models  Spatial Taxonomy:  multitude of descriptions available to organize space.  Topology models homeo-morphic relationships, e.g. overlap  Euclidean space models distance and direction in a plane  Graphs models connectivity, Shortest-Path  Spatial data models  rules to identify identifiable objects and properties of space  Object model helps manage identifiable things, e.g. mountains, cities, land-parcels etc.  Field model helps manage continuous and amorphous phenomenon, e.g. wetlands, satellite imagery, snowfall etc. 18

 A collection of concepts to describe to describe:  structure of a database  data relationships  data semantics  data constraints  Data Model Operations: operations for specifying database retrievals and updates. Data Models 19

Modeling*  Without lose of generality, assume 2-D and GIS application, two basic things need to be represented:  Objects in space: cities, forests, or rivers  modeling single objects  Space: say something about every point in space (e.g., partition of a country into districts)  modeling spatially related collections of objects 20

Modeling*  Fundamental abstractions for modeling single objects:  Point: object represented only by its location in space, e.g., center of a state  Line (actually a curve or ployline): representation of moving through or connections in space, e.g., road, river  Region: representation of an extent in 2-D space, e.g., lake, city 21

Modeling*  Instances of spatially related collections of objects:  Partition: set of region objects that are required to be disjoint (adjacency or region objects with common boundaries), e.g., thematic maps  Networks: embedded graph in plane consisting of set of points (vertices) and lines (edges) objects, e.g. highways, power supply lines, rivers 22

Modeling*  Spatial relationships  Topological relationships: e.g., adjacent, inside, disjoint.  Direction relationships: e.g., above, below, or north_of, southwest_of, …  Metric relationships: e.g., distance  There are 6 valid possible topological relationships between two simple regions (no holes, connected):  disjoint, in, touch, equal, cover, overlap A B 23

Modeling*  SDBMS data model must be extended by ADTs at the level of atomic data types (such as integer, string), or better be open for user-defined types (OR-DBMS approach):  relation states (sname: STRING; area: REGION; spop: INTEGER)  relation cities (cname: STRING; center: POINT; ext: REGION; cpop: INTEGER);  relation rivers (rname: STRING; route: LINE) 24

Spatial Query Language  Spatial query language  Spatial data types, e.g. point, linestring, polygon, …  Spatial operations, e.g. overlap, distance, nearest neighbor, …  Callable from a query language (e.g. SQL3) of underlying DBMS SELECTS.name FROMSenator S WHERE S.district.Area() <

Query Processing  Efficient algorithms to answer spatial queries  Common Strategy – (filter) and (refine)  Filter Step:Query Region overlaps with MBRs of B,C and D  Refine Step: Query Region overlaps with B and C 26

Querying* … Fundamental spatial algebra operations:  Spatial selection: returning those objects satisfying a spatial predicate with the query object  “All cities in Taiwan” SELECT sname FROM cities c WHERE c.center inside Taiwan.area  “All rivers intersecting a query window” SELECT * FROM rivers r WHERE r.route intersects Window  “All big cities no more than 50 Kms from Taichung” SELECT cname FROM cities c WHERE dist(c.center,Taichung.center) < 100 and c.pop > 500k (conjunction with other predicates and query optimization) 27

Querying* …  Spatial join: A join which compares any two joined objects based on a predicate on their spatial attribute values.  “For each river pass through Taichung, find all cities within less than 50 Kms.” SELECT r.rname, c.cname, FROM rivers r, cities c WHERE r.route intersects Taichung.area and dist(r.route,c.area) < 50 Km 28

File Organization and Indices  A difference between GIS and SDBMS assumptions  GIS algorithms: dataset is loaded in main memory (a)  SDBMS: dataset is on secondary storage e.g disk (b)  SDBMS uses space filling curves and spatial indices to efficiently search disk resident large spatial datasets 29

Organizing spatial data with space filling curves  Issues:  Sorting is not naturally defined on spatial data  Many efficient search methods are based on sorting datasets  Space filling curves  Impose an ordering on the locations in a multi-dimensional space  Examples: row-order, z-order, Hilbert curve (higher spatial correlation)  Allow use of traditional efficient search methods on spatial data 30

Spatial Indexing  To expedite spatial selection (as well as other operations such as spatial joins, …)  It organizes space and the objects in it in some way so that only parts of the space and a subset of the objects need to be considered to answer a query.  Two main approaches:  Dedicated spatial data structures (e.g., R-tree)  Spatial objects mapped to a 1-D space to utilize standard indexing techniques (e.g., B-tree) 31

Summary  SDBMS is valuable to many important applications  SDBMS is a software module  works with an underlying DBMS  provides spatial ADTs callable from a query language  provides methods for efficient processing of spatial queries  Components of SDBMS include  spatial data model, spatial data types and operators,  spatial query language, processing and optimization  spatial data mining  SDBMS is used to store, query and share spatial data for GIS as well as other applications 32