Presentation is loading. Please wait.

Presentation is loading. Please wait.

Spatial DBMS issues OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-13 1 Spatial DBMS Research GISt lunch meeting Wilko Quak.

Similar presentations


Presentation on theme: "Spatial DBMS issues OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-13 1 Spatial DBMS Research GISt lunch meeting Wilko Quak."— Presentation transcript:

1 Spatial DBMS issues OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-13 1 Spatial DBMS Research GISt lunch meeting Wilko Quak

2 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-132Spatial DBMS issues Overview Introduction to DBMS Query Processing Benchmarking a spatial DBMS The GeoInfoNed project MonetDB Discussion

3 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-133Spatial DBMS issues Introduction to DBMS query processing Slides borrowed from Dr. Yang He

4 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-134Spatial DBMS issues Query processing overview Review relational algebra Query processing introduction stages of query processing query optimisation relational algebra tree

5 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-135Spatial DBMS issues Relational algebra (1) a relational languages proposed by Codd implementable basis of high-level (SQL) query execution a collection of simple, 'low-level’ operations used to manipulate relations input is one or more relations output is one relation

6 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-136Spatial DBMS issues Relational algebra (2) Relational operations unary operators Restrict (Select)  Project  binary operators Cartesian productX Union  Intersection  Difference- Join Divide  P

7 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-137Spatial DBMS issues e.g. two relations Student and Registration Example relations Student ( SID, Name, Gender ) Registration ( SID, CID, Mark ) SIDNameGender S1KateF S2JohnM S3KateF S4FredM Student SIDCIDMark S1C165 S1C245 S2C280 S2C460 S3C150 S3C275 S4C370 Registration

8 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-138Spatial DBMS issues e.g. “Identify all male students” in SQL in relational algebra  (Gender= ‘ M ’ ) ( Student ) Queries examples (1) Select SIDNameGender S2JohnM S4FredM SELECT SID, Name, Gender FROM Student WHERE Gender=′M′;

9 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-139Spatial DBMS issues e.g. “List student’s name and gender.” in SQL In relational algebra  Name, Gender ( Student ) Queries examples (2) Project NameGender KateF JohnM FredM SELECT Name, Gender FROM Student;

10 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1310Spatial DBMS issues Queries examples (3) e.g. “Show student ID, name, their course ID and marks” in SQL in relational algebra SELECT s.SID, Name, CID, Mark FROM Student s, Registration r WHERE s.SID = r.SID; (  SID, Name ( Student) ) (  (Registration) ) SID,CID,Mark  SID, Name, CID, Mark ( Student Registration ) or ProjectNatural Join

11 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1311Spatial DBMS issues A user query may require several operations to be performed relational algebra is a procedural language so query operations are evaluated in the order specified a complex query can be executed in different ways, so an efficient one should be used as efficiency is an important DBMS requirement – query optimisation Queries in relational algebra

12 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1312Spatial DBMS issues Query processing Four stages involved in query processing query decomposition or parsing query optimization code generation runtime query execution

13 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1313Spatial DBMS issues Query optimization (1) refers to the activity of choosing an efficient execution strategy or plan for processing a query rule-based and cost-based strategies database statistics in system catalog used for cost estimation is a prime objective of the query processing

14 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1314Spatial DBMS issues Query optimization (3) In a query processing, disk access takes most time The main objective of the query optimisation is to minimize the number of disk accesses Many DBMSs use heuristic rules for query optimization e.g. “Perform selection and projection operations as early as possible to reduce the cardinality of the relation and the subsequent process of that relation”

15 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1315Spatial DBMS issues Query processing – an example e.g. “Show student ID, name, their course ID and marks” in SQL it can be transformed into relational algebra query SELECT s.SID, Name, CID, Mark FROM Student s, Registration r WHERE s.SID = r.SID; (  SID, Name ( Student) ) (  (Registration) ) SID,CID,Mark  SID, Name, CID, Mark ( Student Registration ) or The first one is better: much less disk access than the second

16 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1316Spatial DBMS issues e.g. Relational algebra query tree (2) Student Registration  SID, Name  SID,CID,Marke Leaf nodes Intermediate nodes Root (  SID, Name ( Student) ) (  (Registration) ) SID,CID,Mark

17 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1317Spatial DBMS issues Spatial Query processing In spatial query processing the operator is a spatial operator, for the rest it is the same as non-spatial query processing: Spatial Select Find all objects within given rectangle [99.99%] Spatial Join (overlay in GIS terms) Find all restaurants within national parks

18 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1318Spatial DBMS issues DBMS Benchmarking Categorization of DBMS usage Implications for benchmarking benchmark choices

19 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1319Spatial DBMS issues Categories of DBMS users Static usage: Predefined queries with changing parameters Queries can be hand optimized Dynamic usage (browsing): Many different queries Query optimizer is important Access via object-relational mapping (e.g. Hibernate) Not discussed here All categories need different benchmarking

20 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1320Spatial DBMS issues Benchmarking static DBMS usage Notes: Critical factor is testing the ‘query processor’. Query optimizer is not important Benchmark: Make small set of simple queries that test one operation

21 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1321Spatial DBMS issues Benchmarking dynamic DBMS usage Notes: Critical factor is testing the ‘query optimizer’. Very hard to get quality reproducible results. It is very hard to assess the quality of the query optimizer but a small testset might give some insight: select city.name,river.name from city,river where city.inhabitants > X and distance(city.geometry,river.geometry) < Y;

22 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1322Spatial DBMS issues Other benchmarking considerations Functionality Usability update behaviour

23 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1323Spatial DBMS issues GeoInfoNed – RGI-232

24 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1324Spatial DBMS issues GeoInfoNed -- What and Why Build a spatially enabled DBMS because: A DBMS is at the core of many system. If you improve the core the whole system improves. There is a need for an (open source) experimentation platform for Geo DBMS research.

25 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1325Spatial DBMS issues Who CWI – Leading DBMS experts with MonetDB TUDelft/OTB – Knowledge of spatial processes CycloMedia – Huge dataset and interesting problems RWS/AGI – Large and diverse datasets and interesting problems

26 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1326Spatial DBMS issues How At CWI there is the MonetDB DBMS. First we will extend it with basic spatial types (According to OpenGIS). Together with our ‘Problem Holder’ partners we will find directions for more extensions. MonetDB already has support for: Image Data, XML storage and querying etc.

27 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1327Spatial DBMS issues Example Is there a relationship between traffic accidents and objects near the road? GeoInfoNed

28 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1328Spatial DBMS issues MonetDB Introduction* Hardware trends MonetDB design considerations MonetDB architecture *Slides borrowed from CWI

29 OTB Research Institute for Housing, Urban and Mobility Studies Hardware Trends 50% p/year: - cpu speed - mem size - mem bandwidth - disk bandwidth 1% p/year: - mem latency 10% p/year: - disk latency

30 OTB Research Institute for Housing, Urban and Mobility Studies Latency is the enemy! Commercial DBMS products (oracle, DB2, SQLserver) stem from OLTP roots focus on minimizing random I/Os => depend on latency! MonetDB: built for bulk access optimize CPU and memory performance Latency is one of the killing factors in Friso’s simplicial homology implementation

31 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1331Spatial DBMS issues MonetDB design considerations Multi-model database kernel support Extensible data types, operators, accelerators Database hot-set is memory resident Simple data structures are better Index management should be automatic Do not replicate the operating system Optimize when you know the situation Cooperative transaction management

32 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1332Spatial DBMS issues Monet kernels MAPI protocol JDBC C-mapi lib Perl End-user application ODBC PHP Python SQL XQuery MonetDB product family Here a MATLAB interface and Frank’s life would be easier

33 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1333Spatial DBMS issues MonetDB - Physical data organization Binary Association Tables

34 OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-1334Spatial DBMS issues Discussion


Download ppt "Spatial DBMS issues OTB Research Institute for Housing, Urban and Mobility Studies 2007-04-13 1 Spatial DBMS Research GISt lunch meeting Wilko Quak."

Similar presentations


Ads by Google