26SEP2007 FOSS4G2007 Database Shootout: Benchmarking spatial DMBSs Wim de Haas Wilko Quak.

Slides:



Advertisements
Similar presentations
GIS for Aviation: Case for eTOD Management
Advertisements

Online Construction Estimation Developed By:- Vatsal S. Shah ( ) Dilip K. Babariya ( ) Jimit V. Rajani ( ) Guided by:-
Paging: Design Issues. Readings r Silbershatz et al: ,
Database Architectures and the Web
The State of the Art in Distributed Query Processing by Donald Kossmann Presented by Chris Gianfrancesco.
VoipNow Core Solution capabilities and business value.
1 Cheriton School of Computer Science 2 Department of Computer Science RemusDB: Transparent High Availability for Database Systems Umar Farooq Minhas 1,
Overview Distributed vs. decentralized Why distributed databases
Lecture 4: Introduction to PHP 3 PHP & MySQL
Fall 2001CS 4471 Chapter 2: Performance CS 447 Jason Bakos.
1 Foundations of Software Design Lecture 27: Java Database Programming Marti Hearst Fall 2002.
Chapter 7 Managing Data Sources. ASP.NET 2.0, Third Edition2.
Chapter 8 GIS software. Introduction Chapter 1 : four technical parts of GIS(network, hardware, software, database ). This chapter 8 : concerned with.
The Premier Software Usage Analysis and Reporting Toolset Maximizing Value for Software Users.
11 SERVER CLUSTERING Chapter 6. Chapter 6: SERVER CLUSTERING2 OVERVIEW  List the types of server clusters.  Determine which type of cluster to use for.
Setting Up a Sandbox Presented by: Kevin Brunson Chief Technology Officer.
Structure Commander Technical Presentation. Copyright (C) MCS 2013, All rights reserved. 2 STRUCTURE COMMANDER Introduction Product Overview.
Open Source Geospatial software for the the Army Corps of Engineers Wetlands Regulatory program FOSS4G2007 Victoria, B.C. Canada Michael Smith Engineer.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Mobile Mapping Systems (MMS) for infrastructural monitoring and mapping are becoming more prevalent as the availability and affordability of solutions.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
Windows Server ® Virtualization Infrastructure Planning and Design Published: November 2007 Updated: January 2012.
GIS Application Development for the Internet:
Managing Multi-User Databases AIMS 3710 R. Nakatsu.
1 Distributed and Parallel Databases. 2 Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
MySQL Introduction to the MySQL products. Agenda Company Overview Open Source & MySQL Momentum Why MySQL? MySQL OEM, Community & Enterprise offerings.
PostgreSQL and relational databases As well as assignment 4…
by Marc Comeau. About A Webmaster Developing a website goes far beyond understanding underlying technologies Determine your requirements.
Database Architectures and the Web Session 5
Database and Database Users. Outline Database Introduction An Example Characteristics of the Database Actors on the Scene Advantages of using the DBMS.
Performance Concepts Mark A. Magumba. Introduction Research done on 1058 correspondents in 2006 found that 75% OF them would not return to a website that.
Introduction: Databases and Database Users
Project Title : CyberGIS Project Members : M.S.R Perera D.S Kulasuriya W.M.D Jeewantha Project Title : CyberGIS Project Members : M.S.R Perera D.S Kulasuriya.
PHP With Oracle 11g XE By Shyam Gurram Eastern Illinois University.
Why does my perfectly working App Crash and Burn in Production? Matt Kramer Project Manager, STL Boeing Scalability Test Lab cell.
Winrunner Usage - Best Practices S.A.Christopher.
LiveCycle Data Services Introduction Part 2. Part 2? This is the second in our series on LiveCycle Data Services. If you missed our first presentation,
9. Introduction to ArcObjects Most GIS analysis carried out within a GIS consists of a labor- intensive sequence of steps. Automating a GIS makes it possible.
DBSQL 14-1 Copyright © Genetic Computer School 2009 Chapter 14 Microsoft SQL Server.
Lessons Learned from GIS Application Development in a State Government Context Yueming Wu, Ph.D. West Virginia Department of Environmental Protection 2006.
Software Development Software Testing. Testing Definitions There are many tests going under various names. The following is a general list to get a feel.
Chapter 6 PHP Interacts with Mysql Database. Introduction In PHP, there is no consolidated interface. Instead, a set of library functions are provided.
Introduction of Geoprocessing Topic 7a 4/10/2007.
ZOPE By: Mukesh Lal Advisor: Dr. Chung E. Wang Second Reader: Dr. Gopal Rao Department of Computer Science California State University, Sacramento.
Spatial DBMS Buyer's Guide OTB Research Institute for Housing, Urban and Mobility Studies FOSS4G Cape Town 1 A spatial DBMS buyer’s guide Wim.
Building Quality into Web Applications - Meeting the Challenges of Testing and Usability Paula Duchnowski CQA, CSTE (608)
Introduction to the Adapter Server Rob Mace June, 2008.
PHP Features. Features Clean syntax. Object-oriented fundamentals. An extensible architecture that encourages innovation. Support for both current and.
Parallel and Distributed Simulation Memory Management & Other Optimistic Protocols.
Distributed Database Systems Overview
10 1 Chapter 10 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
INTRODUCTION TO WEB APPLICATION Chapter 1. In this chapter, you will learn about:  The evolution of the Internet  The beginning of the World Wide Web,
MIS 105 LECTURE 1 INTRODUCTION TO COMPUTER HARDWARE CHAPTER REFERENCE- CHP. 1.
Course FAQ’s I do not have any knowledge on SQL concepts or Database Testing. Will this course helps me to get through all the concepts? What kind of.
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
GISt lunch meeting OTB Research Institute for Housing, Urban and Mobility Studies Writing a DBMS buyers guide Wim de Haas Wilko Quak Based.
DATABASE CONNECTIVITY TO MYSQL. Introduction =>A real life application needs to manipulate data stored in a Database. =>A database is a collection of.
10JUN2004 Dutch ministry implements Open(Source) WebGIS Wim de Haas Chiel Stroeven Michel Grothe Geo-Information and ICT Department.
“Request For System Change” Sushil Bhatnagar MBA(IT) 4 th Semester Sikkim Manipal University (SMU DE) Roll No. : LC Code. : IICE College (02086)
Managing Enterprise GIS Geodatabases
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
King saud university Introduction to Database Systems (Chapter 1: Databases and Database Users)
Oracle Architecture - Structure. Oracle Architecture - Structure The Oracle Server architecture 1. Structures are well-defined objects that store the.
Introduction of Geoprocessing Lecture 9 3/24/2008.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Introduction to Performance Testing Performance testing is the process of determining the speed or effectiveness of a computer, network, software program.
1 Chapter The Impact of Database Customer centric approach - A highly personal approach Marketing databases are essential to the marketing process.
BIG DATA/ Hadoop Interview Questions.
Introduction of Week 3 Assignment Discussion
Chapter 2: Performance CS 447 Jason Bakos Fall 2001 CS 447.
Presentation transcript:

26SEP2007 FOSS4G2007 Database Shootout: Benchmarking spatial DMBSs Wim de Haas Wilko Quak

Delft University of Technology 26SEP2007 FOSS4G20072 You’re in the eye of the storm! Early morning: Brock Anderson about WMS/PostGIS/Shapefile performance This afternoon: Kevin Neufeld about tips for the PostGIS power user Now: Reflect on the factor 10

Delft University of Technology 26SEP2007 FOSS4G20073 Overview Introduction What are the problems A classification of Spatial DBMS users How can we help them Benchmark proposal First test results Next steps

Delft University of Technology 26SEP2007 FOSS4G20074 Introducing the Ministry of Transport, Public Works and Water Management Our core tasks are: to offer protection against floods to guarantee safe and reliable connections over land, water and through the air to ensure clean and sufficient water Rijkswaterstaat (RWS) is the executive branche of the Ministry of Transport

Delft University of Technology 26SEP2007 FOSS4G20075 Business drivers How to keep track of all these assets? How to ensure consistency & coherence in operations and change of Rijkswaterstaat? How to facilitate decisionmaking and communication Enter the Digitaal Topografisch Bestand –3D –1:1000 –EUR 60M

Delft University of Technology 26SEP2007 FOSS4G20076 DTB waterway and highway

Delft University of Technology 26SEP2007 FOSS4G20077 DTB Birds eye view

Delft University of Technology 26SEP2007 FOSS4G20078 DTB Amsterdam Airport

Delft University of Technology 26SEP2007 FOSS4G20079 Enter IVRI The new system for data acquisition and maintenance for the DTB Oracle 10g ArcGIS 9.2 Summit Evolution

Delft University of Technology 26SEP2007 FOSS4G First comment on Murphy’s Law Murphy was an optimist Oracle and ESRI in a blame game Took extra time in the project Triggered to be less dependent on Oracle Oracle Spatial is not cheap, so can we use PostGIS as the main datastore for Spatial data?

Delft University of Technology 26SEP2007 FOSS4G Why bother … Stonebraker2007: Where to find dramatic differences in Spatial DBMSs? We define “dramatically outperform” to mean at least a factor 10 advantage […then] customers will be inclined to try the new architecture

Delft University of Technology 26SEP2007 FOSS4G Where to expect Dramatic differences? Linux vs Windows. (No) MySQL vs PostGIS (Yes) Choice of FileSystem (Maybe) Functionality Difference (Yes) Choice of Parameters (Maybe)

Delft University of Technology 26SEP2007 FOSS4G Problems with testing DBMS vendors do not want published results –Oracle explicitly forbids publishing benchmark results Hardware –Moore’s Law –I/O Release Frequency of Software Spatial testing cannot be done on synthetic data Too many parameters Benchmark results are outdated before they are publised

Delft University of Technology 26SEP2007 FOSS4G Benchmark consideration: Weird Cases department diagonal query geometry flat query geometry

Delft University of Technology 26SEP2007 FOSS4G Benchmark consideration: Hot vs Cold

Delft University of Technology 26SEP2007 FOSS4G Solution Do not publish the result of the benchmark Publish a framework that lets people do their own benchmarking No “One size fits all”: Buyer’s guide Help different users to find best DBMS

Delft University of Technology 26SEP2007 FOSS4G Classification of spatial DMBS users Four classes: 1. Server Builders: publish spatial data via web server 2. GIS User: Load various datasets and perform complex analyses 3. Data Maintainer: Maintain one core dataset 4. Power Users: All of the above and more

Delft University of Technology 26SEP2007 FOSS4G Class 1: Web Server Builders You do not really need a DBMS for this (You use a fraction of DBMS functionality) Only one query counts: Find everything within BBOX

Delft University of Technology 26SEP2007 FOSS4G Class 2: GIS users Main interest is functionality Spend more time on loading data Need a good query optimiser Analysis

Delft University of Technology 26SEP2007 FOSS4G Class 3: Dataset Maintainers Limited number of queries Transactions are an issue Clustering of data after updates is interesting

Delft University of Technology 26SEP2007 FOSS4G Class 4: Power users Do their own testing Need a platform to discuss their findings

Delft University of Technology 26SEP2007 FOSS4G Benchmark components 1. Functionality test Literature review Factual testing 2. Very simple performance test script with few parameters BBOX Query Fixed Dataset (Propasal OpenStreetMap dataset) 3. Configurable test suite Full Suite that tests every corner of DBMS For specialists only

Delft University of Technology 26SEP2007 FOSS4G Test 1 – Functionality: MySQL vs PostGIS MySQL Almost all operations in MySQL return the same result as the corresponding MBR- based functions. PostGIS Full OpenGIS support. Functionality of MySQL is only suited for simple WMS support and no spatial operations are done on geometry

Delft University of Technology 26SEP2007 FOSS4G Test 2: simple BBOX select Write simple script that generates a lot of rectangle queries. Paremeter: DBMS size query box size experiment length

Delft University of Technology 26SEP2007 FOSS4G Test 2: grow DBMS size Question: Does query response time depend on DBMS size or on core memory? Experiment: Run same test on more an more copies of same database.

Delft University of Technology 26SEP2007 FOSS4G Test 2 – result: PostGIS vs MySQL

Delft University of Technology 26SEP2007 FOSS4G Test 2 – result: Conclusions As long as dataset fits in core memory differences are small. MySQL can do more with less memery. MySQL degrades faster if you run out of memory

Delft University of Technology 26SEP2007 FOSS4G Test 3: Comprehensive Test Suite Create set of killer polygons so that every line of source code will be touched by running operations. Test Query optimizer Test Join Operator –Must be done with Skewed Data

Delft University of Technology 26SEP2007 FOSS4G Conclusions (overall) This is a work in progress The factor 10 is not within reach

Delft University of Technology 26SEP2007 FOSS4G How to proceed Finish the work and publish these TU Delft wiki or osgeo.org wiki? Start a Special Interest Group a.k.a. Chapter?

Delft University of Technology 26SEP2007 FOSS4G Questions