GISt lunch meeting OTB Research Institute for Housing, Urban and Mobility Studies 2008-04-25 1 Writing a DBMS buyers guide Wim de Haas Wilko Quak Based.

Slides:



Advertisements
Similar presentations
The Lucernex Cloud: A software-as-a-service solution delivered via the Cloud What is the Cloud? Cloud Computing is the future of all software applications,
Advertisements

EPD Co. Ltd. Medical Software Company Why open source platform is the best platform For the PACS servers? Why open source platform is the best platform.
Paging: Design Issues. Readings r Silbershatz et al: ,
Components of GIS.
Database Architectures and the Web
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1.
The State of the Art in Distributed Query Processing by Donald Kossmann Presented by Chris Gianfrancesco.
VoipNow Core Solution capabilities and business value.
Copyright © 2003, SAS Institute Inc. All rights reserved. Where's Waldo Uncovering Hard-to-Find Application Killers Claire Cates SAS Institute, Inc
Adding scalability to legacy PHP web applications Overview Mario A. Valdez-Ramirez.
Welcome Windows SharePoint Service 3.0. Craig Carpenter MCSE, MCT Director, Combined Knowledge.
Cacti Workshop Tony Roman Agenda What is Cacti? The Origins of Cacti Large Installation Considerations Automation The Current.
Keith Wiles DPACC vNF Overview and Proposed methods Keith Wiles – v0.5.
Lecture 2 Web application architecture. Themes Architecture : The large scale structure of a system, especially a computer system Design choice: The need.
Overview Distributed vs. decentralized Why distributed databases
Lars Arge1, Mark de Berg2, Herman Haverkort3 and Ke Yi1
IS4401 Project Technology Issues. Introduction This seminar covers Databases When to use a Database What Database to use Development Tools Visual Studio.
26SEP2007 FOSS4G2007 Database Shootout: Benchmarking spatial DMBSs Wim de Haas Wilko Quak.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Copyright © 2006, SAS Institute Inc. All rights reserved. Data at its Best How to keep large data volumes in order and ensure high quality ? Milen Georgiev.
Setting Up a Sandbox Presented by: Kevin Brunson Chief Technology Officer.
Confidential ODBC May 7, Features What is ODBC? Why Create an ODBC Driver for Rochade? How do we Expose Rochade as Relational Transformation.
Open Source Geospatial software for the the Army Corps of Engineers Wetlands Regulatory program FOSS4G2007 Victoria, B.C. Canada Michael Smith Engineer.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Mobile Mapping Systems (MMS) for infrastructural monitoring and mapping are becoming more prevalent as the availability and affordability of solutions.
For more notes and topics visit:
GIS Application Development for the Internet:
MySQL Introduction to the MySQL products. Agenda Company Overview Open Source & MySQL Momentum Why MySQL? MySQL OEM, Community & Enterprise offerings.
Troubleshooting SQL Server Enterprise Geodatabase Performance Issues
PostgreSQL and relational databases As well as assignment 4…
by Marc Comeau. About A Webmaster Developing a website goes far beyond understanding underlying technologies Determine your requirements.
Database Architectures and the Web Session 5
Gary MarsdenSlide 1University of Cape Town Computer Architecture – Introduction Andrew Hutchinson & Gary Marsden (me) ( ) 2005.
MapReduce April 2012 Extract from various presentations: Sudarshan, Chungnam, Teradata Aster, …
Course Introduction Introduction to Databases Instructor: Joe Bockhorst University of Wisconsin - Milwaukee.
DIFFERENCE BETWEEN ORCAD AND LABVIEW
Data Warehouse Management March 13, 2000 Prof. Hwan-Seung Yong Dept. of CSE, Ewha Womans Univ. The Case for Data Warehousing.
PHP With Oracle 11g XE By Shyam Gurram Eastern Illinois University.
Why does my perfectly working App Crash and Burn in Production? Matt Kramer Project Manager, STL Boeing Scalability Test Lab cell.
Physical Database Design & Performance. Optimizing for Query Performance For DBs with high retrieval traffic as compared to maintenance traffic, optimizing.
PostgreSQL and relational databases As well as assignment 4…
Introduction of Geoprocessing Topic 7a 4/10/2007.
Spatial DBMS Buyer's Guide OTB Research Institute for Housing, Urban and Mobility Studies FOSS4G Cape Town 1 A spatial DBMS buyer’s guide Wim.
1 Wenguang WangRichard B. Bunt Department of Computer Science University of Saskatchewan November 14, 2000 Simulating DB2 Buffer Pool Management.
DBSQL 12-1 Copyright © Genetic Computer School 2009 Chapter 12 Recent Concepts and Application of Databases.
Copyright © 2015 – Curt Hill Version Control Systems Why use? What systems? What functions?
PHP Features. Features Clean syntax. Object-oriented fundamentals. An extensible architecture that encourages innovation. Support for both current and.
Moving Point Type OTB Research Institute for Housing, Urban and Mobility Studies Dagstuhl 1 A ‘movingpoint’ type for a DBMS Wilko Quak - TUDelft.
Spatial DBMS issues OTB Research Institute for Housing, Urban and Mobility Studies Spatial DBMS Research GISt lunch meeting Wilko Quak.
INFO1408 Database Design Concepts Week 15: Introduction to Database Management Systems.
1 Chapter 10 Joins and Subqueries. 2 Joins & Subqueries Joins – Methods to combine data from multiple tables – Optimizer information can be limited based.
Elmasri and Navathe, Fundamentals of Database Systems, Fourth Edition Copyright © 2004 Pearson Education, Inc. Slide 2-1 Data Models Data Model: A set.
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Microsoft Management Seminar Series SMS 2003 Change Management.
Project Name Program Name Project Scope Title Project Code and Name Insert Project Branding Image Here.
By Douglas Copas and Mark Perry.  Currently available small business based commercial inventory management systems are either prohibitively expensive.
EGRE 426 Computer Organization and Design Chapter 4.
1 Chapter The Impact of Database Customer centric approach - A highly personal approach Marketing databases are essential to the marketing process.
(OBIA) Training & Placement Program By Keen IT To request free demo session please mail us at
OPERATING SYSTEMS DO YOU REQUIRE AN OPERATING SYSTEM IN YOUR SYSTEM?
1 Copyright © 2007, Oracle. All rights reserved. Installing and Setting Up the Warehouse Builder Environment.
BIG DATA/ Hadoop Interview Questions.
09/09/201009/09/2010 Open Source Geospatial Foundation 1 1 ERDAS : Overview Background – First participation – Tested on Windows server – Java based for.
Integrating Disk into Backup for Faster Restores
Computer Software Digital Literacy.
Computer Software Digital Literacy.
Software Architecture in Practice
Introduction of Week 3 Assignment Discussion
Computer Organization and Design Chapter 4
Presentation transcript:

GISt lunch meeting OTB Research Institute for Housing, Urban and Mobility Studies Writing a DBMS buyers guide Wim de Haas Wilko Quak Based on presentation at FOSS4G 2007 on benchmarking

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Overview Original idea: benchmarking Complications of benchmarking New Idea: buyers guide What should be in this guide

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Benchmark consideration: Weird Cases department diagonal query geometry flat query geometry

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Benchmark consideration: Hot vs Cold

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Why bother with benchmarking… Stonebraker2007: Where to find dramatic differences in Spatial DBMSs? We define “dramatically outperform” to mean at least a factor 10 advantage […then] customers will be inclined to try the new architecture

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Where to expect Dramatic differences? Linux vs Windows. (No) Choice of DBMS (Only in specific cases) Choice of FileSystem (no) Functionality Difference (Yes) Choice of Parameters (Maybe)

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Problems with testing DBMS vendors do not want published results Oracle explicitly forbids publishing benchmark results Hardware Moore’s Law Release Frequency of Software Spatial testing cannot be done on synthetic data Too many parameters Benchmark results are outdated before they are publised

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Solution Don’t spend our time on producing benchmark results: Write buyer’s guide: we need a classification of users. Let people do their own testing: Tell them what en when to test and help them with at test suite.

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Classification of spatial DMBS users Four classes: 1.Server Builders: publish spatial data via web server 2.GIS User: Load various datasets and perform complex analyses 3.Data Maintainer: Maintain one core dataset 4.Power Users: All of the above and more

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Class 1: Web Server Builders You do not really need a DBMS for this (You use a fraction of DBMS functionality) Only one query counts: Find everything within BBOX

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Class 2: GIS users Main interest is functionality Spend more time on loading data Need a good query optimiser Analysis

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Class 3: Dataset Maintainers Limited number of queries Transactions are an issue Clustering of data after updates is interesting

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Class 4: Power users Do their own testing Need a platform to discuss their findings

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Test suite proposal 1.Very simple performance test script with few parameters BBOX Query Fixed Dataset (Propasal OpenStreetMap dataset) 2.Configurable test suite Full Suite that tests every corner of DBMS For specialists only

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Test 1: simple BBOX select Write simple script that generates a lot of rectangle queries. Paremeter: DBMS size query box size experiment length

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Test 1: grow DBMS size Question: Does query response time depend on DBMS size or on core memory? Experiment: Run same test on more an more copies of same database.

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Test 1 – result: PostGIS vs MySQL

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Test 2: Comprehensive Test Suite Create set of killer polygons so that every line of source code will be touched by running operations. Test Query optimizer Test Join Operator Must be done with Skewed Data

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting What should be in the Buyer’s guide Performance is not an issue. What are issues: Details of functionality (topology, coordinate transforms) Total cost of ownership (open-source vs proprietary) Configuration (faster disks or faster CPU) Ease of Use (2 days of programming == A LOT OF HARDWARE) Use of standards (vendor lock-in, system integration) Can we answer these questions?

OTB Research Institute for Housing, Urban and Mobility Studies GISt lunch meeting Discussion