109.07.2012Michael Soffner A Variability Model for Query Optimizers Michael Soffner 1, Norbert Siegmund 1, Marko Rosenmüller 1, Janet Siegmund 1, Thomas.

Slides:



Advertisements
Similar presentations
Module 13: Performance Tuning. Overview Performance tuning methodologies Instance level Database level Application level Overview of tools and techniques.
Advertisements

1 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
Implementing Reflective Access Control in SQL Lars E. Olson 1, Carl A. Gunter 1, William R. Cook 2, and Marianne Winslett 1 1 University of Illinois at.
CSE 6331 © Leonidas Fegaras XML and Relational Databases 1 XML and Relational Databases Leonidas Fegaras.
Marko Rosenmüller - University of Magdeburg, Germany 1 Feature-oriented Refactoring of Berkeley DB Dagstuhl Seminar on Software Engineering for Tailor-made.
Introduction to Structured Query Language (SQL)
Query Execution Professor: Dr T.Y. Lin Prepared by, Mudra Patel Class id: 113.
Marko Vrhovnik 1, Holger Schwarz 1, Oliver Suhre 2, Bernhard Mitschang 1, Volker Markl 3, Albert Maier 2, Tobias Kraft 1 1 Universität Stuttgart 2 IBM.
Bitmap Index Buddhika Madduma 22/03/2010 Web and Document Databases - ACS-7102.
A First Attempt towards a Logical Model for the PBMS PANDA Meeting, Milano, 18 April 2002 National Technical University of Athens Patterns for Next-Generation.
Geographic Information Systems
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 11 Database Performance Tuning and Query Optimization.
Introduction to Structured Query Language (SQL)
Attribute databases. GIS Definition Diagram Output Query Results.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 18 Object Database Management Systems.
Database Management COP4540, SCS, FIU An Introduction to database system.
Query Processing Presented by Aung S. Win.
Database Management System Lecture 2 Introduction to Database management.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 11 Database Performance Tuning and Query Optimization.
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
IT The Relational DBMS Section 06. Relational Database Theory Physical Database Design.
Chapter 8 Architecture Analysis. 8 – Architecture Analysis 8.1 Analysis Techniques 8.2 Quantitative Analysis  Performance Views  Performance.
Context Tailoring the DBMS –To support particular applications Beyond alphanumerical data Beyond retrieve + process –To support particular hardware New.
1 CS 430 Database Theory Winter 2005 Lecture 1: Introduction.
9/14/2012ISC329 Isabelle Bichindaritz1 Database System Life Cycle.
Physical Database Design Chapter 6. Physical Design and implementation 1.Translate global logical data model for target DBMS  1.1Design base relations.
G-SPARQL: A Hybrid Engine for Querying Large Attributed Graphs Sherif SakrSameh ElniketyYuxiong He NICTA & UNSW Sydney, Australia Microsoft Research Redmond,
Database Management 9. course. Execution of queries.
Ashwani Roy Understanding Graphical Execution Plans Level 200.
GUS: 0262 Fundamentals of GIS Lecture Presentation 3: Relational Data Model Jeremy Mennis Department of Geography and Urban Studies Temple University.
CPS216: Advanced Database Systems Notes 02:Query Processing (Overview) Shivnath Babu.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
1 CS 430 Database Theory Winter 2005 Lecture 17: Objects, XML, and DBMSs.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Daniel J. Abadi · Adam Marcus · Samuel R. Madden ·Kate Hollenbach Presenter: Vishnu Prathish Date: Oct 1 st 2013 CS 848 – Information Integration on the.
Efficiently Processing Queries on Interval-and-Value Tuples in Relational Databases Jost Enderle, Nicole Schneider, Thomas Seidl RWTH Aachen University,
Object Persistence (Data Base) Design Chapter 13.
Generative Programming. Automated Assembly Lines.
Chapter 18 Object Database Management Systems. McGraw-Hill/Irwin © 2004 The McGraw-Hill Companies, Inc. All rights reserved. Outline Motivation for object.
RRXS Redundancy reducing XML storage in relations O. MERT ERKUŞ A. ONUR DOĞUÇ
SPARQL Query Graph Model (How to improve query evaluation?) Ralf Heese and Olaf Hartig Humboldt-Universität zu Berlin.
Component Based SW Development and Domain Engineering 1 Component Based Software Development and Domain Engineering.
The Volcano Optimizer Generator Extensibility and Efficient Search.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
1 Biometric Databases. 2 Overview Problems associated with Biometric databases Some practical solutions Some existing DBMS.
Benjamin Post Cole Kelleher.  Availability  Data must maintain a specified level of availability to the users  Performance  Database requests must.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Dec. 13, 2002 WISE2002 Processing XML View Queries Including User-defined Foreign Functions on Relational Databases Yoshiharu Ishikawa Jun Kawada Hiroyuki.
Chapter 8 Physical Database Design. Outline Overview of Physical Database Design Inputs of Physical Database Design File Structures Query Optimization.
1 Execution Strategies for SQL Subqueries Mostafa Elhemali, César Galindo- Legaria, Torsten Grabs, Milind Joshi Microsoft Corp.
An Introduction Student Name: Riaz Ahmad Program: MSIT( ) Subject: Data warehouse & Data Mining.
Mapping the Data Warehouse to a Multiprocessor Architecture
Data Mining and Decision Support
1 Database Systems, 8 th Edition 1 Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: –How business intelligence.
Chapter 18 Object Database Management Systems. Outline Motivation for object database management Object-oriented principles Architectures for object database.
1 Chapter 2 Database Environment Pearson Education © 2009.
Oracle9i Developer: PL/SQL Programming Chapter 11 Performance Tuning.
Lecture 15: Query Optimization. Very Big Picture Usually, there are many possible query execution plans. The optimizer is trying to chose a good one.
7 1 Database Systems: Design, Implementation, & Management, 7 th Edition, Rob & Coronel 7.6 Advanced Select Queries SQL provides useful functions that.
Database Systems, 8 th Edition SQL Performance Tuning Evaluated from client perspective –Most current relational DBMSs perform automatic query optimization.
Concepts of Database Management, Fifth Edition Chapter 3: The Relational Model 2: SQL.
CHAPTER 19 Query Optimization. CHAPTER 19 Query Optimization.
Artificial Intelligence
An Introduction to database system
Database Performance Tuning and Query Optimization
Chapter 11 Database Performance Tuning and Query Optimization
Query Optimization.
Presentation transcript:

Michael Soffner A Variability Model for Query Optimizers Michael Soffner 1, Norbert Siegmund 1, Marko Rosenmüller 1, Janet Siegmund 1, Thomas Leich 2, Gunter Saake 1 1 University of Magdeburg, Germany 2 METOP GmbH, Germany

Michael Soffner Motivation Variability Approach System Analysis Unified Variability Model Outline

Michael Soffner Database vendors continuously extend functionality to fit to new application domains Leads to over bloated systems that have decreased performance and manageability Specialized systems outperform RDBMS, e.g., Sensor Networks and Data Warehouses (Stonebraker2005) Driving factors for Query Optimizer extensions SQL  conformity to standard New indexes, operations, statistics Result: Increased search space and reduced performance Motivation

Michael Soffner Goal: Specialized query processors by introducing variability Selection of only needed functionality and omitting the rest Variability through Software Product Lines (SPLs) Our Approach Fig.1 Benefits of tailored Query Optimizers

Michael Soffner Software Product Lines (SPLs) Use Features to describe a concept in a domain model

Michael Soffner Product Derivation Configuration Feature Model Reusable Implementation Artifacts Program GeneratorFinal Product Domain Engineering Application Engineering

Michael Soffner 3 Steps to a unified model Overall Process Course Model System Analysis Unification Classification by Jarke (1984) Oracle, PostgreSQL, SQLite Unified Model

Michael Soffner Generally distinguishes logical and physical optimization Optimizer Functionality Classification (Jarke) LogicalPhysical Standardization Transformation into a standardized representation (e.g. predicate normalization) Evaluation Algorithm General algorithm that generates the program to a given query (e.g. recursive search) Simplification Elimination of redundancies, (e.g. idempotency rules) Operations Physical implementations of logical operations (e.g. nested loop join) Amelioration Generating semantically equal queries with better performance (e.g. heuristics) Strategy Concepts to find best query plan (e.g. cost-based approach)

Michael Soffner Customizable through #ifdef compiler flags  static configuration All logical optimization features are optional Only B-Tree indexes Allows statistics to be omitted statically SQLite

Michael Soffner Most logical optimization feature aim to standardize the input query No features for special heuristics Includes inline set returning functions Two evaluation algorithms: exhaustive search, genetic algorithm Four index types: b-tree, hash-based and multi-dimension-based indexes (GIS support) PostgreSQL

Michael Soffner Special feature: predicates pushing, rewrite materialized views Most Access Paths Configuration through Hints Oracle

Michael Soffner Goal: System-independent Variability Model Identification of feature that implement same functionality 1. Integration A1: Same functionality but different names A2: Same names but different functionality Only semantic descriptions allow a decision Basis: Documentation and Source Code Example: Nested Loop Variability Model: Unification Process SQLitePostgreSQL NameNested LoopNestpath SourceSource-code Comment typedef definition and cost calculation algorithm

Michael Soffner 2. Unification 1:1 Mapping (Mapping of one Features into one unified Feature) 1:n Mapping (Compose multiple system-dependent Features into one unified Feature) Variability Model: Unification Process 2 FeatureSQLitePostgreSQLOracle No. of Table Entries N/ANo. of Table EntriesNo. of Rows Pull Up Subqueries Subquery Flattening Pull Up SubqueriesSubquery Unnesting From ClauseN/AFrom Clause N/A Where Clause N/APull Up SublinksN/A

Michael Soffner Variability Model

Michael Soffner Provide a basis for implementing configurable query optimizer Unified semantic description of query optimizer functionality (Taxonomy/Ontology) Provides a foundation for a (semi-)automatic configuration of query optimizers based on application requirements Provide a basis for modeling dependencies between query optimizers and deeper layers of DBMSs, e.g., Storage Engine Conclusion

Michael Soffner [Stonebraker2005] M. Stonebraker and U. Cetintemel. One Size Fits All: An Idea Whose Time Has Come and Gone. In Proceedings of the International Conference on Data Engineering (ICDE),pages 2-11, [Jarke84] M. Jarke and J. Koch. Query optimization in database systems. ACM Computing Surveys (CSUR), 16: , June ACM ID: References

Michael Soffner Thanks for your attention!

Michael Soffner Specialized Systems outperform RDBMS DBMS originally designed to support Business Data Other Application Domains need different paradigms Argued that the One Size Fits all is not applicable in future DBMS Mentioned Domains: Sensor Networks (Streaming Applications) Textdatabase (Google  BigData) Data Warehouses ? One Size Fits All? (Stonebreaker)