U NIVERSITY U NIVERSITY OF T ORONTO U NIVERSITY OF T ORONTO Bionic databases are coming. What will they look like? *Ryan Johnson & Ippokratis Pandis**

Slides:



Advertisements
Similar presentations
How We Manage SaaS Infrastructure Knowledge Track
Advertisements

Deep Packet Inspection Which Implementation Platform? Sarang Dharmapurikar Cisco.
1 NUMA-aware algorithms: the case of data shuffling Yinan Li* Ippokratis Pandis Rene Mueller Vijayshankar Raman Guy Lohman *University of Wisconsin - MadisonIBM.
Extreme Performance with Oracle Data Warehousing
1 Copyright © 2012 Oracle and/or its affiliates. All rights reserved. Convergence of HPC, Databases, and Analytics Tirthankar Lahiri Senior Director, Oracle.
Critical Sections: Re-emerging Concerns for DBMS Ryan JohnsonIppokratis Pandis Anastasia Ailamaki Carnegie Mellon University École Polytechnique Féderale.
@ Carnegie Mellon Databases Data-oriented Transaction Execution VLDB 2010 Ippokratis Pandis Ryan Johnson Nikos Hardavellas Anastasia Ailamaki Carnegie.
EISCambridge 2009MoN8 Exploring Markov models for gate-limited service and their application to network-based services Glenford Mapp and Dhawal Thakker.
Distributed Processing, Client/Server and Clusters
 Open source software framework designed for storage and processing of large scale data on clusters of commodity hardware  Created by Doug Cutting and.
IT Infrastructure Database Tuning – Fall 2015.
1 Database Servers on Chip Multiprocessors: Limitations and Opportunities Nikos Hardavellas With Ippokratis Pandis, Ryan Johnson, Naju Mancheril, Anastassia.
Daniel Schall, Volker Höfner, Prof. Dr. Theo Härder TU Kaiserslautern.
Challenges and Opportunities for System Software in the Multi-Core Era or The Sky is Falling, The Sky is Falling!
10 REASONS Why it makes a good option for your DB IN-MEMORY DATABASES Presenter #10: Robert Vitolo.
1. Aim High with Oracle Real World Performance Andrew Holdsworth Director Real World Performance Group Server Technologies.
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
Meanwhile RAM cost continues to drop Moore’s Law on total CPU processing power holds but in parallel processing… CPU clock rate stalled… Because.
Chapter 1.3: Data Models and DBMS Architecture Title: Anatomy of a Database System Authors: J. Hellerstein, M. Stonebraker Pages:
Operating System Support for Database Management
CPSC-608 Database Systems Fall 2011 Instructor: Jianer Chen Office: HRBB 315C Phone: Notes #11.
Computer Organization and Architecture
CMU SCS Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications C. Faloutsos – A. Pavlo How to Scale a Database System.
M.A.Doman Model for enabling the delivery of computing as a SERVICE.
Layers and Views of a Computer System Operating System Services Program creation Program execution Access to I/O devices Controlled access to files System.
1: IntroductionData Management & Engineering1 Course Overview: CS 395T Semantic Web, Ontologies and Cloud Databases Daniel P. Miranker Objectives: Get.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
An Introduction to Infrastructure Ch 11. Issues Performance drain on the operating environment Technical skills of the data warehouse implementers Operational.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
1 Intro to Info Tech Database Management Systems Copyright 2003 by Janson Industries This presentation can be viewed on line at:
MapReduce April 2012 Extract from various presentations: Sudarshan, Chungnam, Teradata Aster, …
Introduction. Outline What is database tuning What is changing The trends that impact database systems and their applications What is NOT changing The.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Performance Concepts Mark A. Magumba. Introduction Research done on 1058 correspondents in 2006 found that 75% OF them would not return to a website that.
Introduction to Cloud Computing
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Meet with the AppEngine Márk Gergely eu.edge. What is AppEngine? It’s a tool, that lets you run your web applications on Google's infrastructure. –Google's.
Introduction to Hadoop and HDFS
Contents HADOOP INTRODUCTION AND CONCEPTUAL OVERVIEW TERMINOLOGY QUICK TOUR OF CLOUDERA MANAGER.
Architectural Support for Fine-Grained Parallelism on Multi-core Architectures Sanjeev Kumar, Corporate Technology Group, Intel Corporation Christopher.
Data Structures & Algorithms and The Internet: A different way of thinking.
Multi-core Programming Introduction Topics. Topics General Ideas Moore’s Law Amdahl's Law Processes and Threads Concurrency vs. Parallelism.
® 1 VLSI Design Challenges for Gigascale Integration Shekhar Borkar Intel Corp. October 25, 2005.
Reminder Lab 0 Xilinx ISE tutorial Research Send me an if interested Looking for those interested in RC with skills in compilers/languages/synthesis,
April 26, CSE8380 Parallel and Distributed Processing Presentation Hong Yue Department of Computer Science & Engineering Southern Methodist University.
March 9, 2015 San Jose Compute Engineering Workshop.
© 2008 Quest Software, Inc. ALL RIGHTS RESERVED. Perfmon and Profiler 101.
Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper
CompSci 100E 39.1 Memory Model  For this course: Assume Uniform Access Time  All elements in an array accessible with same time cost  Reality is somewhat.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Big Data Directions Greg.
Proposal for an Open Source Flash Failure Analysis Platform (FLAP) By Michael Tomer, Cory Shirts, SzeHsiang Harper, Jake Johns
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
CS 540 Database Management Systems
Storage Systems CSE 598d, Spring 2007 OS Support for DB Management DB File System April 3, 2007 Mark Johnson.
Hardware Trends CSE451 Andrew Whitaker. Motivation Hardware moves quickly OS code tends to stick around for a while “System building” extends way beyond.
Hardware Trends CSE451 Andrew Whitaker. Motivation Hardware moves quickly OS code tends to stick around for a while “System building” extends way beyond.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
 Cloud Computing technology basics Platform Evolution Advantages  Microsoft Windows Azure technology basics Windows Azure – A Lap around the platform.
BIG DATA/ Hadoop Interview Questions.
Proactive Index Design Using QUBE Lauri Pietarinen Courtesy of Tapio Lahdenmäki November 2010 IDUG 2010.
Gorilla: A Fast, Scalable, In-Memory Time Series Database
The Tail at Scale Dean and Barroso, CACM Feb 2013 Presenter: Chien-Ying Chen (cchen140) CS538 - Advanced Computer Networks 1.
Shaopeng, Ho Architect of Chinac Group
CS 540 Database Management Systems
The Future? Or the Past and Present?
Automated ad placement
The Tail At Scale Dean and Barroso, CACM 2013, Pages 74-80
CSE8380 Parallel and Distributed Processing Presentation
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
Presentation transcript:

U NIVERSITY U NIVERSITY OF T ORONTO U NIVERSITY OF T ORONTO Bionic databases are coming. What will they look like? *Ryan Johnson & Ippokratis Pandis** CIDR 2013 ***

U NIVERSITY U NIVERSITY OF T ORONTO Dark silicon trumps Moore’s Law? More transistors coming, but we can’t use them “complicated” (grows as transistors shrink) power # transistors voltage frequency Dark silicon = Amdahl + Power

U NIVERSITY U NIVERSITY OF T ORONTO The DBMS is at a crossroads Data deluge continues Dark silicon looms Specialized HW becomes economical + =

U NIVERSITY U NIVERSITY OF T ORONTO Didn’t database machines fail? (yes) Economies of scale – Cloud: Amazon, Facebook, Google – Appliances: Exadata, Netezza Alternatives yield diminishing returns – OoO: mined out – MHz: mined out – Multicore: days numbered FPGA allows to “patch” hardware NRE: $O(10 9 ) for each new Intel process/chip Past reasons for failure smaller/missing now DIRECT

U NIVERSITY U NIVERSITY OF T ORONTO OLTP is fast enough. Why care about it? Operational analytics – OLTP “solved” but takes over the server – OLTP “solutions” make analytics harder Data acquisition is the new OLTP – Not just bread and butter transactions any more – Facebook, Twitter graphs, ad serving, financials, sensors, smart devices,... – App devs learning hard way that ACID is nice Goal: free resources for other/more uses

U NIVERSITY U NIVERSITY OF T ORONTO Death by a thousand paper cuts disk log insert lock wait latch wait queues cache miss jump or branch “Software-friendly” latencies (10μs or longer) Fine-grained latencies (under 10μs) Offload small latencies to specialized hardware Transition point is roughly context switch cost

U NIVERSITY U NIVERSITY OF T ORONTO Control flow in HW? HW good at control flow too – Many algs are state machines (sorting networks, regexp, FFT, etc.) – OS/server basic block size: 3 (load/compare/jump) Toolchains needs a lot of work – Everybody needs it, hard problem, out of scope Software platform needs to adapt – Computer architects are stuck – Ship computation to specialized components – Structure/design HW to make SW’s job easier

U NIVERSITY U NIVERSITY OF T ORONTO One possible bionic DBMS Columnar database Log files Log buffer Overlay manager Queuing engine Log insertion Enhanced scanner Space mgt. & bpool Log sync & recovery Query engine Route & schedule Database overlay Tree probe engine CC, SMO & reorg

U NIVERSITY U NIVERSITY OF T ORONTO Example: Tree-based indexing 100 operations wait 300ns 100 operations Tree probe in software Key compare End of node Value fetch is inode is leaf target key not found target key found Tree probe in hardware Simple operation, why waste Intel’s best on it?

U NIVERSITY U NIVERSITY OF T ORONTO Conclusions Custom database hardware is coming – Alternatives yield diminishing returns – Cloud/vertical providers have economies of scale How to benefit OLTP and data gathering? – Speedup unlikely, focus on power, latency offload – Must integrate with analytics Software and tools need to adapt – How to fit HW into SW ecosystem? – How to design hardware quickly?

U NIVERSITY U NIVERSITY OF T ORONTO Beware, bionic DBMSs are coming!