 Relational Cloud: A Database-as-a-Service for the Cloud Carlo Curino, Evan Jones, Raluca Ada Popa, Nirmesh Malaviya, Eugene Wu, Sam Madden, Hari Balakrishnan,

Slides:



Advertisements
Similar presentations
Help! My table is getting too big! How to divide and conquer SQL Relay 2014.
Advertisements

Database Scalability, Elasticity, and Autonomy in the Cloud Agrawal et al. Oct 24, 2011.
C-Store: Data Management in the Cloud Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY Jun 5, 2009.
Amazon RDS (MySQL and Oracle) and SQL Azure Emil Tabakov Telerik Software Academy academy.telerik.com.
CryptDB: Protecting Confidentiality with Encrypted Query Processing
CryptDB: Confidentiality for Database Applications with Encrypted Query Processing Raluca Ada Popa, Catherine Redfield, Nickolai Zeldovich, and Hari Balakrishnan.
CryptDB: A Practical Encrypted Relational DBMS Raluca Ada Popa, Nickolai Zeldovich, and Hari Balakrishnan MIT CSAIL New England Database Summit 2011.
Samuel Madden MIT CSAIL Towards a Scalable Database Service With Carlo Curino, Evan Jones, and Hari Balakrishnan.
Samuel Madden MIT CSAIL Director, Intel ISTC in Big Data Schism: Graph Partitioning for OLTP Databases in a Relational Cloud Implications for the design.
Microsoft SQL Server x 46% 900+ For Hosting Service Providers
CMU SCS Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications C. Faloutsos – A. Pavlo How to Scale a Database System.
Running Your Database in the Cloud Eran Levin VP R&D - Xeround.
Jiazhang Liu;Yiren Ding Team 8 [10/22/13]. Traditional Database Servers Database Admin DBMS 1.
Manage & Configure SQL Database on the Cloud Haishi Bai Technical Evangelist Microsoft.
Database Design, Application Development, and Administration, 5 th Edition Copyright © 2011 by Michael V. Mannino. All rights reserved. Chapter 18 Client-Server.
Managing Multi-User Databases AIMS 3710 R. Nakatsu.
Aubrey L. Tatarowicz #1, Carlo Curino #2, Evan P. C. Jones #3, Sam Madden #4 # Massachusetts Institute of Technology, USA.
CSC271 Database Systems Lecture # 30.
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
Chapter 2 Database System Architecture. An “architecture” for a database system. A specification of how it will work, what it will “look like.” The “ANSI/SPARC”
Mohammad Ahmadian COP-6087 University of Central Florida.
1 Introduction to Database Systems. 2 Database and Database System / A database is a shared collection of logically related data designed to meet the.
Your Data Any Place, Any Time Online Transaction Processing.
Amazon Web Services BY, RAJESH KANDEPU. Introduction  Amazon Web Services is a collection of remote computing services that together make up a cloud.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
1 Moshe Shadmon ScaleDB Scaling MySQL in the Cloud.
MIT DB GROUP. People Sam Madden Daniel Abadi (Yale)Daniel Abadi Magdalena Balazinska (U. Wash.)Magdalena Balazinska.
Massively Distributed Database Systems - Distributed DBS Spring 2014 Ki-Joune Li Pusan National University.
Designing and Deploying a Scalable EPM Solution Ken Toole Platform Test Manager MS Project Microsoft.
H-Store: A Specialized Architecture for High-throughput OLTP Applications Evan Jones (MIT) Andrew Pavlo (Brown) 13 th Intl. Workshop on High Performance.
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
AZR308. Building distributed systems on an abstraction against commodity hardware at Internet scale, composed of multiple services. Distributed System.
Data Confidentiality on Clouds Sharad Mehrotra University of California, Irvine.
Microsoft SQL Server 2008 Spotlight on Cost 12 Ways to Reduce Costs with Microsoft SQL Server 2008 Name Title Microsoft Corporation.
Intro – Part 2 Introduction to Database Management: Ch 1 & 2.
 2009 Calpont Corporation 1 Calpont Open Source Columnar Storage Engine for Scalable MySQL Data Warehousing April 22, 2009 MySQL User Conference Santa.
Database Replication in Tashkent CSEP 545 Transaction Processing Sameh Elnikety.
Databases Illuminated
CryptDB: Protecting Confidentiality with Encrypted Query Processing
Last Updated : 27 th April 2004 Center of Excellence Data Warehousing Group Teradata Performance Optimization.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
Azure SQL DW – Elastic Data Analytics in the cloud Josh Sivey | Microsoft TSP #492 | Phoenix.
1 Cloud Computing, CS Data in the Cloud: Data-as- a-Service for the Cloud.
E-Store: Fine-Grained Elastic Partitioning for Distributed Transaction Processing Systems Jihui Yang CS525 Advanced Distributed System March 1, 2016.
Sql Server Architecture for World Domination Tristan Wilson.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Data in the Cloud: Data-as-a-Service for the Cloud
Security Group Amazon RDS Mysql Media Request S3
Data Platform and Analytics Foundational Training
Azure SQL Databases - Introduction and Migration
CSCI5570 Large Scale Data Processing Systems
Business Critical Application Platform
Lead SQL BankofAmerica Blog: SQLHarry.com
Hybrid Cloud Architecture for Software-as-a-Service Provider to Achieve Higher Privacy and Decrease Securiity Concerns about Cloud Computing P. Reinhold.
Introduction to NewSQL
Business Critical Application Platform
Using cryptography in databases and web applications
Migrating Your BI Platform To Azure
Be Fast, Cheap and in Control
April 30th – Scheduling / parallel
NoSQL Databases An Overview
Modernizing your enterprise with hybrid it
Zhen Xiao, Qi Chen, and Haipeng Luo May 2013
Managing batch processing Transient Azure SQL Warehouse Resource
H-store: A high-performance, distributed main memory transaction processing system Robert Kallman, Hideaki Kimura, Jonathan Natkins, Andrew Pavlo, Alex.
Cloud Computing Architecture
Database System Architectures
Presentation transcript:

 Relational Cloud: A Database-as-a-Service for the Cloud Carlo Curino, Evan Jones, Raluca Ada Popa, Nirmesh Malaviya, Eugene Wu, Sam Madden, Hari Balakrishnan, Nickolai Zeldovich Presented by Arka Bhattacharya (for CS 294,Berkeley) (some slides are taken from the CIDR ‘11 talk)

THE STARTUP STORY

Motivation  Why move to the cloud ?  Economies of scale (hw & licensing costs)  Pay per use & lower administrative costs  Present players :  Amazon RDS (MySQL on EC2)  Microsoft SQL Azure

Problems !  Problems arising :  Efficient Multi-tenancy (Provider)  Elastic scalability (Provider)  Privacy (User)  Note : Relational Cloud is mainly for OLTP workloads & DAS architectures, consistency guarantees

1. Efficient Multi-tenancy – Placement & Migrations  Problem : Consolidate databases into smallest number of servers, balancing load and without affecting performance  Solution : Kairos, SIGMOD ’11  Upto 17:1 consolidation  Key insight : Single database server per machine + logical databases ; (as opposed to DB in VM, or multiple DB servers per machine )  Reduces redundant work, group commits, lower RAM wastage, code sharing, cheaper context switches

Kairos ….cntd  Measure RAM,CPU & Disk usage of a database, and estimate combined load  RAM : Probe table to gauge working set size ; additive  Disk : Deduce model by testing DBMS with different write rates & working set size & measuring amount of IO  CPU : additive  Frame optimization problem (non-linear programming)  Solving takes time  After lots of heuristics, optimization solutions terminate in 8 minutes for 20 servers & 100 workloads !

2. Elastic Scalability Database Partitioning  Problem : Partition an OLTP database into N chunks so as to maximize performance  Solution : Schism, VLDB 2010  Close to optimal  Key insight : Minimize number of distributed transactions  Advantage over Hashing, round-robin  Use workload trace to find good partitions

Schism …cntd

Schism …. cntd  Use a classifier to capture partitioning in compact form, for efficient query routing  Lots of heuristics to choose good workload sample  Sampling, blanket state filtering, etc  Graph Partitioning in fast ( < 40 sec )  Achieves almost linear scalability !

3. Privacy  Problem :  Prevent DBA from snooping on data  ensure data security during application and DBMS server compromise  Solution : CryptDB, SOSP 2011  Low overhead ~ 22.5%  Key insight : Adjustable security

CrpytDB …Onions Any value DET : equality join DET : equality selection RND Any value OPE-inequality join OPE : inequality select RND int value HOM Onion 1Onion 2Onion 3

Overall architecture DB stats Partitions & placements

Relational Cloud  Advantages :  Unmodified DB backends  Workload-aware consolidation  Workload-aware sharding  High availability via replication of front-end servers  SQL over encrypted data