@andy_pavl o OLTP on NVM: YM MV. The Last Six Months ? PDL Retreat October 2013 PDL Visit Day May 2014.

Slides:



Advertisements
Similar presentations
MOST COMMON ORACLE SYSTEM-LEVEL TUNING OPPORTUNITIES MARCH2009 Jeff Lippe Master Solution Architect.
Advertisements

OLTP is Totally Different by Michael Stonebraker.
C.R.E.A.M. C ACHE R ULES E VERYTHING A ROUND M E.
Introduction to the new mainframe © Copyright IBM Corp., All rights reserved. Chapter 2: z/OS Overview.
What will my performance be? Resource Advisor for DB admins Dushyanth Narayanan, Paul Barham Microsoft Research, Cambridge Eno Thereska, Anastassia Ailamaki.
1 Overview of Storage and Indexing Chapter 8 (part 1)
Chapter 6: Database Evolution Title: AutoAdmin “What-if” Index Analysis Utility Authors: Surajit Chaudhuri, Vivek Narasayya ACM SIGMOD 1998.
Chapter 3 Data Storage and Access Methods Title: Operating Systems Support for Database Management Author: Michael Stonebraker Pages: 217 – 223 Group 01:
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 11 Database Performance Tuning and Query Optimization.
Distributed and Streaming Evaluation of Batch Queries for Data-Intensive Computational Turbulence Kalin Kanov Department of Computer Science Johns Hopkins.
CPSC-608 Database Systems Fall 2010 Instructor: Jianer Chen Office: HRBB 315C Phone: Notes #5.
File Organizations and Indexing Lecture 4 R&G Chapter 8 "If you don't find it in the index, look very carefully through the entire catalogue." -- Sears,
1.1 CAS CS 460/660 Introduction to Database Systems File Organization Slides from UC Berkeley.
1 Overview of Storage and Indexing Chapter 8 1. Basics about file management 2. Introduction to indexing 3. First glimpse at indices and workloads.
Anti-Caching in Main Memory Database Systems Justin DeBrabant Brown University
Chapter 1 Database and Database Users Dr. Bernard Chen Ph.D. University of Central Arkansas.
Cloud Computing Lecture Column Store – alternative organization for big relational data.
Continuous resource monitoring for self-predicting DBMS Dushyanth Narayanan 1 Eno Thereska 2 Anastassia Ailamaki 2 1 Microsoft Research-Cambridge, 2 Carnegie.
Lecture 11: DMBS Internals
The Worlds of Database Systems Chapter 1. Database Management Systems (DBMS) DBMS: Powerful tool for creating and managing large amounts of data efficiently.
Course Introduction Introduction to Databases Instructor: Joe Bockhorst University of Wisconsin - Milwaukee.
Database and Database Users. Outline Database Introduction An Example Characteristics of the Database Actors on the Scene Advantages of using the DBMS.
Task Scheduling for Highly Concurrent Analytical and Transactional Main-Memory Workloads Iraklis Psaroudakis (EPFL), Tobias Scheuer (SAP AG), Norman May.
Logging in Flash-based Database Systems Lu Zeping
In-Memory Database 전준민, 정주성, 이한민, 곽하녹.
VLDB2012 Hoang Tam Vo #1, Sheng Wang #2, Divyakant Agrawal †3, Gang Chen §4, Beng Chin Ooi #5 #National University of Singapore, †University of California,
Oracle Advanced Compression – Reduce Storage, Reduce Costs, Increase Performance Session: S Gregg Christman -- Senior Product Manager Vineet Marwah.
H-Store: A Specialized Architecture for High-throughput OLTP Applications Evan Jones (MIT) Andrew Pavlo (Brown) 13 th Intl. Workshop on High Performance.
1 Overview of Storage and Indexing Chapter 8 (part 1)
IN-MEMORY OLTP By Manohar Punna SQL Server Geeks – Regional Mentor, Hyderabad Blogger, Speaker.
A Few Things You Don’t Know You Have To Worry About Yet David Boyes CMG Philadelphia, Nov 2008.
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
Database Systems Lecture 1. In this Lecture Course Information Databases and Database Systems Some History The Relational Model.
CPSC-608 Database Systems Fall 2015 Instructor: Jianer Chen Office: HRBB 315C Phone: Notes #5.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Bits Eugene Wu, Carlo Curino, Sam Madden
@andy_pavl o OLTP on the NVM SDV: YMMV January Retreat Thesis Defense December Retreat Job Interviews Moved to CMU.
DMBS Internals I February 24 th, What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the.
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
DMBS Architecture May 15 th, Generic Architecture Query compiler/optimizer Execution engine Index/record mgr. Buffer manager Storage manager storage.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Overview of Storage and Indexing Chapter 8.
October 15-18, 2013 Charlotte, NC Accelerating Database Performance Using Compression Joseph D’Antoni, Solutions Architect Anexinet.
File System Department of Computer Science Southern Illinois University Edwardsville Spring, 2016 Dr. Hiroshi Fujinoki CS 314.
The Google File System Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Presenter: Chao-Han Tsai (Some slides adapted from the Google’s series lectures)
Troubleshooting Dennis Shasha and Philippe Bonnet, 2013.
1 Lecture 16: Data Storage Wednesday, November 6, 2006.
1 Overview of Storage and Indexing Chapter 8. 2 Review: Architecture of a DBMS  A typical DBMS has a layered architecture.  The figure does not show.
Persistent Memory (PM)
Microsoft Ignite /22/2018 7:21 PM BRK2007
Lecture 16: Data Storage Wednesday, November 6, 2006.
Database Management Systems (CS 564)
Windows Azure Migrating SQL Server Workloads
OLTP on NVM:
Chapter Overview Understanding the Database Architecture
CS 286: Memory Paging and Virtual Memory
Lecture 11: DMBS Internals
Adda Quinn 1974 Nancy Wheeler Jenkins 1978.
Real world In-Memory OLTP
Predictive Performance
Computer Architecture
Anti-Caching in Main Memory Database Systems
Hybrid Indexes Reducing the Storage Overhead of
Introduction to Database Systems
September Workshop and Advisory Board Meeting Presenter Affiliation
Cloud computing mechanisms
September Workshop and Advisory Board Meeting Presenter Affiliation
Benchmarking Cloud Serving Systems with YCSB
File Storage and Indexing
Department of Computer Science
CS222/CS122C: Principles of Data Management UCI, Fall 2018 Notes #03 Row/Column Stores, Heap Files, Buffer Manager, Catalogs Instructor: Chen Li.
Presentation transcript:

@andy_pavl o OLTP on NVM: YM MV

The Last Six Months ? PDL Retreat October 2013 PDL Visit Day May 2014

Prison Life Washing Dishes Not Fighting Repentant Cafeteria Thievery Shankings Making Pruno G O OD EV IL

NVM OLTP Lightweight CC Logical Logging Snapshots Heavyweight CC ARIES Logging Making Pruno DR AM SSD/ HDD

Overview 9 Understand the performance characteristics of NVM to develop an optimal DBMS architecture for OLTP workloads.

OLTP Workloads 10 Transactions have three defining characteristics: –Short-lived. –Small footprint. –Repetitive.

Intel NVM Emulator 11 Instrumented motherboard that slows down access to the memory controller. Two execution interfaces: –NUMA (NVM-only) –PMFS (DRAM+NVM)

NUMA Interface – NVM- Only 12 Virtual CPU where all memory access uses the NVM portion of DRAM. No change to application code.

PMFS Interface – DRAM+NVM 13 Special filesystem designed for byte- addressable NVM. Avoids overhead of traditional filesystems.

DBMS Architectures 14 Disk-oriented. Main memory-oriented.

Disk-oriented DBMS 15 Pessimistic assumption that the data a txn needs is not in memory Based on the design assumptions made in the 1970s. –Ingres (Berkeley) –System R (IBM)

Primary Storage Buffer Pool Application ARIES LOG

Memory-oriented DBMS 17 Assume that all data fits in memory. Avoid the overhead of concurrency control + recovery. –SmallBase (AT&T) –Hekaton (Microsoft) –H-Store/VoltDB (Me & others…)

Snapshots Primary Storage Application CMD LOG

Experimental Evaluation 19 Compare the DBMS architectures on the two NVM interfaces. Yahoo! Cloud Serving Benchmark: –10 million records (~10GB) –8x database / memory –Variable skew

Evaluated Systems 20 NVM-Only –H-Store (v2014) –MySQL (v5.5) NVM+DRAM –H-Store + Anti-Caching (v2014) –MySQL (v5.5)

21 YCSB // TXN/SEC H-Store NUMA Interface (NVM-Only) Read-Only Workload 2x Latency Relative to DRAM SKEW AMOUNT (HIGH → LOW) MySQL

22 YCSB // TXN/SEC Anti-Caching PMFS Interface (NVM+DRAM) Read-Only Workload 2x Latency Relative to DRAM SKEW AMOUNT (HIGH → LOW) MySQL

23 YCSB // TXN/SEC NUMA Interface (NVM-Only) Write-Heavy Workload 2x Latency Relative to DRAM SKEW AMOUNT (HIGH → LOW) H-Store MySQL

24 YCSB // TXN/SEC PMFS Interface (NVM+DRAM) Write-Heavy Workload 2x Latency Relative to DRAM SKEW AMOUNT (HIGH → LOW) Anti-Caching MySQL

Discussion 25 NVM latency did not make a big difference in performance. Logging is major bottleneck in DBMS performance on NVM. MySQL wastes NVM space.

Database CMU

Database CMU 27 N-Store H-Store + Anti-Caching ThomasDB MongoDB

N-STORE

N-Store 29 NVM-only Architecture. Hybrid OLTP/OLAP DBMS: –High-performance txn processing. –Low-latency analytical operations. –Instant recovery.

30 N-Store – Shadow Paging Shadow Page Table Master Page Table DB Root X X X X

H-STORE

H-Store + Anti-Caching 32 Allows DBMS to mange DBs that are larger than amount of DRAM. Reducing memory overhead of evicted data. Using ML to reduce disk I/O.

MAIN TUPLE TABLE HEADER { EVICTED TUPLE TABLE INDEXES ANTI-CACHE Ø ANTI-CACHE BLOCKSLRU CHAIN 33

H-Store + Anti-Caching 34 Bloom filter usage tracking. Multi-faceted indexes. Cold storage approximations. Semantic block clustering.

ThomasD B

36 High-performance, low- overhead incremental computation platform. Maintains mapping between function invocations and data.

37 ThomasDB – Data Marts PUBLIC DATABASE PRIVATE DATABASE UDF Update! Materialized View ANALYTICS PROGRAM

38 ThomasDB – Preliminary Results MAP FUNCTION OVER WIKIPEDIA CORPUS

MongoDB

Distributed Document Database Design 40 Automatic tool selects the near-optimal physical design: –Sharding Keys –Denormalization –Indexes

Vigilante DBA 41 Automated framework for finding DB applications and fixing them. Build a large catalog of database applications from public sources.

Summary 42 Lots of database stuff in the works. Always looking for industry collaborators.

@ANDY _ PAVLO END db.cs.cmu.edu

This talk is in compliance with the Federal Bureau of Prisons (FBP) Act (1930) Pub. L. No , 46 Stat All reference herein to any specific commercial products, process, or service by trade name, trademark, manufacturer, or otherwise, does not necessarily constitute or imply its endorsement, recommendation, or favoring by the FBP. The views and opinions of authors expressed herein do not necessarily state or reflect those of the FBP, and shall not be used for evaluation by including, but not limited, to FBP parole board, its directors, and other corrections officers. Regulations are contained in Title 28, Chapter V of the Code of Federal Regulations (CFR). Contact with parolees for research questions is regulated under 28 C.F.R All complaints about the validity of said research should be directed to the 28 C.F.R. 542 Administrative Remedy Program for a formal review of an issue relating to any aspect of the evaluation that "does not apply to inmates confined in other non-federal facilities."

Andy Pavlo Col. Stan Zdonik Mike Stonebraker Justin DeBrabant Joy Arulraj Subramanya Dulloor Rajesh Sankaran Jeff Parkhurst