Storage Manager Scalability on CMPs Ippokratis Pandis CIDR 2009 - Gong Show.

Slides:



Advertisements
Similar presentations
From A to E: Analyzing TPCs OLTP Benchmarks Pınar Tözün Ippokratis Pandis* Cansu Kaynak Djordje Jevdjic Anastasia Ailamaki École Polytechnique Fédérale.
Advertisements

1 Copyright © 2012 Oracle and/or its affiliates. All rights reserved. Convergence of HPC, Databases, and Analytics Tirthankar Lahiri Senior Director, Oracle.
Critical Sections: Re-emerging Concerns for DBMS Ryan JohnsonIppokratis Pandis Anastasia Ailamaki Carnegie Mellon University École Polytechnique Féderale.
@ Carnegie Mellon Databases Data-oriented Transaction Execution VLDB 2010 Ippokratis Pandis Ryan Johnson Nikos Hardavellas Anastasia Ailamaki Carnegie.
© 2010 Ippokratis Pandis Aether: A Scalable Approach to Logging VLDB 2010 Ryan Johnson Ippokratis Pandis Radu Stoica Manos Athanassoulis Anastasia Ailamaki.
Improving OLTP scalability using speculative lock inheritance Ryan Johnson, Ippokratis Pandis, Anastasia Ailamaki.
OLTP on Hardware Islands Danica Porobic, Ippokratis Pandis*, Miguel Branco, Pınar Tözün, Anastasia Ailamaki Data-Intensive Application and Systems Lab,
Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
High Performing Cache Hierarchies for Server Workloads
1 Database Servers on Chip Multiprocessors: Limitations and Opportunities Nikos Hardavellas With Ippokratis Pandis, Ryan Johnson, Naju Mancheril, Anastassia.
ECE 454 Computer Systems Programming Parallel Architectures and Performance Implications (II) Ding Yuan ECE Dept., University of Toronto
PERFORMANCE ANALYSIS OF MULTIPLE THREADS/CORES USING THE ULTRASPARC T1 (NIAGARA) Unique Chips and Systems (UCAS-4) Dimitris Kaseridis & Lizy K. John The.
Parallel Database Systems
Thread-Level Transactional Memory Decoupling Interface and Implementation UW Computer Architecture Affiliates Conference Kevin Moore October 21, 2004.
Scaling up a Web-Based Intelligent Tutoring System Jozsef Patvarczki, Shane Almeida, and Neil Heffernan Computer Science Department Our research team has.
G Robert Grimm New York University Disconnected Operation in the Coda File System.
Extended Memory Semantics for Thread Synchronization Sheng Li, Ying Zhou Operating System Progress Report Nov 1 st, 2007 Sheng Li, Ying Zhou Operating.
Hoard: A Scalable Memory Allocator for Multithreaded Applications -- Berger et al. -- ASPLOS 2000 Emery Berger, Kathryn McKinley *, Robert Blumofe, Paul.
SCHEDULER ACTIVATIONS Effective Kernel Support for the User-level Management of Parallelism Thomas E. Anderson, Brian N. Bershad, Edward D. Lazowska, Henry.
G Robert Grimm New York University SGI’s XFS or Cool Pet Tricks with B+ Trees.
CS533 Concepts of Operating Systems Class 2 Thread vs Event-Based Programming.
Tianzheng Wang Ryan Johnson Alan Fekete Ippokratis Pandis The Serial Safety Net: Efficient concurrency control on modern hardware 1.
Threads Chapter 4. Modern Process & Thread –Process is an infrastructure in which execution takes place  (address space + resources) –Thread is a program.
Database System Architectures  Client-server Database System  Parallel Database System  Distributed Database System Wei Jiang.
Single-Chip Multi-Processors (CMP) PRADEEP DANDAMUDI 1 ELEC , Fall 08.
File Systems and N/W attached storage (NAS) | VTU NOTES | QUESTION PAPERS | NEWS | VTU RESULTS | FORUM | BOOKSPAR ANDROID APP.
Performance and Scalability. Performance and Scalability Challenges Optimizing PerformanceScaling UpScaling Out.
Hardware.  Learn what hardware is  Learn different input and output devices  Learn what the CPU is.
Multi-core Programming Thread Profiler. 2 Tuning Threaded Code: Intel® Thread Profiler for Explicit Threads Topics Look at Intel® Thread Profiler features.
Parallel Programming Models Jihad El-Sana These slides are based on the book: Introduction to Parallel Computing, Blaise Barney, Lawrence Livermore National.
Parallel and Distributed Systems Instructor: Xin Yuan Department of Computer Science Florida State University.
Uncovering the Multicore Processor Bottlenecks Server Design Summit Shay Gal-On Director of Technology, EEMBC.
1 COMPSCI 110 Operating Systems Who - Introductions How - Policies and Administrative Details Why - Objectives and Expectations What - Our Topic: Operating.
H-Store: A Specialized Architecture for High-throughput OLTP Applications Evan Jones (MIT) Andrew Pavlo (Brown) 13 th Intl. Workshop on High Performance.
Enterprise Grid in Financial Services Nick Werstiuk
CSC 7600 Lecture 28 : Final Exam Review Spring 2010 HIGH PERFORMANCE COMPUTING: MODELS, METHODS, & MEANS FINAL EXAM REVIEW Daniel Kogler, Chirag Dekate.
Databases Illuminated
Classic Model of Parallel Processing
Data Management for Decision Support Session-4 Prof. Bharat Bhasker.
Computer Network Lab. Korea University Computer Networks Labs Se-Hee Whang.
Processor Level Parallelism. Improving the Pipeline Pipelined processor – Ideal speedup = num stages – Branches / conflicts mean limited returns after.
Parallelism without Concurrency Charles E. Leiserson MIT.
CPU The Central Processing Unit (CPU), has 3 main parts: Control Unit Arithmetic and Logic Unit Registers. These components are connected to the rest.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
CS533 Concepts of Operating Systems Jonathan Walpole.
Shouqing Hao Institute of Computing Technology, Chinese Academy of Sciences Processes Scheduling on Heterogeneous Multi-core Architecture.
University of Michigan Electrical Engineering and Computer Science 1 Embracing Heterogeneity with Dynamic Core Boosting Hyoun Kyu Cho and Scott Mahlke.
Unit - 4 Introduction to the Other Databases.  Introduction :-  Today single CPU based architecture is not capable enough for the modern database.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 4: Threads.
GPFS: A Shared-Disk File System for Large Computing Clusters Frank Schmuck & Roger Haskin IBM Almaden Research Center.
Synchronization in Distributed File Systems Advanced Operating System Zhuoli Lin Professor Zhang.
On Transactional Memory, Spinlocks and Database Transactions Khai Q. Tran Spyros Blanas Jeffrey F. Naughton (University of Wisconsin Madison)
Programming Multi-Core Processors based Embedded Systems A Hands-On Experience on Cavium Octeon based Platforms Lab Exercises: Lab 1 (Performance measurement)
EJB Enterprise Java Beans JAVA Enterprise Edition
Shared Nothing Architecture Allen Archer. What is Shared Nothing architecture? It is a distributed architecture in which each node is independent and.
My Coordinates Office EM G.27 contact time:
Tuning Threaded Code with Intel® Parallel Amplifier.
Elec/Comp 526 Spring 2015 High Performance Computer Architecture Instructor Peter Varman DH 2022 (Duncan Hall) rice.edux3990 Office Hours Tue/Thu.
COMPSCI 110 Operating Systems
EECS 582 Midterm Review Mosharaf Chowdhury EECS 582 – W16.
The Multikernel: A New OS Architecture for Scalable Multicore Systems
COMPSCI 110 Operating Systems
Introduction to NewSQL
Multi-Processing in High Performance Computer Architecture:
Multi-Processing in High Performance Computer Architecture:
EECS 582 Midterm Review Mosharaf Chowdhury EECS 582 – F16.
What is Parallel and Distributed computing?
Database Servers on Chip Multiprocessors: Limitations and Opportunities Nikos Hardavellas With Ippokratis Pandis, Ryan Johnson, Naju Mancheril, Anastassia.
Design Components are Code Components
Distributed Systems and Concurrency: Distributed Systems
Presentation transcript:

Storage Manager Scalability on CMPs Ippokratis Pandis CIDR Gong Show

Scalability on CMP 2 Sun Niagara I - 32 HW Contexts Update intensive workload No database lock conflicts Linear (ideal) scaling Scalability problems Are database systems ready for modern hardware??

Problem? Too many active concurrent threads Physical contention on centralized structures L2 CPU-0 L1 CPU-1 L1 CPU-2 L1 CPU-N L1 Many HW Contexts (16-64+) 20 cycles 1-25 GB/sec L2 CPU-0 L1 CPU-1 L1 CPU-2 L1 CPU-N L1 L2 CPU-0 L1 CPU-1 L1 CPU-2 L1 CPU-N L1

Breaking the barrier 4 Distribute everything! Locking, Logging, etc… Rethink parallel databases on CMP context Instead of Moving data to computation Unregulated access to resources Distribute computation closer to data Control which thread is accessing each resource

Take-away Message Focus on Scalability!! – Not Single-thread Performance Centralized mechanisms are evil Distribute computation to data – Instead of moving data to computation Use Shore-mt*!! *EDBT2009 – Soon at