Tom Van Steenkiste Supervisor: Predrag Buncic

Slides:



Advertisements
Similar presentations
High throughput chain replication for read-mostly workloads
Advertisements

Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
An Adaptable Benchmark for MPFS Performance Testing A Master Thesis Presentation Yubing Wang Advisor: Prof. Mark Claypool.
Lecture 6 – Google File System (GFS) CSE 490h – Introduction to Distributed Computing, Winter 2008 Except as otherwise noted, the content of this presentation.
Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Farsite: Ferderated, Available, and Reliable Storage for an Incompletely Trusted Environment Microsoft Reseach, Appear in OSDI’02.
5.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 5: Working with File Systems.
FLANN Fast Library for Approximate Nearest Neighbors
Northwestern University 2007 Winter – EECS 443 Advanced Operating Systems The Google File System S. Ghemawat, H. Gobioff and S-T. Leung, The Google File.
Lecture The Client/Server Database Environment
Google Distributed System and Hadoop Lakshmi Thyagarajan.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
Distributed Data Stores – Facebook Presented by Ben Gooding University of Arkansas – April 21, 2015.
ObliviStore High Performance Oblivious Cloud Storage Emil StefanovElaine Shi
FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment.
CSC 456 Operating Systems Seminar Presentation (11/13/2012) Leon Weingard, Liang Xin The Google File System.
Some key-value stores using log-structure Zhichao Liang LevelDB Riak.
Software Engineering for Business Information Systems (sebis) Department of Informatics Technische Universität München, Germany wwwmatthes.in.tum.de Data-Parallel.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Ideas to Improve SharePoint Usage 4. What are these 4 Ideas? 1. 7 Steps to check SharePoint Health 2. Avoid common Deployment Mistakes 3. Analyze SharePoint.
Fragmentation in Large Object Repositories Russell Sears Catharine van Ingen CIDR 2007 This work was performed at Microsoft Research San Francisco with.
Integration of a key-value database in the parameter management system of the ALFA framework CERN openlab Summer Students Lightning Talk Sessions Tom Van.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
MapReduce and GFS. Introduction r To understand Google’s file system let us look at the sort of processing that needs to be done r We will look at MapReduce.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
1 An Adaptive File Distribution Algorithm for Wide Area Network Takashi Hoshino, Kenjiro Taura, Takashi Chikayama University of Tokyo.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Department of Computing, School of Electrical Engineering and Computer Sciences, NUST - Islamabad KTH Applied Information Security Lab Secure Sharding.
CSI 3125, Preliminaries, page 1 SERVLET. CSI 3125, Preliminaries, page 2 SERVLET A servlet is a server-side software program, written in Java code, that.
CHAPTER 7 CLUSTERING SERVERS. CLUSTERING TYPES There are 2 types of clustering ; Server clusters Network Load Balancing (NLB) The difference between the.
Predrag Buncic Future IT challenges for ALICE Technical Workshop November 6, 2015.
Scalability == Capacity * Density.
Parallel IO for Cluster Computing Tran, Van Hoai.
Implementation of Simple Cloud-based Distributed File System Group ID: 4 Baolin Wu, Liushan Yang, Pengyu Ji.
Distributed Systems Architecure. Architectures Architectural Styles Software Architectures Architectures versus Middleware Self-management in distributed.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Fault – Tolerant Distributed Multimedia Streaming Web Application By Nirvan Sagar – Srishti Ganjoo – Syed Shahbaaz Safir
Apache Ignite Compute Grid Research Corey Pentasuglia.
Outline Introduction and motivation, The architecture of Tycho,
Machine Learning Library for Apache Ignite
OpenLegacy Training Day Four Introduction to Microservices
性能测试那些事儿 刘博 ..
The Client/Server Database Environment
Software Architecture in Practice
Alternative system models
Spark Presentation.
CSE-291 Cloud Computing, Fall 2016 Kesden
Noah Treuhaft UC Berkeley ROC Group ROC Retreat, January 2002
Chapter 9: The Client/Server Database Environment
Cloud Data Anonymization Using Hadoop Map-Reduce Framework With Qos Evaluation and Behaviour analysis PROJECT GUIDE: Ms.S.Subbulakshmi TEAM MEMBERS: A.Mahalakshmi( ).
Introduction to Microservices Prepared for
Algorithms for Big Data Delivery over the Internet of Things
Introduction to Networks
Introduction to Networks
Elastic Consistent Hashing for Distributed Storage Systems
Understanding System Characteristics of Online Erasure Coding on Scalable, Distributed and Large-Scale SSD Array Systems Sungjoon Koh, Jie Zhang, Miryeong.
A Survey on Distributed File Systems
Providing Secure Storage on the Internet
Oracle Architecture Overview
Architectures of distributed systems Fundamental Models
Architectures of distributed systems Fundamental Models
Architectures of distributed systems
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Outline Introduction LSM-tree and LevelDB Architecture WiscKey.
Architectures of distributed systems Fundamental Models
Building global and highly-available services using Windows Azure
5/7/2019 Map Reduce Map reduce.
Introducing NTFS Reliability Security Long file names Efficiency
Presentation transcript:

Tom Van Steenkiste Supervisor: Predrag Buncic Integration of a key-value database in the parameter management system of the ALFA framework Tom Van Steenkiste Supervisor: Predrag Buncic

Detector parameters geometry position calibration … ROOT ASCII (SQL) ? 28/08/2015 Tom Van Steenkiste

Key-value database requirements 1kB - 50MB 200 files read - 60 files written 200MB read - 30MB written C/C++ interface 28/08/2015 Tom Van Steenkiste

Key-value database requirements 1kB, 10kB, 100kB, 1MB, 10MB, and 50MB 3.33 reads per write random access pattern C/C++ interface 28/08/2015 Tom Van Steenkiste

Key-value database candidates RAMCloud Riak … 28/08/2015 Tom Van Steenkiste

Case-study: RAMCloud low latency scalability C++ 1MB limit chunking? 28/08/2015 Tom Van Steenkiste

Case-study: RIAK high availability scalability fault tolerance configurable 28/08/2015 Tom Van Steenkiste

Case-study: Riak C/C++ client very buggy last commit 1 year ago no future support 28/08/2015 Tom Van Steenkiste

Case-study: Riak 3rd party C/C++ client very buggy last commit 1 year ago incomplete 28/08/2015 Tom Van Steenkiste

Case-study: Riak Java client official library future support available now 28/08/2015 Tom Van Steenkiste

Case-study: Riak 28/08/2015 Tom Van Steenkiste

Case-study: Riak Java client easily configurable any protobuf language distributed 28/08/2015 Tom Van Steenkiste

Performance framework existing benchmarks not full pipeline only certain use cases sometimes wrong 28/08/2015 Tom Van Steenkiste

Performance framework 28/08/2015 Tom Van Steenkiste

Performance framework keyvalueclusterperf full pipeline universal highly configurable easily extendable 28/08/2015 Tom Van Steenkiste

Case-study: RAMCloud object write speed 0B: 15µs 1MB: very slow +1MB: not possible 28/08/2015 Tom Van Steenkiste

Case-study: Riak requirements 1kB, 10kB, 100kB, 1MB, 10MB, and 50MB 3.33 reads per write random access pattern C/C++ interface 28/08/2015 Tom Van Steenkiste

Case-study: Riak 28/08/2015 Tom Van Steenkiste

Case-study: Riak read vs write latency Size Read duration[s] Write duration[s] % Difference 1kB 0.003456 0.003455 -0.033927 10kB 0.003673 0.003675 0.066285 100kB 0.008708 0.005512 1MB 0.085394 0.085384 -0.010775 10MB 0.936064 0.935816 -0.026501 28/08/2015 Tom Van Steenkiste

Case-study: Riak read vs write latency small differences under load costs amortized no need for separate analysis 28/08/2015 Tom Van Steenkiste

Case-study: Riak sample size 28/08/2015 Tom Van Steenkiste

Case-study: Riak sample size 28/08/2015 Tom Van Steenkiste

Case-study: Riak sample size amount of simulations 250 is sufficient convergence small amount of simulations multiple iterations 28/08/2015 Tom Van Steenkiste

Case-study: Riak storage backend Bitcask log-structured LevelDb compression 28/08/2015 Tom Van Steenkiste

Case-study: Riak storage backend object size: 1kB 28/08/2015 Tom Van Steenkiste

Case-study: Riak storage backend Bitcask best latency nodes out-of-memory LevelDb similar performance compressed storage 28/08/2015 Tom Van Steenkiste

Case-study: Riak object sizes: latency Riak nodes: 4 28/08/2015 Tom Van Steenkiste

Case-study: Riak object sizes: throughput Riak nodes: 4 28/08/2015 Tom Van Steenkiste

Case-study: Riak object sizes always same trend strong increase in latency constant throughput large objects high latency but only large objects (not realistic) 28/08/2015 Tom Van Steenkiste

Case-study: Riak cluster size: latency 28/08/2015 Tom Van Steenkiste

Case-study: Riak cluster size: throughput 28/08/2015 Tom Van Steenkiste

Case-study: Riak cluster size larger cluster better latency better throughput (better availability) 28/08/2015 Tom Van Steenkiste

Case-study: Riak configuration object size: 1kB 28/08/2015 Tom Van Steenkiste

Case-study: Riak configuration object size: 1MB 28/08/2015 Tom Van Steenkiste

Case-study: Riak configuration small objects noWrite: improved performance noRead: degraded performance large objects noWrite: degraded performance see further… (difficulties in simulation) 28/08/2015 Tom Van Steenkiste

Case-study: Riak security 28/08/2015 Tom Van Steenkiste

Case-study: Riak security authentication weak collision resistance (pre-image) encryption not required 1 entry point load balancer firewall 28/08/2015 Tom Van Steenkiste

Case-study: Riak security SHA1 sufficient for weak collision resistance a lot of implementations VMAC build for speed only C++, no Java 28/08/2015 Tom Van Steenkiste

Case-study: Riak security object size: 1kB 28/08/2015 Tom Van Steenkiste

Case-study: Riak security object size: 1MB 28/08/2015 Tom Van Steenkiste

Case-study: Riak security small objects VMAC very efficient large objects strange results IO bound operation? incorrect results? see further… (difficulties in simulation) 28/08/2015 Tom Van Steenkiste

Case-study: Riak amount of Java clients 28/08/2015 Tom Van Steenkiste

Case-study: Riak amount of Java clients object size: 1kB 28/08/2015 Tom Van Steenkiste

Case-study: Riak amount of Java clients object size: 1MB 28/08/2015 Tom Van Steenkiste

Case-study: Riak amount of Java clients small objects more Java clients better less Riak nodes better large objects more Riak nodes needed 28/08/2015 Tom Van Steenkiste

Case-study: Riak availability and fault tolerance replication policy configurable default: 3 replicas multi-node multi-datacentre tuneable read/write consistency 28/08/2015 Tom Van Steenkiste

Case-study: Riak simulation difficulties clean database background processes e.g. Bitcask merging other processes same nodes same network Be careful when interpreting results! 28/08/2015 Tom Van Steenkiste

Case-study: Riak future work other databases C/C++ client for Riak further investigation in security different load balancer exact value distribution and access pattern 28/08/2015 Tom Van Steenkiste

Case-study: Riak more information all code on GitHub Doxygen JavaDoc discussion results in report including software user manual in appendix 28/08/2015 Tom Van Steenkiste

Thank you! 28/08/2015 Tom Van Steenkiste