Evaluation of NoSQL databases for DIRAC monitoring and beyond

Slides:



Advertisements
Similar presentations
CHEP 2012 – New York City 1.  LHC Delivers bunch crossing at 40MHz  LHCb reduces the rate with a two level trigger system: ◦ First Level (L0) – Hardware.
Advertisements

Chapter 1 Introduction 1.1A Brief Overview - Parallel Databases and Grid Databases 1.2Parallel Query Processing: Motivations 1.3Parallel Query Processing:
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
Stuart K. PatersonCHEP 2006 (13 th –17 th February 2006) Mumbai, India 1 from DIRAC.Client.Dirac import * dirac = Dirac() job = Job() job.setApplication('DaVinci',
Identifying and Incorporating Latencies in Distributed Data Mining Algorithms Michael Sevilla.
Pilots 2.0: DIRAC pilots for all the skies Federico Stagni, A.McNab, C.Luzzi, A.Tsaregorodtsev On behalf of the DIRAC consortium and the LHCb collaboration.
Scalability By Alex Huang. Current Status 10k resources managed per management server node Scales out horizontally (must disable stats collector) Real.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
DIRAC Web User Interface A.Casajus (Universitat de Barcelona) M.Sapunov (CPPM Marseille) On behalf of the LHCb DIRAC Team.
ZhangGang, Fabio, Deng Ziyan /31 NoSQL Introduction to Cassandra Data Model Design Implementation.
Elasticsearch in Dashboard Data Management Applications David Tuckett IT/SDC 30 August 2013 (Appendix 11 November 2013)
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
` tuplejump The data engineering platform. A startup with a vision to simplify data engineering and empower the next generation of data powered miracles!
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
Introduction to Hadoop and HDFS
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
OSG Area Coordinator’s Report: Workload Management February 9 th, 2011 Maxim Potekhin BNL
Processing of the WLCG monitoring data using NoSQL J. Andreeva, A. Beche, S. Belov, I. Dzhunov, I. Kadochnikov, E. Karavakis, P. Saiz, J. Schovancova,
CERN IT Department CH-1211 Genève 23 Switzerland t Monitoring: Tracking your tasks with Task Monitoring PAT eLearning – Module 11 Edward.
Streamlining Monitoring Infrastructure in IT-DB-IMS Charles Newey ›
DDM Monitoring David Cameron Pedro Salgado Ricardo Rocha.
CERN IT Department CH-1211 Geneva 23 Switzerland t CF Computing Facilities Agile Infrastructure Monitoring CERN IT/CF.
Hadoop IT Services Hadoop Users Forum CERN October 7 th,2015 CERN IT-D*
Stairway to the cloud or can we take the highway? Taivo Liik.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Distributed Time Series Database
Transformation System report Luisa Arrabito 1, Federico Stagni 2 1) LUPM CNRS/IN2P3, France 2) CERN 5 th DIRAC User Workshop 27 th – 29 th May 2015, Ferrara.
1 A lightweight Monitoring and Accounting system for LHCb DC'04 production V. Garonne R. Graciani Díaz J. J. Saborido Silva M. Sánchez García R. Vizcaya.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Agile Infrastructure Monitoring HEPiX Spring th April.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT IT Monitoring WG Update on the tool hunt & MonALISA monitoring.
Scalable data access with Impala Zbigniew Baranowski Maciej Grzybek Daniel Lanza Garcia Kacper Surdy.
Monitoring with InfluxDB & Grafana
FroNtier Stress Tests at Tier-0 Status report Luis Ramos LCG3D Workshop – September 13, 2006.
WLCG Transfers Dashboard A unified monitoring tool for heterogeneous data transfers. Alexandre Beche.
1 Benchmarking Cloud Serving Systems with YCSB Brian F. Cooper, Adam Silberstein, Erwin Tam, Raghu Ramakrishnan and Russell Sears Yahoo! Research.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN Agile Infrastructure Monitoring Pedro Andrade CERN – IT/GT HEPiX Spring 2012.
OSG Area Coordinator’s Report: Workload Management February 9 th, 2011 Maxim Potekhin BNL
Alfresco Monitoring with OpenSource Tools Miguel Rodriguez Technical Account Manager.
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
CERN IT Department CH-1211 Genève 23 Switzerland t Monitoring: Present and Future Pedro Andrade (CERN IT) 31 st August.
Gorilla: A Fast, Scalable, In-Memory Time Series Database
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Our experience with NoSQL and MapReduce technologies Fabio Souto.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
WLCG Transfers monitoring EGI Technical Forum Madrid, 17 September 2013 Pablo Saiz on behalf of the Dashboard Team CERN IT/SDC.
Metrics data published Via different methods Monitoring Server
A presentation on ElasticSearch
Kibana, Grafana and Zeppelin on Monitoring data
Image taken from: slideshare
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
Database Services Katarzyna Dziedziniewicz-Wojcik On behalf of IT-DB.
WinCC-OA Log Analysis SCADA Application Service - Reporting
Monitoring with Clustered Graphite & Grafana
Running virtualized Hadoop, does it make sense?
Experience in CMS with Analytic Tools for Data Flow and HLT Monitoring
DI4R, 30th September 2016, Krakow
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
Conditions Data access using FroNTier Squid cache Server
New developments on the LHCb Bookkeeping
Monitoring of the infrastructure from the VO perspective
1 Demand of your DB is changing Presented By: Ashwani Kumar
Overview of big data tools
Project Goals Collect and permanently store the data flowing around ONAP system into several Big Data storages, each in different category. Also serve.
Academic & More Group 4 谢知晖 王逸雄 郭嘉宋 程若愚.
Performance And Scalability In Oracle9i And SQL Server 2000
Indexing with ElasticSearch
Presentation transcript:

Evaluation of NoSQL databases for DIRAC monitoring and beyond Adrian Casajus Ramo, Federico Stagni, Luca Tomassetti, Zoltan Mathe On behalf of the LHCb collaboration

Motivation Develop a system for real time monitoring and data analysis: Focus on monitoring the jobs (not accounting) Requirements Optimized for time series analysis Efficient data storage, data analysis and retrieval Easy to maintain Scale Horizontally East to create complex reports (dashboards) Why? Current system is based on MySQL: is not designed for real time monitoring (more for accounting) does not scale to hundred of million rows (>500 million). It requires ~400 second to generate a one-month duration plot is not for real time analysis is not schema-less: Often change the data format Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Motivation Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Technologies used Database: Data visualization: InfluxDB is a distributed time series database with no dependency OpenTSDB is a distributed time series database based on HBase ElasticSearch is a distributed search and analytic engine Data visualization: Grafana Metric dashboard and graph editor for InfluxDB, Graphite and OpenTSDB Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Motivation Grafana dashboard: Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Technologies used Database: Data visualization: InfluxDB is a distributed time series database with no dependency OpenTSDB is a distributed time series database based on HBase ElasticSearch is a distributed search and analytic engine Data visualization: Grafana Metric dashboard and graph editor for InfluxDB, Graphite and OpenTSD Kibana Flexible analytic and visualization framework Developed for creating complex dashboards Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Technologies used Kibana dashboard: Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Technologies used Database: Data visualization: Communication InfluxDB is a distributed time series database with no dependencies OpenTSDB is a distributed time series database based on HBase ElasticSearch is a distributed search and analytic engine Data visualization: Grafana Metric dashboard and graph editor for InfluxDB, Graphite and OpenTSD Kibana Flexible analytic and visualization framework Developed for creating complex dashboards Communication RabbitMQ Robust messaging system Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Overview of the System Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Hardware and data format RabbitMQ one physical machine 12 VMs provided by CERN OpenStack Each VM has 4 core, 8 GB memory and 80GB disk We used 3 clusters with 4 nodes Data format: The records are sent to the RabbitMQ in JSON format. Each record must contain a minimum of four elements: metric, time, key/value pairs, value For example: {"Status": "Done", ”time": 1404086442, "JobSplitType": "MCSimulation", "MinorStatus": "unset", "Site": "ARC.Oxford.uk", "value": 10, ”metric": ”WMSHistory", "User": "phicharp", "JobGroup": "00037468", "UserGroup": "lhcb_mc”} Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Performance comparison We have recorded ~600 million records during ~1.5 month We defined 5 different queries Running jobs grouped by Site Running jobs grouped by JobGroup Running jobs grouped by JobSplitType Failed jobs grouped by JobSplitType Waiting jobs grouped by JobSplitType Query intervals: 1, 2, 7 and 30 day Random interval: Start and end time are generated randomly between 2015-02-05, 15:00:00 and 2015-03-12 15:00:00 The high workload is generated by 10, 50, 100 clients (python threads) to measure the response time and the throughput REST APIs are used to retrieve the data from the DB All clients are used a random query and a random period All clients are continuously running parallel during 7200 second InfluxDB has not scaled after 2 days Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Results: 10 client Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Results: 50 client Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Results: 100 client Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Response time of all experiments Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Throughput of all experiments Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

Conclusions ElasticSearch was faster than OpenTSDB and InfluxDB It is easy to maintain Marvel is a very good tool for monitoring the cluster license required… It can be easily integrated to the DIRAC portal OpenTSDB was slower than ElasticSearch but it may scale better by adding more nodes to the cluster It is not easy to maintain (lot of parameters which have to be correctly set) Very good monitoring of the cluster. InfluxDB is a new time series database, which is easy to use, but it does not scale Kibana can fulfil our needs But we’ll look at integration in the DIRAC portal According to our experience we decided to use ElasticSerach for real time monitoring of jobs, and for all real time DIRAC monitoring systems Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015

? Question, comments Thanks! Evaluation of NoSQL databases for DIRAC monitoring and beyond, CHEP2015