M. Grigorieva, M. Golosova.  Separates data access layer and visualization  Built around common key PanDA objects: jobs, resources, etc.  BigPanDAMon.

Slides:



Advertisements
Similar presentations
Database Futures Workshop CERN Michael Dahlinger, GSI
Advertisements

Query Methods (SQL). What is SQL A programming language for databases. SQL (structured Query Language) It allows you add, edit, delete and run queries.
1Key – Report Creation with DB2. DB2 Databases Create Domain for DB2 Test Demo.
RIVELA Database for the Research on Venice and the Lagoon Dr. Pierpaolo Campostrini Dr. Caterina Dabalà Dr. Stefania De Zorzi Prof. Renzo Orsini RIVELA.
Overview on ZHT 1.  General terms  Overview to NoSQL dabases and key-value stores  Introduction to ZHT  CS554 projects 2.
Jennifer Widom NoSQL Systems Overview (as of November 2011 )
Objectives In this session, you will learn to:
BY VAIBHAV NACHANKAR ARVIND DWARAKANATH Evaluation of Hbase Read/Write (A study of Hbase and it’s benchmarks)
Introduction to Backend James Kahng. Install Node.js.
The Relational Model Codd (1970): based on set theory Relational model: represents the database as a collection of relations (a table of values --> file)
Session-01. Hibernate Framework ? Why we use Hibernate ?
Web Application Architecture: multi-tier (2-tier, 3-tier) & mvc
First Indico Workshop Database Technology Pedro Ferreira May 2013 CERN.
GRID job tracking and monitoring Dmitry Rogozin Laboratory of Particle Physics, JINR 07/08/ /09/2006.
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
ATLAS DQ2 Deletion Service D.A. Oleynik, A.S. Petrosyan, V. Garonne, S. Campana (on behalf of the ATLAS Collaboration)
Persistence Store Project Proposal.
NoSQL continued CMSC 461 Michael Wilson. MongoDB  MongoDB is another NoSQL solution  Provides a bit more structure than a solution like Accumulo  Data.
Intro-Part 1 Introduction to Database Management: Ch 1 & 2.
Modern Databases NoSQL and NewSQL Willem Visser RW334.
OSG Area Coordinator’s Report: Workload Management February 9 th, 2011 Maxim Potekhin BNL
CCP4(i) Database Development Wanjuan (Wendy) Yang CCP4 Annual developer’s meeting March 28, 2006 York.
Changwon Nati Univ. ISIE 2001 CSCI5708 NoSQL looks to become the database of the Internet By Lawrence Latif Wed Dec Nhu Nguyen and Phai Hoang CSCI.
Windows Azure Conference 2014 Polyglot Persistence: NoSQL + Azure!
Methodological Foundations of Biomedical Informatics (BMSC-GA 4449) Himanshu Grover.
The τ - Synopses System Yossi Matias Leon Portman Tel Aviv University.
Analytics: SQL or NoSQL? Richard Taylor Chair Business Intelligence SIG.
File Processing Concepts – Field – combination of 1 or more characters that is the smallest unit of data to be accessed – Record – group of related fields.
1 Copyright © 2004, Oracle. All rights reserved. Introduction.
Spatial Tajo Supporting Spatial Queries on Apache Tajo Slideshare Shorten URL : goo.gl/j0VLXpgoo.gl/j0VLXp.
Development of Hybrid SQL/NoSQL PanDA Metadata Storage PanDA/ CERN IT-SDC meeting Dec 02, 2014 Marina Golosova and Maria Grigorieva BigData Technologies.
(1) Introduction to Models using the Play Framework Philip Johnson Collaborative Software Development Laboratory Information and Computer Sciences University.
INFO1408 Database Design Concepts Week 15: Introduction to Database Management Systems.
3 Copyright © 2009, Oracle. All rights reserved. Accessing Non-Oracle Sources.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
NOSQL Implementation and examples Maciej Matuszewski.
Modeling MongoDB with Relational Model Proposed by Christopher Polanco.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
NoSQL: Graph Databases. Databases Why NoSQL Databases?
Metasolv-OCDM Connector Metasolv OCDM. What is the MSS Adapter for Oracle Communications Data Model? The Oracle Communications Metasolv and Solution Adapter.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Data and Information Systems Laboratory University of Illinois Urbana-Champaign Data Mining Meeting Mar, From SQL to NoSQL Xiao Yu Mar 2012.
NoSQL databases A brief introduction NoSQL databases1.
CERN 21 January 2005Piotr Nyczyk, CERN1 R-GMA Basics and key concepts Monitoring framework for computing Grids – developed by EGEE-JRA1-UK, currently used.
OSG Area Coordinator’s Report: Workload Management February 9 th, 2011 Maxim Potekhin BNL
M. Grigorieva, M. Golosova Laboratory of Big Data Technologies for mega-science projects.
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Our experience with NoSQL and MapReduce technologies Fabio Souto.
CS122B: Projects in Databases and Web Applications Winter 2017
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
So, what was this course about?
POW MND section.
Bridging SQL and NoSQL Rupal Dhillon U
Modern Databases NoSQL and NewSQL
Data Warehouse.
New Mexico State University
Establishing A Data Management Fabric For Grid Modernization At Exelon
NOSQL and CAP Theorem.
11/18/2018 2:14 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Grid Data Integration In the CMS Experiment
Entity Framework Core.
Introduction to FIS 318/618: Financial Systems & Databases
5 Azure Services Every .NET Developer Needs to Know
Introduction to NoSQL Database Systems
Storing and Processing Sensor Networks Data in Public Clouds
Comparison of SQL relational database document NoSQL database MongoDB
The Database World of Azure
Presentation transcript:

M. Grigorieva, M. Golosova

 Separates data access layer and visualization  Built around common key PanDA objects: jobs, resources, etc.  BigPanDAMon based on Framework  Runs on top of SQL DB backends  Modular and reusable monitoring: Goal Adapt BigPanDA Monitor to work with both SQL and NoSQL DB backends (SQL – operational data, NoSQL – historical data): Methods: 1.Enchance Django ORM to interact with both: SQL and NoSQL 2.Integration of Hybrid SQL/NoSQL Storage in BigPanDA Monitor Analyze job failures Monitor progress of analysis/activity of a PanDA resource Organize/visualize data

 Django-nonrel is an independent branch of Django that adds NoSQL database support to the ORM.  The long-term goal is to add NoSQL support to the official Django release.  Django-dbindexer  use SQL features on NoSQL databases and abstract the differences between NoSQL databases  denormalization, JOINs, and other important features  Currently, this project is in an early development stage.

1.Enchance Django ORM to interact with NoSQL 2.Use Cassandra database wrapper

settings.py SQL NoSQL (Cassandra) from cqlengine import columns from cqlengine.models import Model # main table class PandaJobArchived(Model): pandaid = columns.BigInt(primary_key=True) modificationtime = columns.DateTime() jobdefinitionid = columns.BigInt() schedulerid = columns.Text(max_length=384) pilotid = columns.Text(max_length=600) creationtime = columns.DateTime() creationhost = columns.Text(max_length=384) modificationhost = columns.Text(max_length=384) ……………………………………… # dependent tables class task_status(Model): task_id = columns.Integer(partition_key = True) job_status = columns.Text(partition_key = True) modification_time = columns.DateTime(primary_key = True) panda_id = columns.BigInt(primary_key = True) ………… models.py views.py def jobList(request, mode=None, param=None): jobs.extend(Jobsdefined4.objects.filter(**query).values()) jobs.extend(Jobsactive4.objects.filter(**query).values()) jobs.extend(Jobswaiting4.objects.filter(**query).values()) jobs.extend(Jobsarchived4.objects.filter(**query).values()) jobs.extend(Jobsarchived.objects.filter(**query).values()) #SQL # NoSQL (Cassandra) query jobs.extend(PandaJobArchived.objects.filter(**query).values()) INSTALLED_APPS = ('django_cassandra_engine',) + INSTALLED_APPS

3. Integration with Hybrid Storage API

 Database performance tests : SQL - NoSQL, NoSQL - NoSQL  Technology evaluation tests results for NoSQL databases: MongoDB, HBase, Cassandra, Dremel, CouchDB, MariaDB  Experience of using Hadoop/Spark/MapReduce in PanDA Infrasctucture. Use cases.  Foreseen performance and possible changes in PanDA Oracle archived database schema during/after the LHC Run2  Query routing strategy in BigPanDA applications (BigPanDA Monitor in particular)  How to implement cross database requests in heterogeneous architecture  Strategies of the data modelling for NoSQL databases