Database Monitoring Requirements Salvatore Di Guida (CERN) On behalf of the CMS DB group.

Slides:



Advertisements
Similar presentations
Database System Concepts and Architecture
Advertisements

Database Architectures and the Web
SQL Server Replication
1 Databases in ALICE L.Betev LCG Database Deployment and Persistency Workshop Geneva, October 17, 2005.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
Chapter 9: Moving to Design
.NET Mobile Application Development Introduction to Mobile and Distributed Applications.
Query Processing in Mobile Databases
SQL Server Replication By Karthick P.K Technical Lead, Microsoft SQL Server.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Database Architectures and the Web
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
DIRAC Web User Interface A.Casajus (Universitat de Barcelona) M.Sapunov (CPPM Marseille) On behalf of the LHCb DIRAC Team.
Chapter Oracle Server An Oracle Server consists of an Oracle database (stored data, control and log files.) The Server will support SQL to define.
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
HPS Online Software Discussion Jeremy McCormick, SLAC Status and Plans.
The protection of the DB against intentional or unintentional threats using computer-based or non- computer-based controls. Database Security – Part 2.
A Self-Manageable Infrastructure for Supporting Web-based Simulations Yingping Huang Xiaorong Xiang Gregory Madey Computer Science & Engineering University.
Databases E. Leonardi, P. Valente. Conditions DB Conditions=Dynamic parameters non-event time-varying Conditions database (CondDB) General definition:
2-Tier,3-Tier datawarehouse Submitted by Manisha Dubey & Akanksha Agrawal.
Web application for detailed real-time database transaction monitoring for CMS condition data ICCMSE 2009 The 7th International Conference of Computational.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Lecture # 3 & 4 Chapter # 2 Database System Concepts and Architecture Muhammad Emran Database Systems 1.
Chapter 1 Introduction to Databases. 1-2 Chapter Outline   Common uses of database systems   Meaning of basic terms   Database Applications  
ALICE, ATLAS, CMS & LHCb joint workshop on
ATLAS Detector Description Database Vakho Tsulaia University of Pittsburgh 3D workshop, CERN 14-Dec-2004.
Lesson Overview 3.1 Components of the DBMS 3.1 Components of the DBMS 3.2 Components of The Database Application 3.2 Components of The Database Application.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
Detector Diagnostics Calibration Analysis Ped/LED/Laser RadDam Analysis Detector Optimization Lumi Detector Performance Monitoring DQM On/Offline Prompt.
V.Sirotenko, July Status of Online Databases Currently there are 2 online Oracle Databases running on d0online cluster: 1.Production DB, d0onprd,
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Introduction CMS database workshop 23 rd to 25 th of February 2004 Frank Glege.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
What is a Package? A package is an Oracle object, which holds other objects within it. Objects commonly held within a package are procedures, functions,
Introduction Database integral part of our day to day life Collection of related database Database Management System : software managing and controlling.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle Real Application Clusters (RAC) Techniques for implementing & running robust.
CERN-IT Oracle Database Physics Services Maria Girone, IT-DB 13 December 2004.
The Process Manager in the ATLAS DAQ System G. Avolio, M. Dobson, G. Lehmann Miotto, M. Wiesmann (CERN)
ITGS Network Architecture. ITGS Network architecture –The way computers are logically organized on a network, and the role each takes. Client/server network.
Michele de Gruttola 2008 Report: Online to Offline tool for non event data data transferring using database.
G.Govi CERN/IT-DB 1 September 26, 2003 POOL Integration, Testing and Release Procedure Integration  Packages structure  External dependencies  Configuration.
Database authentication in CORAL and COOL Database authentication in CORAL and COOL Giacomo Govi Giacomo Govi CERN IT/PSS CERN IT/PSS On behalf of the.
ESG-CET Meeting, Boulder, CO, April 2008 Gateway Implementation 4/30/2008.
Oct HPS Collaboration Meeting Jeremy McCormick (SLAC) HPS Web 2.0 OR Web Apps and Databases (Oh My!) Jeremy McCormick (SLAC)
Chapter 2 Database Environment.
November 1, 2004 ElizabethGallas -- D0 Luminosity Db 1 D0 Luminosity Database: Checklist for Production Elizabeth Gallas Fermilab Computing Division /
Maria del Carmen Barandela Pazos CERN CHEP 2-7 Sep 2007 Victoria LHCb Online Interface to the Conditions Database.
Replicazione e QoS nella gestione di database grid-oriented Barbara Martelli INFN - CNAF.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
Interstage BPM v11.2 1Copyright © 2010 FUJITSU LIMITED INTERSTAGE BPM ARCHITECTURE BPMS.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Time critical condition data handling in the CMS experiment during the first data taking period CHEP 2010 Software Engineering, Data Stores, and Databases.
The Database Project a starting work by Arnauld Albert, Cristiano Bozza.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.
CERN IT Department CH-1211 Genève 23 Switzerland t Load testing & benchmarks on Oracle RAC Romain Basset – IT PSS DP.
CT-PPS DB Info (Preliminary) DB design will be the same as currently used for CMS Pixels, HCAL, GEM, HGCAL databases DB is Oracle based A DB for a sub-detector.
Databases and DBMSs Todd S. Bacastow January 2005.
Business System Development
Database Replication and Monitoring
Database Architectures and the Web
CMS High Level Trigger Configuration Management
Database operations in CMS
Conditions Data access using FroNTier Squid cache Server
Database Architectures and the Web
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Data, Databases, and DBMSs
Mark Quirk Head of Technology Developer & Platform Group
SDMX IT Tools SDMX Registry
Presentation transcript:

Database Monitoring Requirements Salvatore Di Guida (CERN) On behalf of the CMS DB group

Outline CMS Database infrastructure and data flow. Data access patterns. Requirements coming from the hardware and software infrastructure: – DB safety and security; – DB monitoring for Conditions. Requirements to be fulfilled by front-end applications (web): – 3 tier architecture; – Authorization. Monitoring WorkshopSalvatore Di Guida2

CMS Database Infrastructure CMS has two production Oracle Real Application Clusters: – CMSONR, 6 nodes Oracle RAC located in the CMS experimental area: Only visible from the CMS online network, Hosting two databases: – OMDS stores data for sub-detectors, trigger, conditions (slow control, configuration, detector status), luminosity, monitoring, – ORCON stores conditions (detector status data and calibration data); – CMSR, 4 nodes ORACLE RAC located at CERN IT Only visible within GPN, Hosting one database (ORCOFF), storing conditions, luminosity, workflow management data (file transfer, data bookkeeping, jobs processing, authentication and authorization). Monitoring WorkshopSalvatore Di Guida3

Condition Drop-box ORCOFF (Offline Reconstruction Condition Database Offline System) Streaming OMDS (Online Master Database System) ORCON (Offline Reconstruction Condition Database Online System) CMS Compact Muon Solenoid PopCon Online network at IP5GPN CMSONR CMSR CMS Database Data Flow OMDS stores all online conditions coming from the different sub-detectors. A subset (summary) of condition data is read from OMDS, reformatted in order to be retrieved as C++ object (payload) and stored in ORCON: – Using Object Relational Access (ORA) design pattern; – Performed by applications based on a Common API integrated in CMSSW (PopCon). Oracle streams populate ORCOFF with data from OMDS and ORCON. Condition Dropbox exports automatically data processed offline in ORCON. Monitoring WorkshopSalvatore Di Guida4

Database monitoring tools Many tools already available thanks to IT DB services: – For developers, they allow to check the status of all services, and the usage of DB resources: Main page: – SLS monitoring for all services deployed, – Lemon for all hardware involved, – Session monitoring for each DB service and for each schema; – For experts, they allow to deeply monitor each component of the system: Streams availability, DB resource usage (plenty of history plots); – Automatic alarm notifications: service failures (invalid objects, streams failure), high loads on nodes (high CPU load, high network traffic…). Monitoring WorkshopSalvatore Di Guida5

Monitoring requirements for hardware and software Database safety and security. Hardware and service monitoring across different networks: – Complying with the security policy of the different clusters; – With different levels of monitoring and a corresponding alarm system. Monitoring WorkshopSalvatore Di Guida6

Data access patterns In general, access patterns depend on how an application exploits data stored in its backend: – Transactional data should be accessed by the application itself in update mode, and not visible from other users; – Bookkeeping and authentication information is static and read only, but can be huge; – Conditions are of two kinds: static and read-only (construction, equipment), varying with time and requiring frequent lookups (conditions, calibrations). Monitoring WorkshopSalvatore Di Guida7

Access patterns for conditions Condition data produced by the CMS detector are essential for running HLT, DQM and the offline reconstruction chain: – Managed by several groups within the collaboration; – Wide range of update frequency and data volume. The stability and the availability of the infrastructure must be ensured, and its performance must not be downgraded neither in write nor in read access: therefore, this requires to limit the access patterns for these data: – establishing a strict policy: NO DELETE, NO UPDATE, INSERT ONLY (append data to time-based sequences of validity ranges – IOV), – promoting the usage of a reduced number of applications, for both data insertion and data retrieval: PopCon and Condition DropBox, Framework modules reading conditions (grouped consistently via Global Tag); – Servers load in reading reduced using a caching mechanism (FroNTier). Monitoring WorkshopSalvatore Di Guida8

Retrieving conditions The HLT (running at P5), DQM (running at P5 and Tier0/CAF), offline reconstruction jobs (running at Tier0/Tier1s, ~20000 per day) and a subset of analysis jobs (running at Tier2s, ~50000 per day) can create a massive load when retrieving data (conditions, luminosity) from ORCON/ORCOFF. Frontier caches allow to minimize direct access to Oracle in read- only mode: – 2 services implemented: at P5 on ORCON for HLT and DQM, and at CERN on ORCOFF for Tier0/1/2, Dedicated instances for Tier0 express/prompt reconstruction, luminosity workflows, MonteCarlo simulation, – The cache refreshing policy can imply some latency in retrieving data. The system is reliable w.r.t. the current workflows, but a change to the current infrastructure must lead to severe loads on one or more nodes/services. Scalability is an issue. Monitoring WorkshopSalvatore Di Guida9

FroNTier Architecture Monitoring WorkshopSalvatore Di Guida10

FroNTier monitoring Each one of the FroNTier services is monitored: – Availability of CERN launchpads and all squids; – HTTP requests for CERN launchpads and all squids; – Network traffic of CERN launchpads and all squids; – Objects stored in cache for CERN launchpads and all squids (object = payload of FroNTier request). Monitoring WorkshopSalvatore Di Guida11

Database safety and security Definition of a clearer account policy, and improvement of user privileges’ granting: – Based on application and user roles, See Oracle® Database Security GuideOracle® Database Security Guide This policy is beneficial for monitoring the access to all DB schemas: – Each account can be easily associated to an application or a group of developers: Transactions can be easily tracked – Reduce access with schema owner privileges: Identify quickly accesses trying to perform unauthorized actions (e.g. creating or inserting values in a read-only table). Monitoring WorkshopSalvatore Di Guida12

Monitoring access to Conditions PopCon monitors all payload transfers to production database: – Using a DB account where the status of all transfers is logged in relation tables; – Exposing the logs to developers, managers, users via a web-based application; See Antonio’s presentation this afternoon. From the DB point of view, all transactions against production schemas performing DML statements are logged. Monitoring WorkshopSalvatore Di Guida13

Monitoring access to Conditions The creation/modification of ORA schemas (i.e. schemas where a mapping between tables and C++ data members is defined) is not yet monitored: – From the DB point of view, this means logging also DDL statements, together with DML statements storing the mapping in the dedicated tables. This new monitoring instance will help to identify quickly: – Access to production schemas with wrong privileges; – Users/applications trying to perform illegal actions; – Corrupted schemas, providing help to experts for troubleshooting; Monitoring WorkshopSalvatore Di Guida14

Plans for conditions in CMSSW The new account policy and the schema modification monitoring are going to be put in the Condition Core software package. All actions will be performed with the help of IT DBAs: – Validation of code and procedures; – Testing. Monitoring WorkshopSalvatore Di Guida15

Hardware & service configuration The hardware involved in DB operations is split in two networks: – CERN GPN: Only applications approved by CERN Security Team can be visible from the outside network! – CMS online network at IP5 has a very strict security policy and a very constrained data transfer design: Files cannot be copied from GPN to CMS network, but they must be pulled in the online cluster from the offline network, Files must be pushed by the online network to offline network, Transferring data from GPN to CMS network is not envisaged in the online network design. Some services are deployed in one network, but others (e.g. condition drop-box) use resources in both networks: – The communication between networks must be monitored too! Monitoring WorkshopSalvatore Di Guida16

Front-end applications The different monitoring instances for Database tools have a frontend application: – Retrieving monitoring data; – Aggregating them according to metrics based on different use-case models; – Publishing them. The DB group focuses on web based front-end applications: – The monitoring data are read directly from Oracle: Small data volume, Reduce latency as much as possible, Checking Oracle availability (if Oracle fails, the application fails and an alarm is raised). Monitoring WorkshopSalvatore Di Guida17

Multi-tier architecture The monitoring system is based on three tier architecture: – The presentation tier (frontend server) supports user interaction and data presentation; – The logic tier (backend server) handles information exchange between the database and the user interface; – The data tier supports the access to the data stored in the database. This architecture has many advantages from the CMS DB monitoring point of view: – encapsulates the functionality processing related to user interaction in an application separated from the client application; – provides the possibility to program efficiently connection strategies (such as clients requests queuing, database access control); – all the code related to database connection can be totally separated from the client application, no queries issued by the client (users!); – Enforces security of backend and DB using firewall protection of GPN. Monitoring WorkshopSalvatore Di Guida18

ORCOFF Database Backend server Frontend server WSGI POST XMLHTTP cx_Oracle Web Interface Logic tierData tier Presentation tier Monitoring WorkshopSalvatore Di Guida19

Authorization Database activity must be controlled. Not all monitoring data should be visible worldwide: – DB service names; – Account names. Authentication mechanism for all web based applications, deployed in the frontend servers: – For Drop-box: access to machine where the service is deployed; – For Web browsing: SSO, e-groups. Monitoring WorkshopSalvatore Di Guida20

Technology There are many technologies available on the market and within the Open Source community. DB web based monitoring must be visible not only on desktops and laptops, but also on modern mobile devices: – See Antonio’s slides where this item is discussed in detail. Monitoring WorkshopSalvatore Di Guida21