Query Operational data is collected by probes, administrative interfaces, log scraping, etc. Display Process Collect Query Metrics Correlation and Analysis.

Slides:



Advertisements
Similar presentations
Process Monitoring is only the first step in improving process efficiency.
Advertisements

Messaging and Data integration models are used in Phase I of the MCAS project to connect set of message enabled services into a workflow for efficient.
MCTS GUIDE TO MICROSOFT WINDOWS 7 Chapter 10 Performance Tuning.
ManageEngine TM Applications Manager 8 Monitoring Custom Applications.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 11: Monitoring Server Performance.
Packard BioScience. Packard BioScience What is ArrayInformatics?
Components and Architecture CS 543 – Data Warehousing.
Chapter 7 Managing Data Sources. ASP.NET 2.0, Third Edition2.
TRACK 3™ The ultimate process management software.
Maintaining and Updating Windows Server 2008
Overview of Search Engines
CERN IT Department CH-1211 Genève 23 Switzerland t Integrating Lemon Monitoring and Alarming System with the new CERN Agile Infrastructure.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
KPI Business Pack Christa Fine Sr. Product Manager, Information Delivery.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Concept demo System dashboard. Overview Dashboard use case General implementation ideas Use of MULE integration platform Collection Aggregation/Factorization.
MCTS Guide to Microsoft Windows 7
Using the SAS® Information Delivery Portal
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
To enhance learning, service, and research through an advanced information technology environment. Our Mission:To enhance learning, service,and research.
CERN - IT Department CH-1211 Genève 23 Switzerland Service Level Status (SLS) What it does The default SLS.
Marcel Casado NCAR/RAP WEATHER WARNING TOOL NCAR.
Introduction to Database Management. 1-2 Outline  Database characteristics  DBMS features  Architectures  Organizational roles.
Informix IDS Administration with the New Server Studio 4.0 By Lester Knutsen My experience with the beta of Server Studio and the new Informix database.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
Chapter 6 SAS ® OLAP Cube Studio. Section 6.1 SAS OLAP Cube Studio Architecture.
Real Time Monitor of Grid Job Executions Janusz Martyniak Imperial College London.
1 What’s the difference between DocuShare 3.1 and 4.0?
Professor Michael J. Losacco CIS 1110 – Using Computers Database Management Chapter 9.
Messaging and Data integration models are used in Phase I of the MCAS project to connect a set of message- enabled services into a workflow for efficient.
1 Data Warehouses BUAD/American University Data Warehouses.
Instrumentation of the SAM-Grid Gabriele Garzoglio CSC 426 Research Proposal.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
The Glance Project ATLAS Management January 2012.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
A Brief Documentation.  Provides basic information about connection, server, and client.
Tracker data quality monitoring based on event display M.S. Mennea – G. Zito University & INFN Bari - Italy.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
Introduction to Enterprise Guide Jennifer Schmidt Rhonda Ellis Cassandra Hall.
DATABASE MANAGEMENT SYSTEMS CMAM301. Introduction to database management systems  What is Database?  What is Database Systems?  Types of Database.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
A university for the world real R © 2009, Chapter 9 The Runtime Environment Michael Adams.
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
Building Dashboards SharePoint and Business Intelligence.
0 / Database Management. 1 / Identify file maintenance techniques Discuss the terms character, field, record, and table Describe characteristics.
Creating SmartArt 1.Create a slide and select Insert > SmartArt. 2.Choose a SmartArt design and type your text. (Choose any format to start. You can change.
XROOTD AND FEDERATED STORAGE MONITORING CURRENT STATUS AND ISSUES A.Petrosyan, D.Oleynik, J.Andreeva Creating federated data stores for the LHC CC-IN2P3,
PerfSONAR-PS Working Group Aaron Brown/Jason Zurawski January 21, 2008 TIP 2008 – Honolulu, HI.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Global ADC Job Monitoring Laura Sargsyan (YerPhI).
03/09/2007http://pcalimonitor.cern.ch/1 Monitoring in ALICE Costin Grigoras 03/09/2007 WLCG Meeting, CHEP.
Metric Studio Cognos 8 BI. Objectives  In this module, we will examine:  Concepts and Overview  An Introduction to Metric Studio  Cognos 8 BI Integration.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
ConTZole Tomáš Kubeš, 2010 atlas-tz-monitoring.cern.ch An Interactive ATLAS Tier-0 Monitoring.
Keith Chadwick 1 Metric Analysis and Correlation Service. CD Seminar.
Maintaining and Updating Windows Server 2008 Lesson 8.
17 Copyright © 2006, Oracle. All rights reserved. Information Publisher.
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
9 Copyright © 2004, Oracle. All rights reserved. Getting Started with Oracle Migration Workbench.
Fermilab Scientific Computing Division Fermi National Accelerator Laboratory, Batavia, Illinois, USA. Off-the-Shelf Hardware and Software DAQ Performance.
WLCG Transfers monitoring EGI Technical Forum Madrid, 17 September 2013 Pablo Saiz on behalf of the Dashboard Team CERN IT/SDC.
Simulation Production System
Data Warehouse.
Metric Correlation and Analysis Service (MCAS)
Metric Correlation and Analysis Service (MCAS)
Presentation transcript:

Query Operational data is collected by probes, administrative interfaces, log scraping, etc. Display Process Collect Query Metrics Correlation and Analysis Service A workflow-driven system for retrieval, correlation, analysis, and display of multisource heterogeneous operational data. Ted Hesselroth, Gabriele Garzoglio, Tanya Levshina, Parag Mhashilkar, Andrew Baranovski {tdh, garzoglio, tlevshin, Computing Division, Fermilab, Batavia, IL Sources from: HTML XML RRD database text images CollectStoreDisplay Process Query Format- specific Interface Utilities read data from probes and convert to a common format Mule ESB Workflow Mule reads and processes the system information Berkeley XML Database Mule posts the processed information to the database XQuery Data Mining Correlation and analysis are combined with custom views Javascript Visualization Widget Views are organized into a coherent dashboard DB Data feed endpoint Agg. Mule Service Component Data feed endpoint Transform Transform1 Data source URL Polling Data sink Mule Service Comp portlet aggregator async sync Aggregation Correlation Analysis Aggregation Correlation Analysis Storage administrators want to see the current health of multiple cells run by dCache. MCAS aggregates and correlates cell status then displays it as a bar graph. The bar graph shows the number of up/down gridftp and dcap doors as well as the number of up/down pools, aggregated by pool group. This allows the end user to quickly make generalizations about the data. Storage administrators want to see the status of each dCache cell, displayed in a sortable table. For example, this table allows the sorting of information in such a way that all offline cells appear on top. More sorting criteria can be added. The administrators of public dCache running at Fermilab include in the dashboard the health status of gridftp doors over time. MCAS can display this information as a heatmap, where the color of each bin represents the status of a particular door during some period of time, e.g., the percentage of the time the door was up or down. The administrators of the Minos experiment collect multiple graphs of various kinds (e.g number of running/idle jobs per user, read data rates on bluearc over time, etc). The MCAS image viewer portlet allows to organizing and displaying multiple graphs on one page as thumbnails. A end-user can enlarge a graph by clicking on it. Use Cases Triggered Workflows Data can be either pushed or pulled into the XML database of MCAS. In the push model, probes periodically post XML containing system information to a service endpoint provided by the Mule instance. Mule then processes the data in a workflow, for example, converting the schema or aggregating data from multiple sources. In the pull model, probe data may be acquired through periodic queries performed by Mule, or may be triggered from user actions in the graphical interface. Workflow Configuration The transformations that occur in workflows are based on models that are specific to each operational system. MCAS provides tools to manage data source and workflow characteristics. In particular, the characteristics of the workflow, including its dependencies, are specified as parameters in python objects. The workflow configuration tool maps these objects into runnable Mule workflow configurations. Database Maintenance Automatic rotation of database files is built into the software. When a database reaches a specified size or number of entries, a new instance of it is created. Previous data from rotated versions is still transparently accessible. This allows gradual deletion or archiving of the oldest, non-used data without affecting the functionality of the software. Automatic backup is also supported. The Minos Example Minos is one of the neutrino experiments at Fermilab from the High Energy Physics intensity frontier. It uses MCAS to display the principal elements of their computing production system. Information is organized in tabs dedicated to the major architectural components: Computing Cluster, Grid Computing Resources, Storage System, Local Disk. These pages display plots that summarize the status of the various components. Minos also relies on a summary page to provide a status "at-a-glance" of the entire computing infrastructure. The page displays aggregate plots from the information already available in detail at the other pages. Operator or system administrator views unified, coherent, customizable display of system state and history. Monitoring Dashboard Data Mining Monitoring Dashboard Data Mining MCAS Components The DZero Example DZero is one of the "RunII" experiments on the Fermilab Tevatron accelerator. MCAS shows relevant metrics for their computing infrastructure for both Monte Carlo production and data processing. The views are also organized in separate pages ("tabs"), displaying statistics of the production and status of the system. The CMS Example Fermilab is a Tier 1 site for the CMS experiment of the Large Hadron Collider. Proof-of-concept MCAS work (shown above) demonstrated displays for the Tier 1 computing infrastructure. In particular, the bargraph widget of MCAS shows an immediate view on the number of functioning data transport doors of their local storage. Alarm conditions based on the number of non-functioning nodes is shown in red. On the same page, there is an aggregated a view of several key nodes, each displaying a Ganglia plot pulled from the individual web pages of each system. Information is also displayed in tabular format. A Transforming Workflow Previously, operational data of various formats was generated by legacy utilities and read through independently created web pages. An overall picture of system state was difficult to obtain. MCAS allows the collection of operational data, transforming it into a common XML format which can be stored in a database. Visualization, correlation, and analysis of data from different sources can then be obtained by querying the database and displaying the data through various javascript portlets. A portlet server gives a unified, customizable view of the system state. Since the database stores operational data over a period of time, data mining techniques can be used for historical analyses of events. MCAS dCache Health BargraphCell Status TableGanglia Images Door Status Heatmap Operational DataUser Display The Fermi National Accelerator Laboratory is operated by Fermi Research Alliance, LLC under Contract No. DE-AC02-07CH11359 with the United States Department of Energy. Mule ESB Mule provides a lightweight Enterprise Service Bus architecture which connects input and output endpoints through customizable event-driven processing. RRD Integration MCAS has support for storing information in a Round Robin Database (RRD). Internally, MCAS can use JRobin as RRD. When used in the push model, probes periodically post Data in RRD format to the RRD service endpoint provided by Mule. Mule then processes the data in a workflow to store the data in one or more RRDs. In the pull model, probe data can be retrieved in RRD or XML format using appropriate queries. Users can create complex workflows that store the data in RRD databases and pull the data to create plots that are displayed using the Image Viewer portlet.