HPDC Report Domenico Vicinanza CERN IT-GD-OPS CERN, July 12 th weekly OPS section meeting.

Slides:



Advertisements
Similar presentations
1 From Grids to Service-Oriented Knowledge Utilities research challenges Thierry Priol.
Advertisements

GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
The System Center Family Microsoft. Mobile Device Manager 2008.
Plateforme de Calcul pour les Sciences du Vivant SRB & gLite V. Breton.
EU-GRID Work Program Massimo Sgaravatto – INFN Padova Cristina Vistoli – INFN Cnaf as INFN members of the EU-GRID technical team.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
23/04/2008VLVnT08, Toulon, FR, April 2008, M. Stavrianakou, NESTOR-NOA 1 First thoughts for KM3Net on-shore data storage and distribution Facilities VLV.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
An Agent-Oriented Approach to the Integration of Information Sources Michael Christoffel Institute for Program Structures and Data Organization, University.
Sergey Belov, Tatiana Goloskokova, Vladimir Korenkov, Nikolay Kutovskiy, Danila Oleynik, Artem Petrosyan, Roman Semenov, Alexander Uzhinskiy LIT JINR The.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
January, 23, 2006 Ilkay Altintas
OSG Operations and Interoperations Rob Quick Open Science Grid Operations Center - Indiana University EGEE Operations Meeting Stockholm, Sweden - 14 June.
Science Clouds and FutureGrid’s Perspective June Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox
INFSO-RI Enabling Grids for E-sciencE The US Federation Miron Livny Computer Sciences Department University of Wisconsin – Madison.
Publication and Protection of Site Sensitive Information in Grids Shreyas Cholia NERSC Division, Lawrence Berkeley Lab Open Source Grid.
HPDC 2007 / Grid Infrastructure Monitoring System Based on Nagios Grid Infrastructure Monitoring System Based on Nagios E. Imamagic, D. Dobrenic SRCE HPDC.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
E-science in the Netherlands Maria Heijne TU Delft Library Director / Chair Consortium of University Libraries and National Library.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Authors: Ronnie Julio Cole David
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
CERN IT Department CH-1211 Geneva 23 Switzerland t GDB CERN, 4 th March 2008 James Casey A Strategy for WLCG Monitoring.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
CERN IT Department CH-1211 Geneva 23 Switzerland t CF Computing Facilities Agile Infrastructure Monitoring CERN IT/CF.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.
LCG workshop on Operational Issues CERN November, EGEE CIC activities (SA1) Accounting: current status
Site Manageability & Monitoring Issues for LCG Ian Bird IT Department, CERN LCG MB 24 th October 2006.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Agile Infrastructure Monitoring HEPiX Spring th April.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
INFSO-RI Enabling Grids for E-sciencE /10/20054th EGEE Conference - Pisa1 gLite Configuration and Deployment Models JRA1 Integration.
David Foster LCG Project 12-March-02 Fabric Automation The Challenge of LHC Scale Fabrics LHC Computing Grid Workshop David Foster 12 th March 2002.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.
Parag Mhashilkar Computing Division, Fermi National Accelerator Laboratory.
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
1 Open Science Grid: Project Statement & Vision Transform compute and data intensive science through a cross- domain self-managed national distributed.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
Enterprise Requirements: Industry Workshops and OGF Robert Cohen, Area Director, Enterprise Requirements.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
HPDC Grid Monitoring Workshop June 25, 2007 Grid monitoring from the VO/user perspectives Shava Smallen.
Frascati, 2-3 July 2008 Slide 1 User Management compliance testing for G-POD HMA-T Phase 2 KO Meeting 2-3 July 2008, Frascati Andrew Woolf, STFC Rutherford.
Bob Jones EGEE Technical Director
Accessing the VI-SEEM infrastructure
Monitoring Windows Server 2012
James Casey, CERN IT-GD WLCG Workshop 1st September, 2007
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
(Prague, March 2009) Andrey Y Shevel
POW MND section.
Joseph JaJa, Mike Smorul, and Sangchul Song
Grid Service Monitoring Working Group
Recap: introduction to e-science
Cristina del Cano Novales STFC - RAL
LCG middleware and LHC experiments ARDA project
LCG Operations Workshop, e-IRG Workshop
GGF15 – Grids and Network Virtualization
Leigh Grundhoefer Indiana University
The Globus Toolkit™: Information Services
Large Scale Distributed Computing
Review of grid computing
Welcome to (HT)Condor Week #19 (year 34 of our project)
Presentation transcript:

HPDC Report Domenico Vicinanza CERN IT-GD-OPS CERN, July 12 th weekly OPS section meeting

HPDC '07 in a nutshell Held in Monterey (CA-USA), July Four parallel workshops: Grid Monitoring Workflows in Support of Large-Scale Science (WORKS07) Joint EGEE and OSG Workshop on Data Handling in Production Grids Challenges of Large Applications in Distributed Environment (CLADE 2007) Three days conference

Grid Monitoring WS Monitoring in Grids –Fabric monitoring Publishing on the Service Availability Information to the local fabric monitoring Nagios (integration with SAM) –Monitoring from the VO/User perspect. INCA (San Diego Supercomputing Center) – –Aiming to integrate (part of) their testing infrastructure within SAM framework

cont... Interoperability of the monitoring tools and OSG-LCG interoperability –Overview of the work done by the Grid Service Monitoring Working Group –Service Availability Monitor as one of the main components of the monitoring framework prototype for WLCG/EGEE infrastructure (SAM Team paper)

cont... Other monitoring tools: –RGMA (as general framework for information exchange on large scale distributed infrastructure) –GridICE –gLite LB –Centralized logging systems Syslog-NG (OSG) Splunk (Fermilab)

Syslog-NG New system logging utility used by OSG Can replace regular syslog daemon or can be used in parallel More powerful facilities for filtering, formatting, and redirecting log messages Open source license Administered by Php-MySQL tool

Syslog-NG facilities Can filter log messages based on log level, system host, facility, ip address or regular expressions Can reformat and modify messages using template facilities Inputs can be files or sockets Outputs can be other hosts, files, or sockets

Splunk Commercial software used to archive and query log messages Web interface allows log messages to be categorized and correlated Messages can be queried and sorted based on categorization and other parameters Used at Fermilab as well for internal logging collection

Other topics: Open Grids Open Grids –BOINC (Berkeley Open Infrastructure for Network Computing) improvements in load-balancing new check-pointing methods reliability issues RIDGE (kind of BOINC improvement) –observes the past behavior and estimates a reliability rating for worker nodes

Other topics: future improvements Provisioning models (modeling needs) –performance-cost optimization in grids –Genetic Algorithm formulation for provisioning resources for an application Condor extensions (Data-driven workflow planning Scalable I/O virtualization –dynamically manage virtualized components among multiple guest domains

Environment issues How HPDC is affecting the environment –warming –efforts to deliver energy –cooling system Role of the renewable sources of energies in the future of HPDC Solar energy to provide electric power to operate the computers and for cooling. Covering roofs with solar cells: –How much a house can compute?

Conclusions Well established SAM awareness Defining a common monitoring exchange format (Grid Monitoring WG) –started a growing network of monitoring tools integration/interaction –interest including/feeding SAM results from/to other tools (fabric mon) Importance of logs (and log analysis tool) Strong need for an improved modeling of –resources, needs, workflows

Bibliography SAM Team paper (PDF), minutes and slides: – onfId=18405