Simulation Production System

Slides:



Advertisements
Similar presentations
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Advertisements

Building Portals to access Grid Middleware National Technical University of Athens Konstantinos Dolkas, On behalf of Andreas Menychtas.
GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
Metadata Progress GridPP18 20 March 2007 Mike Kenyon.
© 2012 IBM Corporation What’s new in OpenAdmin Tool for Informix? Erika Von Bargen May 2012.
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
Introduction to eValid Presentation Outline What is eValid? About eValid, Inc. eValid Features System Architecture eValid Functional Design Script Log.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
HORIZONT 1 TWS/WebAdmin The Web Interface for TWS Release Notes HORIZONT Software for Datacenters Garmischer Str. 8 D München Tel ++49(0)89 / 540.
GRID job tracking and monitoring Dmitry Rogozin Laboratory of Particle Physics, JINR 07/08/ /09/2006.
KARMA with ProActive Parallel Suite 12/01/2009 Air France, Sophia Antipolis Solutions and Services for Accelerating your Applications.
Zhiling Chen (IPP-ETHZ) Doktorandenseminar June, 4 th, 2009.
ATLAS DQ2 Deletion Service D.A. Oleynik, A.S. Petrosyan, V. Garonne, S. Campana (on behalf of the ATLAS Collaboration)
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
The SAMGrid Data Handling System Outline:  What Is SAMGrid?  Use Cases for SAMGrid in Run II Experiments  Current Operational Load  Stress Testing.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.
Metadata Mòrag Burgon-Lyon University of Glasgow.
Experiment Management System CSE 423 Aaron Kloc Jordan Harstad Robert Sorensen Robert Trevino Nicolas Tjioe Status Report Presentation Industry Mentor:
CASTOR evolution Presentation to HEPiX 2003, Vancouver 20/10/2003 Jean-Damien Durand, CERN-IT.
BOINC: Progress and Plans David P. Anderson Space Sciences Lab University of California, Berkeley BOINC:FAST August 2013.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
EMI INFSO-RI ARC tools for revision and nightly functional tests Jozef Cernak, Marek Kocan, Eva Cernakova (P. J. Safarik University in Kosice, Kosice,
Tier3 monitoring. Initial issues. Danila Oleynik. Artem Petrosyan. JINR.
Data Transfer Service Challenge Infrastructure Ian Bird GDB 12 th January 2005.
K. Harrison CERN, 22nd September 2004 GANGA: ADA USER INTERFACE - Ganga release status - Job-Options Editor - Python support for AJDL - Job Builder - Python.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
T3g software services Outline of the T3g Components R. Yoshida (ANL)
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
The GridPP DIRAC project DIRAC for non-LHC communities.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
RENKEI:UGI Takashi Sasaki. Project history The RENKEI project led by Prof. Ken Miura of NII is funded by MEXT during JFY The goal of the project.
A Data Handling System for Modern and Future Fermilab Experiments Robert Illingworth Fermilab Scientific Computing Division.
Compute and Storage For the Farm at Jlab
Architecture Review 10/11/2004
Jean-Philippe Baud, IT-GD, CERN November 2007
Integrating ArcSight with Enterprise Ticketing Systems
Integrating ArcSight with Enterprise Ticketing Systems
Troubleshooting Tools
System Monitoring with Lemon
Database Replication and Monitoring
U.S. ATLAS Grid Production Experience
PLM, Document and Workflow Management
Module Overview Installing and Configuring a Network Policy Server
Operating System.
Data Bridge Solving diverse data access in scientific applications
Existing Perl/Oracle Pipeline
Content Management Systems
EIN 6133 Enterprise Engineering
Job workflow Pre production operations:
CHAPTER 3 Architectures for Distributed Systems
Tom Rink Tom Whittaker Paolo Antonelli Kevin Baggett.
Presenter: Karoline Lapko
Census Technology: Processing architecture and data analysis
Patrick Dreher Research Scientist & Associate Director
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
David Palella, Filodea Pastorelli
TriFoil System Overview From Global Directions, Inc.
Data Management Components for a Research Data Archive
Status and plans for bookkeeping system and production tools
Robert Dattore and Steven Worley
Presentation transcript:

Simulation Production System Science Advisory Committee Meeting UW-Madison March 1st-2nd 2007 Juan Carlos Díaz Vélez

outline introduction production tools job management & error handling production sites & computing production challenges data storage and transfer

introduction SimProd written in Python interfaces easily with IceTray through Boost IceTray configurations written in well formatted XML are easy to store in database daemons manage cluster job submission SOAP interface for GUI client and for job monitoring

production tools GUI Production Client (& TUI) designed for IceTray configuration and job submission (simulation & offline processing)

production tools GUI Production Client (& TUI) Production Database designed for IceTray configuration and job submission (simulation & offline processing) Production Database store production history including all configured module parameters provide information on configurable parameters for client

production tools GUI Production Client (& TUI) Production Database store production history including all configured module parameters provide information on configurable parameters for client Production Server accepts dataset requests from client Provides job management including error handling Separate daemons handle dataset submission, queue/job management & monitoring

production tools GUI Production Client (& TUI) Production Database Production Server accepts dataset requests from client Provides job management including error handling Separate daemons handle dataset submission, queue/job management & monitoring Queuing Plugins(s) adaptable to different sites and batch systems

production tools GUI Production Client (& TUI) Production Database Production Server Queuing Plugins(s) adaptable to different sites and batch systems Logging/monitoring Database production status & troubleshooting remote job management unified monitoring for multiple clusters

production tools GUI Production Client (& TUI) Production Database Production Server Queuing Plugins(s) Logging/monitoring Database production status & troubleshooting remote job management unified monitoring for multiple clusters Web Interface (Ian Rae) cluster/dataset/job monitoring search engine for production db dataset statistics

job management job goes through series of states Web Interface (Ian Rae) job/server communication job eviction file transfer error

production sites current simulation production grid

computing different architectures and OS and batch systems different policies each site provides a local contact person work with local sys admin maintain production monitor runtime & completion troubleshoot system check data integrity

photonics photon interaction probability tables are produces with detailed module of ice properties full set of tables is ~14 GB (too large for memory (32-bit) we sort events in zenith bins and process process each bin separately. current production clusters have tables pre-installed on nodes This limits our ability to add new clusters or large grids for simulation production.

data storage archive documented through DIF metadata DIF (Directory Interchange Format) adapted to Astrophysics SimProd automatically generates DIF from simulation parmeters. pending: interface simprod with datawarehouse Ingest system files collected from sites and stored at UW local responsibles manually transfer data to UW pending: automatic data movement from sites (testing gridftp)

work in progress some features planned for SimProd automatic file transfer (GridFTP) dynamic collection and reporting of detailed simulation statistics. better user interface. Search engine will allow user to display datasets based on different criteria (e.g. geometry, primary spectrum, grid site, etc.)

links Simulation Production Web Page http://internal.icecube.wisc.edu/simulation sim-prod documentation and wiki http://wiki.icecube.wisc.edu/index.php/SimProd