QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.

Slides:



Advertisements
Similar presentations
Building Portals to access Grid Middleware National Technical University of Athens Konstantinos Dolkas, On behalf of Andreas Menychtas.
Advertisements

QCDgrid User Interfaces James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
The Quantum Chromodynamics Grid James Perry, Andrew Jackson, Matthew Egbert, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
UKQCD GridPP NeSCAC Irving, 4/2/041 9 th GridPP Collaboration Meeting QCDgrid: Status and Future Alan Irving University of Liverpool.
Jens G Jensen Atlas Petabyte store Supporting Multiple Interfaces to Mass Storage Providing Tape and Mass Storage to Diverse Scientific Communities.
Peter Berrisford RAL – Data Management Group SRB Services.
Configuration management
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Andrew McNab - EDG Access Control - 14 Jan 2003 EU DataGrid security with GSI and Globus Andrew McNab University of Manchester
Summary Role of Software (1 slide) ARCS Software Architecture (4 slides) SNS -- Caltech Interactions (3 slides)
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
GridScape Ding Choon Hoong Grid Computing and Distributed Systems (GRIDS) Lab. The University of Melbourne Melbourne, Australia WW Grid.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Workload Management Massimo Sgaravatto INFN Padova.
Operating Systems.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
Grid Information Systems. Two grid information problems Two problems  Monitoring  Discovery We can use similar techniques for both.
ILDG5QCDgrid1 QCDgrid status report UKQCD data grid Chris Maynard.
QCDgrid UKQCD Achievements and Future Priorities Who and what Achievements QCDgrid middleware Future priorities Demo of meta-data catalogue browser Alan.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
3 Sept 2001F HARRIS CHEP, Beijing 1 Moving the LHCb Monte Carlo production system to the GRID D.Galli,U.Marconi,V.Vagnoni INFN Bologna N Brook Bristol.
Components of Database Management System
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
CHEP 2000, Giuseppe Andronico Grid portal based data management for Lattice QCD data ACAT03, Tsukuba, work in collaboration with A.
© Geodise Project, University of Southampton, Data Management in Geodise Zhuoan Jiao, Jasmin Wason and Marc Molinari
Configuration Management (CM)
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Setting up a Pan-European Datagrid using QCDgrid technology Chris Johnson, James Perry, Lorna Smith and Jean-Christophe Desplat EPCC, The University Of.
UKQCD QCDgrid Richard Kenway. UKQCD Nov 2001QCDgrid2 why build a QCD grid? the computational problem is too big for current computers –configuration generation.
SAM and D0 Grid Computing Igor Terekhov, FNAL/CD.
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
The european ITM Task Force data structure F. Imbeaux.
Database Architectures Database System Architectures Considerations – Data storage: Where do the data and DBMS reside? – Processing: Where.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
Data and storage services on the NGS Mike Mineter Training Outreach and Education
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
Metadata Mòrag Burgon-Lyon University of Glasgow.
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
UKQCD Grid Status Report GridPP 13 th Collaboration Meeting Durham, 4th—6th July 2005 Dr George Beckett Project Manager, EPCC +44.
Website: Answering Continuous Queries Using Views Over Data Streams Alasdair J G Gray Werner.
Replica Management Kelly Clynes. Agenda Grid Computing Globus Toolkit What is Replica Management Replica Management in Globus Replica Management Catalog.
May 2005 PPARC e-Science PG School1 QCDgrid Chris Maynard A Grid for UKQCD National collaboration for lattice QCD.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Application Web Service Toolkit Allow users to quickly add new applications GGF5 Edinburgh Geoffrey Fox, Marlon Pierce, Ozgur Balsoy Indiana University.
The Storage Resource Broker and.
Stephen Burke – Sysman meeting - 22/4/2002 Partner Logo The Testbed – A User View Stephen Burke, PPARC/RAL.
The GridPP DIRAC project DIRAC for non-LHC communities.
10 March Andrey Grid Tools Working Prototype of Distributed Computing Infrastructure for Physics Analysis SUNY.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Data services on the NGS
Middleware independent Information Service
Presentation transcript:

QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh

QCDgrid Technology  Currently a 4-site data grid –provides reliable distributed data storage, including a searchable metadata catalogue  A job submission system is also deployed  Key technologies used –Globus Toolkit 2.4 –European Data Grid –eXist XML database  Custom QCDgrid software builds on all these technologies, adding extra functionality and providing a convenient user interface

The Data Grid  The data storage grid has been up and running for several months and is currently managing a few hundred gigabytes of data  Replication of files is managed by custom written software –built on Globus 2.4  Central control thread –ensures there are always at least two copies of each file, stored at different sites  Replica catalogue –maps logical filenames to actual physical locations  Command line interface

Metadata  Raw logical filenames may not be meaningful, so data could be hard to find  Metadata catalogue associates some important information with each file, making it easy to search the grid  Each file on the datagrid may have an associated XML metadata document Application Metadata Service Data Grid Nearest file copy Logical File Names Attributes of Desired Data (1) (2)(3) (4)  These documents are stored in the eXist open source XML database, where they can be searched using the XPath query language

Metadata Browser  A browser GUI written in Java provides a user friendly interface to the XML metadata  Originally developed by OGSA-DAI, extended and now maintained by QCDgrid team

Metadata/Datagrid Integration  Browser also integrates with lower level datagrid software through the Java Native Interface  Data can be fetched from the grid simply using the GUI  A simple GUI exists for data/metadata submission

Job Submission: Requirements Next stage of the project is to allow data generation/analysis jobs to be easily submitted to grid machines  Integration with the existing datagrid is desirable  Resource brokering not particularly important, users normally know in advance on which machine a job should run, with a few exceptions  Real-time job status monitoring would be useful  Must work with a diverse range of machines from normal (Linux) PCs to QCDOC supercomputer  User-friendly GUI or web portal if time permits

Job Submission: Technology  As with datagrid, requirements dictate a combination of existing software and purpose-built middleware –Globus toolkit used for low level access to grid resources and data –European Data Grid software used for virtual organisation management and security –Batch systems such as PBS integrated with the system –QCDgrid job submission software builds on these components, providing the interface and features that users need

Job Submission: Status  Job submission system was developed on a test grid before being deployed on the main datagrid –Jobs can now be submitted to grid resources using a command line tool –Input files can be fetched automatically from datagrid –Job output and input can be streamed to and from the user’s console, allowing for job to be monitored, and even for interactive jobs to run on grid resources (useful for debugging) –All output files generated by the job are automatically brought back to the user’s local machine, or optionally stored on the datagrid –Can also submit to machines with only Globus (no QCDgrid installation)

QCDgrid 2  A follow on project, QCDgrid 2, is beginning –part of the larger GridPP 2 collaboration  Will build on what was created by QCDgrid –creating well defined web service interfaces to grid functionality –assisting international QCD collaboration efforts –strengthening the software to cope with increased loads in future –general maintenance and support for existing grid

Beyond UKQCD: ILDG  ILDG stands for International Lattice DataGrid –A collaboration of scientists involved in lattice QCD from all over the world (UK, Japan, USA, France, Germany, Australia and other countries) –Working on standards to allow national datagrids to interoperate, for easier data sharing –Two working groups looking at different aspects of this goal: metadata and middleware –QCDgrid 2 has time specifically allocated for ILDG work

ILDG/QCDgrid 2 Technology  ILDG is setting standards for interoperability between grids for QCD, the QCDgrid2 project will implement them on the UK’s grid –web service interface to metadata catalogue functionality –web service interface to data storage grid functionality – possibly based on Storage Resource Manager (SRM) –common XML schema for metadata. A schema for describing gauge configuration metadata has already been defined and will be extended to other data types  Security will be a bigger issue for international collaboration

Other QCDgrid 2 Work  Maintain the existing QCDgrid software and provide support  Make any necessary additions to cope with future changes in usage of the grid –particularly QCDOC coming online later this year, which will likely produce unprecedented volumes of data –a risk analysis is underway to identify possible limitations of the software and how to deal with them  Provide tools to assist in the generation of metadata documents –a web-based form is the likely interface

Summary  QCDgrid project has developed a grid for use by the UKQCD collaboration for storing data and performing computations –This consists of three software components: data grid, metadata catalogue and job submission tool –Software based on Globus toolkit and European Data Grid middleware  QCDgrid 2 project starting –will focus on international standards and web service interfaces –as well as maintaining and improving what already exists

References  QCDgrid Web Site –  ILDG Web Site –  European Data Grid Project –