ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.

Slides:



Advertisements
Similar presentations
30-31 Jan 2003J G Jensen, RAL/WP5 Storage Elephant Grid Access to Mass Storage.
Advertisements

QCDgrid User Interfaces James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
The Quantum Chromodynamics Grid James Perry, Andrew Jackson, Matthew Egbert, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
JLab Lattice Portal – Data Grid Web Service Ying Chen, Chip Watson Thomas Jefferson National Accelerator Facility.
ILDG File Format Chip Watson, for Middleware & MetaData Working Groups.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Software Frameworks for Acquisition and Control European PhD – 2009 Horácio Fernandes.
1 The World Wide Web. 2  Web Fundamentals  Pages are defined by the Hypertext Markup Language (HTML) and contain text, graphics, audio, video and software.
Magda – Manager for grid-based data Wensheng Deng Physics Applications Software group Brookhaven National Laboratory.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Data Grid Web Services Chip Watson Jie Chen, Ying Chen, Bryan Hess, Walt Akers.
Talend 5.4 Architecture Adam Pemble Talend Professional Services.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
ILDG5QCDgrid1 QCDgrid status report UKQCD data grid Chris Maynard.
The Japanese Virtual Observatory (JVO) Yuji Shirasaki National Astronomical Observatory of Japan.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
11/16/2012ISC329 Isabelle Bichindaritz1 Web Database Application Development.
ILDG Middleware Status Presented By: Bálint Joó, Jlab, USA Working Group Members: G. Beckett (EPCC, UK) T. Boku (CCS Tsukuba, Japan) D. Byrne (EPCC, UK)
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
Presentation: SOAP/WS in a distributed object framework, Application Servers & AXIS SOAP.
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Presentation: SOAP/WS in a distributed object framework, Application Servers & AXIS SOAP.
ILDG Middleware Status Bálint Joó UKQCD University of Edinburgh, School of Physics on behalf of ILDG Middleware Working Group alternative title: Report.
Lattice QCD Data Grid Middleware: status report M. Sato, CCS, University of Tsukuba ILDG6, May, 12, 2005.
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
Operated by the Southeastern Universities Research Association for the U.S. Depart. Of Energy Thomas Jefferson National Accelerator Facility Andy Kowalski.
09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
1 ILDG Status in Japan  Lattice QCD Archive(LQA) a gateway to ILDG Japan Grid  HEPNet-J/sc an infrastructure for Japan Lattice QCD Grid A. Ukawa Center.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Server-side Programming The combination of –HTML –JavaScript –DOM is sometimes referred to as Dynamic HTML (DHTML) Web pages that include scripting are.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
January 26, 2003Eric Hjort HRMs in STAR Eric Hjort, LBNL (STAR/PPDG Collaborations)
Lattice QCD Data Grid Middleware: Meta Data Catalog (MDC) -- CCS ( tsukuba) proposal -- M. Sato, for ILDG Middleware WG ILDG Workshop, May 2004.
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
USQCD regional grid Report to ILDG /28/09ILDG14, June 5, US Grid Usage  Growing usage of gauge configurations in ILDG file format.  Fermilab.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
Bookkeeping Tutorial. 2 Bookkeeping content  Contains records of all “jobs” and all “files” that are produced by production jobs  Job:  In fact technically.
Query Health Technical WG Update 1/12/2011. Agenda TopicTime Slot Administrative stuff and reminders2:00 – 2:05 pm Specification Updates QRDA HQMF Query.
Grid Interoperability Update on GridFTP tests Gregor von Laszewski
LCG Distributed Databases Deployment – Kickoff Workshop Dec Database Lookup Service Kuba Zajączkowski Chi-Wei Wang.
USGS GRID Exploratory Status Review Stuart Doescher Mike Neiers USGS/EDC May
10 May 2001WP6 Testbed Meeting1 WP5 - Mass Storage Management Jean-Philippe Baud PDP/IT/CERN.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
IBM Express Runtime Quick Start Workshop © 2007 IBM Corporation Deploying a Solution.
1 DIRAC Data Management Components A.Tsaregorodtsev, CPPM, Marseille DIRAC review panel meeting, 15 November 2005, CERN.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Scientific Linux Inventory Project (SLIP) Troy Dawson Connie Sieh.
Node.js Express Web Applications
Data Bridge Solving diverse data access in scientific applications
Outline SOAP and Web Services in relation to Distributed Objects
Outline SOAP and Web Services in relation to Distributed Objects
LQCD Computing Operations
Patrick Dreher Research Scientist & Associate Director
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
Module 01 ETICS Overview ETICS Online Tutorials
Presentation transcript:

ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005

Outline Status: small changes from Dec 2005 Status: small changes from Dec 2005 Quick review of architecture Quick review of architecture Minimal implementation facts Minimal implementation facts Next steps Next steps

Status (quick look) Only a small amount of middleware work has been done in the last 6 months Only a small amount of middleware work has been done in the last 6 months –development of new metadata catalog prototype at Adelaide based on XML database –modifications to metadata catalog prototype at Fermilab to conform to new interface –small amount of work on replica catalog prototypes at several sites (JLab, Adelaide, Fermilab) Architecture remains unchanged Architecture remains unchanged

Architecture (review) Web Services Web Services –Metadata Catalog maps meta data to a global name –Replica Catalog maps a global name to one or more instances –Storage Resource Manager (optional) manages a disk, or disk + tape resource Draft schemas (WSDL) for these services exist

Architecture (review) File based directories contain... File based directories contain... –Master directory of all collaborations’ MDC, RC and membership lists, stored as XML files –Distributed group membership lists (XML) Initial version of schemas (XML) exist

Implementation View Master Directory contains for each collaboration: metadata catalog replica catalog group membership MDC for UKQCD MDC for USQCD MDC for Japan RC for UKQCD RC for USQCD RC for Japan Japan group file UKQCD group file USQCD group file subgroup A file subgroup B file file X

MetaData Catalog ILDG schema defines only a query interface ILDG schema defines only a query interface –multiple query languages (syntax) allowed for now (no clear winner yet) –queries map from physics metadata values to Global File Name (GFN) –proposed minor modification can also return the full physics metadata

Minimal Implementation Master XML directory to be held at Master XML directory to be held at For each collaboration, need at least these: For each collaboration, need at least these: –MetaDataCatalog (e.g. running at ) –trivial Replica Catalog (does 1:1 name mapping) –standard web or ftp server to serve files

Getting going... (or, what must a collaboration do?) First: Deploy a metadata catalog 1.choose an existing prototype & deploy 2.populate the catalog with qcdml v1.1 compliant documents, with ILDG compliant GFN’s (global file names) Note: names must have collaboration name as part of the string; this name matches the entry name in the master directory: gfn://collaboration/local-name Note: names must have collaboration name as part of the string; this name matches the entry name in the master directory: gfn://collaboration/local-name 3.request to add your MDC to the master directory on

Getting going... (or, what must a collaboration do?) Second: Deploy a replica catalog 1.(option 1) write a simple function which maps your collaboration’s GFN naming convention into a static URL pointing to the file (i.e., no database, just string shuffling) OR 2.(option 2) get / implement a true RC, with multiple instance tracking (a database) 3.request to add your RC to the master directory on Third: Serve the files (http, ftp, srm,...)

Nice things to also do... Deploy a real RC, which can track another collaboration’s copies of your files Deploy a real RC, which can track another collaboration’s copies of your files Populate a group membership file, to support group read/write access (otherwise your collaboration is relegated to “world” status) Populate a group membership file, to support group read/write access (otherwise your collaboration is relegated to “world” status) Deploy an SRM (with protocol negotiation) and also at least one file server that supports parallel streams (gridftp, bbftp,...) for higher performance file retrieval Deploy an SRM (with protocol negotiation) and also at least one file server that supports parallel streams (gridftp, bbftp,...) for higher performance file retrieval Implement a web interface to your metadata catalog Implement a web interface to your metadata catalog

Near Term Expectations Adelaide will deploy an MDC, RC within the next few months Adelaide will deploy an MDC, RC within the next few months USQCD will also try to match this within the next 6 months, but is currently distracted with getting machines into production USQCD will also try to match this within the next 6 months, but is currently distracted with getting machines into production others have not committed yet others have not committed yet

Australian ILDG Node Paul Coddington School of Computer Science, University of Adelaide South Australian Partnership for Advanced Computing May 2005

Overview A prototype ILDG node has been set up in Australia for data from the Centre for the Subatomic Structure of Matter (CSSM). We have developed a metadata catalog, replica catalog and web portal. Currently just allows searching, browsing and downloading of QCDML metadata –ability to download configuration files will be added later. Metadata for around 50 ensembles is currently available.

Metadata Catalog Ensemble and configuration QCDML metadata is generated as XML files which are loaded into Apache Xindice, an XML database. The metadata catalog web service was developed in Java using Xindice's implementation of the XML:DB API for XML databases. –So should work with other XML databases It (almost) conforms to the metadata catalog interface defined by the ILDG Middleware Working Group. –Added additional parameter to specify returning GFNs or XML XPath queries are passed directly to the XML database.

Other Components Replica catalog is a web service wrapper around the Replica Location Service for Globus Toolkit 3. –Plan to change this to GT4 RLS or something else. No mechanism for downloading files yet –Will initially generate wget script, like Japanese portal. –Then investigate using SRM. Web portal written using JSP. – All software will be made freely available after code is cleaned up and documented.

Middleware Working Group Near Term Task List Approve minor changes to MDC interface Approve minor changes to MDC interface Decide on the URL for, and deploy: Decide on the URL for, and deploy: master directory file master membership file Collect official CA certificates from all collaborations and post at for all to easily retrieve (for configuring servers for strongly authenticated operations) Collect official CA certificates from all collaborations and post at for all to easily retrieve (for configuring servers for strongly authenticated operations)

Most Significant Challenges Get data into ILDG compliant format Get data into ILDG compliant format –create or automate creation of metadata compliant with qcdml1.1 –write files in ILDG format (or write translation program for on-the-fly translation) will LQCD application developers do this? or will manpower need to be found for translation programs? Get the MDC operational and populated Get the MDC operational and populated (other tasks are comparatively easy)

Other Challenges Manpower to implement a nice user interface for browsing, and optionally retrieving files Manpower to implement a nice user interface for browsing, and optionally retrieving files (once per collaboration, or shared, even hosted at ?) Manpower to write some simple command line client tools to be used in workflow scripting Manpower to write some simple command line client tools to be used in workflow scripting Goal of reaching an operational status by June 2006 is still feasible!