San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.

Slides:



Advertisements
Similar presentations
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
Advertisements

OGF-23 iRODS Metadata Grid File System Reagan Moore San Diego Supercomputer Center.
Jens G Jensen Atlas Petabyte store Supporting Multiple Interfaces to Mass Storage Providing Tape and Mass Storage to Diverse Scientific Communities.
The Storage Resource Broker and.
The Storage Resource Broker and.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
Data Grid: Storage Resource Broker Mike Smorul. SRB Overview Developed at San Diego Supercomputing Center. Provides the abstraction mechanisms needed.
San Diego Supercomputer Center Self-organizing Smart Namespaces : Next Generation Data Grid Systems Arun Jagatheesan iRODS.org.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.
San Diego Supercomputer Center, University of California at San Diego Grid Physics Network (GriPhyN) University of Florida A Data Storage Language for.
San Diego Supercomputer CenterNational Partnership for Advanced Computational Infrastructure1 Grid Based Solutions for Distributed Data Management Reagan.
ELECTRONIC RECORDS PRESERVATION ARCHIVES OF MICHIGAN.
Looking ahead for GFS … Arun Jagatheesan San Diego Supercomputer Center Remote Talk at GGF-16 Athens, Greece.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Infrastructure overview Arnold Meijster &
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
On Developing Data Grid Workflows using Storage Resource Broker (SRB) and Kepler Tim H. Wong - UC Davis Efrat Frank - SDSC Bertram Ludäscher - UC Davis.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Architecture of Grid File System (GFS) - Based on the outline draft - Arun swaran Jagatheesan San Diego Supercomputer Center Global Grid Forum 11 Honolulu,
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida Programming Gridflows using Matrix Arun Jagatheesan Architect, SDSC.
Jan Storage Resource Broker Managing Distributed Data in a Grid A discussion of a paper published by a group of researchers at the San Diego Supercomputer.
San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida Dataflows in SRB using SDSC Matrix Arun Jagatheesan Architect & Team.
Rule-Based Data Management Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar {moore, schroede, mwan, {moore, schroede, mwan,
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
San Diego Supercomputer Center SDSC Storage Resource Broker Data Grid Automation Arun Jagatheesan et al., San Diego Supercomputer Center University of.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
What is Internet2? Ted Hanss, Internet2 5 March
Grid Service  Grid Webservice Arun Jagatheesan San Diego Supercomputer Center/ University of Florida.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
San Diego Supercomputer Center SDSC Storage Resource Broker A Data Storage Language for the Requirements of Rebels and Misfits Arun Jagatheesan San Diego.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
PPDG and ATLAS Particle Physics Data Grid Ed May - ANL ATLAS Software Week LBNL May 12, 2000.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
© 2007 Open Grid Forum Data Grid Management Systems: Standard API - community development Arun Jagatheesan, San Diego Supercomputer Center & iRODS.org.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure SRB + Web Services = Datagrid Management System (DGMS) Arcot.
Designing the Architecture for Grid File System (GFS) Arun swaran Jagatheesan San Diego Supercomputer Center Global Grid Forum 12 Brussels, Belgium.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Archive for the NSDL Reagan W. Moore Charlie Cowart.
GGF9 GFS WG BOF10/07/2003, Chicago Grid File System Group Proposal BOF Osamu Tatebe (AIST) Jane Xu (IBM) Arun Jagatheesan (SDSC)
San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida DGL: The Assembly Language for Grid Computing Arun swaran Jagatheesan.
Client Server Network Model:
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
San Diego Supercomputer Center iRODS DGMS Towards Data Grid Standard Implementations Arun Jagatheesan San Diego Supercomputer Center Open.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
Grid File System WG – GGF 17 Arun Jagatheesan San Diego Supercomputer Center GGF 17 May 11, 2006 Tokyo, Japan.
Replica Management Kelly Clynes. Agenda Grid Computing Globus Toolkit What is Replica Management Replica Management in Globus Replica Management Catalog.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Introduction to The Storage Resource.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for.
Distributed Data Access Control Mechanisms and the SRM Peter Kunszt Manager Swiss Grid Initiative Swiss National Supercomputing Centre CSCS GGF Grid Data.
Grid File System Working Group SAGA and GFS-WG Grid File System Working Group (GFS-WG) Global Grid Forum (GGF)
DOE/NSF Quarterly review January 1999 Particle Physics Data Grid Applications David Malon Argonne National Laboratory
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
1 Particle Physics Data Grid (PPDG) project Les Cottrell – SLAC Presented at the NGI workshop, Berkeley, 7/21/99.
Enabling Grids for E-sciencE EGEE-II INFSO-RI Status of SRB/SRM interface development Fu-Ming Tsai Academia Sinica Grid Computing.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Grid File System WG & Architecture
Problem: Ecological data needed to address critical questions are dispersed, heterogeneous, and complex Solution: An internet-based mechanism to discover,
GFS-WG: Informal Status Report
Designing the Architecture for Grid File System (GFS)
Introduction What is an institutional repository? Why use an IR?
Arcot Rajasekar Michael Wan Reagan Moore (sekar, mwan,
San Diego Supercomputer Center University of California, San Diego
Distributed Data Management Architecture for Embedded Computing
Technical Issues in Sustainability
Architecture of Grid File System (GFS) - Based on the outline draft -
Presentation transcript:

San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure University of Florida SRB-n-SRM for HEP Data Management (Possible architectures for brainstorming) Arun swaran Jagatheesan San Diego Supercomputer Center

Grid Physics Network (GriPhyN) University of Florida 2 A picture is worth a 1000 KB First, options/possibilities in the Over all Architecture

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 3 Distributed SRMs /…/text1.txt /…//text2.txt SRM /txt3.txt SRM University in UK (B) Organizations in Asia (C & D) National Lab in US (A)

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 4 Global Logical Namespace Needed National Lab in US (A) /…/text1.txt /…//text2.txt /txt3.txt SRM University in UK (B) Organizations in Asia (C & D) /home/arun.sdsc/cms /home/arun.sdsc/cms/text1.txt /home/arun.sdsc/cms/text2.txt /home/arun.sdsc/cms/text3.txt Logical Namespace: Organization of data, meta-data and storage (This logical view – need not be same as physical view of data sources )

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 5 Global Logical Namespace Needed National Lab in US (A) /…/text1.txt /…//text2.txt /txt3.txt SRM University in UK (B) Organizations in Asia (C & D) If we use SRB for this Global Logical Namespace.... (next slides)

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 6 Using Zone SRBs /…/text1.txt /…//text2.txt /txt3.txt SRM SRB Zone for National Lab in US One or more SRB Zones in Asia One SRB Zone for all UK universities Global Logical Name space

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 7 Possible Options for Zones Each Organization is a Zone with multiple SRMs One of more organizations make a Zone based on geography and administrative factors Each country is a Zone

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 8 A picture is worth a 1000 KB Second, Access

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 9 Do we want XML/SOAP for control? Services (SOAP-based) to manage and operate on the logical namespace of data and storage resources. Of course, this is only for control channel and not data movement channel

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 10 Access Option1: SRM still the interface /…/text1.txt /…//text2.txt /txt3.txt SRM SRB Zone for National Lab in US One or more SRB Zones in Asia One SRB Zone for all UK universities Global Logical Name space SRM

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 11 Access Option1: SRM still the interface + Existing applications in SRM can be preserved + HEP community already aware of SRM + SRM will emerge as the Grid Storage Management standard in GGF (so useful for us to follow the standard thingy) – SRM interface does not deal with a global logical namespace – Additional functionalities to manage or take advantage of this global logical namespace would be lost – Bulk operations are essential for data grid management – Zone to Zone (Peer to peer) – Meta data – SRM is NOT the Grid File System standard at GGF

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 12 Access Option2: SRB as the interface /…/text1.txt /…//text2.txt /txt3.txt SRM SRB Zone for National Lab in US One or more SRB Zones in Asia One SRB Zone for all UK universities Global Logical Name space SRB

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 13 Access Option2: SRB as the interface – Existing applications might have to be modified – HEP community has to be educated about the use and advantages of SRB (community resistance also) – Does it directly support storage space management NOW? + SRB interface deals with a global logical namespace of logical resources and logical data + Additional functionalities to manage or take advantage of this global logical namespace is available + Bulk Operations are essential for data grid management + Zone to Zone (Peer to peer) + Metadata + SRB could influence the Grid File System standards at GGF

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 14 Access Option3: GGF Grid File System Interface /…/text1.txt /…//text2.txt /txt3.txt SRM SRB Zone for National Lab in US One or more SRB Zones in Asia One SRB Zone for all UK universities Global Logical Name space GFS

San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida 15 Access Option3: GGF GFS as the interface – DOES NOT EXIST. We can NOT wait. – Our limited cycles might be used in just working for the GGF standard – rather than making this interface for us to work – What if we can not influence the standard creation (?) or its irrelevant for us + It is going to be the standard. So would possibly end up having multiple implementations (commercial and academic) + This community of different organizations and countries could influence the standard creation based on real needs + Experts from other communities would also suggest / help in the design of this interface