Presentation is loading. Please wait.

Presentation is loading. Please wait.

NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.

Similar presentations


Presentation on theme: "NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan."— Presentation transcript:

1 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan Moore Bing Zhu Arcot Rajasekar Michael Wan Wayne Schroeder moore@sdsc.edu

2 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Grid Requirements Access legacy systems Interface to local storage managers Manage replicas Interoperate with GSS API Interoperate with data grid manager

3 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Support Tasks Software development Interface to LBNL Storage Manager Collection creation Data handling system installation Demonstration of replication

4 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid Data Handling System SDSC Storage Resource Broker Collection based management of distributed data sets Designed to: Function over Wide Area Network Support access to archives, file systems, databases Work across administration domains Manage replicas, containers, metadata

5 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid SDSC Storage Resource Broker & Meta-data Catalog SRB ADSM HPSS DB2Oracle Unix Application File SIDDBLobj SIDObj SID MCAT Dublin Core Resource User Application Meta-data Remote Proxies DataCutter Third-party copy

6 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid Digital Library Data Management Persistent identifiers Ability to move a data set without the name changing Data set replicas Management of multiple copies of a data set Archival backup of data sets Integration of disk data caches with archival storage Persistent archives Management of a collection through multiple cycles of technology evolution

7 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid Software Development Sstage - request to an HRM to pre- stage a file SfileStatus - check state File cached locally File being cached, time to complete is returned File in staging queue Query rejected, HRM down

8 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid Software Development Sget - synchronous request to stage, transfer file, and purge the local cache Register a data set as a replica Allows data sets to be moved independently of the SRB, and then registered

9 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid Collection Creation Assembled 4750 datasets into SRB collection /home/lblsrb.lbl/PPDG SRB server ‘unix-test2-lbl’ Replicated 30 data sets From starsu00.nersc.edu To vulture.cs.wisc.edu

10 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Grid Sites: LBNL - HRM interface LBNL - file system Wisconsin - file system CalTech - HPSS and file system Fermi Lab - file system Stanford - file system SDSC - HPSS and file system

11 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid HRM esrb.server Wisc Client 2 IPC SRB Server @Wisc esrb.driver Wisc Client 1 SRB Server @LBL esrb.driver IPC Disk cache HPSS FC file caching File caching request file purging Stage() purge() fileStatus() S-Commands Current Data Grid

12 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG FY2000 Tasks Upgrade to version 1.1.7 Supports GSI authentication Revise SfileStatus to meet current design changes Integrate registering of replica into production system Support data subsetting

13 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid Data Set Management Model-Based Information Management Rule-based ontology mapping, conceptual-level mediation - CMIX Data Grid Data federation across multiple libraries - MIX Digital Library Interoperable services for information discovery and presentation - SDLIP Data Collection Tools for managing data set collections on databases - MCAT Data Handling Systems for data retrieval from remote storage - SRB Persistent Archives Storage of data collections for 30 years - HPSS

14 NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid Further Information http://www.npaci.edu/DICE


Download ppt "NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan."

Similar presentations


Ads by Google