SAN DIEGO SUPERCOMPUTER CENTER By: Roman Olschanowsky An Introduction to the.

Slides:



Advertisements
Similar presentations
National University Community Research Institute (NUCRI) NU Community Research Institute (NUCRI) HASTAC (higher education)/HASS grid National School Board.
Advertisements

San Diego Supercomputer Center & National Partnership for Advance Computational Infrastructure Storage Resource Broker Reagan W. Moore San Diego Supercomputer.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
The Storage Resource Broker and.
The Storage Resource Broker and.
Overview of the SDSC Storage Resource Broker Wayne Schroeder (and other SRB team members) May, 2004 San Diego Supercomputer Center, University of California.
Peter Berrisford RAL – Data Management Group SRB Services.
SALSA HPC Group School of Informatics and Computing Indiana University.
Data Grid: Storage Resource Broker Mike Smorul. SRB Overview Developed at San Diego Supercomputing Center. Provides the abstraction mechanisms needed.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids Reagan W. Moore San Diego Supercomputer Center.
SAN DIEGO SUPERCOMPUTER CENTER By: Roman Olschanowsky Scommands Tutorial.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids, Digital Libraries and Persistent Archives Reagan.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.
San Diego Supercomputer Center, University of California at San Diego Grid Physics Network (GriPhyN) University of Florida A Data Storage Language for.
San Diego Supercomputer CenterNational Partnership for Advanced Computational Infrastructure1 Grid Based Solutions for Distributed Data Management Reagan.
SAN DIEGO SUPERCOMPUTER CENTER Choonhan Youn Viswanath Nandigam, Nancy Wilkins-Diehr, Chaitan Baru San Diego Supercomputer Center, University of California,
Background Chronopolis Goals Data Grid supporting a Long-term Preservation Service Data Migration Data Migration to next generation technologies Trust.
A Very Brief Introduction to iRODS
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
Applying Data Grids to Support Distributed Data Management Storage Resource Broker Reagan W. Moore Ian Fisk Bing Zhu University of California, San Diego.
Jean-Yves Nief, CC-IN2P3 Wilko Kroeger, SCCS/SLAC Adil Hasan, CCLRC/RAL HEPiX, SLAC October 11th – 13th, 2005 BaBar data distribution using the Storage.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
SAN DIEGO SUPERCOMPUTER CENTER Developing a CUAHSI HIS Data Node, as part of Cyberinfrastructure for the Hydrologic Sciences David Valentine Ilya Zaslavsky.
On Developing Data Grid Workflows using Storage Resource Broker (SRB) and Kepler Tim H. Wong - UC Davis Efrat Frank - SDSC Dr. Bertram Ludäscher - UC Davis.
By: Roman Olschanowsky An Introduction to the.
Core SRB Technology for 2005 NCOIC Workshop By Michael Wan And Wayne Schroeder SDSC SDSC/UCSD/NPACI.
San Diego Supercomputer Center SDSC Storage Resource Broker SRB as data grid solution (Chinese version) Arun Jagatheesan San Diego Supercomputer.
San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida Dataflows in SRB using SDSC Matrix Arun Jagatheesan Architect & Team.
Data R&D Issues for GTL Data and Knowledge Systems San Diego Supercomputer Center University of California, San Diego Bertram Ludäscher
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
San Diego Supercomputer Center SDSC Storage Resource Broker Data Grid Automation Arun Jagatheesan et al., San Diego Supercomputer Center University of.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Grid Services/SRB/SRM & Practical Hai-Ning Wu Academia Sinica Grid Computing.
Grid tool integration within the eMinerals project Mark Calleja.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure SRB + Web Services = Datagrid Management System (DGMS) Arcot.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Archive for the NSDL Reagan W. Moore Charlie Cowart.
San Diego Supercomputer Center Grid Physics Network (GriPhyN) University of Florida DGL: The Assembly Language for Grid Computing Arun swaran Jagatheesan.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Wong11/29/ SHARING DATA USING THE STORAGE RESOURCE BROKER (SRB) Ken Wong The Applied Research Laboratory (ARL) and The Department of Computer Science.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation NEES TeraGrid.
Michael Doherty RAL UK e-Science AHM 2-4 September 2003 SRB in Action.
Center for Computational Visualization University of Texas, Austin Visualization and Graphics Research Group University of California, Davis Molecular.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Introduction to The Storage Resource.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for.
Capacity and Capability Computing using Legion Anand Natrajan ( ) The Legion Project, University of Virginia (
Biomedical Informatics Research Network The Storage Resource Broker & Integration with NMI Middleware Arcot Rajasekar, BIRN-CC SDSC October 9th 2002 BIRN.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
The Storage Resource Broker and.
1 GridFTP and SRB Guy Warner Training, Outreach and Education Team, Edinburgh e-Science.
The SMB Archive System: Data Backup Across the Web Kenneth R. Sharp Stanford Synchrotron Radiation Laboratory.
SAN DIEGO SUPERCOMPUTER CENTER Replication Policies for Federated Digital Repositories Robert H. McDonald Chronopolis Project Manager
HPC University Requirements Analysis Team Training Analysis Summary Meeting at PSC September Mary Ann Leung, Ph.D.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
INTRODUCTION TO XSEDE. INTRODUCTION  Extreme Science and Engineering Discovery Environment (XSEDE)  “most advanced, powerful, and robust collection.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Introduction to iRODS Jean-Yves Nief.
Data services on the NGS
Recap: introduction to e-science
Introduction to Apache
VORB Virtual Object Ring Buffers
Presentation transcript:

SAN DIEGO SUPERCOMPUTER CENTER By: Roman Olschanowsky An Introduction to the

SAN DIEGO SUPERCOMPUTER CENTER Outline SDSC and History of SRB Example Project Introduction to SRB Discussion on SRB basics SRB Clients Overview of a Data Grid Infrastructure Topology

SAN DIEGO SUPERCOMPUTER CENTER Archival Systems 6 PB 10.4 TF DataStar IBM Power4 4.4 TF TeraGrid Linux Cluster (IA64) 600 TB Storage Area Network Disk Sun F15K Disk Server Networking Visualization Storage and Compute Resources Human infrastructure: Experienced multi- disciplinary staff support a broad spectrum of national science, engineering and technology projects Blue Gene/L (Due 12/04) 2.8/5.7 TF

SAN DIEGO SUPERCOMPUTER CENTER Sites Using the SRB

SAN DIEGO SUPERCOMPUTER CENTER SDSC SRB Projects (60 million,.5 PB ) Digital Libraries UCB, Umich, UCSB, Stanford,CDL NSF NSDL - UCAR / DLESE NASA Information Power Grid Astronomy National Virtual Observatory 2MASS Project (2 Micron All Sky Survey) Particle Physics Particle Physics Data Grid (DOE) GriPhyN SLAC Synchrotron Data Repository Medicine Digital Embryo (NLM) Earth Systems Sciences ESIPS LTER Persistent Archives NARA LOC Neuro Science & Molecular Science TeleScience/NCMIR, BIRN SLAC, AfCS, …

SAN DIEGO SUPERCOMPUTER CENTER The SCEC Project Southern California Earthquake Center 400 people, the best earthquake seismologists in the country (33 states) and several from abroad (9 countries). (Sep SCEC AHM attendees) Simulating a 7.7 earthquake in the L.A. basin 10 year effort 100+ TB of input data ( soil conditions, topography, grid coordinates, etc… ) 240 procs on SDSC Datastar cluster, 5 days, 1 TB RAM, 2GB/sec IO Thanks! SDSC, scientific applications group, with porting the code; parallelizing the calculation and the IO; and generalizing the code for scaling up to a large run. Offered invaluable insights regarding IO management. SRB, took care of draining the GPFS cache regularly, moving 43 TB of data safely to archive storage. That task was completed a mere 36 hours after the end of the calculation. The SRB was critical in this achievement.

SAN DIEGO SUPERCOMPUTER CENTER SDSC & SRB Example

SAN DIEGO SUPERCOMPUTER CENTER Storage Resource Broker (SRB) A distributed file system (Data Grid) Client-Server, Server-Server architecture. Abstracts physical SRB provides the ability to transparently share data across remote sites. Heterogeneous Resources Single sign on Single logical file hierarchy

SAN DIEGO SUPERCOMPUTER CENTER What we are familiar with

SAN DIEGO SUPERCOMPUTER CENTER What we are not familiar with, yet

SAN DIEGO SUPERCOMPUTER CENTER How do the file systems differ? Logical Abstraction Folders are NOT physical Files do NOT inherit physical location Everything is potentially distributed Access Control Permissions are NOT rwxrwxrwx Permissions ARE on a object by object basis Groups and permissions ARE more similar to NTFS Domains Geographical / logical grouping of users Namespace scalability: Also doubles as groups

SAN DIEGO SUPERCOMPUTER CENTER Interfaces to the Storage Resource Broker inQ– Windows Client Scommands– UNIX, DOS Command line Client Jargon– Java API and GUI components mySRB– Web Client Matrix– WSDL, Data Grid Workflows C, C++– C and C++ API Python– Python API Perl– Perl API

SAN DIEGO SUPERCOMPUTER CENTER Common Scommands (69 total) Sinit Senv Spwd Sls Scd Sget Sput Ssh Scp Smv (logical) Sphymove (physical) Srm Smkdir Srmdir Serror Schmod Sexit

SAN DIEGO SUPERCOMPUTER CENTER mySRB

SAN DIEGO SUPERCOMPUTER CENTER BIRN Portal (perl based)

SAN DIEGO SUPERCOMPUTER CENTER NEEScentral Portal (php based)

SAN DIEGO SUPERCOMPUTER CENTER Biomedical Informatics Research Network (BIRN) Major collaboration with SDSC, several of the projects’ Co-Investigators and Co-PIs are at SDSC. BIRN’s purpose is to provide it’s consortium of neuroscience laboratories the ability to share, compute, and collaborate. The Storage Resource Broker provides the ability to transparently share data across remote sites.

SAN DIEGO SUPERCOMPUTER CENTER The BIRN SRB Data Grid

SAN DIEGO SUPERCOMPUTER CENTER Doing this “Manually”

SAN DIEGO SUPERCOMPUTER CENTER The BIRN Data Grid

SAN DIEGO SUPERCOMPUTER CENTER The grid is in the details

SAN DIEGO SUPERCOMPUTER CENTER File Replication Sls /home/Demo/SRB-Tutorial/files-2: Doc.txt Sls -l /home/Demo/SRB-Tutorial/files-2: romanoly 0 z-ucsd-ncmir-nas Doc.txt romanoly 1 z-jhu-cis-nas Doc.txt romanoly 2 z-stanford-lucas-nas Doc.txt romanoly 3 z-umn-cmrr-nas Doc.txt romanoly 4 z-uci-bic-nas Doc.txt

SAN DIEGO SUPERCOMPUTER CENTER SRB “Location” or “Slave Server” SRB “Location” “Physical Resources” z-jhu-cis-nas0 “jhu-cis-nas” DRDR z-jhu-cis-nas1 z-jhu-cis-nas2 “Logical Resource”

SAN DIEGO SUPERCOMPUTER CENTER Pooling physical resources 0.7 TB 5.2 TB 0 TB 1.6 TB 0.8 TB 3.2 TB 0.8 TB 2.4 TB 0.8 TB 2.4 TB 1.6 TB 0.8 TB 5.0 TB 0.78 TB 0.08 TB

SAN DIEGO SUPERCOMPUTER CENTER Logical / Compound Resources SRB “My-Resource” “instant replication” “fast archival” “resource pooling”

SAN DIEGO SUPERCOMPUTER CENTER Logical Resources

SAN DIEGO SUPERCOMPUTER CENTER Thanks! SRB handles large data and provides the ability to share and collaborate on distributed heterogeneous resources. Questions?