Peter Berrisford RAL – Data Management Group SRB Services.

Slides:



Advertisements
Similar presentations
National University Community Research Institute (NUCRI) NU Community Research Institute (NUCRI) HASTAC (higher education)/HASS grid National School Board.
Advertisements

National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
Jens G Jensen CCLRC e-Science Single Sign-on to the Grid Federated Access and Integrated Identity Management.
GOC resilience John Gordon, STFC GridPP 22 microtalk.
Jens G Jensen Atlas Petabyte store Supporting Multiple Interfaces to Mass Storage Providing Tape and Mass Storage to Diverse Scientific Communities.
The Storage Resource Broker and.
The Storage Resource Broker and.
ICS 434 Advanced Database Systems
Data Grid: Storage Resource Broker Mike Smorul. SRB Overview Developed at San Diego Supercomputing Center. Provides the abstraction mechanisms needed.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.
San Diego Supercomputer CenterNational Partnership for Advanced Computational Infrastructure1 Grid Based Solutions for Distributed Data Management Reagan.
Objektorienteret Middleware Presentation 2: Distributed Systems – A brush up, and relations to Middleware, Heterogeneity & Transparency.
VL-e PoC Introduction Maurice Bouwhuis VL-e work shop, April 7 th, 2006.
OxGrid, A Campus Grid for the University of Oxford Dr. David Wallom.
Applying Data Grids to Support Distributed Data Management Storage Resource Broker Reagan W. Moore Ian Fisk Bing Zhu University of California, San Diego.
Jean-Yves Nief, CC-IN2P3 Wilko Kroeger, SCCS/SLAC Adil Hasan, CCLRC/RAL HEPiX, SLAC October 11th – 13th, 2005 BaBar data distribution using the Storage.
Data Grid: GRASP Mike Smorul. Grid Retrieval and Search Platform Based on concepts developed in the Earth Science Data Interface (ESDI) developed at the.
Magda – Manager for grid-based data Wensheng Deng Physics Applications Software group Brookhaven National Laboratory.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Overview of SQL Server Alka Arora.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
Jan Storage Resource Broker Managing Distributed Data in a Grid A discussion of a paper published by a group of researchers at the San Diego Supercomputer.
Rule-Based Data Management Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar {moore, schroede, mwan, {moore, schroede, mwan,
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Grid Services/SRB/SRM & Practical Hai-Ning Wu Academia Sinica Grid Computing.
SURENDER SARA 10GAS Building Corporate KPI’s
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
December 1, 2005HDF & HDF-EOS Workshop IX P eter Cao, NCSA December 1, 2005 Sponsored by NLADR, NFS PACI Project in Support of NCSA-SDSC Collaboration.
Grid tool integration within the eMinerals project Mark Calleja.
DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
Mainframe (Host) - Communications - User Interface - Business Logic - DBMS - Operating System - Storage (DB Files) Terminal (Display/Keyboard) Terminal.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Archive for the NSDL Reagan W. Moore Charlie Cowart.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Building the e-Minerals Minigrid Rik Tyer, Lisa Blanshard, Kerstin Kleese (Data Management Group) Rob Allan, Andrew Richards (Grid Technology Group)
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Michael Doherty RAL UK e-Science AHM 2-4 September 2003 SRB in Action.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Introduction to The Storage Resource.
E-Curator: A Web-based Curatorial Tool Ian Brown, Mona Hess Sally MacDonald, Francesca Millar Yean-Hoon Ong, Stuart Robson Graeme Were UCL Museums & Collections.
Biomedical Informatics Research Network The Storage Resource Broker & Integration with NMI Middleware Arcot Rajasekar, BIRN-CC SDSC October 9th 2002 BIRN.
SDSC Storage Resource Broker & Meta-data Catalog SRB Archives HPSS, ADSM, UniTree, DMF Databases DB2, Oracle, Sybase File Systems Unix, NT, Mac OSX Application.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
ViaSQL Technical Overview. Viaserv, Inc. 2 ViaSQL Support for S/390 n Originally a VSE product n OS/390 version released in 1999 n Identical features.
ATLAS Database Access Library Local Area LCG3D Meeting Fermilab, Batavia, USA October 21, 2004 Alexandre Vaniachine (ANL)
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
The Storage Resource Broker and.
1 GridFTP and SRB Guy Warner Training, Outreach and Education Team, Edinburgh e-Science.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 22 February 2006.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
1 eScience Grid Environments th May 2004 NESC - Edinburgh Deployment of Storage Resource Broker at CCLRC for E-science Projects Ananta Manandhar.
Building Preservation Environments from Federated Data Grids Reagan W. Moore San Diego Supercomputer Center Storage.
2nd year Computer Science & Engineer
The Client/Server Database Environment
Data services on the NGS
CSC 480 Software Engineering
Arcot Rajasekar Michael Wan Reagan Moore (sekar, mwan,
Presentation transcript:

Peter Berrisford RAL – Data Management Group SRB Services

Peter Berrisford RAL Introduction An Overview of SRB CCLRC and SRB Case Study: e-Minerals Mini-Grid SRB Production Services Questions

Peter Berrisford RAL Managing Data Historically data has been STORED rather than MANAGED Problems arising from this include: –Scaling –Distribution –Access Control, Authentication, Security –Data Migration –Data Curation

Peter Berrisford RAL What is SRB? Storage Resource Broker (SRB) is a software product developed by the San Diego Supercomputing Centre (SDSC). Allows users to access files and database objects across a distributed environment. Actual physical location and way the data is stored is abstracted from the user Allows the user to add user defined metadata describing the scientific content of the information

Peter Berrisford RAL How SRB Works MCAT Database MCAT Server SRB A Server SRB B Server SRB Client a b cd e f g 4 major components: –The Metadata Catalogue (MCAT) –The MCAT-Enabled SRB Server –The SRB Storage Server –The SRB Client

Peter Berrisford RAL The MCAT Database The MCAT database is a metadata repository that provides a mechanism for storing information used by the SRB system. Includes both –Internal system data required for running the system –Application (user) metadata regarding data sets being brokered by SRB.

Peter Berrisford RAL The MCAT Server At least one SRB Server must be installed on the node that can access the MCAT database. This is known as the MCAT-Enabled SRB Server. MCAT SRB Server works directly against the MCAT database to provide SRB Services All other SRB Servers interact through the MCAT Server

Peter Berrisford RAL The SRB Server The SRB Server is a middleware application that accepts requests from clients and obtains/queries/manages the necessary data sets. It queries the MCAT SRB Server to gather information on datasets and supplies this back to the SRB client.

Peter Berrisford RAL SRB Client Tools Provide a user interface to send requests to the SRB server. 4 main interfaces: –Command line (S-Commands) –MS Windows (InQ) –Web based (MySRB). –Java (JARGON) Web Services (MATRIX)

Peter Berrisford RAL

Peter Berrisford RAL Concepts Location: A physical node running an SRB Server Physical Resource: A storage area managed by an SRB Server Logical Resource: One or more Physical Resources – can be distributed Collection – Data abstraction of resources

Peter Berrisford RAL SRB in Detail SRB Archives ADS, HPSS, ADSM,DMF Databases DB2, Oracle, PostgreSQL File Systems Unix, NT, Mac OSX Application C, C++, Linux I/O Unix Shell Resource, User Defined Application Meta-data Remote Proxies DataCutter Third-party copy Java, NT Browsers Web Prolog Python MCAT HRM

Peter Berrisford RAL Administration Users / Locations / Resources must be managed Two methods for doing this: –Java MCAT Admin Tool –Command line tools

Peter Berrisford RAL CCLRC and SRB The Data Management Group in CCLRC started working with SRB in November 2002 after a fact finding mission to the USA. There was an immediate requirement for a storage based product that allowed the addition of searchable metadata Generated lots of internal interest, which led to a number of projects with SRB

Peter Berrisford RAL SRB Example: CMS Largest project using CCLRC SRB services to date is the CERN CMS experiment. SRB chosen for Pre-Challenge Production, producing data for Data Challenge 2003/2004 (DC03/DC04) Need to prove data can be transferred, replicated and stored at LHC rates DC04 provided key input to SRB Version 3.2

Peter Berrisford RAL SRB Case Study: e-Minerals UK e-science project for modelling the atomistic processes involved in environmental issues

Peter Berrisford RAL e-Minerals Requirements Data Management Requirements –Scientists want to store input and output files from simulations in different locations –manage their own files/data via the web –give access to other project members –give temporary access to others

Peter Berrisford RAL Architecture Daresbury App Server Cambridge SRB Resource Reading SRB Resource Bath SRB Resource Eminerals MiniGrid Daresbury Database server MCAT SRB Server Oracle Client MySRB Web Browser Application server runs SRB software Database server holds locations of files

Peter Berrisford RAL Building on Experience - New Services CCLRC SRB Service –Initial service availability: October –Proposed Customers include: ISIS Facility, British Atmospheric Data Centre (BADC), AHDS –ADS interface (with Containers) –Test systems in place NGS SRB Service –e-Minerals, e-Materials, Integrative Biology

Peter Berrisford RAL SRB Services SRB version 3.2 –Performance, scalability and reliability Ongoing Service Enhancements –Automatic failover –Product Documentation and Training - Collaboration with SDSC

Peter Berrisford RAL CCLRC Service DB-Instance-1DB-Instance-2 MES ADS ADS-SRB Multiple Servers SRB Server SRB Server SRB Storage Servers SRB Server SRB Server SRB Storage Servers SRB Server SRB Server SRB Storage Servers App MCAT Server Schema1Schema2Schema3Schema4 Oracle RAC Database Server MCAT Database Oracle Client App Web Server SRB Server SRB Server SRB Storage Servers

Peter Berrisford RAL MCAT: CCLRC Database Service MCAT requires professionally run database Two IBM x440 clusters, one based at Daresbury Laboratory and the other at Rutherford Appleton Laboratory. The clusters connect to their own 1TB RAID 5 storage arrays via a independent fibre channel Storage Area Networks (SAN). Run Oracle Real Application Clusters software on Redhat Advanced Server for high availability/scalability RDBMS CMS MCAT hosted by 2 nodes Can load balance

Peter Berrisford RAL Atlas Datastore (ADS) Available as an SRB Resource CCLRC have written a custom SRB driver for the ADS

Peter Berrisford RAL ADS Driver for SRB Implemented Storage System Driver Implement (most) of the 16 standard calls that implement the driver layer such as copy, move, delete and create. Some functions have no equivalent in ADS

Peter Berrisford RAL Summary Links established with SRB community and SDSC Real SRB projects implemented Creating new generation of SRB Production Systems Can help community with: –SRB Test Systems –SRB Production Systems –SRB Training and Support Contributing to future versions

Peter Berrisford RAL Questions ?