PPDGLHC Computing ReviewNovember 15, 2000 PPDG The Particle Physics Data Grid Making today’s Grid software work for HENP experiments, Driving GRID science.

Slides:



Advertisements
Similar presentations
Jens G Jensen Atlas Petabyte store Supporting Multiple Interfaces to Mass Storage Providing Tape and Mass Storage to Diverse Scientific Communities.
Advertisements

Peter Berrisford RAL – Data Management Group SRB Services.
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.
Foundations for an LHC Data Grid Stu Loken Berkeley Lab.
Aug Arie Shoshani Particle Physics Data Grid Request Management working group.
EU-GRID Work Program Massimo Sgaravatto – INFN Padova Cristina Vistoli – INFN Cnaf as INFN members of the EU-GRID technical team.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
GRID DATA MANAGEMENT PILOT (GDMP) Asad Samar (Caltech) ACAT 2000, Fermilab October , 2000.
Data Grids for Next Generation Experiments Harvey B Newman California Institute of Technology ACAT2000 Fermilab, October 19, 2000
Grid Collector: Enabling File-Transparent Object Access For Analysis Wei-Ming Zhang Kent State University John Wu, Alex Sim, Junmin Gu and Arie Shoshani.
The LHC Computing Grid Project Tomi Kauppi Timo Larjo.
Simo Niskala Teemu Pasanen
SAMGrid – A fully functional computing grid based on standard technologies Igor Terekhov for the JIM team FNAL/CD/CCF.
DataGrid Middleware: Enabling Big Science on Big Data One of the most demanding and important challenges that we face as we attempt to construct the distributed.
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
D0 SAM – status and needs Plagarized from: D0 Experiment SAM Project Fermilab Computing Division.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
1 Use of SRMs in Earth System Grid Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL U.S. ATLAS Physics and Computing Advisory Panel Review Argonne National Laboratory Oct 30, 2001.
File and Object Replication in Data Grids Chin-Yi Tsai.
DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.
Tier 1 Facility Status and Current Activities Rich Baker Brookhaven National Laboratory NSF/DOE Review of ATLAS Computing June 20, 2002.
PPDG and ATLAS Particle Physics Data Grid Ed May - ANL ATLAS Software Week LBNL May 12, 2000.
SAM and D0 Grid Computing Igor Terekhov, FNAL/CD.
Ruth Pordes, Fermilab CD, and A PPDG Coordinator Some Aspects of The Particle Physics Data Grid Collaboratory Pilot (PPDG) and The Grid Physics Network.
August 26, 1999: MONARC Regional Reps Meeting Harvey Newman (CIT) MONARC Second Regional Centre Representatives Meeting Harvey B. Newman (Caltech) CERN.
1 Grid Related Activities at Caltech Koen Holtman Caltech/CMS PPDG meeting, Argonne July 13-14, 2000.
Data Grid projects in HENP R. Pordes, Fermilab Many HENP projects are working on the infrastructure for global distributed simulated data production, data.
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
Major Grid Computing Initatives Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer Science The.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
NOVA Networked Object-based EnVironment for Analysis P. Nevski, A. Vaniachine, T. Wenaus NOVA is a project to develop distributed object oriented physics.
D0RACE: Testbed Session Lee Lueking D0 Remote Analysis Workshop February 12, 2002.
Operated by the Southeastern Universities Research Association for the U.S. Depart. Of Energy Thomas Jefferson National Accelerator Facility Andy Kowalski.
09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.
HEP-CCC Meeting, November 1999Grid Computing for HEP L. E. Price, ANL Grid Computing for HEP L. E. Price Argonne National Laboratory HEP-CCC Meeting CERN,
1 DØ Grid PP Plans – SAM, Grid, Ceiling Wax and Things Iain Bertram Lancaster University Monday 5 November 2001.
The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
Internet 2 Workshop (Nov. 1, 2000)Paul Avery (The GriPhyN Project)1 The GriPhyN Project (Grid Physics Network) Paul Avery University of Florida
PPDG update l We want to join PPDG l They want PHENIX to join NSF also wants this l Issue is to identify our goals/projects Ingredients: What we need/want.
Computing Sciences Directorate, L B N L 1 CHEP 2003 Standards For Storage Resource Management BOF Co-Chair: Arie Shoshani * Co-Chair: Peter Kunszt ** *
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
The Particle Physics Data Grid Collaboratory Pilot Richard P. Mount For the PPDG Collaboration DOE SciDAC PI Meeting January 15, 2002.
May 6, 2002Earth System Grid - Williams The Earth System Grid Presented by Dean N. Williams PI’s: Ian Foster (ANL); Don Middleton (NCAR); and Dean Williams.
DoE NGI Program PI Meeting, October 1999Particle Physics Data Grid Richard P. Mount, SLAC Particle Physics Data Grid Richard P. Mount SLAC Grid Workshop.
1 P.Kunszt LCGP Data Management on the GRID Peter Z. Kunszt CERN Database Group EU DataGrid – Data Management.
STAR Collaboration, July 2004 Grid Collector Wei-Ming Zhang Kent State University John Wu, Alex Sim, Junmin Gu and Arie Shoshani Lawrence Berkeley National.
US Grid Efforts Lee Lueking D0 Remote Analysis Workshop February 12, 2002.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Open Science Grid & its Security Technical Group ESCC22 Jul 2004 Bob Cowles
SLAC Status, Les CottrellESnet International Meeting, Kyoto July 24-25, 2000 SLAC Update Les Cottrell & Richard Mount July 24, 2000.
Biomedical Informatics Research Network The Storage Resource Broker & Integration with NMI Middleware Arcot Rajasekar, BIRN-CC SDSC October 9th 2002 BIRN.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Magda Distributed Data Manager Prototype Torre Wenaus BNL September 2001.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL DOE/NSF Review of US LHC Software and Computing Fermilab Nov 29, 2001.
D0 File Replication PPDG SLAC File replication workshop 9/20/00 Vicky White.
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
DOE/NSF Quarterly review January 1999 Particle Physics Data Grid Applications David Malon Argonne National Laboratory
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
1 Efficient Data Access for Distributed Computing at RHIC A. Vaniachine Efficient Data Access for Distributed Computing at RHIC A. Vaniachine Lawrence.
1 Scientific Data Management Group LBNL SRM related demos SC 2002 DemosDemos Robust File Replication of Massive Datasets on the Grid GridFTP-HPSS access.
1 Particle Physics Data Grid (PPDG) project Les Cottrell – SLAC Presented at the NGI workshop, Berkeley, 7/21/99.
Magda Distributed Data Manager Torre Wenaus BNL October 2001.
Hall D Computing Facilities Ian Bird 16 March 2001.
Patrick Dreher Research Scientist & Associate Director
Presentation transcript:

PPDGLHC Computing ReviewNovember 15, 2000 PPDG The Particle Physics Data Grid Making today’s Grid software work for HENP experiments, Driving GRID science and technology. ( Richard P. Mount November 15, 2000

PPDGLHC Computing ReviewNovember 15, 2000 PPDG Who is involved? How is it funded? What has it achieved? How does it fit in to the big Grid picture? How is it relevant for LHC?

PPDGLHC Computing ReviewNovember 15, 2000 PPDG Collaborators

PPDGLHC Computing ReviewNovember 15, 2000 PPDG Collaborators Particle Accelerator Computer Physics Laboratory Science ANLXX LBNL XX BNLXXx CaltechXX FermilabXXx Jefferson LabXXx SLACXXx SDSCX WisconsinX

PPDGLHC Computing ReviewNovember 15, 2000 PPDG BaBar D0 CDF Nuclear Physics CMSAtlas Globus Users SRB Users Condor Users STAR BaBar Data Management CMS Data Management Nuclear Physics Data Management D0 Data Management CDF Data Management Atlas Data Management Globus Team Condor SRB Team STACS PPDG: A Coordination Challenge

PPDGLHC Computing ReviewNovember 15, 2000 PPDG Funding FY 1999: –PPDG NGI Project approved with $1.2M ($2M requested) from DoE Next Generation Internet program. FY 2000 –DoE NGI program not funded –$1.2M funded by DoE/OASCR/MICS ($470k) and HENP ($770k) FY –Proposal (to be written) for DoE/OASCR/MICS and HENP funding in SciDAC context. Likely total FY2001 request: ~$3M.

PPDGLHC Computing ReviewNovember 15, 2000 Initial PPDG Goals Implement and Run two services in support of the major physics experiments at BNL, Fermilab, JLAB, SLAC: –“High-Speed Site-to-Site File Replication Service”; Data replication up to 100 Mbytes/s –“Multi-Site Cached File Access Service”: Based on deployment of file-cataloging, and transparent cache-management and data movement middleware Using middleware components already developed by the collaborators.

PPDGLHC Computing ReviewNovember 15, 2000 PPDG Site-to-Site Replication Service SECONDARY SITE CPU, Disk, Tape Robot PRIMARY SITE Data Acquisition, CPU, Disk, Tape Robot

PPDGLHC Computing ReviewNovember 15, 2000 Progress: 100 Mbytes/s Site-to-Site Focus on SLAC – Caltech over NTON at OC48 (2.5 gigabits/s); Fibers in place; SLAC Cisco with OC48 and 2 × OC12 in place; Caltech Juniper M160 with OC48 installed; 990 Mbits/s achieved between SC2000 and SLAC.

PPDGLHC Computing ReviewNovember 15, 2000 Throughput from SC2000 to SLAC Up to 990 Mbits/s using two machines at each end plus multi-stream TCP with large windows

PPDGLHC Computing ReviewNovember 15, 2000 PPDG Multi-site Cached File Access System University CPU, Disk, Users PRIMARY SITE Data Acquisition, Tape, CPU, Disk, Robot Satellite Site Tape, CPU, Disk, Robot Satellite Site Tape, CPU, Disk, Robot University CPU, Disk, Users University Users Satellite Site Tape, CPU, Disk, Robot

PPDGLHC Computing ReviewNovember 15, 2000 PPDG Cached File Access Progress Demonstration of multi-site cached file access based mainly on SRB *. (LBNL, ANL, U.Wisconsin) Development of HRM storage management interface and implementation in SRB and SAM (D0 data management) * Storage Resource Broker (SDSC)

PPDGLHC Computing ReviewNovember 15, 2000 Test of PPDG Storage Management API (HRM) 2 separate Clients request and get files from: –SRB catalog and HPSS – LBL and Wisconsin –D0 SAM catalog, disk cache and Enstore storage system – Fermilab and Wisconsin. Demo’d at SC2000. Agreed on common Storage Resource Management interface. Next step – Client that requests and gets files from each/both storage management systems – goal to meet the PPDG “multi-site file caching file access” across 2 existing grid components.

PPDGLHC Computing ReviewNovember 15, 2000 PPDG: Initial Architecture

PPDGLHC Computing ReviewNovember 15, 2000 Initial PPDG “System” Components Middleware Components (Initial Choice): See PPDG Proposal Page 15 Object and File-Based Objectivity/DB (SLAC enhanced) Application Services GC Query Object, Event Iterator, Query Monitor FNAL SAM System Resource ManagementStart with Human Intervention (but begin to deploy resource discovery & mgmnt tools) File Access Service Components of OOFS (SLAC) Cache ManagerGC Cache Manager (LBNL) Mass Storage ManagerHPSS, Enstore, OSM (Site-dependent) Matchmaking Service Condor (U. Wisconsin) File Replication Index MCAT (SDSC) Transfer Cost Estimation ServiceGlobus (ANL) File Fetching ServiceComponents of OOFS File Movers(s) SRB (SDSC); Site specific End-to-end Network ServicesGlobus tools for QoS reservation Security and authenticationGlobus (ANL)

PPDGLHC Computing ReviewNovember 15, 2000 Request Interpreter Storage Access service Request Manager Cache Manager Request to move files {file: from,to} logical request (property predicates / event set) Local Site Manager To Network File Access service Fig 1: Architecture for the general scenario - needed APIs files to be retrieved {file:events} Logical Index service Storage Reservation service Request to reserve space {cache_location: # bytes} Matchmaking Service File Replica Catalog GLOBUS Services Layer Remote Services Resource Planner Application (data request) Client (file request) Local Resource Manager Cache Manager Properties, Events, Files Index

PPDGLHC Computing ReviewNovember 15, 2000 Current PPDG Focus: File Replication Service Use cases from BaBar, D0, CMS, etc. Typical target: BaBar SLAC-Lyon transfers (current low-tech approach absorbs about 2 FTE). Replica catalog distinct from Objectivity catalogs; GRIDftp transfer. Globus inter-site security.

PPDGLHC Computing ReviewNovember 15, 2000 The Big Grid Picture QoS, Reservations High-throughput IP Reliable Object Transfer Modeling Prototypes  Products Deployment in Experiments Security/Authentication Technology Security/Authentication Architecture Matchmaking Resource Policy Resource Discovery User SupportTestbeds Cost/Feasibility Estimation Distributed Transaction Management Distributed Replica Catalog Worldwide Grid Project Coordination Software Configuration Control Derived-Object Definition Database Mobile Agents Grid Architecture and Interface Definition Error Tracing Instrumentation

PPDGLHC Computing ReviewNovember 15, 2000 The Big Grid Picture Grid projects must become coordinated (in progress); Progress in the commercial world must be exploited;

PPDGLHC Computing ReviewNovember 15, 2000 PPDG in the Big Grid Picture Rapid deployment of Grid software in support of HENP experiments; Drive and contribute to Grid architecture: –Architecture must define interfaces between evolving components; Design and develop new Grid middleware components (deliverables to be defined in consultation with GriPhyN, EU-DataGrid …): –Focus on rapid delivery to HENP experiments (to validate concepts, get feedback and be useful).

PPDGLHC Computing ReviewNovember 15, 2000 PPDG and LHC? BaBar Example SLAC CCIN2P3 RAL CASPUR PPDG-SLAC-IN2P3-BaBar plan to implement Grid components allowing SLAC + CCIN2P3 + … to become an (adequately) integrated data analysis resource. Delivery of useful service: scheduled for end 2001

PPDGLHC Computing ReviewNovember 15, 2000 PPDG and LHC US LHC groups are strong participants in PPDG; Computer scientists in PPDG see the LHC challenge as the leading opportunity to advance the science of data-intensive Grids; PPDG, GriPhyN and EU-DataGrid are creating coordinated management and joint working groups: –Interoperable systems with consistent components.