The LIGO Scientific Collaboration Data Grid Client/Server Environment Junwei Cao MIT LIGO Laboratory For the LIGO Scientific Collaboration.

Slides:



Advertisements
Similar presentations
Test harness and reporting framework Shava Smallen San Diego Supercomputer Center Grid Performance Workshop 6/22/05.
Advertisements

11/12/2003LIGO Document G Z1 Data reduction for S3 I Leonor (UOregon), P Charlton (CIT), S Anderson (CIT), K Bayer (MIT), M Foster (PSU), S Grunewald.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
GRID workload management system and CMS fall production Massimo Sgaravatto INFN Padova.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
John Kewley e-Science Centre GIS and Grid Computing Workshop 13 th September 2005, Leeds Grid Middleware and GROWL John Kewley
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
GRID Workload Management System Massimo Sgaravatto INFN Padova.
Grid Services at NERSC Shreyas Cholia Open Software and Programming Group, NERSC NERSC User Group Meeting September 17, 2007.
Workload Management Massimo Sgaravatto INFN Padova.
LIGO- GXXXXXX-XX-X Advanced LIGO Data & Computing Material for the breakout session NSF review of the Advanced LIGO proposal Albert Lazzarini Caltech,
Minerva Infrastructure Meeting – October 04, 2011.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking.
Zach Miller Computer Sciences Department University of Wisconsin-Madison What’s New in Condor.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
LIGO-G E ITR 2003 DMT Sub-Project John G. Zweizig LIGO/Caltech Argonne, May 10, 2004.
OSG End User Tools Overview OSG Grid school – March 19, 2009 Marco Mambelli - University of Chicago A brief summary about the system.
G D LSC ITR2003, ANL, LIGO Scientific Collaboration ITR 2003 Advisory Committee Meeting K. Blackburn 1, P. Brady 2, S. Finn 3, E.
April 19, 2005Scott Koranda1 The LIGO Scientific Collaboration Data Grid Scott Koranda UW-Milwaukee On behalf of the LIGO Scientific Collaboration.
LSC Segment Database Duncan Brown Caltech LIGO-G Z.
LIGO-G9900XX-00-M LIGO and GriPhyN Carl Kesselman Ewa Deelman Sachin Chouksey Roy Williams Peter Shawhan Albert Lazzarini Kent Blackburn CaltechUSC/ISI.
National Center for Supercomputing Applications The Computational Chemistry Grid: Production Cyberinfrastructure for Computational Chemistry PI: John Connolly.
10/20/05 LIGO Scientific Collaboration 1 LIGO Data Grid: Making it Go Scott Koranda University of Wisconsin-Milwaukee.
HEP Experiment Integration within GriPhyN/PPDG/iVDGL Rick Cavanaugh University of Florida DataTAG/WP4 Meeting 23 May, 2002.
Patrick R Brady University of Wisconsin-Milwaukee
G Z LIGO Scientific Collaboration Grid Patrick Brady University of Wisconsin-Milwaukee LIGO Scientific Collaboration.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
Update on the LIGO Data Analysis System LIGO Scientific Collaboration Meeting LIGO Hanford Observatory August 19 th, 2002 Kent Blackburn Albert Lazzarini.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
TRASC Globus Application Launcher VPAC Development Team Sudarshan Ramachandran.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
1 The Roadmap to New Releases Todd Tannenbaum Department of Computer Sciences University of Wisconsin-Madison
LIGO-G9900XX-00-M ITR 2003 DMT Sub-Project John G. Zweizig LIGO/Caltech.
Rochester Institute of Technology Job Submission Andrew Pangborn & Myles Maxfield 10/19/2015Service Oriented Cyberinfrastructure Lab,
10/24/2015OSG at CANS1 Open Science Grid Ruth Pordes Fermilab
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
Review of Condor,SGE,LSF,PBS
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
LIGO-G E LIGO Scientific Collaboration Data Grid Status Albert Lazzarini Caltech LIGO Laboratory Trillium Steering Committee Meeting 20 May 2004.
LIGO-G Z LIGO at the start of continuous observation Prospects and Challenges Albert Lazzarini LIGO Scientific Collaboration Presentation at NSF.
Junwei Cao March LIGO Document ID: LIGO-G Computing Infrastructure for Gravitational Wave Data Analysis Junwei Cao.
Alain Roy Computer Sciences Department University of Wisconsin-Madison Condor & Middleware: NMI & VDT.
Jan 12, 2009LIGO-G Z1 DMT and NDS2 John Zweizig LIGO/Caltech Ligo PAC, Caltech, Jan 12, 2009.
Contact: Junwei Cao SC2005, Seattle, WA, November 12-18, 2005 The authors gratefully acknowledge the support of the United States National.
LIGO Plans for OSG J. Kent Blackburn LIGO Laboratory California Institute of Technology Open Science Grid Technical Meeting UCSD December 15-17, 2004.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
State of LSC Data Analysis and Software LSC Meeting LIGO Hanford Observatory November 11 th, 2003 Kent Blackburn, Stuart Anderson, Albert Lazzarini LIGO.
Scott Koranda, UWM & NCSA 14 January 2016www.griphyn.org Lightweight Data Replicator Scott Koranda University of Wisconsin-Milwaukee & National Center.
11/12/2003LIGO-G Z1 Data reduction for S3 P Charlton (CIT), I Leonor (UOregon), S Anderson (CIT), K Bayer (MIT), M Foster (PSU), S Grunewald (AEI),
May 29, 2006 GWADW, Elba, May 27 - June 21 LIGO-G0200XX-00-M Data Quality Monitoring at LIGO John Zweizig LIGO / Caltech.
Grid Compute Resources and Job Management. 2 Grid middleware - “glues” all pieces together Offers services that couple users with remote resources through.
LIGO-G W Use of Condor by the LIGO Scientific Collaboration Gregory Mendell, LIGO Hanford Observatory On behalf of the LIGO Scientific Collaboration.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
LIGO-G Z1 Using Condor for Large Scale Data Analysis within the LIGO Scientific Collaboration Duncan Brown California Institute of Technology.
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
PX444 – The Distant Universe Lecture 13 The History of Star Formation.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
HTCondor-CE. 2 The Open Science Grid OSG is a consortium of software, service and resource providers and researchers, from universities, national laboratories.
Gravitational Wave Data Analysis  GW detectors  Signal processing: preparation  Noise spectral density  Matched filtering  Probability and statistics.
Workload Management Workpackage
Gregory Mendell LIGO Hanford Observatory
Wide Area Workload Management Work Package DATAGRID project
Software Implementation
Presentation transcript:

The LIGO Scientific Collaboration Data Grid Client/Server Environment Junwei Cao MIT LIGO Laboratory For the LIGO Scientific Collaboration GridWorld 2006, Washington D.C., U.S.A.

Junwei Cao, GridWorld 2006, September Outline Gravitational Waves LIGO LSC LIGO Data LSC Data Grid (LDG) The LDG Client/Server Environment Case Study I – LDM Case Study II – LDGreport The Open Science Grid (OSG) Summary

Junwei Cao, GridWorld 2006, September Gravitational Waves Gravitational waves are prediction of Einstein’s General Theory of Relativity They are emitted when matter moves or changes its configuration They distort space itself: stretching one direction and squeezing the perpendicular in the first half period and vice versa in the second half A new window to observe the universe

Junwei Cao, GridWorld 2006, September LIGO & LSC LIGO: Laser Interferometer Gravitational wave Observatory Aiming at first direct detection of gravitational waves Led by Caltech and MIT, and Funded by NSF LSC: the LIGO Scientific Collaboration with over 500 scientists from over 40 institutes worldwide LIGO Hanford ObservatoryLIGO Livingston Observatory

Junwei Cao, GridWorld 2006, September International Network of Detectors LIGO Simultaneously detect signal (within msec) GEO Virgo TAMA AIGO

Junwei Cao, GridWorld 2006, September LIGO Data Continuous time series (1Hz, 128Hz … 16kHz), all written in an internationally agreed data format: frames Data are comprised of: »Gravitational Wave channel: 96 KB/s »Physical Environment Monitors (seismometers, microphones etc): ~1.2 MB/s »Internal Engineering Monitors (sensing, housekeeping, status etc): ~7.2 MB/s Multiple data products beyond raw data 1TB/day in 24/7 operation

Junwei Cao, GridWorld 2006, September The LSC Data Grid (LDG) AEI/Golm  Cardiff Birmingham

Junwei Cao, GridWorld 2006, September The LDG Software Stack Applications Infrastructures End users & applications Application enabling Middleware / Services Operating Systems and … FC4GCCAutotoolsPythonMySQL Job scheduling / Condor Condor-G Data transfer / GridFTP Worklfow management / Condor DAGman Grid security / Globus GSIResource management / Globus GRAM Resource location service / GlobusCatalog service / GlobusInformation service / Globus MDS VDS LSCcertUtils LSC Security management LSCdataFindLSCsegFindLDR LSC Data management LDM GlueOnasys LSC Job management LDASDMTLALApps LSC CA VOMS The LSC Data Grid Client/Server Environment Version 4.0 (based on VDT 1.3.9) Matlab LDGreport

Junwei Cao, GridWorld 2006, September The LDG 4.0 Client/Server Written in Pacman 3 Based on VDT Support LDG:Client and LDG:ClientPro Support multiple platforms: FC4, Solaris and Darwin at client and only FC4 at server Support both 32bit and 64bit machines Server includes client Online documentation for step-by-step installation …… platformGE( 'Linux' ); package( 'Client-Environment' ); cd( 'vdt' ); package( 'VDT_CACHE:Globus-Client' ); package( 'VDT_CACHE:CA-Certificates' ); package( 'VDT_CACHE:Condor' ); package( 'VDT_CACHE:Fault-Tolerant-Shell' ); package( 'VDT_CACHE:GSIOpenSSH' ); package( 'VDT_CACHE:KX509' ); package( 'VDT_CACHE:MyProxy' ); package( 'VDT_CACHE:PyGlobus' ); package( 'VDT_CACHE:PyGlobusURLCopy' ); package( 'VDT_CACHE:UberFTP' ); package( 'VDT_CACHE:EDG-Make-Gridmap' ); package( 'VDT_CACHE:Globus-RLS-Client' ); package( 'VDT_CACHE:VDS' ); package( 'VDT_CACHE:VOMS-Client' ); cd(); package( 'Client-FixSSH' ); package( 'Client-RLS-Python-Client' ); package( 'Client-Cert-Util' ); package( 'Client-LSC-CA' ); …… OR platformGE( 'Sun' ); package( 'SolarisPro' ); OR platformGE( 'MacOS' ); package( 'Mac' );

Junwei Cao, GridWorld 2006, September Data Monitoring Toolkit (DMT) SMP DMT Online Use Scenario – control-room type DMT Offline Use Scenario – standalone or grid enabled DMT Monitors Single data stream Name Server /data/node10/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node11/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node12/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node13/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node14/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node15/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node16/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node10/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node11/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node12/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node13/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node14/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node15/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node16/frame/S3/L3/LLO/L-RDS_R_L gwf Stdout Trigger files Alarm files Trend files …… Multiple data streams client lsmp lmsg Server gui base container sigp xml html xsilezcalib dmtenvevent trig …… MonServer frameio …… DMT Libraries

Junwei Cao, GridWorld 2006, September An Example DMT Monitor /data/node10/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node11/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node12/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node13/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node14/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node15/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node16/frame/S3/L3/LHO/H-RDS_R_L gwf /data/node10/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node11/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node12/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node13/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node14/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node15/frame/S3/L3/LLO/L-RDS_R_L gwf /data/node16/frame/S3/L3/LLO/L-RDS_R_L gwf rmon filelist1.txt filelist2.txt multilist.txt stride 16.0 channel_1 H1:LSC-AS_Q channel_2 L1:LSC-AS_Q opt filelist1.txt filelist2.txt rmon]$ export LD_LIBRARY_PATH=/opt/lscsoft/dol/lib rmon]$./rmon -opt opt -inlists multilist.txt Processing multi list file: multilist.txt Number of lists added: 2 Total data streams: 2 Processing frame list file: /home/jcao/rmon/filelist1.txt Number of files added: 1188 Total frame files: 1188 Processing frame list file: /home/jcao/rmon/filelist2.txt Number of files added: 1188 Total frame files: 1188 channel[1]=H1:LSC-AS_Q channel[2]=L1:LSC-AS_Q startgps= stride=16 r-statistic= startgps= stride=16 r-statistic= startgps= stride=16 r-statistic= …… standalone run of rmon DMT offline monitor

Junwei Cao, GridWorld 2006, September The LDM Modules and Flowchart QUEUED SCHEDULED LOCATING LOCATED RUNNING FINISHED ldm_agent ldm_locate_script ldm_exec_script ldm_submit RELEASED ldm_rm ldm_q REJECTED LSCdataFind condor_submit LDM Other tools client server LSCdataFind Server Globus Job Manager ldm_agent Condor [AGENT] RESOURCES = SITES = /home/jcao/ldm/etc/LDM_SITES EXEC = /home/jcao/ldm/bin/ldm_exec_script LOCATE = /home/jcao/ldm/bin/ldm_locate_script PID = /home/jcao/ldm/var/ldm.pid LOG = /home/jcao/ldm/var/ldm.log LDG = /home/jcao/ldg-3.0/ [MIT] lscdatafindserver = ldas-gridmon.mit.edu globusscheduler = ldas-grid.mit.edu/jobmanager-condor environment = LD_LIBRARY_PATH=/dso-test/home/jcao/dol/lib [CIT] lscdatafindserver = ldas-gridmon.ligo.caltech.edu globusscheduler = ldas-grid.ligo.caltech.edu/jobmanager-condor environment = LD_LIBRARY_PATH=/dso-test/jcao/dol/lib [LHO] lscdatafindserver = ldas-gridmon.ligo-wa.caltech.edu globusscheduler = ldas-grid.ligo-wa.caltech.edu/jobmanager-condor environment = LD_LIBRARY_PATH=/dso-test/jcao/dol/lib [LLO] lscdatafindserver = ldas-gridmon.ligo-la.caltech.edu globusscheduler = ldas-grid.ligo-la.caltech.edu/jobmanager-condor environment = LD_LIBRARY_PATH=/data2/jcao/dol/lib LDM_CONFIG LDM_SITES Modules developed or deployed Modules designed and underdeveloped condor_master

Junwei Cao, GridWorld 2006, September Users are interfaced with a LIGO friendly language. LIGO Data Monitoring Using LDM [job] id = test monitor = rmon args = -opt opt input = opt [data] observatory = type = start = end = ldm.sub ~]$ cd ldm ldm]$ source setup.sh ldm]$ cd../rmon rmon]$ ldm_agent rmon]$ ldm_submit ldm.sub Job test has been submitted. rmon]$ more ldm_test_condor.out Processing multi list file: ldm_test_CIT_multilist.txt Number of lists added: 2 Total data streams: 2 …… startgps= stride=16 r-statistic= …… grid-enabled run of rmon DMT offline monitor using LDM universe = globus globusscheduler = ldas-grid.ligo.caltech.edu/jobmanager-condor log = ldm_test_condor.log output = ldm_test_condor.out error = ldm_test_condor.err should_transfer_files = YES when_to_transfer_output = ON_EXIT transfer_input_files = ldm_test_CIT_multilist.txt, ldm_test_CIT_filelist1.txt, ldm_test_CIT_filelist2.txt, /home/jcao/rmon/opt arguments = -inlists ldm_test_CIT_multilist.txt -opt opt environment = LD_LIBRARY_PATH=/dso-test/jcao/dol/lib executable = /home/jcao/rmon/rmon Queue automatically generated Condor submission file Users do not bother with technical details of LSC data grid services. Data are located and file lists are generated automatically

Junwei Cao, GridWorld 2006, September LDGreport A command line tool to generate LDG cluster usage statistics Integrated with LDG cluster and LSC user information Using globusrun to call remote Condor utilities and retrieve user-based usage information Automatic chart generation ########################### ## LDG user information ## LSC working groups enums: ## 0 unknown ## 1 burst ## 2 inspiral ## 3 CW ## 4 stochastic ## 5 other(sysadmin/grid3/osg/...) ############################ [jcao] name = Junwei Cao affli = MIT group = 1 [jzweizig] name = John Zweizig affli = CIT group = 1 … user.conf [MIT] address = ldas-grid.mit.edu scheduler = condor starttime = , endtime = ,present nodenumber = 105,80 cpupernode = 1,2 cpuhz = 2,3 uptime = 0.962,0.962 [CIT] …… cluser.conf

Junwei Cao, GridWorld 2006, September The Open Science Grid Grid of Grids 20 thousand CPUs Petabytes of data storage High energy physics Bioinformatics Astroinfomatics ……

Junwei Cao, GridWorld 2006, September Summary The LSC Data Grid: A production level grid environment with professional maintenance and supports Over 10 grid sites in the USA and Europe; Thousands of CPUs and terabytes of data storage; Hundreds of users Producing real scientific computing results Future work: Enabling LIGO applications on LDG/OSG LDG/OSG Compatibility References: