PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003.

PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003

Abstract The vision of science grids allocating resources to analyze huge quantities of HENP data clearly depends on reliable network performance. Tools developed at SLAC in conjunction with the Internet2 PIPES project will help to ensure this. In this talk, these tools will be discussed and the procedure for publishing performance data, in particular using the Globus toolkit's MDS and web services will be reviewed. The subsequent analysis and trouble-shooting methodology will be discussed with real world examples from the particle physics data grid (PPDG) and the European data grid (EDG).

Overview What is the problem ? What is PIPES ?
Network performance monitoring Problem identification

Network Monitoring for the Grid
The Data Grid consists of many components that must interoperate Farm requestor Data Farm Data The Network Data Farm requestor Resource Broker

Allocate Resources The resource broker must be fully informed
Measurement is required ! Farm requestor Data 12% pkt loss Farm Data The Network 80% Utilization OC48 Data Farm requestor Resource Broker

What is PIPES ? Internet2 End-to-end performance initiative
PI Performance Evaluation System (PIPES) PIPES Monitoring Platform (PMP) Overlap with goals of HENP Tremendous resources

IEPM-BW Package developed at SLAC Measurement Engine Job Manager
Iperf, bbftp, bbcp, ping, traceroute Abwe, owamp, udpmon, gridftp Job Manager Data Storage and data server Analysis Engine

LANL KEK EDG CERN TRIUMF NIKHEF NERSC FNAL IN2P3 ANL PPDG/GriPhyN CHI CERN SNV ESnet ORNL RAL JLAB NY UCL SLAC ORNL UManc SLAC Imperial JAnet DL BNL NNW APAN Stanford RIKEN Stanford INFN-Roma APAN INFN-Padua CalREN Geant INFN-Milan Abilene SEA CESnet NY NASA WASH SNV Monitoring Site SOX DNVR HSTN ATL CLV IPLS UTAH SDSC UFL CALTECH I2 UTDallas Rice UMich NCSA

BaBar Grid NNW Manchester 10 Gbps TVN 622Mbps RAL Janet ESnet SWERN
SLAC Bristol Geant Stanford DFN Dresden Calren Abilene 1 Gbps 2.5 Gbps Renater IN2P3

Problem Identification
Typical Scenario User complains file transfer is slow Net admin runs ping, traceroute, iperf test Complain to upstream provider Proactive What do we mean by throughput? How do we know there was a performance hit? Our approach is diurnal changes

Alarms Too much to keep track of Rather not wait for complaints
Automated Alarms Rolling average à la RIPE-TT May not be the best approach AMP Automated Detection System

Limitations Could be over an hour before alarm is generated
More frequent measurements impact the network and measurements overlap Low impact tools allow finer grained measurement Use NWS multi-variate method Use SCIDAC ABwE tool Use PingER, OWAMP

Publishing Many monitoring projects, publish data to allow them to inter-operate MDS EDG NM Schema Web Services GLUE NE Schema GGF NMWG Hierarchy Doc Tools Doc ./get_data

Net Rat Alarm System Informant database
Multiple tools Multiple measurement points Trigger further measurements Cross reference off site stats Informant database No measurement is ‘authoritative’ Cannot even believe a measurement

Log 03/20/ :13:46 ALARM pcgiga throughput= ctresh= athresh=312.91 03/20/ :13:48 TRACE no change in route detected 03/20/ :16:07 CALM Throughput within acceptable limits. ALARM CANCELLED

Toward a Monitoring Infrastructure
MAGGIE Measurement and Analysis package built on NIMI/Akenti EDEE production-quality Data Grid for Europe

More Information IEPM Home Page IEPM-BW I2 E2E and PIPES RIPE-TT
AMP Automated Event Detection NWS ABWE

End This talk made possible by the IEPM team at SLAC (Les Cottrell, Connie Logg, Jiri Navratil, Jerrod Williams, Fabrizio Coccetti), and the many developers and maintainers around the world.

PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003.

Similar presentations

Presentation on theme: "PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003.

Similar presentations

Presentation on theme: "PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003."— Presentation transcript:

Similar presentations

About project

Feedback