Using Netflow data for forecasting

Slides:



Advertisements
Similar presentations
Web100 at SLAC Presented at the Web100 Workshop, Boulder, CO, August 2002.
Advertisements

A Flexible Model for Resource Management in Virtual Private Networks Presenter: Huang, Rigao Kang, Yuefang.
An Analysis of Bulk Data Movement Patterns in Large-scale Scientific Collaborations W. Wu, P. DeMar, A. Bobyshev Fermilab CHEP 2010, TAIPEI TAIWAN
1 High Performance Active End-to- end Network Monitoring Les Cottrell, Connie Logg, Warren Matthews, Jiri Navratil, Ajay Tirumala – SLAC Prepared for the.
1 Network Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
1 SLAC Internet Measurement Data Les Cottrell, Jerrod Williams, Connie Logg, Paola Grosso SLAC, for the ISMA Workshop, SDSC June,
1 Evaluation of Techniques to Detect Significant Performance Problems using End-to-end Active Network Measurements Les Cottrell, SLAC 2006 IEEE/IFIP Network.
Network Traffic Measurement and Modeling CSCI 780, Fall 2005.
Copyright © 2005 Department of Computer Science CPSC 641 Winter Network Traffic Measurement A focus of networking research for 20+ years Collect.
1 Tools for High Performance Network Monitoring Les Cottrell, Presented at the Internet2 Fall members Meeting, Philadelphia, Sep
Internet Bandwidth Measurement Techniques Muhammad Ali Dec 17 th 2005.
INTRUSION DETECTION SYSTEMS Tristan Walters Rayce West.
Internet Traffic Management Prafull Suryawanshi Roll No - 04IT6008.
Network Monitoring School of Electronics and Information Kyung Hee University. Choong Seon HONG Selected from ICAT 2003 Material of James W. K. Hong.
Internet Traffic Management. Basic Concept of Traffic Need of Traffic Management Measuring Traffic Traffic Control and Management Quality and Pricing.
POSTECH DP&NM Lab. Internet Traffic Monitoring and Analysis: Methods and Applications (1) 5. Passive Monitoring Techniques.
1 Using Netflow data for forecasting Les Cottrell SLAC and Fawad Nazir NIIT, Presented at the CHEP06 Meeting, Mumbai India, February
DoE SciDAC high-performance networking research project: INCITE INCITE.rice.edu 2004 Technical Challenges INCITE R. Baraniuk, E. Knightly, R. Nowak, R.
1 Overview of IEPM-BW - Bandwidth Testing of Bulk Data Transfer Tools Connie Logg & Les Cottrell – SLAC/Stanford University Presented at the Internet 2.
1 Network Measurement Summary ESCC, Feb Joe Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
1 Internet End-to-end Monitoring Project - Overview Les Cottrell – SLAC/Stanford University Partially funded by DOE/MICS Field Work Proposal on Internet.
1 High Performance Network Monitoring Challenges for Grids Les Cottrell, SLAC Presented at the International Symposium on Grid Computing 2006, Taiwan
1 Evaluating NGI performance Matt Mathis
1 Distributed Monitoring CERNET's experience Xing Li
DoE SciDAC high-performance networking research project: INCITE INCITE.rice.edu 2004 Technical Challenges INCITE R. Baraniuk, E. Knightly, R. Nowak, R.
Performance Limitations of ADSL Users: A Case Study Matti Siekkinen, University of Oslo Denis Collange, France Télécom R&D Guillaume Urvoy-Keller, Ernst.
1 Minneapolis‘ IETF IPFIX Aggregation draft-dressler-ipfix-aggregation-00.txt.
1 Internet Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
1 WAN Monitoring Prepared by Les Cottrell, SLAC, for the Joint Engineering Taskforce Roadmap Workshop JLab April 13-15,
1 Lessons Learned Monitoring Les Cottrell, SLAC ESnet R&D Advisory Workshop April 23, 2007 Arlington, Virginia Partially funded by DOE and by Internet2.
BOF Discussion: Uploading IEPM-BW data to MonALISA Connie Logg SLAC Winter 2006 ESCC/Internet2 Joint Techs Workshop ESCCInternet2ESCCInternet2 February.
1 Terapaths: DWMI: Datagrid Wide Area Monitoring Infrastructure Les Cottrell, SLAC Presented at DoE PI Meeting BNL September
1 Performance Network Monitoring for the LHC Grid Les Cottrell, SLAC International ICFA Workshop on Grid Activities within Large Scale International Collaborations,
1 Network Measurement Challenges LHC E2E Network Research Meeting October 25 th 2006 Joe Metzger Version 1.1.
Toward a Measurement Infrastructure. Warren Matthews (SLAC) Presented at the e2e Workshop Miami, FL, February 2003.
Network Monitoring Sebastian Büttrich, NSRC / IT University of Copenhagen Last edit: February 2012, ICTP Trieste
1 High Performance Network Monitoring Challenges for Grids Les Cottrell, Presented at the Internation Symposium on Grid Computing 2006, Taiwan
Architecture and Algorithms for an IEEE 802
Data Center Network Architectures
Les Cottrell & Yee-Ting Li, SLAC
Lessons Learned Monitoring the WAN
Monitoring 10Gbps and beyond
Fast Pattern-Based Throughput Prediction for TCP Bulk Transfers
R. Hughes-Jones Manchester
Prepared by Les Cottrell & Hadrien Bullot, SLAC & EPFL, for the
BOF Discussion: Uploading IEPM-BW data to MonALISA
Tools for High Performance Network Monitoring
Terapaths: DWMI: Datagrid Wide Area Monitoring Infrastructure
High Speed File Replication
CPSC 641: Network Measurement
Connie Logg, Joint Techs Workshop February 4-9, 2006
Prepared by Les Cottrell & Hadrien Bullot, SLAC & EPFL, for the
Wide Area Networking at SLAC, Feb ‘03
High Performance Active End-to-end Network Monitoring
ABwE: Available Bandwidth Estimator Jiri Navratil R. Les
Connie Logg February 13 and 17, 2005
My Experiences, results and remarks to TCP BW and CT Measurements Tools Jiří Navrátil SLAC.
End-to-end Anomalous Event Detection in Production Networks
Evaluation of Techniques to Detect Significant Performance Problems using End-to-end Active Network Measurements Mahesh Chhaparia & Les Cottrell, SLAC.
High Performance Network Monitoring for UltraLight
High Performance Network Monitoring for UltraLight
Forecasting Network Performance
CapProbe Ling-Jyh Chen, M. Y. Sanadidi, Mario Gerla
Wide-Area Networking at SLAC
MAGGIE NIIT- SLAC On Going Projects
Prepared by Les Cottrell & Hadrien Bullot, SLAC & EPFL, for the
EE 122: Lecture 22 (Overlay Networks)
CPSC 641: Network Measurement
pathChirp Efficient Available Bandwidth Estimation
pathChirp Efficient Available Bandwidth Estimation
Presentation transcript:

Using Netflow data for forecasting Les Cottrell and Fawad Nazir, Presented at the CHEP06 Meeting, Mumbai India, February 2006 www.slac.stanford.edu/grp/scs/net/talk06/chep06.ppt Tools for High Performance Network Monitoring Les Cottrell, SLAC Data intensive sciences such as High Energy Physics have a need to move large amounts of data between major collaboration sites worldwide. This in turn requires advanced high-speed networking capabilities which need to be monitored. In this talk we will review and compare the most popular active end-to-end (E2E) Internet monitoring tools in use today, including those used for measurement of delays (ping, owamp), routes (traceroute), available bandwidth (pathchirp, pathload, ABwE) and achievable throughput (iperf, thrulay). We will illustrate how the results from some of the tools may be visualized (including by MonALISA), and used for forecasting, detecting problems and alerting for problems. We will also identify challenges in scaling the current active E2E tools to the emerging high speed networks (> 1 Gbits/s, often using layer 2 or 1 paths) showing how and why they may fail. We will then look at the advantages and drawbacks of using passive measurements to reduce the need for active E2E bandwidth and throughput measurements, and look at whether the passive measurements may be used successfully for forecasting. Partially funded by DOE/MICS for Internet End-to-end Performance Monitoring (IEPM)

Netflow Router/Switch identifies flow by sce/dst ports, protocol Cuts record for each flow: src, dst, ports, protocol, TOS, start, end time Collect records and analyze Can be a lot of data to collect each day, needs lot cpu Hundreds of MBytes to GBytes No intrusive traffic, real: traffic, collaborators, applications No accounts/pwds/certs/keys No reservations etc Characterize traffic: top talkers, applications, flow lengths etc. Internet 2 backbone http://netflow.internet2.edu/weekly/ SLAC: www.slac.stanford.edu/comp/net/slac-netflow/html/SLAC-netflow.html NetraMet, SCAMPI are a couple of non-commercial flow projects. IPFIX is an IETF standardization effort for Netflow type passive monitoring.

Typical day’s flows Very much work in progress Look at SLAC border >100KB flows ~ 28K flows/day ~ 75 sites with > 100KByte bulk-data flows Few hundred flows > GByte

Forecasting? Collect records for several weeks Filter 40 major collaborator sites, big (> 100KBytes) flows, bulk transport apps/ports (bbcp, bbftp, iperf, thrulay, scp, ftp Divide by remote site, aggregate parallel streams Fold data onto one week, see bands at known capacities and RTTs ~ 500K flows/mo

Netflow et. al. Peaks at known capacities and RTTs RTTs might suggest windows not optimized

How many sites have enough flows? In May ’05 found 15 sites at SLAC border with > 1440 (1/30 mins) flows Enough for time series forecasting for seasonal effects Three sites (Caltech, BNL, CERN) were actively monitored Rest were “free” Only 10% sites have big seasonal effects in active measurement Remainder need fewer flows So promising

Compare active with passive Predict flow throughputs from Netflow data for SLAC to Padova for May ’05 Compare with E2E active ABwE measurements

Netflow limitations Use of dynamic ports. GridFTP, bbcp, bbftp can use fixed ports P2P often uses dynamic ports Discriminate type of flow based on headers (not relying on ports) Types: bulk data, interactive … Discriminators: inter-arrival time, length of flow, packet length, volume of flow Use machine learning/neural nets to cluster flows E.g. http://www.pam2004.org/papers/166.pdf Aggregation of parallel flows (not difficult) SCAMPI/FFPF/MAPI allows more flexible flow definition See www.ist-scampi.org/ Use application logs (OK if small number) For FTP port 21 only used as a control channel.

More challenges Throughputs often depend on non-network factors: Host interface speeds (DSL, 10Mbps Enet, wireless) Configurations (window sizes, hosts) Applications (disk/file vs mem-to-mem) Looking at distributions by site, often multi-modal Predictions may have large standard deviations How much to report to application

Conclusions Traceroute dead for dedicated paths Some things continue to work Ping, owamp Iperf, thrulay, bbftp … but Packet pair dispersion needs work, its time may be over Passive looks promising with Netflow SNMP needs AS to make accessible Capture expensive ~$100K (Joerg Micheel) for OC192Mon

More information