© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice S 3 : A Scalable Sensing Service.

Slides:



Advertisements
Similar presentations
Scalable and Extensible Network Monitoring For Scalable and Extensible Network Monitoring For GENI GENI Sonia.
Advertisements

1 Scalability is King. 2 Internet: Scalability Rules Scalability is : a critical factor in every decision Ease of deployment and interconnection The intelligence.
4.1.5 System Management Background What is in System Management Resource control and scheduling Booting, reconfiguration, defining limits for resource.
Suphakit Awiphan, Takeshi Muto, Yu Wang, Zhou Su, Jiro Katto
Path Optimization in Computer Networks Roman Ciloci.
Streaming Video over the Internet: Approaches and Directions Dapeng Wu, Yiwei Thomas Hou et al. Presented by: Abhishek Gupta
Probabilistic Aggregation in Distributed Networks Ling Huang, Ben Zhao, Anthony Joseph and John Kubiatowicz {hling, ravenben, adj,
1 In VINI Veritas: Realistic and Controlled Network Experimentation Jennifer Rexford with Andy Bavier, Nick Feamster, Mark Huang, and Larry Peterson
IPlane: An Information Plane for Distributed Services Offence by: Anup Goyal Sagar Vemuri.
1 Ossama Younis and Sonia Fahmy Department of Computer Sciences Purdue University For slides, technical report, and implementation, please see:
Traffic Engineering With Traditional IP Routing Protocols
1 GENI: Global Environment for Network Innovations Jennifer Rexford Princeton University
An Overlay Data Plane for PlanetLab Andy Bavier, Mark Huang, and Larry Peterson Princeton University.
Software Engineering and Middleware: a Roadmap by Wolfgang Emmerich Ebru Dincel Sahitya Gupta.
Internet In A Slice Andy Bavier CS461 Lecture.
1 Routing as a Service Karthik Lakshminarayanan (with Ion Stoica and Scott Shenker) Sahara/i3 retreat, January 2004.
Presentation Date : 16 Nov Measuring Bandwidth between PlanetLab Nodes Sung-Ju Lee, Puneet Sharma, Sujata Banerjee, Sujoy Basu Hewlett-Packard Laboratories,
UCB Communication Networks: Big Picture Jean Walrand U.C. Berkeley
OceanStore/Tapestry Toward Global-Scale, Self-Repairing, Secure and Persistent Storage Anthony D. Joseph John Kubiatowicz Sahara Retreat, January 2003.
Chapter 9: Moving to Design
Or, Providing Scalable, Decentralized Location and Routing Network Services Tapestry: Fault-tolerant Wide-area Application Infrastructure Motivation and.
New Challenges in Cloud Datacenter Monitoring and Management
Network Planète Chadi Barakat
Network Support for Cloud Services Lixin Gao, UMass Amherst.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Enable Cloud with Virtual.
Distributed Real-Time Systems for the Intelligent Power Grid Prof. Vincenzo Liberatore.
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
CS An Overlay Routing Scheme For Moving Large Files Su Zhang Kai Xu.
SCAN: a Scalable, Adaptive, Secure and Network-aware Content Distribution Network Yan Chen CS Department Northwestern University.
Monitoring Latency Sensitive Enterprise Applications on the Cloud Shankar Narayanan Ashiwan Sivakumar.
Information-Based Building Energy Management SEEDM Breakout Session #4.
An Integrated Instrumentation Architecture for NGI Applications Ian Foster, Darcy Quesnel, Steven Tuecke Argonne National Laboratory The University of.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. LogKV: Exploiting Key-Value.
BitTorrent enabled Ad Hoc Group 1  Garvit Singh( )  Nitin Sharma( )  Aashna Goyal( )  Radhika Medury( )
© Copyright 2010 Hewlett-Packard Development Company, L.P. 1 Jayaram Mudigonda, HP Labs Praveen Yalagandula, HP Labs Mohammad Al-Fares, UCSD Jeff Mogul,
Salim Hariri HPDC Laboratory Enhanced General Switch Management Protocol Salim Hariri Department of Electrical and Computer.
AKARI New Generation Network Architecture SeungHo Lee.
1 Logistical Computing and Internetworking: Middleware for the Use of Storage in Communication Micah Beck Jack Dongarra Terry Moore James Plank University.
Distributed Monitoring of Mesh Networks Elizabeth Belding-Royer Mobility Management and Networking (MOMENT) Lab Dept. of Computer Science University of.
P.C. Rossin College of Engineering and Applied Science RESEARCH C O M P U T E R S C I E N C E & E N G I N E E R I N G C O M P U T E R S C I E N C E & E.
November 17, 2015Department of Computer Sciences, UT Austin1 SDIMS: A Scalable Distributed Information Management System Praveen Yalagandula Mike Dahlin.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Resilient Overlay Networks Robert Morris Frans Kaashoek and Hari Balakrishnan MIT LCS
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
Microsoft Management Seminar Series SMS 2003 Change Management.
1 AReNA: Adaptive Distributed Catalog Infrastructure Based On Relevance Networks Vladimir Zadorozhny, University of Pittsburgh, Pittsburgh, PA Avigdor.
Measurement in the Internet Measurement in the Internet Paul Barford University of Wisconsin - Madison Spring, 2001.
Marcelo R.N. Mendes. What is FINCoS? A set of tools for data generation, load submission, and performance measurement of CEP systems; Main Characteristics:
Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems.
Wireless Mesh Networks Myungchul Kim
Sponsored by the National Science Foundation Scalable, Extensible, and Safe Monitoring of GENI Spiral 2 Year-end Project Review PI: Sonia Fahmy, Purdue.
Challenges in the Next Generation Internet Xin Yuan Department of Computer Science Florida State University
Network Management Mechanisms Two major network management protocols: Simple Network Management Protocol (SNMP) Common Management Information Protocol.
Network Computing Laboratory Load Balancing and Stability Issues in Algorithms for Service Composition Bhaskaran Raman & Randy H.Katz U.C Berkeley INFOCOM.
Resolve today’s IT management dilemma Enable generalist operators to localize user perceptible connectivity problems Raise alerts prioritized by the amount.
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
Proposal Presentation Inferring Geographic proximity of the Internet Node using a Compound Metric M. Kamran Nishat.
LACSI 2002, slide 1 Performance Prediction for Simple CPU and Network Sharing Shreenivasa Venkataramaiah Jaspal Subhlok University of Houston LACSI Symposium.
1 Scalability and Accuracy in a Large-Scale Network Emulator Nov. 12, 2003 Byung-Gon Chun.
Interaction and Animation on Geolocalization Based Network Topology by Engin Arslan.
Chapter 1 Characterization of Distributed Systems
Accelerating Peer-to-Peer Networks for Video Streaming
University of Maryland College Park
Kris, Karthik, Ansley, Sean, Jeremy Dick, David K, Frans, Hari
Network Requirements Javier Orellana
Northwestern Lab for Internet and Security Technology (LIST) Yan Chen Department of Computer Science Northwestern University.
GGF15 – Grids and Network Virtualization
Overlay Networking Overview.
AWS Cloud Computing Masaki.
Presentation transcript:

© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice S 3 : A Scalable Sensing Service for Monitoring Large Networked Systems Mobile & Media Systems Lab, HP Labs Praveen Yalagandula Puneet Sharma Sujata Banerjee Sung-Ju Lee Sujoy Basu

211 June 2015 Motivation Traditional access to network state −Only network admins have access to probe router MIBs SNMP based data collection and analysis −On path sensing (e.g., TCP, RTP) Emerging distributed applications −Need comprehensive picture of the network in real time For reliability and better performance To meet QoS guarantees −Examples CHART: Next Generation Internet Control Plane Mobile Streaming Media Systems

311 June 2015 Client-end Daemon Overlay Link Physical Link P2P Overlay Network A B Adaptive routing under fine real-time control Constant, pervasive sensing of network conditions Rapid routing response to link failures and degradations CHART: Control for High-throughput Adaptive Resilient Transport GOAL: Improve end-to-end TCP/IP performance 10x under multiple communication link impairments DARPA funded joint project with UC Berkeley, Princeton, George Mason University, UC Santa Barbara, and Anagran 10% lossrate

411 June 2015 Mobile Streaming Media System client Streaming server A Streaming server B Automatic server selection depending on network conditions Imperceptible to end user New media services introduced mid- session, depending on need High cross traffic

511 June 2015 What does these applications need? Sensing feedback that is: −Responsive: Seconds instead of minutes −Scalable: Low overhead even for ubiquitous sensing −Shareable: Between different applications Between different components of an application −Robust: Adapt to infrastructure failures −Flexible and Extensible: Per-flow, per-application to meet multiple application requirements

611 June 2015 S 3 : A Scalable Sensing Service Goal: provide system state in real-time −Both individual network and node state Monitor actively and passively E2E but leverage network element info where possible Flexible interfaces −Configurable time scales to measure −Support complex queries To which node do I have large bandwidth? Which node with file “foo” is within 10ms latency? Share measurement info across applications −Eliminate redundant expensive measurements Scalable, Secure, and Reliable

711 June 2015 Outline Motivation for a sensing service S 3 : Scalable Sensing Service −Goals −Architecture Sensing Pods Backplane Scalable Inferencing Engines Deployment: All pair network metrics on PlanetLab Summary and Future work

811 June 2015 Sensor Pod Secure Web Interface Controller Latency Lossrate Bandwidth Load Capacity Memory Repositor y Configuration & Data API: query, control, and notification SNMP Agent

911 June 2015 Sensor Pods Web-Service (WS) enabled collection of sensors −Goal is to provide a flexible and extensible framework −Focus not on building new measurement tools Allows easy plugging of new sensors WS enables composition  aggregate sensors −E.g., Spruce needs capacity, use PathRate Archival of sensing information

1011 June 2015 Backplane Sensing Information Management Backplane

1111 June 2015 Sensing Backplane Programmable middleware −Aggregate data from end-points −Distribute configurations to end-points −Configurable and self-managing Exploring SDIMS [Yalagandula et al. SIGCOMM’04] −DHT based information management middleware −Scalable: with both nodes and attributes −Flexible: supports different aggregation strategies −Reliable: self-configuration in the face of failures −Adaptive: efficiently handle dynamic load patterns

1211 June 2015 Scalable Inference Engines Difficult to collect complete sensing information −Large overhead for probing and data exchange −E2E properties require gathering information from multiple sensors −Measurement/Monitoring failures Inference based on incomplete information −Exploit properties such as triangular inequality e.g. latency −Bayesian techniques e.g. hop counts Prediction based on archived information −E.g. network weather service A coarse estimate may suffice for many applications

1311 June 2015 Inferencing Latency and Proximity estimation −Co-ordinates: e.g., GNP, Vivaldi −Landmark based: e.g., Netvigator, GNP, Lighthouse −Others: e.g., Meridian, ID Maps, King Lossrate −Subset Path Selection: NetQuest Bandwidth: Non additive, highly dynamic  hard −Route sharing model: BRoute

1411 June 2015 S 3 : Architecture Sensor pods −Collection of sensors −Measure system state from a node’s view Backplane −Programmable fabric −Connects pods and aggregates measured system state Inference Engines −Infers O(n 2 ) E2E paths info by measuring few paths −Schedules measurements on pods −Aggregates data on backplane Applications

1511 June 2015 Outline Motivation for a sensing service S 3 : Scalable Sensing Service −Goals and Challenges −Architecture Sensing Pods Backplane Scalable Inferencing Deployment: All pair network metrics on PlanetLab Future work

1611 June 2015 Deployment: PlanetLab Early January 2006 All pair network metrics −Latency: Inferred by Netvigator −Lossrate: Measured using Tulip lossrate tool −Available Bandwidth: Measured using Spruce and PathChirp −Capacity: Measured using Pathrate Stats:~14GB raw data every day, ~1GB compressed

1711 June 2015 Screenshot: Hop by hop loss sensor

1811 June 2015 S 3 Screenshot

1911 June 2015 Network Visualizer – NOC on a Laptop

2011 June 2015 CHART: S 3 in Action

2111 June 2015 Ongoing work Security Model Admission control for S 3 −Limit network and node usage Efficient and accurate bandwidth inference Validation with more applications