The Plane distributed measurement infrastructure Overview, insights & hindsights Journée du Conseil Scientifique de l’Afnic, #JCSA2015, July 9 th 2015.

Slides:



Advertisements
Similar presentations
Research Challenges in the CarTel Mobile Sensor System Samuel Madden Associate Professor, MIT.
Advertisements

§ Project acronym: mPlane Project full title: “mPlane – an Intelligent Measurement Plane for Future Network and Application Management” Grant agreement.
MPlane: project and architecture The need of an intelligent measurement plane for the internet FIRE-GENI workshop May 5-6, Cambridge USA Disclaimer: I’m.
1/20 Cloud Computing SLAs in FP7 Bruxelles, May 27, 2013 mPlane – an Intelligent Measurement Plane for Future Network and Application Management MPLANE.
Logically Centralized Control Class 2. Types of Networks ISP Networks – Entity only owns the switches – Throughput: 100GB-10TB – Heterogeneous devices:
Multihoming in IPV6 Habib Naderi Department of Computer Science University of Auckland.
PlanetLab Operating System support* *a work in progress.
SDN and Openflow.
Open Government Vlora Ademi, Business Development Manager-Edu, Microsoft Macedonia &Kosovo
1 DCS860A Emerging Technology Physical layer transparency in Cloud Computing (rev )
Presentation 7 part 2: SOAP & WSDL. Ingeniørhøjskolen i Århus Slide 2 Outline Building blocks in Web Services SOA SOAP WSDL (UDDI)
IPlane: An Information Plane for Distributed Services Offence by: Anup Goyal Sagar Vemuri.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
Realising the Potential of Service Oriented Architecture Kris Horrocks Connected Systems Division Microsoft.
1 Routing as a Service Karthik Lakshminarayanan (with Ion Stoica and Scott Shenker) Sahara/i3 retreat, January 2004.
Distributed and Efficient Classifiers for Wireless Audio-Sensor Networks Baljeet Malhotra Ioanis Nikolaidis Mario A. Nascimento University of Alberta Canada.
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
Understanding Active Directory
Load Test Planning Especially with HP LoadRunner >>>>>>>>>>>>>>>>>>>>>>
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
MPlane – Building an Intelligent Measurement Plane for the Internet Maurizio Dusi – NEC Laboratories Europe NSF Workshop on perfSONAR.
MPlane – Building an Intelligent Measurement Plane for the Internet Alessandro D’Alconzo FTW Forschungszentrum Telekommunikation Wien, AT
MPlane – Building an Intelligent Measurement Plane for the Internet A quick overview.
S New Security Developments in DICOM Lawrence Tarbox, Ph.D Chair, DICOM WG 14 (Security) Siemens Corporate Research.
(jeez y) Where is the Internet? Answers from : (G. Whilikers) Out there. (Mike) the way I see it, the "internet" has to be somewhere. a router collects.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
Chapter 6: Packet Filtering
mPlane – Building an Intelligent Measurement Plane for the Internet
1 Controlling IP Spoofing via Inter-Domain Packet Filters Zhenhai Duan Department of Computer Science Florida State University.
Happy Network Administrators  Happy Packets  Happy Users WIRED Position Statement Aman Shaikh AT&T Labs – Research October 16,
Towards Street-Level Client- Independent IP Geolocation Yong Wang, UESTC/Northwestern Daniel Burgener, Northwestern Marcel Flores, Northwestern Aleksandar.
Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, SCAPE Scalable Preservation Environments.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Maximize Return on Engagement via Scalable Omni-Channel Online Services in the Cloud COMPANY PROFILE: XOMNI, INC. Founded in 2011 and headquartered in.
10/18/20151 Business Process Management and Semantic Technologies B. Ramamurthy.
An Approach To Automate a Process of Detecting Unauthorised Accesses M. Chmielewski, A. Gowdiak, N. Meyer, T. Ostwald, M. Stroiński
Tony McGregor RIPE NCC Visiting Researcher The University of Waikato DAR Active measurement in the large.
An Intelligent Measurement Plane for the Internet Pedro Casas – Senior FTW Vienna Traffic Monitoring & Analysis.
Russ Housley IETF Chair Internet2 Spring Member Meeting 28 April 2009 Successful Protocol Development.
Accada – Open Source EPC Network Prototyping Platform Christian Floerkemeier Christof Roduner SAP October 2006.
1 - GEC8, San Diego, July 20-22, 2010 Measurement Tools in PlanetLab Europe Tanja Zseby (Fraunhofer FOKUS, Berlin, Germany) (some slides from other OneLab.
Computer Security Workshops Networking 101. Reasons To Know Networking In Regard to Computer Security To understand the flow of information on the Internet.
1 Network Measurement Summary ESCC, Feb Joe Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
- 1 IPv6 Quality of Service Measurement Issues and Solutions Alessandro Bassi Hitachi Europe SAS RIPE 50 meeting Stockholm, 2 nd May 2005.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Protocols COM211 Communications and Networks CDA College Olga Pelekanou
MPlane – an Intelligent Measurement Plane for Future Network and Application Management Grant Agreement n Heidelberg mPlane – Demo.
Where are the anycasters?
Network Computing Laboratory 1 Vivaldi: A Decentralized Network Coordinate System Authors: Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris MIT Published.
Census Processing Baku Training Module.  Discuss:  Processing Strategies  Processing operations  Quality Assurance for processing  Technology Issues.
Where are the anycasters, reloaded MAT WG Talk Grant Agreement n Dario Rossi Professor telecom-paristech.fr/~drossi/anycast.
1 workshop Barcelona, April 22, 2015.
Highly Available Internet Telephony Fact or Fiction? Manfred Reitenspiess Fujitsu Siemens Computers Munich, Germany
Library Online Resource Analysis (LORA) System Introduction Electronic information resources and databases have become an essential part of library collections.
REST Style Large Measurement Platform Protocol draft-liu-lmap-rest-00.txt Dapeng Liu(Presenter) Lingli Deng China Mobile Shihui Duan CATR Cathy Li China.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
A New Generation of Artificial Neural Networks.  Support Vector Machines (SVM) appeared in the early nineties in the COLT92 ACM Conference.  SVM have.
1. ABSTRACT Information access through Internet provides intruders various ways of attacking a computer system. Establishment of a safe and strong network.
Anycast enumeration and geolocation approaches Scuola Politecnica e delle Scienze di Base Corso di Laurea Magistrale in Ingegneria Informatica master’s.
St. Petersburg, 2016 Openstack Disk Storage vs Amazon Disk Storage Computing Clusters, Grids and Cloud Erasmus Mundus Master Program in PERCCOM Author:
CIS 700-5: The Design and Implementation of Cloud Networks
LMAP BoF 1. ISP use case 2. Framework
PERISCOPE: Standardizing and Orchestrating Looking Glass Querying
Donatella Castelli CNR-ISTI
Presented by Munezero Immaculee Joselyne PhD in Software Engineering
Dude, where’s that IP? Circumventing measurement-based geolocation
Continuous Automated Chatbot Testing
11/17/2018 9:32 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Phillipa Gill University of Toronto
Presentation transcript:

The Plane distributed measurement infrastructure Overview, insights & hindsights Journée du Conseil Scientifique de l’Afnic, #JCSA2015, July 9 th 2015 Grant Agreement n Dario Rossi Professor telecom-paristech.fr/~drossi

Today Internet Which ISP is the best in my area ? Why is not working? How does it work? How to do better? User viewpoint ISP viewpoint Cloud viewpoint How to optimize my network for users apps? Where is traffic coming from? MotivationsMeasurementArchitectureEcosystemExample of useSummary How to keep users happy & engaged? Researcher viewpoint

Today Internet “The Internet is the first thing that humanity has built that humanity doesn't understand, the largest experiment in anarchy that we have ever had.” Eric Schmidt – ex Google Exec. Chairman MotivationsMeasurementArchitectureEcosystemExample of useSummary 3

…. issing orchestration Internet measurement Shed light on the Internet operational obscurity Lot of measurement tools… Plane: avoid to reinvent the wheel & assist in building automated pilots! MotivationsMeasurementArchitectureEcosystemExample of useSummary Plane: 4

Two kinds of measurements Passive – Observe network traffic without interference – Similar to Aristotle’s observational method Active – Perturb the network & measure its reaction – Similar to Newton’s experimental method 5 Ἀριστοτέλης Sir All science is either practical, poetical or theoretical (Metaphysics) Every body continues in its state of rest, or of uniform motion in a right line, unless it is compelled to change that state by forces impressed upon it (Principia) MotivationsMeasurementArchitectureEcosystemExample of useSummary 5

Passive m easurements Deploy passive sensors Deploy passive sensors Extract meaningful analytics Post Process Passive sensor Raw (or half- cooked) data Data collection User traffic MotivationsMeasurementArchitectureEcosystemExample of useSummary 6

Active m easurements Raw (or half- cooked) data Data collection Actuator Control active sensors Control active sensors A Post Process Active probes Perform active measurements Extract meaningful analytics MotivationsMeasurementArchitectureEcosystemExample of useSummary 7

erging all together Active sensor Control Passive sensor Data Integration with existing monitoring frameworks Integration with existing monitoring frameworks A Active and passive analysis for iterative root-cause-analysis Active and passive analysis for iterative root-cause-analysis MotivationsMeasurementArchitectureEcosystemExample of useSummary 8

~~~Big data hype~~~ Traffic measurements orders of m agnitude – 40Gbps (157PB/yr) full-packet processing at each passive sensor, with up to O(10 7 ) traffic classifications per second… – O(10 9 ) active probes in an Internet anycast census… (more on that later if time allows) To be compared with To be compared with – The Large Hadron Collider (LHC), generates ~25PB/yr 8 ) collisions per second and O(10 8 ) collisions per second – Sloan Digital Sky Survey (SDSS), generates ~73TB/yr – Capacity of the Human genome with 2-bits bases ~ O(10 9 ) [Image from [courtesy of Sony Classical]

Note: simplified view, yet the devil hides in the details Plane architecture walkthrough A Repositories Probes S Supervisor Client/ Reasoner MotivationsMeasurementArchitectureEcosystemExample of useSummary 10

Plane architecture: entities A Repositories Probes S Supervisor Client/ Reasoner Orchestrate measurements; expert system (automatic pilot) Access control; delegation; boostrap Past sensor data; legacy databases; Big data processing Past sensor data; legacy databases; Big data processing Sensor data; Proxies for legacy tools/ platforms Sensor data; Proxies for legacy tools/ platforms MotivationsMeasurementArchitectureEcosystemExample of useSummary 11

Plane architecture: interfaces A Probes S Supervisor Client/ Reasoner Native probe Proxy probe/repo Native repo Repositories (avoid reinventing the wheel!) MotivationsMeasurementArchitectureEcosystemExample of useSummary 12

Plane architecture: messages A Repository Probe S Supervisor Client/ Reasoner Schema-centric definition, with a type registry Schema completely describes the measurement, JSON format C S R MotivationsMeasurementArchitectureEcosystemExample of useSummary 13

Note: simple ping example, for illustrational purposes Plane architecture: messages CSR { "capability": “measure”, "version": 0, "registry": "label": “ping-aggregate” "when": “now.. future / 1s” "parameters": {"source.ip4“: " ", "destination.ip4": "*"}, “results”: ["delay.twoway.icmp.us.min", "delay.twoway.icmp.us.mean", "delay.twoway.icmp.us.max", "delay.twoway.icmp.count"] } { “specification": “measure”, "version": 0, "registry": "label": “ping-aggregate-experiment1”, "when": “now + 30s / 1s”, "token" : "0f31c9033f8fce0c9be41d4944" "parameters": {"source.ip4“: " ", "destination.ip4": “ "}, “results”: ["delay.twoway.icmp.us.min", "delay.twoway.icmp.us.mean", "delay.twoway.icmp.us.max", "delay.twoway.icmp.count"] } { “specification": “measure”, "version": 0, "registry": "label": “ping-aggregate-experiment1”, "when": “ :30: […] /1s“, "token" : "0f31c9033f8fce0c9be41d4944“, "parameters": {"source.ip4“: " ", "destination.ip4": “ "}, “results”: ["delay.twoway.icmp.us.min", "delay.twoway.icmp.us.mean", "delay.twoway.icmp.us.max", "delay.twoway.icmp.count"] “resultsvalue” : [[ 2390,2983, 6600,30]] } *grin* If I'd known then that [ping] would be my most famous accomplishment in life, I might have worked on it another day or two and added some more options. Mike Muuss - US Army Research Laboratory MotivationsMeasurementArchitectureEcosystemExample of useSummary 14

Plane architecture: transport Repository Probe S Supervisor Client/ Reasoner mPlane over: HTTPS (HTTP over TLS) WSS (WebSockets over TLS) SSH (Secure Shell) Indirect export uses custom protocols: IPFIX, FTP, BitTorrent, Apache Flume, RFC1149, RFC6214, etc. MotivationsMeasurementArchitectureEcosystemExample of useSummary 15

Plane architecture: workflow Repositories Probes S Supervisor Client/ Reasoner 1. Send high level specification 2. Decompose specification 3. Send lowlevel specification 4. Measure 5. Indirect export of Big data 6.Analyze 7.Return results 8. Aggregate results 9.Return high-level results 10. Decide and iterate MotivationsMeasurementArchitectureEcosystemExample of useSummary 16

Plane architecture: Inter-domain Repositories Probes Supervisor S S S Client/ Reasoner MotivationsMeasurementArchitectureEcosystemExample of useSummary 17

The broader easurement ecosystem “From global measurements to local management” – 10 partners, 3.8 MEUR, FP7 STREP – More focused use case: Build a framework out of probes – Knowledge sharing (e.g., joint work, Dagsthul seminars, etc.) IETF Large-Scale Measurement of Broadband Performance (LMAP) – Defines the components, protocols, rules, etc., but does not specifically target adding “a brain” to the system – Common core set, Largely interoperable IETF IP Performance Metrics (IPPM) – Registry related, we use its vocabulary as much as possible IETF IP Flow Information Export (IPFIX) – Indirect export related, active contributors Others in scope – IETF DOCTORS; tcpm; ConEx; NETCONF; IRTF NMRG; ETSI STQ; ITU SG12 MotivationsMeasurementArchitectureEcosystemExample of useSummary 18

Plane consortium Marco Mellia POLITO Saverio Nicolini NEC Dina Papagiannaki Telefonica Ernst Biersack Eurecom Brian Trammell ETH Tivadar Szemethy NetVisor Dario Rossi ENST Fabrizio Invernizzi Telecom Italia Guy Leduc Univ. Liege Pietro Michiardi Eurecom Pedro Casas FTW Andrea Fregosi Fastweb 16 partners – 3 ISPs – 6 research centers – 5 universities – 2 SMEs FP7 IP – 11 MEUR – 3 years long – ends early 2016 MotivationsMeasurementArchitectureEcosystemExample of useSummary 19

Plane achievements so far Standardization &dissemination – 3 RFCs, 5 drafts – 80+ papers, including 4 Best paper awards and ACM SIGCOMM IMC & CoNEXT, IEEE INFOCOM Software and (Ready-to-use Virtual Machines under preparation) Check the demos on “Biased sampling”: Anycast geolocation as a showcase of mPlane achievements Note: hard to summarize! MotivationsMeasurementArchitectureEcosystemExample of useSummary 20

IP Anycast Anycast: set of equivalent replicated servers, traffic routed to closest replica (in BGP sense) Question: where are Google DNS servers ? Mountain View, CA (IP2Location) New York, NY (Geobytes) United States (Maxmind) ????? 1ms ≈ 100Km 2ms from EU to US ????? Speed of light violation !! Tools using distributed measurement aren’t better ! Time varying answers Unknown accuracy Commercial databases Distributed measurement MotivationsMeasurementArchitectureEcosystemExample of useSummary 21

Geolocation technique MotivationsMeasurementArchitectureEcosystemExample of useSummary Measure Latency Planetlab/RIPE Atlas Measure Latency Planetlab/RIPE Atlas Enumerate Solve MIS Optimum (brute force). 5-approx (greedy).Enumerate Solve MIS Optimum (brute force). 5-approx (greedy). GeolocateClassification Maximum likelihoodGeolocateClassification Detect Speed of light violations Detect Scalability Infrastructure ? Lightweight IterateFeedback Filter latency noise MeasureDetectEnumerateGeolocateIterate D. Cicalese, et al. “A fistful of Pings: Accurate and lightweight Anycast enumeration and Geolocation”, IEEE INFOCOM

measurement infrastructure RIPE Atlas vs PlanetLab Anecdotal understanding – For some deployments, RIPE includes PlanetLab – For others, the union is greater than the sum of the parts ! mPlane enables/simplifies systematic studies MotivationsMeasurementArchitectureEcosystemExample of useSummary Infrastructure ? FootprintVPsASesCountries RIPE Atlas7k2k150 PlanetLab~ (Microsoft IP/24 Example) 23

Anycast census MotivationsMeasurementArchitectureEcosystemExample of useSummary O(10 7 ) targets x O(10 2 ) active sensors O(10 3 ) targets /sensor /second O(10 3 ) targets are anycast – needle in the IPv4 haystack Scalable Lightweight 24

Su ary Overview – Measurements to shed light on Internet operational obscurity Insights – Measurement plane to facilitate expression of measurement capabilities and needs – Allow users to concentrate effort on hard problems – Avoiding time consuming tasks Hindsights – Crucial to foster adoption (GitHub, ready-to-use VMs) to reach a critical mass (incentive in proxying) MotivationsMeasurementArchitectureEcosystemExample of useSummary 25

any thanks ?? || // 26 MotivationsMeasurementArchitectureEcosystemExample of useSummary

Backup slides 27

Plane architecture: simplistic view Repository components Measurement components (aka Probes) mPlaneAPI A Reasoner Client S Supervisor Client proxy

S Plane architecture: gory details

Methodology overview MeasureLatency Planetlab/Ripe Enumerate Solve MIS Optimum (brute force). 5-approx (greedy). GeolocateClassification Maximum likelihood IterateFeedback Detect Speed of light violations

Measure PlanetLab – 300 vantage points – Geolocated with Spotter (ok for unicast) – Freedom in type/rate of measurement ICMP, DNS, TCP-3way delay, etc RIPE – 6000 vantage points – Geolocated with MaxMind (ok for unicast) – More constrained (ICMP, traceroute) In this talk: min over 10 ICMP samples MeasureLatency Planetlab/Ripe Enumerate Solve MIS Optimum (brute force). 5-approx (greedy). GeolocateClassification Maximum likelihood Iterate Feedbac k Detect Speed of light violations

Detect The vantage points p and q are referring to two different instances if: MeasureLatency Planetlab/Ripe Enumerate Solve MIS Optimum (brute force). 5-approx (greedy). GeolocateClassification Maximum likelihood Iterate Feedbac k Detect Speed of light violations Packets cannot travel faster than the speed of light

Enumerate Find a maximum independent set E – of discs such that: – Brute force (optimum) vs Greedily from smallest (5-approximation) MeasureLatency Planetlab/Ripe Enumerate Solve MIS Optimum (brute force). 5-approx (greedy). GeolocateClassification Maximum likelihood Iterate Feedbac k Detect Speed of light violations

Geolocate Classification task – Map each disk Dp to most likely city – Compute likelihood (p) of each city in disk based on: ci: Population of city i Ai: Location of ATA airport of city i d(x,y): Geodesic distance  : city vs distance weighting Output policy Proportional: Return all cities in D p with respective likelihoods Argmax: Pick city with highest likelihood Frankfurt p=0.30 Zurich p=0.10 Munich p=0.60 Munich MeasureLatency Planetlab/Ripe Enumerate Solve MIS Optimum (brute force). 5-approx (greedy). GeolocateClassification Maximum likelihood Iterate Feedbac k Detect Speed of light violations rationale: users lives in densely populated area; to serve users, servers are placed close to cities airports: simplifies validation against ground truth (DNS)

Iterate Collapse – Geolocated disks to city area Rerun – Enumeration on modified input set Increase recall of anycast replicas enumeration MeasureLatency Planetlab/Ripe Enumerate Solve MIS Optimum (brute force). 5-approx (greedy). GeolocateClassification Maximum likelihood Iterate Feedbac k Detect Speed of light violations

Performance at a glance – Protocol agnostic and lightweight Based on a handful of delay measurement O(100) VPs 1000x fewer VPs than state of the art – Enumeration iGreedy use75% of the probes (25% discarded due to overlap) Overall 50% recall (depends on VPs; stratification is promising) – Geolocation Correct geolocation for 78% of enumerated replicas 361 km mean geolocation error for all enumerated replicas (271 km median for erroneous classification)