Proximity service Main idea – provide “glue” between experiments and sonar topology – mainly map sonars to storages and vice versa – determine existing.

Slides:



Advertisements
Similar presentations
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public ITE PC v4.0 Chapter 1 1 Link-State Routing Protocols Routing Protocols and Concepts – Chapter.
Advertisements

Logically Centralized Control Class 2. Types of Networks ISP Networks – Entity only owns the switches – Throughput: 100GB-10TB – Heterogeneous devices:
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.1 Routing Working at a Small-to-Medium Business or ISP – Chapter 6.
Update on OSG/WLCG perfSONAR infrastructure Shawn McKee, Marian Babik HEPIX Spring Workshop, Oxford 23 rd - 27 th March 2015.
Integrating Network and Transfer Metrics to Optimize Transfer Efficiency and Experiment Workflows Shawn McKee, Marian Babik for the WLCG Network and Transfer.
Banker’s Algorithm Implementation in CPN Tools Michal Žarnay Department of Transportation Networks University of Žilina, Slovakia.
1 A survey of Internet Topology Discovery. 2 Outline Motivations Internet topology IP Interface Level Router Level AS Level PoP Level.
Internet Traffic Patterns Learning outcomes –Be aware of how information is transmitted on the Internet –Understand the concept of Internet traffic –Identify.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
King : Estimating latency between arbitrary Internet end hosts Krishna Gummadi, Stefan Saroiu Steven D. Gribble University of Washington Presented by:
1 Relates to Lab 4. This module covers link state routing and the Open Shortest Path First (OSPF) routing protocol. Dynamic Routing Protocols II OSPF.
PerfSONAR in ATLAS/WLCG Shawn McKee, Marian Babik ATLAS Jamboree / Network Section 3 rd December 2014.
Distance Indexing on Road Networks A summary Andrew Chiang CS 4440.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Business Portal Project Community Seminars & Workshops GEOG June 2011 W. Millen, C. Jhagroo, J. Pickering, F. Fahad & T. Banchuen.
Geek Night Nima Ben Tramchester & Graph Databases.
Systems analysis and design, 6th edition Dennis, wixom, and roth
Use Cases. Summary Define and understand slow transfers – Identify weak links, narrow down the source – Understand what perfSONAR measurements mean wrt.
1 1 ISyE 6203 Radical Tools Intro To GIS: MapPoint John H. Vande Vate Spring 2012.
Copyright 2010, The World Bank Group. All Rights Reserved. COVERAGE, FRAMES & GIS, Part 2 Quality assurance for census 1.
© Copyright 2007 Arbinet-thexchange, Inc. All Rights Reserved. Voice Peering Steve Heap Chief Technology Officer.
Network and Transfer WG Metrics Area Meeting Shawn McKee, Marian Babik Network and Transfer Metrics Kick-off Meeting 26 h November 2014.
Dashboard Creation Mapping SQL queries to Google Chart Images.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES PhEDEx Monitoring Nicolò Magini CERN IT-ES-VOS For the PhEDEx.
Advanced Networking Lab. Given two IP addresses, the estimation algorithm for the path and latency between them is as follows: Step 1: Map IP addresses.
LexBIG Release Overview Aug 21, LexBIG Context Project Goals for Sept –Incremental point release of LexBIG infrastructure to support EVS activities.
Network and Transfer Metrics WG Meeting Shawn McKee, Marian Babik Network and Transfer Metrics WG Meeting 8 th April 2015.
1 Internet Routing. 2 Terminology Forwarding –Refers to datagram transfer –Performed by host or router –Uses routing table Routing –Refers to propagation.
1 Network Measurement Summary ESCC, Feb Joe Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
Towards a Global Service Registry for the World-Wide LHC Computing Grid Maria ALANDES, Laurence FIELD, Alessandro DI GIROLAMO CERN IT Department CHEP 2013.
Update on OSG/WLCG Network Services Shawn McKee, Marian Babik 2015 WLCG Collaboration Workshop 12 th April 2015.
Determining the Geographic Location of Internet Hosts Venkata N. Padmanabhan Microsoft Research Lakshminarayanan Subramanian University of California at.
Update on WLCG/OSG perfSONAR Infrastructure Shawn McKee, Marian Babik HEPiX Fall 2015 Meeting at BNL 13 October 2015.
Bug Localization with Association Rule Mining Wujie Zheng
WLCG Network and Transfer Metrics WG After One Year Shawn McKee, Marian Babik GDB 4 th November
Storage dashboard Status report A.Baranovski 12/10/07.
CIS 250 Advanced Computer Applications Database Management Systems.
An Open Source GIS Architecture Connected and Linked Data
Network and Transfer WG perfSONAR operations Shawn McKee, Marian Babik Network and Transfer Metrics WG Meeting 28 h January 2015.
PerfSONAR for LHCOPN/LHCONE Update Shawn McKee/University of Michigan LHCONE/LHCOPN Meeting Amsterdam, NL October 28 th, 2015.
Introduction to Databases Angela Clark University of South Alabama.
WLCG Latency Mesh Comments + – It can be done, works consistently and already provides useful data – Latency mesh stable, once configured sonars are stable.
Graph Database - Neo4j ISQS3358, Spring Graph Database A graph database is a database that uses graph structures for semantic queries with nodes,
BIG DATA/ Hadoop Interview Questions.
Campana (CERN-IT/SDC), McKee (Michigan) 16 October 2013 Deployment of a WLCG network monitoring infrastructure based on the perfSONAR-PS technology.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.1 Routing Working at a Small-to-Medium Business or ISP – Chapter 6.
The Grid Information System Maria Alandes Pradillo IT-SDC White Area Lecture, 4th June 2014.
1 Relates to Lab 4. This module covers link state routing and the Open Shortest Path First (OSPF) routing protocol. Dynamic Routing Protocols II OSPF.
Network Layer COMPUTER NETWORKS Networking Standards (Network LAYER)
Neo4j: GRAPH DATABASE 27 March, 2017
Gridpp37 – 31/08/2016 George Ryall David Meredith
Working at a Small-to-Medium Business or ISP – Chapter 6
Shawn McKee, Marian Babik for the
Naming for Mobile Systems
A Case Study in Building Layered DHT Applications
Fast Kernel-Density-Based Classification and Clustering Using P-Trees
Spark Presentation.
Routing Protocols and Concepts
COMP 3270 Computer Networks
Link-State Routing Protocols
Graph Database.
Alerting/Notifications (MadAlert)
David Ostrovsky | Couchbase
NOSQL databases and Big Data Storage Systems
ECE 544 Protocol Design Project 2016
Data Queries Raster & Vector Data Models
Link-State Routing Protocols
Link-State Routing Protocols
Working at a Small-to-Medium Business or ISP – Chapter 6
Polyglot Persistence: Graph Stores
Presentation transcript:

proximity service Main idea – provide “glue” between experiments and sonar topology – mainly map sonars to storages and vice versa – determine existing tools/technologies that can be used – 3 approaches Site-based – join based on common mapping to GOCDB/OIM sites – Already works for ATLAS – AGIS has such mapping automatically – There are caveats – each experiment has different site topology – each has different mapping to GOCDB/OIM – not all cases are one to one GeoIP – join based on geographical distance Traceroutes – join based on network distance (hops) Shawn McKee, Saul Youssef and M.B. are working on this

GeoIP The main idea: Determine geographical location for hosts from their IPs, use geographical distance (Great-circle, Vincenty) to calculate proximity Example – psum02.aglt2.org (perfSONAR) – long: /lat: – earth.crc.nd.edu (storage) – long: /lat: – Vincenty’s distance = km Determining location – perfSONARs contain their geo location as part of toolkit’s information – storages can be mapped using GeoIP service - DB which maps subnets to geo locations GeoLite-City – open source – accuracy at city level, falls back to country level Commercial provides, e.g. MaxMind– usually have better accuracy and provide more information (estimated accuracy/precision) – computed distance can be improved by subnet match – hosts on the same subnet (24) -> 0 distance

GeoIP prototype Prototype available at proximity.cern.ch Uses a custom GeoIP DB build from various sources – Contains only ATLAS (from AGIS) and CMS (from PhEDEx) storages Provides API/JSON – you can get nearest storages for a sonar and vice versa – Motivated by Shoal, returns n nearest records with geo information – Adds “origin” at the end You can tune – number of records returned (via count=7) – units (via unit=miles) – subnet match (via subnet=24) Examples – ount=5 ount=5 – se01.in2p3.fr&count=10

GeoIP summary Prototype algorithms and database are easy to port and can be integrated in the already existing tools (within experiment) – Few lines of codes, most of the effort spent in building up the DB Prototype available for testing/experimental purposes only (not a production service) – we can decide to evolve it to a production service if there is interest Establishing WLCG-wide GeoIP service shouldn’t be too complicated and can also have other benefits: – We already have many existing sources for this – GOCDB site locations, Shoal, sonars, LHCONE/LHCOPN routing – we can combine them to get a solid baseline – We could run a crowdsourcing campaign to fine tune the baseline (get additional subnets mapped) – Once established, DB would only require minor updates

Traces The main idea: Use traceroutes to create a graph model of the network, use graph traversal algorithms to determine proximity of nodes Graph model of the network has potential for other use cases such as path analysis, correlations with other metrics, auto-detection of weak links, visual debugging perfSONAR can be used as the source for the traceroutes – Currently runs full mesh traceroutes between 110 sonars (12K traces/hour), ideal for baseline, other sources can be added since traceroute/tracepath Graph databases such as Neo4j can be used to store and query the model – Designed to support graphs with billions of nodes and relationships – Runs 4M+ relationship traversals/second – Can import 100K-1M records/second

Graph database concepts Graph database stores data in a graph, graph contains nodes and relationships Nodes – The objects in the graph – Can have name-value properties – Can have labels (types) Relationships – Relate nodes by type and direction – Can have name-value properties Graph query language (cypher), e.g. – match (x:Person)-[DRIVES]->(y) return x,y

Traceroute representation

Properties

Proximity Via graph query, what is n hops away from given node – (match (x:SE {host: "se3.itep.ru"})-[:LINK*1..2]-(y:SONAR) return x, y)

Site links

Path analysis RAL Aachen ITEP QMUL

Path analysis DFN JANET GEANT latency, packet-loss, throughput RAL Aachen ITEP QMUL We can correlate paths with packet-loss/latency information We can simplify the graph by aggregating nodes that belong to same NREN (visual debugging)

Prototype Available at proximity.cern.ch – Graph with sonar traceroutes, ipwhois, geoip, ATLAS/CMS sites and SEs linked (via common subnets only) – Neo4j access on request at proximity.cern.ch (only from CERN) and at (from anywhere, but needs to be updated) – Currently only for testing There is also an initial version of API – grid.saske.sk&count=3 grid.saske.sk&count=3 – npt2.bu.edu&dst=heplnx130.pp.rl.ac.uk npt2.bu.edu&dst=heplnx130.pp.rl.ac.uk – npt2.bu.edu&dst=heplnx130.pp.rl.ac.uk npt2.bu.edu&dst=heplnx130.pp.rl.ac.uk – Needs more work to be useful

Traces - Summary Initial prototype for proximity based on traceroutes available – There is also Spatial Extension to Neo4j which would provide GeoIP functionality – needs to be tested - common model for both GeoIP and traces – Need to gain more experience in graph databases and evolve loaders (also add other sources), validate the current model, evaluate performance – Evolve API to support more complex queries Positive experience in using graph databases (Neo4j) – There are others available, usually not as mature as Neo4J, but they’re catching up fast (SPARK GraphX, Giraph, Blueprints, etc.) There is potential to offer more advanced and very useful functionality – Path analysis, aggregations,