Ofir Israel Guy Paskar. An Internet Tale Once upon a time.. Users unhappy (slow connection) ISPs unhappy (poor revenues) Then came Broadband access...

Slides:



Advertisements
Similar presentations
Ningning HuCarnegie Mellon University1 Optimizing Network Performance In Replicated Hosting Peter Steenkiste (CMU) with Ningning Hu (CMU), Oliver Spatscheck.
Advertisements

Rarest First and Choke Algorithms are Enough Arnaud LEGOUT INRIA, Sophia Antipolis France G. Urvoy-Keller and P. Michiardi Institut Eurecom France.
Incentives Build Robustness in BitTorrent Bram Cohen.
Web Server Benchmarking Using the Internet Protocol Traffic and Network Emulator Carey Williamson, Rob Simmonds, Martin Arlitt et al. University of Calgary.
Clayton Sullivan PEER-TO-PEER NETWORKS. INTRODUCTION What is a Peer-To-Peer Network A Peer Application Overlay Network Network Architecture and System.
On the Effectiveness of Measurement Reuse for Performance-Based Detouring David Choffnes Fabian Bustamante Fabian Bustamante Northwestern University INFOCOM.
The Structure of Networks with emphasis on information and social networks T-214-SINE Summer 2011 Chapter 8 Ýmir Vigfússon.
Resilient Peer-to-Peer Streaming Paper by: Venkata N. Padmanabhan Helen J. Wang Philip A. Chou Discussion Leader: Manfred Georg Presented by: Christoph.
1 Routing and Scheduling in Web Server Clusters. 2 Reference The State of the Art in Locally Distributed Web-server Systems Valeria Cardellini, Emiliano.
CompSci 356: Computer Network Architectures Lecture 21: Content Distribution Chapter 9.4 Xiaowei Yang
Efficient and Flexible Parallel Retrieval using Priority Encoded Transmission(2004) CMPT 886 Represented By: Lilong Shi.
One-Click Hosting Services: A File-Sharing Hideout Demetris Antoniades Evangelos P. Markatos ICS-FORTH Heraklion,
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada ISP-Friendly Peer Matching without ISP Collaboration Mohamed Hefeeda (Joint.
1 Internet Networking Spring 2004 Tutorial 13 LSNAT - Load Sharing NAT (RFC 2391)
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
IPlane: An Information Plane for Distributed Services Offence by: Anup Goyal Sagar Vemuri.
Traffic Engineering With Traditional IP Routing Protocols
Network Coding for Large Scale Content Distribution Christos Gkantsidis Georgia Institute of Technology Pablo Rodriguez Microsoft Research IEEE INFOCOM.
Cis e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © )
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
Improving ISP Locality in BitTorrent Traffic via Biased Neighbor Selection Ruchir Bindal, Pei Cao, William Chan Stanford University Jan Medved, George.
Peer-Assisted Content Distribution Networks: Techniques and Challenges Pei Cao Stanford University.
Service Differentiated Peer Selection An Incentive Mechanism for Peer-to-Peer Media Streaming Ahsan Habib, Member, IEEE, and John Chuang, Member, IEEE.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
A Trust Based Assess Control Framework for P2P File-Sharing System Speaker : Jia-Hui Huang Adviser : Kai-Wei Ke Date : 2004 / 3 / 15.
1 End-to-End Detection of Shared Bottlenecks Sridhar Machiraju and Weidong Cui Sahara Winter Retreat 2003.
1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.
Wide Web Load Balancing Algorithm Design Yingfang Zhang.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #12 LSNAT - Load Sharing NAT (RFC 2391)
Tradeoffs in CDN Designs for Throughput Oriented Traffic Minlan Yu University of Southern California 1 Joint work with Wenjie Jiang, Haoyuan Li, and Ion.
The Structure of Networks with emphasis on information and social networks T-214-SINE Summer 2011 Chapter 8 Ýmir Vigfússon.
Maputo, Mozambique, April 2014 QoS Framework for Broadband Internet Service Country Case: Mauritius Roubee GADEVADOO, Engineer/Licensing Officer,
FIREWALL TECHNOLOGIES Tahani al jehani. Firewall benefits  A firewall functions as a choke point – all traffic in and out must pass through this single.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
P2P File Sharing Systems
Content Distribution March 8, : Application Layer1.
1 Proceeding the Second Exercises on Computer and Systems Engineering Professor OKAMURA Laboratory. Othman Othman M.M.
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
BitTorrent. BitTorrent network  On the itinerary:  Introduction to BitTorrent  Basics & properties  3 Interesting analysis results.
1. 1.Charting the CDNs(locating all their content and DNS servers). 2.Assessing their server availability. 3.Quantifying their world-wide delay performance.
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
1 One-Click Hosting Services: A File-Sharing Hideout Demetris Antoniades Evangelos P. Markatos ICS-FORTH Heraklion,
BitTorrent Presentation by: NANO Surmi Chatterjee Nagakalyani Padakanti Sajitha Iqbal Reetu Sinha Fatemeh Marashi.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
1 ISP-aided Biased Query Search in P2P Systems Vinay Aggarwal and Anja Feldmann Deutsche Telekom Laboratories / TU Berlin Berlin,
Chapter 4. After completion of this chapter, you should be able to: Explain “what is the Internet? And how we connect to the Internet using an ISP. Explain.
Taming the Torrent: A Practical Approach to Reducing Cross-ISP Traffic in Peer-to-Peer Systems David R. Choffnes and Fabián E. Bustamante Speaker: Wally.
Lecture 8 Page 1 Advanced Network Security Review of Networking Basics: Internet Architecture, Routing, and Naming Advanced Network Security Peter Reiher.
Application-Layer Anycasting By Samarat Bhattacharjee et al. Presented by Matt Miller September 30, 2002.
1 BitHoc: BitTorrent for wireless ad hoc networks Jointly with: Chadi Barakat Jayeoung Choi Anwar Al Hamra Thierry Turletti EPI PLANETE 28/02/2008 MAESTRO/PLANETE.
CDN Brokering* Presented By Nick Arnold Authors Alexandros Biliris, et. Al.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
Web Caching and Content Distribution: A View From the Interior Syam Gadde Jeff Chase Duke University Michael Rabinovich AT&T Labs - Research.
Aditya Akella The Performance Benefits of Multihoming Aditya Akella CMU With Bruce Maggs, Srini Seshan, Anees Shaikh and Ramesh Sitaraman.
Tony McGregor RIPE NCC Visiting Researcher The University of Waikato DAR Active measurement in the large.
1 Capacity Dimensioning Based on Traffic Measurement in the Internet Kazumine Osaka University Shingo Ata (Osaka City Univ.)
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
INTERNET TECHNOLOGIES Week 10 Peer to Peer Paradigm 1.
Performance Limitations of ADSL Users: A Case Study Matti Siekkinen, University of Oslo Denis Collange, France Télécom R&D Guillaume Urvoy-Keller, Ernst.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante Northwestern, EECS.
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
Drafting Behind Akamai (Travelocity-Based Detouring) Ao-Jan Su, David R. Choffnes, Aleksandar Kuzmanovic and Fabián E. Bustamante Department of EECS Northwestern.
Taming the Torrent (Can’t P2P and ISPs just get along?)
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
Content Distribution Networks
IFIP – Performance 2007 A Modeling Framework to Understand the Tussle between ISPs and Peer-to-Peer File Sharing Users Michele Garetto - unito.
Challenges with developing a Commercial P2P System
Presentation transcript:

Ofir Israel Guy Paskar

An Internet Tale Once upon a time.. Users unhappy (slow connection) ISPs unhappy (poor revenues) Then came Broadband access... And everybody were happy

The Villain arrives P2P File-Sharing Applications (Kazaa, eMule, BitTorrent, etc..) Users love it! Good and free content, overnight downloads ISPs hate it! Users using their entire link Internet link utilization gone wild More bandwidth costs more money!

But is it really a villain? Users love it Driving force for broadband adoption Increased revenues for ISPs What should the ISPs do??

Some Ideas User unfriendly ideas Increase subscription cost Volume-based pricing Block/shape P2P traffic (priority for non-P2P packets) User friendly ideas Acquire more BW Network caching

Today.. Generally understand the problem – DONE! (? ) Describe an analytical model to help us understand situation better Describe one practical solution and it’s empirical results (Hint: it works)

Research Goals Modeling framework to analyze interactions between P2P File-Sharing users and their ISPs Basic insight about system dynamics Used to evaluate different strategies to manage P2P traffic

Meet the Players User Generates queries (P2P application finds the object and retrieves it) Pays a subscription price, has QoS expectations What’s popular, what’s not ISP Goal: TO MAKE MONEY Sets subscription price Controls bandwidth Influences P2P app behavior

System Setting n users inside ISP N users in the world User-ISP ul., dl. bandwidt h ISP-ISP ul., dl. bandwidth User generates query Gets a response from within the ISP Or from a user in another ISP

The Simple System Model Average query rate Aggregate query rate Prob. P2P App. locates object Prob. Object is located inside ISP Unconstrained downloads from within the ISP Model for “Internet to ISP” link System throughput Object retrieval prob. (QoS):

User Utility Function Shape parameter Object retrieval prob. Subscription cost Users subscribe only if: Equivalently, if: is the minimal service level acceptable by user i Benefit Cost

ISP Utility Function Revenues from subscribers’ fee Cost per unit of BW Fixed cost ISP starts service only if: Benefit Cost

Traffic Locality Probability that there exists at least one internal replica of object replicated r times in the system Probability to download from internal replica Number of files inside ISP Number of files outside ISP Locality parameter

Minimum BW Reminder: So: Assuming We get

Minimum BW Non-linear behavior (on n) More users  more locality  less BW needed Can be zero if n large enough (self-sustainability) Dependant on multiple parameters Self-Sustainability

Simple(?!) Model

Impact of Object Replication (r) More replicas  Better locality  Lower B d needed B min has two roots: x 1 – No users, x 2 – Enough users for self- sustainability

Impact of external QoS ( ) Higher external QoS  More BW needed (because there are more replicas externally)

Impact of prob. to locate objects (q) Some ISPs drop queries. This graph shows them different.

Impact of prob. to retrieve objects internally (Gamma) Det. by the ability to find a local object given that it exists. Can be influenced by the ISP – this graph shows it should.

Model Refinements Simple Model Users’ access BW are unconstrained Object popularity is identical Users availability identical Refined Model Relax these assumptions Propose object popularity and replication model

Model Refinements We adopt a processor sharing model with rate limit b d to describe the sharing of B d Now each user is limited by it’s own BW. Queue Model

Model Refinements We introduce a new parameter: that describes user patience Denote b as the initial download rate, and assuming the decision to abort is made at the beginning then the prob. p g to continue the transfer is: Larger eta  user claims to get a rate close to what they paid for

B min as a function of b d =b u with different values of gamma Higher gamma  smaller b u needed for self-sustainability Optimal gamma is not gamma=1 !!! For b u < 250 the BW available inside the ISP is not enough to satisfy demanding users

Impact of asymmetric access BWs Cost for ISP increases as ratio increases (what about ADSL??) Larger b u  Better locality  lower B d

Conclusions Locality is good for the ISPs More replicas, larger querying probability, larger upload bandwidth for users’ access, larger probability to retrieve objects internally (gamma)  SELF SUSTAINABILITY == GOOD Reading slow leads to better understanding

Further Reading Original paper of course: Garetto et al, “A modeling framework to understand the tussle between ISPs and peer-to- peer file-sharing users” in Performance Evaluation, June 2007 Same as the original paper but talks about ISP-ISP connections: Wang et al, “Modeling the Peering and Routing Tussle between ISPs and P2P Applications” in the proceedings of IWQoS 2006

BREAK ?

Academic Work Oracle-based vs. non-Oracle-based (e.g., with ISP cooperation or without) Legality issues, reluctance issues Improvements via locality research Network location or Geographic location? Which method of network location? Improvements via redirection research Can we redirect traffic to inexpensive links? Many more

Part 2 Taming the Torrent - A Practical Approach to Reducing Cross-ISP Traffic in Peer-to-Peer Systems David R. Choffnes and Fabián E. Bustamante

The problem  Over 66% of P2P users & growing  But how do we know which peer to choose?  Which peers? Trackers provide a random subset of peers in the torrent  Random peer connections → growing ISP operation costs.  So, how do we know if a suggested peer is inside our Isp or outside?  We want to reduce cross isp transport. Meaning use the “closest” peers.  But, how can we do that?

The ISP Perspective P2P performance - key factor for service upgrade & selection by users A major engineering challenge for ISPs ≈ 70% of the Internet traffic But, a lot of cross isp,means a lot of cost for the Isp. What can the isp do in order to fight the p2p users?

Isp methods and its problems ISPs shape traffic directed to standard ports P2Ps move to dynamic, non-standard ports ISPs turn to deep-packet inspection to identify & shape P2P flows P2Ps encrypt their connections ISPs place caches and/or spoofs TCP RST msgs Legality issues. (Some ways to overcome this – in Israel!) So good solution must be agreed by the p2p users!

One solution: Oracles.  Suggestion – the isp’s itself will have to implement an oracle, this oracles will guide the user which peers to choose.  Help reduce cross-ISP traffic  This solution looks appealing But:  Assumes P2P users & ISPs trust each other  Misses incentive for user adoption  Therefore not so good after all 34

The authors suggestion  CDN’S – content distribution network. What is it?  CDNs attempt to improve web performance by redirecting requests to replica servers  The goal is to help content providers (i.e. CNN) to distribute content by redirecting requests to replica servers that are: Topologically proximate Provide lower-latency  But how do they do that?

How does CDN work? There are some ways that a CDN works by for example: Way 1 : I want to go to cnn.com  dns lookup,  directs me to the domain name of the CDN (cnn.akamai.com)  CDN sends me to the right replica. Way 2: I want to go to cnn.com  dns lookup  first page from original cnn.com,  directs me to CDN server  sends to right replica.

Reusing CDNs’ network views  Client’s request redirected to “nearby” server  Client gets web site’s DNS CNAME entry with domain name in CDN network  Hierarchy of CDN’s DNS servers direct client to nearby servers 37 Internet Web client Client requests translation for Yahoo Client gets CNAME entry with domain name in Akamai (3) Hierarchy of CDN DNS servers Customer DNS servers (1) (2) (4) (5) (6) LDNS Web replica servers Multiple redirections to find nearby edge servers Another web client Client is given 2 web replica servers (fault tolerance) Clients and replica servers are “nearby”]

The authors suggestion  So how do we use CDN?  We are going to recycle data that is already being collected by Content Distribution Networks, and use it.  But how?  By simply comparing DNS redirections.  Assumptions :  Links between “nearby” hosts cross few ISPs  If two hosts are close to the same CDN replica servers, they are close to each other

Reducing cross-ISP traffic So we can use the CDN’s data, what are the advantages for this recycilng? Does not requires any trust between isp and p2p users The infrastructure is already exist And most importantly reduces cross isp traffic without harming the p2p users (even improving)

An approach to reducing cross isp  Introducing “Ono” Extension (plugin) to Azureus BitTorrent client Will use CDN and ratio maps(?) to determine who is “closer” We will describe its implementation  Uses multiple CDN customers Only DNS resolution, no content download needed Adaptive lookup rates on CDN names Represent redirection behavior using ratio-maps

Ratio Maps A ratio map represents the frequency of redirecting to a specific replica Number of replicas is usually small (max 31) Keep a time window about 24 hours How does it looks?

Ratio maps represantation  The ratio map of a peer (a) is a set of (replica server, ratio)  for peer a  Specifically, if peer a is redirected toward replica server r1 75% of the time window, and toward replica server r2 25% of the time window, then the corresponding ratio- map is  The sum of all in a given ratio map equals one  For each peer there exist a ratio map  But what can we do with it?

choosing peers by ratio map  2 peers has close ratio map, than we say that they are close. ( possibly in the same network), and the ooposit  So, we need a calculation that will determine for 2 peers if they are “close” or not.  Than we can check for all available peers and choose the “closest” one  For that we define cosine similarity for 2 peers

Cosine-similarity  the cosine similarity of two maps will range from 0 to 1, since the term frequencies cannot be negative  If cos_sim(a,b) = 0, the vectors are orthogonal  if cos_sim(a,b) = 1 than they are the same  This is very close to dot product  And we determine a threshold currently 0.15, if cos_sim(a,b)>0.15 than we recommend these peers as close

Implementation Ono, an extension to Azureus client Performs periodic DNS lookups on popular CDN and create a ratio map  Periodically updates the ratio-maps Exchanging ratio maps for cos-sim(a,b): On Handshake From trackers  But how do we deal with peers not using Ono?  Ono also attempts to perform DNS lookups on behalf of other peers that it encounters, to determine their ratio maps  How?  Taken from Ono code : getting the other peer DNS server  And querying it

So what's now?  Get ratio information from other peers that got from tracker, and understand who is close When determine similar redirection behavior, attempts to bias traffic towards that peer by ensuring that the connection is always on  Sends Ono information to supporting trackers(in case of supporting trackers) But what is the cost? How much is our overhead? 18KB upstream, 36KB downstream per day Computation of cosine-similarity is easy

Important notes CDN names being used: Initialization of ratio map: DNS on each CDN name at most once every 30 sec. for 2 min. this gives basis ratio map After this phase If the redirection info for CDN name similar to prev. query  the interval between queries increases by 1 min. Otherwise the interval is halved(to a min. of 30 sec.)

Some statistics regarding Ono  Details for 2007 :  > 195,000 users worldwide  … collecting ~15GB of data per day

Empirical results Over 120,000 peers use Ono Ono collects extra network data Samples transfer rates for each connection every 5 sec. Get RTT for endpoint using pings Get Trace-route between end points Note : Not easy to determine cross-ISP hops IP hops is easy and gives some measure

Empirical resualts  So in practice Trace Route gives a router level views of path between hosts. BUT an ISP can contain many routers, we wish for a metric that is closely correspond to ISP hops.  How do we get this metric?  Autonomous systems, how?  Although there is no 1 to 1 correlation between AS and ISPs, the number of AS hops gives us an upper bound estimate on the number of cross-ISP hops  So in practice we generate AS level path info from our trace-routes using mapping that can be provided  Example :

Example

Empirical results Ono finds shorter paths Median in less than half More than 20% are only one hop away, via less than 2%

Reducing cross-ISP traffic Average number of AS hops to reach Ono- recommended/random peers 53 > 30% of paths to Ono-recommended peers do not leave the AS of origin Note BT curve includes all peers, either Ono-recommended or randomly selected

Finding nearby peers 54 Two orders of magnitude difference And, on average, 31% lower loss rates!

Improving transfer performance 55 Heavy Tail – Average performance improves by 31% Median difference is ~2KB/s Even when Ono doesn’t help, it allows BT to naturally select faster peers DSL in England -- 4/8Mbps down, only 768Kbps up ISP bandwidth allocation policy brings bottleneck to the access link

… with the right bandwidth allocation policy Romania: 50 Mb/s in metro-area, 4 Mb/s outside % median improvement

Helpful ISPs can help themselves 57

Duscussion Absolute network positioning system, and just throw away the “far” peers Problem – all peers must take a part in the service, in contrast to our method Use just AS numbers There are ISPs (like comcast) that have many AS numbers, so using these numbers can restrict cross-AS traffic that is not cross-ISP traffic

Conclusion Recycling network views collected by CDNs The method reduces cross-ISP traffic Performance of peers is not effected(we saw this) Scalable (the more clients adopt it, the more accurate the bias would get) Available easily and freely Therefore the method is good and can provide good results in reducing Cross-ISP traffic

Question TBD

Extras