Taming the Torrent (Can’t P2P and ISPs just get along?)

Slides:



Advertisements
Similar presentations
Incentives Build Robustness in BitTorrent Bram Cohen.
Advertisements

On the Effectiveness of Measurement Reuse for Performance-Based Detouring David Choffnes Fabian Bustamante Fabian Bustamante Northwestern University INFOCOM.
CompSci 356: Computer Network Architectures Lecture 21: Content Distribution Chapter 9.4 Xiaowei Yang
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada ISP-Friendly Peer Matching without ISP Collaboration Mohamed Hefeeda (Joint.
1 Modeling and Emulation of Internet Paths Pramod Sanaga, Jonathon Duerig, Robert Ricci, Jay Lepreau University of Utah.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Network Coding for Large Scale Content Distribution Christos Gkantsidis Georgia Institute of Technology Pablo Rodriguez Microsoft Research IEEE INFOCOM.
Internet Traffic Patterns Learning outcomes –Be aware of how information is transmitted on the Internet –Understand the concept of Internet traffic –Identify.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
Improving ISP Locality in BitTorrent Traffic via Biased Neighbor Selection Ruchir Bindal, Pei Cao, William Chan Stanford University Jan Medved, George.
Peer-Assisted Content Distribution Networks: Techniques and Challenges Pei Cao Stanford University.
Kyushu University Graduate School of Information Science and Electrical Engineering Department of Advanced Information Technology Supervisor: Professor.
1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.
Adaptive Web Caching Lixia Zhang, Sally Floyd, and Van Jacob-son. In the 2nd Web Caching Workshop, Boulder, Colorado, April 25, System Laboratory,
Content Delivery Networks. History Early 1990s sees 100% growth in internet traffic per year 1994 o Netscape forms and releases their first browser.
Characterizing Residential Broadband Networks Marcel Dischinger †, Andreas Haeberlen †‡, Krishna P. Gummadi †, Stefan Saroiu* † MPI-SWS, ‡ Rice University,
Can Internet Video-on-Demand Be Profitable? SIGCOMM 2007 Cheng Huang (Microsoft Research), Jin Li (Microsoft Research), Keith W. Ross (Polytechnic University)
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
P2P File Sharing Systems
1 Proceeding the Second Exercises on Computer and Systems Engineering Professor OKAMURA Laboratory. Othman Othman M.M.
Ofir Israel Guy Paskar. An Internet Tale Once upon a time.. Users unhappy (slow connection) ISPs unhappy (poor revenues) Then came Broadband access...
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
Taming the Torrent: A Practical Approach to Reducing Cross-ISP Traffic in Peer-to-Peer Systems David R. Choffnes and Fabián E. Bustamante Speaker: Wally.
Network Technologies essentials Week 9: Distributed file sharing & multimedia Compilation made by Tim Moors, UNSW Australia Original slides by David Wetherall,
CPSC 441: Multimedia Networking1 Outline r Scalable Streaming Techniques r Content Distribution Networks.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
BitTorrent enabled Ad Hoc Group 1  Garvit Singh( )  Nitin Sharma( )  Aashna Goyal( )  Radhika Medury( )
MULTI-TORRENT: A PERFORMANCE STUDY Yan Yang, Alix L.H. Chow, Leana Golubchik Internet Multimedia Lab University of Southern California.
Othman Othman M.M., Koji Okamura Kyushu University 1.
Peer-Assisted Content Distribution Pablo Rodriguez Christos Gkantsidis.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
1 COMP 431 Internet Services & Protocols HTTP Persistence & Web Caching Jasleen Kaur February 11, 2016.
John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante Northwestern, EECS.
1 Three ways to (ab)use Multipath Congestion Control Costin Raiciu University Politehnica of Bucharest.
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
MICROSOFT AJAX CDN (CONTENT DELIVERY NETWORK) Make Your ASP.NET site faster to retrieve.
Drafting Behind Akamai (Travelocity-Based Detouring) Ao-Jan Su, David R. Choffnes, Aleksandar Kuzmanovic and Fabián E. Bustamante Department of EECS Northwestern.
John S. Otto, Mario A. Sanchez, David R. Choffnes*, Fabián E. Bustamante, Georgos Siganos** Northwestern, EECS * U.
25/09/2016 INASP: Effective Network Management Workshops Unit 6: Solving Network Problems.
1 Design and Implementation of a High-Performance Distributed Web Crawler Polytechnic University Vladislav Shkapenyuk, Torsten Suel 06/13/2006 석사 2 학기.
Federating Data in the ALICE Experiment
Accelerating Peer-to-Peer Networks for Video Streaming
The Hidden Locality in Swarms
P4P : Provider Portal for (P2P) Applications Haiyong Xie, Y
Content Distribution Networks
THE NEED FOR DNS DOMAIN NAME SYSTEM
Scale and Performance in the CoBlitz Large-File Distribution Service
RTP: A Transport Protocol for Real-Time Applications
The Internet.
Practical Censorship Evasion Leveraging Content Delivery Networks
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
NOMA Network Operator Measurement Activity
Web Caching? Web Caching:.
Working at a Small-to-Medium Business or ISP – Chapter 7
No Direction Home: The True cost of Routing Around Decoys
Co* Projects : CoDNS, CoDeploy, CoMon
Working at a Small-to-Medium Business or ISP – Chapter 7
ECE 671 – Lecture 16 Content Distribution Networks
Distributed Content in the Network: A Backbone View
Computer Communication & Networks
Internet Interconnection
Distributed Systems CS
Working at a Small-to-Medium Business or ISP – Chapter 7
Content Distribution Networks
Inter-domain Routing Outline Homework #3 solutions
CS 6290 Many-core & Interconnect
INTERNET APPLICATIONS
Content Delivery and Remote DNS services
Challenges with developing a Commercial P2P System
EE 122: Lecture 22 (Overlay Networks)
Presentation transcript:

Taming the Torrent (Can’t P2P and ISPs just get along?) David R. Choffnes Fabián E. Bustamante Northwestern University Mention the subtitle of the paper – paraphrase the paper’s subtitle SIGCOMM 2008

David Choffnes, Taming the Torrent, SIGCOMM 2008 Peer-to-Peer Systems Enable a range of important services End-system multicast, file sharing, content distribution, … Leverage everybody’s resources Disk, processing power, bandwidth Natural scalability, inherent robustness, high flexibility … Studies have shown that users upgrade their conneciton or change ISPs for better P2P performance David Choffnes, Taming the Torrent, SIGCOMM 2008

The BitTorrent protocol Over 66% of P2P users & growing Distribute files through piece exchanges between collaborating peers A file for download is described by a torrent Peers sharing content are connected to the same torrent Which peers? Trackers provide a random subset of peers in the torrent For a peer’s initial set of connections For new connections, as transfer progresses and other connections are dropped Stronger connection to previous slide Comment on BT taking over a good fraction of the multimedia flows David Choffnes, Taming the Torrent, SIGCOMM 2008

David Choffnes, Taming the Torrent, SIGCOMM 2008 The ISP Perspective P2P performance - key factor for service upgrade & selection by users A major engineering challenge for ISPs ≈70% of the Internet traffic Random peer connections → growing ISP operation costs Internet Video to TV Internet Video to PC VoIP Video Gaming P2P Web/E-mail Users pay flat rate, but ISPs see increasing, expensive traffic volumes Make positive and negative clear To say :oblivious of net topology & link costs Source: Cisco David Choffnes, Taming the Torrent, SIGCOMM 2008

All fun and games until someone gets a subpoena ISPs shape traffic directed to standard ports P2Ps move to dynamic, non-standard ports ISPs turn to deep-packet inspection to identify & shape P2P flows P2Ps encrypt their connections ISPs place caches and/or spoofs TCP RST msgs Lawsuits + bad publicity First lesson An effective solution needs users to buy in Three class-action lawsuits filed against Comcast ars technica. June 6th, 2008 Wikipedia has a full page about how to defeat DPI Comcast is reading blogs to try to fix user perceptions David Choffnes, Taming the Torrent, SIGCOMM 2008

A straightforward approach ISPs guide peers when choosing neighbors Help reduce cross-ISP traffic First proposed by Bindal et al. (ICDCS 2006) and Aggarwal et al. (CCR 2007) More recently – Xie et al. P4P (SIGCOMM 2008) Two remaining issues Assumes P2P users & ISPs trust each other Misses incentive for user adoption

An alternative, practical approach CDNs are already collecting the necessary info Various CDNs, e.g. Akamai, Limelight Serve popular sites e.g. NYTimes, Apple, Yahoo … Move load from providers to edge of network How do they do it? Using thousands of replica servers worldwide Extensive network measurements to inform replica server selection Redirecting web-clients to web content replicas Reuse CDNs’ net views for peer selection How? By simply comparing DNS redirections Present time

Reusing CDNs’ network views Client’s request redirected to “nearby” server Client gets web site’s DNS CNAME entry with domain name in CDN network Hierarchy of CDN’s DNS servers direct client to nearby servers Multiple redirections to find nearby edge servers Internet (3) Hierarchy of CDN DNS servers Customer DNS servers (1) (2) (4) (5) (6) LDNS Web replica servers Clients and replica servers are “nearby” [SIGCOMM ’06] Another web client As we reported in SIGCOMM Client is given 2 web replica servers (fault tolerance) Client gets CNAME entry with domain name in Akamai Client requests translation for Yahoo Web client David Choffnes, Taming the Torrent, SIGCOMM 2008

Reusing CDNs’ network views (Cont) Hypotheses Links between “nearby” hosts cross few ISPs If two hosts are close to the same CDN replica servers, they are close to each other Advantages Requires no additional infrastructure Needs no topological information Avoids the trust issue between ISPs and P2P Reduces cross-ISP traffic … while improving users’ transfer performance Before this slide, don’t read the text as much, say things different David Choffnes, Taming the Torrent, SIGCOMM 2008

David Choffnes, Taming the Torrent, SIGCOMM 2008 Deployment Available as Ono, an extension to a popular BitTorrent client First released April 2007 Currently > 195,000 users worldwide … in hundreds of countries … observing > 2 million peers per day … with flows traversing ~ 10,000 ASs … collecting ~15GB of data per day David Choffnes, Taming the Torrent, SIGCOMM 2008

David Choffnes, Taming the Torrent, SIGCOMM 2008 Coverage Click image to load interactive version David Choffnes, Taming the Torrent, SIGCOMM 2008

CDN-based peer selection Implementation Ono Extension to Azureus/Vuze BitTorrent client (GPL) Now also part of a tracker CDN-based peer selection Uses multiple CDN customers (DNS names) Only DNS resolution, no content download needed Adaptive lookup rates on CDN names Exchanges ratio information w/ other Ono peers Sends Ono information to supporting trackers Over 12,000 SLOC (it’s Java) 3,000 for data collection, 3,000 for GUI Make point that tracker came after paper

Figures based on a 2-week study in December, 2007 Experimental dataset Types of data Per-connection sampled transfer rates (> 100 million per day) Ping RTTs (> 100 million per day) Traceroutes (> 2 million per day) Figures based on a 2-week study in December, 2007 Each data point is the average for a peer over a 6-hour interval Results of real, working system

Reducing cross-ISP traffic Average number of AS hops to reach Ono-recommended/random peers > 30% of paths to Ono-recommended peers do not leave the AS of origin Note BT curve includes all peers, either Ono-recommended or randomly selected We use AS hops as a proxy for ISP hops, median is 1 hop, that’s the case for less than 10% of the random one. This is the first study of the actual number of cross-ISP links, only 8.2% stay in same AS Median for Ono is 1 AS hop, 3 for random The quantization evident in the Ono curve is not from a lack of data (they are several thousands of data point there, > 5,000) . Rather, because each Ono peer sees a relatively small number of recommended Ono peers during an observation interval and because those peers are typically along short paths, the average of which takes on small range of values, often integers. Each point, the average of statistics recorded from one Ono peer over a six-hour interval. David Choffnes, Taming the Torrent, SIGCOMM 2008

David Choffnes, Taming the Torrent, SIGCOMM 2008 Finding nearby peers Two orders of magnitude difference And, on average, 31% lower loss rates! Not only does it reduce cross ISP traffic, but also finds better paths. Directly say the values below. Also say the value for traceroute medians, and about 60% that have 0 loss which median for random is 2.1% 6ms to 530ms; the large value of 1ms is ‘cause Windows returns only integer values and <1ms for those bellow 1. David Choffnes, Taming the Torrent, SIGCOMM 2008

Improving transfer performance Heavy Tail – Average performance improves by 31% DSL in England -- 4/8Mbps down, only 768Kbps up ISP bandwidth allocation policy brings bottleneck to the access link Median difference is ~2KB/s Even when Ono doesn’t help, it allows BT to naturally select faster peers To our surprise, this is what we found out … Of course 31% is good enough, we could leave it there. But this doesn’t seem to fit with the path qualities. Talk about why we saw this w/ BT getting good system-wide performance, but low per-connection rates per connection, bottleneck at the access link. If this is the case, then we should be able to find an ISP w/ the policy. What do you know, we have one. David Choffnes, Taming the Torrent, SIGCOMM 2008

… with the right bandwidth allocation policy Romania: 50 Mb/s in metro-area, 4 Mb/s outside 883% median improvement We have a user said this was great, Portugal sees this too David Choffnes, Taming the Torrent, SIGCOMM 2008

Helpful ISPs can help themselves What’s more interesting, the ISP that helps you, helps itself. Before was bandwidth allocation, now it’s ISPs with policy that reduces cross-ISP traffic, mutually assured benefits David Choffnes, Taming the Torrent, SIGCOMM 2008

David Choffnes, Taming the Torrent, SIGCOMM 2008 Discussion Scalability/Overhead No path probing to locate nearby peers Requires only periodic DNS lookups Max: 18KB up and 36KB down per day Practical No new infrastructure required Ready to use now, even a year ago No need for trust between ISPs & P2P Performance improvement is the best incentive for adoption! Cross-ISP traffic No comment about complaint Change to “Discussion”. Mention some ISPs want cross-ISP traffic David Choffnes, Taming the Torrent, SIGCOMM 2008

That’s all fine and good, but why not …? Get rid of peers that are “far away” (?) Latency is weakly correlated with path distance Requires everyone to measure Use AS numbers AS-level info can be too … Fine-grained - Cross-AS != Cross-ISP Coarse - Intra-AS could be coast-to-coast Ask your ISP Decisions guided by aggregated traffic policies But what about adoption/deployment cost/operation …? Requires extensive latency measurements (not N^2) Last one, say explicitly that you this is P4P Comcast has >40 different ASNs – Cross-AS != Cross-ISP AS7132 spreads over most of the continental US David Choffnes, Taming the Torrent, SIGCOMM 2008

David Choffnes, Taming the Torrent, SIGCOMM 2008 Summary Ono – A practical & effective way to reduce P2P networking costs Reuses CDNs’ network views An alternative approach for ISPs? Recommend Ono to your P2P users Change bandwidth allocation to favor in-network connections Part of 3R project - Reduce, Reuse, Recycle Reuse/recycle the views of long-running services in building large distributed systems Ono or Ono-like solution, don’t forget limwire David Choffnes, Taming the Torrent, SIGCOMM 2008

David Choffnes, Taming the Torrent, SIGCOMM 2008 Extra, Extra… Network Early Warning System (NEWS) Reuse P2P systems’ aggregated views and natural traffic Detect network anomalies locally Use corroboration to confirm them globally Join the 4,207 users who have already done so! http://aqualab.cs.northwestern.edu/projects/NEWS.html David Choffnes, Taming the Torrent, SIGCOMM 2008

David Choffnes, Taming the Torrent, SIGCOMM 2008 Fin David Choffnes, Taming the Torrent, SIGCOMM 2008