Multicast Pull Scheduling Kirk Pruhs. The Big Problem Movie Distribution Database Replication via Internet Harry Potter Book Download Software Download.

Slides:



Advertisements
Similar presentations
TU/e Service Discovery Mechanisms: two case studies / IC2002 Service Discovery Mechanisms: Two case studies Control point Device UPnP Terminal Domain Host.
Advertisements

Tight Bounds for Online Class- constrained Packing Hadas Shachnai Bell Labs and The Technion IIT Tami Tamir The Technion IIT.
Online Scheduling with Known Arrival Times Nicholas G Hall (Ohio State University) Marc E Posner (Ohio State University) Chris N Potts (University of Southampton)
1 Efficient and Robust Streaming Provisioning in VPNs Z. Morley Mao David Johnson Oliver Spatscheck Kobus van der Merwe Jia Wang.
Princeton University COS 423 Theory of Algorithms Spring 2001 Kevin Wayne Competitive Analysis.
Reliability on Web Services Presented by Pat Chan 17/10/2005.
1 Better Scalable Algorithms for Broadcast Scheduling Ravishankar Krishnaswamy Carnegie Mellon University Joint work with Nikhil Bansal and Viswanath Nagarajan.
June 3, 2015Windows Scheduling Problems for Broadcast System 1 Amotz Bar-Noy, and Richard E. Ladner Presented by Qiaosheng Shi.
Dynamic Internet Congestion with Bursts Stefan Schmid Roger Wattenhofer Distributed Computing Group, ETH Zurich 13th International Conference On High Performance.
REK’s adaptation of Prof. Claypool’s adaptation of
Online Algorithms Motivation and Definitions Paging Problem Competitive Analysis Online Load Balancing.
Tirgul 8 Universal Hashing Remarks on Programming Exercise 1 Solution to question 2 in theoretical homework 2.
Resource augmentation and on-line scheduling on multiprocessors Phillips, Stein, Torng, and Wein. Optimal time-critical scheduling via resource augmentation.
Computer Science Lecture 2, page 1 CS677: Distributed OS Last Class: Introduction Distributed Systems – A collection of independent computers that appears.
Flow Algorithms for Two Pipelined Filtering Problems Anne Condon, University of British Columbia Amol Deshpande, University of Maryland Lisa Hellerstein,
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
1 Scheduling on Heterogeneous Machines: Minimize Total Energy + Flowtime Ravishankar Krishnaswamy Carnegie Mellon University Joint work with Anupam Gupta.
1 Scheduling Jobs with Varying Parallelizability Ravishankar Krishnaswamy Carnegie Mellon University.
Recap Priorities task-level static job-level static dynamic Migration task-level fixed job-level fixed migratory Baker/ Oh (RTS98) Pfair scheduling This.
Approximation Algorithms Motivation and Definitions TSP Vertex Cover Scheduling.
A General Approach to Online Network Optimization Problems Seffi Naor Computer Science Dept. Technion Haifa, Israel Joint work: Noga Alon, Yossi Azar,
On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda, Bharat K. Bhargava Presented by Sam Distributed Computing Systems, FTDCS Proceedings.
Locality-Aware Content Distribution Danny Bickson, Dahlia Malkhi, David Rabinowitz.
Energy, Energy, Energy  Worldwide efforts to reduce energy consumption  People can conserve. Large percentage savings possible, but each individual has.
Communication Part IV Multicast Communication* *Referred to slides by Manhyung Han at Kyung Hee University and Hitesh Ballani at Cornell University.
Resource Placement and Assignment in Distributed Network Topologies Accepted to: INFOCOM 2013 Yuval Rochman, Hanoch Levy, Eli Brosh.
Server Load Balancing. Introduction Why is load balancing of servers needed? If there is only one web server responding to all the incoming HTTP requests.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Communication (II) Chapter 4
Design and Implement an Efficient Web Application Server Presented by Tai-Lin Han Date: 11/28/2000.
1 System Models. 2 Outline Introduction Architectural models Fundamental models Guideline.
Yossi Azar Tel Aviv University Joint work with Ilan Cohen Serving in the Dark 1.
Multicast Scheduling in Cellular Data Networks Katherine Guo, Arun Netravali, Krishan Sabnani Bell-Labs Research Hyungsuk Won, Han Cai, Do Young Eun, Injong.
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
1 Pruhs, Woeginger, Uthaisombut 2004  Qos Objective: Minimize total flow time  Flow time f i of a job i is completion time C i – r i  Power Objective:
1 Server Scheduling in the L p norm Nikhil Bansal (CMU) Kirk Pruhs (Univ. of Pittsburgh)
Architectures of distributed systems Fundamental Models
A Maiden Analysis of Longest Wait First Jeff Edmonds York University Kirk Pruhs University of Pittsburgh.
CSC 413/513: Intro to Algorithms NP Completeness.
Overcast: Reliable Multicasting with an Overlay Network Paper authors: Jannotti, Gifford, Johnson, Kaashoek, O’Toole Jr. Slides by Chris Johnstone.
DISTRIBUTED COMPUTING Introduction Dr. Yingwu Zhu.
Distributed Computing Systems CSCI 4780/6780. Distributed System A distributed system is: A collection of independent computers that appears to its users.
Thomas Dreibholz Institute for Experimental Mathematics University of Duisburg-Essen, Germany University of Duisburg-Essen, Institute.
1 Job Scheduling for Grid Computing on Metacomputers Keqin Li Proceedings of the 19th IEEE International Parallel and Distributed Procession Symposium.
Networked Games Objectives – –Understand the types of human interaction that a network game may provide and how this influences game play. –Understand.
Motivation: Sorting is among the fundamental problems of computer science. Sorting of different datasets is present in most applications, ranging from.
Data Communications and Networks Chapter 9 – Distributed Systems ICT-BVF8.1- Data Communications and Network Trainer: Dr. Abbes Sebihi.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Minimizing Response time in Broadcast Scheduling Nikhil Bansal Don Coppersmith and Maxim Sviridenko.
Non-Preemptive Buffer Management for Latency Sensitive Packets Moran Feldman Technion Seffi Naor Technion.
Scheduling Techniques for Media-on-Demand Amotz Bar-Noy Brooklyn College Richard Ladner Tami Tamir University of Washington.
Ben Moseley Sungjin Im Goals of Talk  Make you aware of/interested in broadcast sched  Highlight known results, key open questions, some recent results.
Scalable video distribution techniques Laurentiu Barza PLANETE project presentation: Sophia Antipolis 12 October 2000.
1 Scheduling Techniques for Broadcasting Popular Media. Amotz Bar-Noy Brooklyn College Richard Ladner Tami Tamir University of Washington.
Scheduling Parallel DAG Jobs to Minimize the Average Flow Time K. Agrawal, J. Li, K. Lu, B. Moseley.
Last Class: Introduction
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
Data Dissemination and Management (2) Lecture 10
Software Architecture in Practice
湖南大学-信息科学与工程学院-计算机与科学系
Distributed Systems Bina Ramamurthy 11/30/2018 B.Ramamurthy.
Architectures of distributed systems Fundamental Models
Replications in Multi-Region Peer-to-peer Systems
New Scheduling Algorithms: Improving Fairness and Quality of Service
Architectures of distributed systems Fundamental Models
University of Pittsburgh
University of Pittsburgh
Architectures of distributed systems
Architectures of distributed systems Fundamental Models
Data Dissemination and Management (2) Lecture 10
Presentation transcript:

Multicast Pull Scheduling Kirk Pruhs

The Big Problem Movie Distribution Database Replication via Internet Harry Potter Book Download Software Download Olympics Pay-Per-View Movies Today’s Internet Audience Size Content Richness

The Standard Centralized Unicast Pull Approach is Not Scalable Creates unnecessary network congestion Overloads the server

One Possible Solution: Multicast

Response Times for the 3 Different Multicast Distribution Methods Average response time Low load High load Multicast Push Unicast Pull Multicast Pull

Appropriate Distribution Method Depends on Popularity of Data Multicast Push Multicast Pull Unicast Pull

Another Application: (RODS) Realtime Outbreak Detection System Developed at Pitt Deployed in Utah for Winter Olympics Now collects information on 70% of doctors visits in Utah Since the anthrax attacks, RODS has received lots of funding

Project Goals Build a prototype data dissemination system that uses all three basic data dissemination methods appropriately Supported by an NSF grant from ANIR program Joint work with Panos Chysanthis and Vincenzo Liberatore Study the interesting data management problems that arise in such a system Supported by an NSF grant from CCR program

Middleware Architecture and Data Management Issues Application Layer Multicast Transport Layer, e.g. Java Reliable Multicast Document Selection Multicast Push Scheduling Indexing Caching Server Side Client Side Multicast Pull Scheduling Unicast Pull Scheduling

Rest of the Talk: Multicast Pull Scheduling From Newsweek magazine From

Simple Example Instance of Multicast Pull Scheduling Schedule Average response time = ( )/4 Input

Standard Worst-case Algorithm Analysis Technique Competitive ratio of algorithm A is max I A(I)/Opt(I)  A(I) is the average response time on input I using algorithm A  Opt(I) is the average response time for the optimal schedule For example, a 2-competitive algorithm A guarantees that it will produce a schedule with average response time at most twice of the optimal average response time

Warm-up Problem: Unit Sized Documents Obvious Algorithm? Most Requests First (MRF): Broadcast the document with the most requests Surprisingly, MRF has unbounded competitive ratio (proof next slide) Moral: Multicast pull scheduling is trickier than it might first appear

Most Requests First (MRF) is not an O(1)-competitive algorithm MRF Optimal Average response time ~ n Average response time ~ 1 Input n jobs n 2 jobs

There is no O(1)-competitive Online Server Scheduling Algorithm for Multicast Pull Input Online schedule Optimal schedule n jobs n 2 jobs Average response time ~ n Average response time ~ 1

Resource Augmentation Analysis Compare the limited (e.g. online) algorithm with more resources (e.g. a faster processor or more processors) to the optimal algorithm with less resources Online algorithm A is s-speed c-competitive if max I A s (I)/Opt 1 (I) < c Subscript denotes processor speed Example: A 2-speed 3-competitive algorithm equipped with a speed 2 processor guarantees an average response time at most 3 times the optimal average response time for a 1 speed processor

Classic Server QoS Curves Average response time Low load High load Fast processor Slow Processor Online Optimal Online is not O(1)-competitive Online is O(1)-speed O(1)-competitive

Old Chinese Saying: Three blind shoemakers are better than one politician

Most Requests First (MRF) is not an O(1)-competitive algorithm MRF Optimal Average response time ~ n Average response time ~ 1 O(1)-speed Input n jobs n 2 jobs

The Power of the Adversary in Multicast Pull Scheduling Recall general lower bound instance Intuition: The adversary forces the online algorithm to labor on sequential work

Definition of Parallel and Sequential Work Rate work is completed Rate work is completed Processing power devoted to the work low high low high Processing power devoted to the work Parallel work Sequential work

Another Application Where Sequential Work Arises: Scheduling Jobs on a Multi-Processor Sequential work Parallel work Input One Possible Optimal Schedule P1P1 P2P2 Average response time = ( )/3

The Main Result to Date (with Jeff Edmonds) A method to construct a multicast pull scheduling algorithm B from a nonclairvoyant unicast scheduling algorithm A. If algorithm A is an s-speed c-competitive algorithm when jobs have parallel and sequential components, then B is a (2+ ε )s-speed c- competitive Formalizes the surprising insight that the difficulty of multicast pull scheduling = the difficulty of unicast scheduling of jobs with parallel and sequential components

Constructing the Multicast Pull Algorithm B from the Unicast Algorithm A Multicast pull input Unicast input Algorithm A’s unicast schedule Algorithm B’s multicast pull schedule

Equipoise Algorithm for Unicast Scheduling of Jobs with Parallel and Sequential Components Equipoise (Round Robin) transmits each file at the same rate. Edmonds (1999) showed that the algorithm Equipoise is a (2+ ε )-speed O(1 + 1/ε)- competitive algorithm

The BEquipoise Multicast Pull Algorithm BEquipoise broadcasts each document at a rate proportional to the number of requests to that document The algorithm BEquipoise is a (4+ε)- speed O(1 + 1/ε)-competitive algorithm BEquipoise will work reasonably well if the server load < ¼ Bequipoise is not an 2-speed O(1)- competitive algorithm

Possible O(1)-competitive (1+ε)-speed Algorithms ? Unit sized files: Longest Wait First (LWF): Send out the document where the sum of the ages of the outstanding requests is maximized Arbitrary sized files: Longest Total Stretch First (LTSF): Send out the document where the sum of the ages of the outstanding requests, divided by the file size, is maximized Appear to be the current experimental champions (Acharya and Muthukrishnan)

Future Directions Are LWF and LTSF (1 + ε)-speed O(1 + 1/ε)-competitive algorithms for multicast pull scheduling? If not, is an (1 + ε)-speed O(1 + 1/ε)-competitive algorithm possible?  One possibility is to find an (1 + ε)-speed O(1 + 1/ε)- competitive algorithm for unicast scheduling of jobs with arbitrary speed-up curves, and to remove the factor of two in the speed in our reduction. Is there an O(1)-competitive polynomial-time offline algorithm for multicast pull scheduling? The problems are known to be NP-hard (Erlebach and Hall) Open for both the case of unit sized files and arbitrary sized files

Every talk has to have a Dilbert cartoon.