Instructor: Shengyu Zhang 1. Offline algorithms Almost all algorithms we encountered in this course assume that the entire input is given all at once.

Slides:

Advertisements

Similar presentations

A Primal-Dual Approach to Online Optimization Problems.

Advertisements

Lecturer: Moni Naor Algorithmic Game Theory Uri Feige Robi Krauthgamer Moni Naor Lecture 8: Regret Minimization.

Analysis of Algorithms

Prompt Mechanisms for Online Auctions Speaker: Shahar Dobzinski Joint work with Richard Cole and Lisa Fleischer.

1 Dynamic Programming Week #4. 2 Introduction Dynamic Programming (DP) –refers to a collection of algorithms –has a high computational complexity –assumes.

Course: Applications of Information Theory to Computer Science CSG195, Fall 2008 CCIS Department, Northeastern University Dimitrios Kanoulas.

Online Algorithms Amrinder Arora Permalink:

Online learning, minimizing regret, and combining expert advice

Follow the regularized leader

Machine Learning Theory Machine Learning Theory Maria Florina Balcan 04/29/10 Plan for today: - problem of “combining expert advice” - course retrospective.

Institute of Computer Science University of Wroclaw Geometric Aspects of Online Packet Buffering An Optimal Randomized Algorithm for Two Buffers Marcin.

A New Understanding of Prediction Markets via No-Regret Learning.

1 Fast Primal-Dual Strategies for MRF Optimization (Fast PD) Robot Perception Lab Taha Hamedani Aug 2014.

Self-Organizing Lists1 Self-Organizing Lists Competitive Analysis Lee Wee Sun SoC Modified by Orgad Keller Modified again by.

Online Algorithms, Linear Programming, and the k-Server Problem Seffi Naor Computer Science Dept. Technion Approximation Algorithms: The Last Decade and.

1 Ecole Polytechnque, Nov 7, 2007 Scheduling Unit Jobs to Maximize Throughput Jobs:  all have processing time (length) = 1  release time r j  deadline.

Matroids, Secretary Problems, and Online Mechanisms Nicole Immorlica, Microsoft Research Joint work with Robert Kleinberg and Moshe Babaioff.

Online Oblivious Routing Nikhil Bansal, Avrim Blum, Shuchi Chawla & Adam Meyerson Carnegie Mellon University 6/7/2003.

Evaluation of Algorithms for the List Update Problem Suporn Pongnumkul R. Ravi Kedar Dhamdhere.

Competitive Analysis of Incentive Compatible On-Line Auctions Ron Lavi and Noam Nisan SISL/IST, Cal-Tech Hebrew University.

A General Approach to Online Network Optimization Problems Seffi Naor Computer Science Dept. Technion Haifa, Israel Joint work: Noga Alon, Yossi Azar,

Approximation Algorithms: Bristol Summer School 2008 Seffi Naor Computer Science Dept. Technion Haifa, Israel TexPoint fonts used in EMF. Read the TexPoint.

Online Function Tracking with Generalized Penalties Marcin Bieńkowski Institute of Computer Science, University of Wrocław, Poland Stefan Schmid Deutsche.

Experts Learning and The Minimax Theorem for Zero-Sum Games Maria Florina Balcan December 8th 2011.

Commitment without Regrets: Online Learning in Stackelberg Security Games Nika Haghtalab Carnegie Mellon University Joint work with Maria-Florina Balcan,

CSCI 3160 Design and Analysis of Algorithms Tutorial 12

1/24 Algorithms for Generalized Caching Nikhil Bansal IBM Research Niv Buchbinder Open Univ. Israel Seffi Naor Technion.

How the Experts Algorithm Can Help Solve LPs Online Marco Molinaro TU Delft Anupam Gupta Carnegie Mellon University.

Primal-Dual Meets Local Search: Approximating MST’s with Non-uniform Degree Bounds Author: Jochen Könemann R. Ravi From CMU CS 3150 Presentation by Dan.

Fair Real-time Traffic Scheduling over Wireless Local Area Networks Insik Shin Joint work with M. Adamou, S. Khanna, I. Lee, and S. Zhou Dept. of Computer.

Online Oblivious Routing Nikhil Bansal, Avrim Blum, Shuchi Chawla & Adam Meyerson Carnegie Mellon University 6/7/2003.

The Multiplicative Weights Update Method Based on Arora, Hazan & Kale (2005) Mashor Housh Oded Cats Advanced simulation methods Prof. Rubinstein.

Why is bin packing interesting?

An Online Auction Framework for Dynamic Resource Provisioning in Cloud Computing Weijie Shi*, Linquan Zhang +, Chuan Wu*, Zongpeng Li +, Francis C.M. Lau*

Trust-Aware Optimal Crowdsourcing With Budget Constraint Xiangyang Liu 1, He He 2, and John S. Baras 1 1 Institute for Systems Research and Department.

1 Online algorithms Typically when we solve problems and design algorithms we assume that we know all the data a priori. However in many practical situations.

Auctions for Digital Goods Ali Echihabi University of Waterloo – Nov 2004.

. CLASSES RP AND ZPP By: SARIKA PAMMI. CONTENTS:  INTRODUCTION  RP  FACTS ABOUT RP  MONTE CARLO ALGORITHM  CO-RP  ZPP  FACTS ABOUT ZPP  RELATION.

1 Online algorithms Typically when we solve problems and design algorithms we assume that we know all the data a priori. However in many practical situations.

Data Stream Algorithms Ke Yi Hong Kong University of Science and Technology.

Jennifer Campbell November 30,  Problem Statement and Motivation  Analysis of previous work  Simple - competitive strategy  Near optimal deterministic.

Online Social Networks and Media

Connections between Learning Theory, Game Theory, and Optimization Maria Florina (Nina) Balcan Lecture 2, August 26 th 2010.

How the Experts Algorithm Can Help Solve LPs Online Marco Molinaro TU Delft Anupam Gupta Carnegie Mellon University.

A polylog competitive algorithm for the k-server problem Nikhil Bansal (IBM) Niv Buchbinder (Open Univ.) Aleksander Madry (MIT) Seffi Naor (Technion)

Frequency Capping in Online Advertising Moran Feldman Technion Joint work with: Niv Buchbinder,The Open University of Israel Arpita Ghosh,Yahoo! Research.

Instructor: Shengyu Zhang 1. Secretary hiring problem 2.

Experts and Multiplicative Weights slides from Avrim Blum.

Non-Preemptive Buffer Management for Latency Sensitive Packets Moran Feldman Technion Seffi Naor Technion.

Matroids, Secretary Problems, and Online Mechanisms Nicole Immorlica, Microsoft Research Joint work with Robert Kleinberg and Moshe Babaioff.

COMP 2208 Dr. Long Tran-Thanh University of Southampton Reinforcement Learning.

1 Online algorithms 叶德仕 2 Online & offline Traditional theoretical analysis is concerned with off-line problems where the complete.

REU 2009-Traffic Analysis of IP Networks Daniel S. Allen, Mentor: Dr. Rahul Tripathi Department of Computer Science & Engineering Data Streams Data streams.

Instructor: Shengyu Zhang 1. Optimization Very often we need to solve an optimization problem.  Maximize the utility/payoff/gain/…  Minimize the cost/penalty/loss/…

Approximation Algorithms based on linear programming.

On-Line Algorithms in Machine Learning By: WALEED ABDULWAHAB YAHYA AL-GOBI MUHAMMAD BURHAN HAFEZ KIM HYEONGCHEOL HE RUIDAN SHANG XINDI.

Unconstrained Submodular Maximization Moran Feldman The Open University of Israel Based On Maximizing Non-monotone Submodular Functions. Uriel Feige, Vahab.

Online Learning Model. Motivation Many situations involve online repeated decision making in an uncertain environment. Deciding how to invest your money.

Carnegie Mellon University

Randomness and Computation

Better Algorithms for Better Computers

Hidden Markov Models Part 2: Algorithms

Q4 : How does Netflix recommend movies?

Online Learning Kernels

Dynamic and Online Algorithms for Set Cover

The loss function, the normal equation,

Online Algorithms via Projections set cover, paging, k-server

Mathematical Foundations of BME Reza Shadmehr

Dynamic and Online Algorithms:

Presentation transcript:

Instructor: Shengyu Zhang 1

Offline algorithms Almost all algorithms we encountered in this course assume that the entire input is given all at once. An exception: Secretary problem.  The input is given gradually.  We need to respond to each candidate in time.  We care about our performance compared to the best one in hindsight. 2

Online algorithms The input is revealed in parts. An online algorithm needs to respond to each part (of the input) upon its arrival. The responding actions cannot be canceled/revoked later. We care about the competitive ratio, which compares the performance of an online algorithm to that of the best offline algorithm.  Offline: the entire input is given beforehand. 3

Ski rental 4

Offline algorithm 5

Deterministic algorithm 6

Randomized algorithm 7

Integer programming 8

Solution 9

Relaxation 10

The relaxation doesn’t lose anything 11

Dual LP 12

13

Property 1 14

Property 1 15

16

Property 2 17

18

Turning into an online algorithm 19 … rentbuy

Expected cost 20

21

22

Optimality: Both deterministic and randomized algorithms are optimal.  No better competitive ratio is possible. Reference: The design of competitive online algorithms via a primal dual approach, Niv Buchbinder and Joseph Naor, Foundations and Trends in Theoretical Computer Science, Vol. 3, pp , Next: Another learning algorithm 23

Stock market Simplification: Only consider up or down. 24

Which expert to follow? 25

Which expert to follow? Each day, stock market goes up or down. At the end of the day, we’ll see whether the market actually goes up or down. We lose 1 if our prediction was wrong. 26

After a year, we’ll see with hindsight that one expert is the best.  But, of course, we don’t know who in advance. We’ll think “If we had followed his advice…” Theorem: We have a method to perform close to the best expert!  We don’t assume anything about the experts. They may not know what they are talking about. They may even collaborate in any bad manner. 27

Method and intuition 28

Randomized Weighted Majority Decrease your weight! Probability is proportional to weight 29

Example (n=5, T=6, ε = 1/4) 12345ourreal 1 1, ↑ 1, ↓1, ↑1, ↓↑↑ 2 1, ↑1, ↓0.75, ↑1, ↑0.75, ↑↑↑ 3 1, ↑0.75, ↑0.75, ↓1, ↓0.75, ↑↓↓ , ↑0.75, ↓ , ↑↑↓ , ↓0.4219, ↑0.75, ↑0.75, ↓0.4219, ↓↓↑ , ↑ 0.75, ↓0.5625, ↑0.3164, ↑↓↓ loss Numbers: weight Arrows: predications. Red: wrong. 30

31

Proof 32

33

34

Extensions 35

Summary Online algorithms:  The input is revealed in parts.  We need to respond to each part upon its arrival.  The responding actions cannot be revoked later. competitive ratio: performance of an online algorithm vs. performance of the best offline algorithm. Primal-dual method. Multiplicative weight update method. 36