1 Pricing Cloud Bandwidth Reservations under Demand Uncertainty Di Niu, Chen Feng, Baochun Li Department of Electrical and Computer Engineering University.

Slides:



Advertisements
Similar presentations
Simultaneous Routing and Resource Allocation in Wireless Networks Mikael Johansson Signals, Sensors and Systems, KTH Joint work with Lin Xiao and Stephen.
Advertisements

Truthful Spectrum Auction Design for Secondary Networks Yuefei Zhu ∗, Baochun Li ∗ and Zongpeng Li † ∗ Electrical and Computer Engineering, University.
Hadi Goudarzi and Massoud Pedram
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Fast Algorithms For Hierarchical Range Histogram Constructions
A Prior-Free Revenue Maximizing Auction for Secondary Spectrum Access Ajay Gopinathan and Zongpeng Li IEEE INFOCOM 2011, Shanghai, China.
Dimensionality reduction. Outline From distances to points : – MultiDimensional Scaling (MDS) Dimensionality Reductions or data projections Random projections.
Dimensionality Reduction PCA -- SVD
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Price-based Resource Allocation in Wireless Ad Hoc Networks Yuan Xue, Baochun Li and Klara Nahrstedt University of Illinois at Urbana-Champaign University.
1 EL736 Communications Networks II: Design and Algorithms Class8: Networks with Shortest-Path Routing Yong Liu 10/31/2007.
Decomposable Optimisation Methods LCA Reading Group, 12/04/2011 Dan-Cristian Tomozei.
Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
A Payment-based Incentive and Service Differentiation Mechanism for P2P Streaming Broadcast Guang Tan and Stephen A. Jarvis Department of Computer Science,
Tracking using the Kalman Filter. Point Tracking Estimate the location of a given point along a sequence of images. (x 0,y 0 ) (x n,y n )
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley Asynchronous Distributed Algorithm Proof.
Rethinking Internet Traffic Management: From Multiple Decompositions to a Practical Protocol Jiayue He Princeton University Joint work with Martin Suchara,
Online Auctions in IaaS Clouds: Welfare and Profit Maximization with Server Costs Xiaoxi Zhang 1, Zhiyi Huang 1, Chuan Wu 1, Zongpeng Li 2, Francis C.M.
On Fairness, Optimizing Replica Selection in Data Grids Husni Hamad E. AL-Mistarihi and Chan Huah Yong IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS,
Multipath Protocol for Delay-Sensitive Traffic Jennifer Rexford Princeton University Joint work with Umar Javed, Martin Suchara, and Jiayue He
Neural Networks Lecture 8: Two simple learning algorithms
Resource Placement and Assignment in Distributed Network Topologies Accepted to: INFOCOM 2013 Yuval Rochman, Hanoch Levy, Eli Brosh.
Presented By Wanchen Lu 2/25/2013
Bargaining Towards Maximized Resource Utilization in Video Streaming Datacenters Yuan Feng 1, Baochun Li 1, and Bo Li 2 1 Department of Electrical and.
DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jennifer Rexford Princeton University With Jiayue He, Rui Zhang-Shen, Ying Li,
1 IEEE Trans. on Smart Grid, 3(1), pp , Optimal Power Allocation Under Communication Network Externalities --M.G. Kallitsis, G. Michailidis.
NOBEL WP Szept Stockholm Game Theory in Inter-domain Routing LÓJA Krisztina - SZIGETI János - CINKLER Tibor BME TMIT Budapest,
A Distributed Framework for Correlated Data Gathering in Sensor Networks Kevin Yuen, Ben Liang, Baochun Li IEEE Transactions on Vehicular Technology 2008.
The Only Constant is Change: Incorporating Time-Varying Bandwidth Reservations in Data Centers Di Xie, Ning Ding, Y. Charlie Hu, Ramana Kompella 1.
An Online Auction Framework for Dynamic Resource Provisioning in Cloud Computing Weijie Shi*, Linquan Zhang +, Chuan Wu*, Zongpeng Li +, Francis C.M. Lau*
1 Min-Cost Live Webcast under Joint Pricing of Data, Congestion and Virtualized Servers Rui Zhu 1, Di Niu1, Baochun Li 2 1 Department of Electrical and.
EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.
© 2009 IBM Corporation 1 Improving Consolidation of Virtual Machines with Risk-aware Bandwidth Oversubscription in Compute Clouds Amir Epstein Joint work.
1 Optimization Based Power Generation Scheduling Xiaohong Guan Tsinghua / Xian Jiaotong University.
MAP: Multi-Auctioneer Progressive Auction in Dynamic Spectrum Access Lin Gao, Youyun Xu, Xinbing Wang Shanghai Jiaotong University.
Scalable Multi-Class Traffic Management in Data Center Backbone Networks Amitabha Ghosh (UtopiaCompression) Sangtae Ha (Princeton) Edward Crabbe (Google)
EE 685 presentation Utility-Optimal Random-Access Control By Jang-Won Lee, Mung Chiang and A. Robert Calderbank.
A Study of Central Auction Based Wholesale Electricity Markets S. Ceppi and N. Gatti.
The Only Constant is Change: Incorporating Time-Varying Bandwidth Reservations in Data Centers Di Xie, Ning Ding, Y. Charlie Hu, Ramana Kompella 1.
Linear Programming Erasmus Mobility Program (24Apr2012) Pollack Mihály Engineering Faculty (PMMK) University of Pécs João Miranda
DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jiayue He, Rui Zhang-Shen, Ying Li, Cheng-Yen Lee, Jennifer Rexford, and Mung.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley.
CIA Annual Meeting LOOKING BACK…focused on the future.
Overview of Optimization in Ag Economics Lecture 2.
2) Combinatorial Algorithms for Traditional Market Models Vijay V. Vazirani.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Lecture 2: Statistical learning primer for biologists
1  Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Review of statistical modeling and probability theory Alan Moses ML4bio.
Searching a Linear Subspace Lecture VI. Deriving Subspaces There are several ways to derive the nullspace matrix (or kernel matrix). ◦ The methodology.
Optimization-based Cross-Layer Design in Networked Control Systems Jia Bai, Emeka P. Eyisi Yuan Xue and Xenofon D. Koutsoukos.
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.
CS 9633 Machine Learning Support Vector Machines
LINEAR CLASSIFIERS The Problem: Consider a two class task with ω1, ω2.
Near-Optimal Spectrum Allocation for Cognitive Radios: A Frequency-Time Auction Perspective Xinyu Wang Department of Electronic Engineering Shanghai.
Deep Feedforward Networks
Probability Theory and Parameter Estimation I
Adaptive Cloud Computing Based Services for Mobile Users
Cross layer design is wireless multi-hop network
Clustering (3) Center-based algorithms Fuzzy k-means
Latent Variables, Mixture Models and EM
Elasticity Considerations for Optimal Pricing of Networks
The following slides are taken from:
Recursively Adapted Radial Basis Function Networks and its Relationship to Resource Allocating Networks and Online Kernel Learning Weifeng Liu, Puskal.
A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.
ACHIEVEMENT DESCRIPTION
A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.
Towards Predictable Datacenter Networks
Presentation transcript:

1 Pricing Cloud Bandwidth Reservations under Demand Uncertainty Di Niu, Chen Feng, Baochun Li Department of Electrical and Computer Engineering University of Toronto

2 Roadmap Part 1 A cloud bandwidth reservation model Part 2 Price such reservations Large-scale distributed optimization Part 3 Trace-driven simulations Part 1 A cloud bandwidth reservation model

3 Cloud Tenants WWWWWW Problem: No bandwidth guarantee Not good for Video-on-Demand, transaction processing web applications, etc.

4 Days012 Demand 10 Gbps Dedicated Network Amazon Cluster Compute Bandwidth Over-provision

5 H. Ballani, et al. Towards Predictable Datacenter Networks ACM SIGCOMM ‘11 C. Guo, et al. SecondNet: a Data Center Network Virtualization Architecture with Bandwidth Guarantees ACM CoNEXT ‘10 Good News: Bandwidth reservations are becoming feasible between a VM and the Internet

6 Reservation Days012 Bandwidth Demand reduces cost due to better utilization Dynamic Bandwidth Reservation Difficulty: tenants don’t really know their demand!

7 A New Bandwidth Reservation Service A tenant specifies a percentage of its bandwidth demand to be served with guaranteed performance; The remaining demand will be served with best effort Bandwidth Reservation Tenant CloudProvider DemandPrediction Workload history of the tenant GuaranteedPortion (e.g., 95%) QoSLevel repeated periodically

8 Tenant Demand Model Each tenant i has a random demand D i Assume D i is Gaussian, with mean μ i = E [ D i ] variance σ i 2 = var [ D i ] covariance matrix Σ = [σ ij ] Service Level Agreement: Outage w.p.

9 Roadmap Part 1 A cloud bandwidth reservation model Part 2 Price such reservations Large-scale distributed optimization Part 3 Trace-driven simulations

10 Objective 1: Pricing the reservations A reservation fee on top of the usage fee Objective 2: Resource Allocation Price affects demand, which affects price in turn Social Welfare Maximization Objectives

11 Tenant i can specify a guaranteed portion w i Tenant i ’s expected utility (revenue) Concave, twice differentiable, increasing Utility depends not only on demand, but also on the guaranteed portion! Tenant Utility (e.g., Netflix)

12 Bandwidth Reservation Given submitted guaranteed portions the cloud will guarantee the demands Non-multiplexing: Multiplexing: Service cost e.g. It needs to reserve a total bandwidth capacity

13 Cloud Objective: Social Welfare Maximization Social Welfare Impossible: the cloud does not know U i Surplus of tenant i Profit of the Cloud Provider Price

14 Surplus (Profit) Pricing function Under P i ( ⋅ ), tenant i will choose Price guaranteed portion, not absolute bandwidth! Example: Linear pricing Pricing Function

15 Pricing as a Distributed Solution Challenge: Cost not decomposable for multiplexing Cost not decomposable for multiplexing Surpluswhere Social Welfare Determine pricing policy to

A Simple Case: Non- Multiplexing Determine pricing policy to where Mean StdSince, for Gaussian

17 The General Case: Lagrange Dual Decomposition M. Chiang, S. Low, A. Calderbank, J. Doyle. Layering as optimization decomposition: A mathematical theory of network architectures. Proc. of IEEE 2007 Lagrange dual Dual problem Original problem

18 Lagrange multiplier k i as price: P i (w i ) := k i w i Lagrange dual Dual problem Subgradient Algorithm: a subgradient of For dual minimization, update price: decompose

19 Weakness of the Subgradient Method Social Welfare (SW) Surplus Tenant i Cloud Provider... Tenant 1 Tenant N Step size is a issue! Convergence is slow. 22 Price 11 Guaranteed Portion Update to increase

20 Our Algorithm: Equation Updates Tenant i Cloud Provider Set Solve KKT Conditions of Linear pricing P i (w i ) = k i w i suffices!

21 Theorem 1 (Convergence) Equation updates converge if for all i for all betweenand

22 Convergence: A Single Tenant (1-D) Subgradient method Equation Updates Not converging

23 The Case of Multiplexing Covariance matrix: symmetric, positive semi-definite is a cone centered at 0 Satisfies Theorem 1, algorithm converges. and is small is not zero if

24 Roadmap Part 1 A cloud bandwidth reservation model Part 2 Price such reservations Large-scale distributed optimization Part 3 Trace-driven simulations

25 Data Mining: VoD Demand Traces 200+ GB traces (binary) from UUSee Inc. reports from online users every 10 minutes Aggregate into video channels

26 Bandwidth (Mbps) Predict Expected Demand via Seasonal ARIMA Time periods (1 period = 10 minutes)

27 Time periods (1 period = 10 minutes) Mbps Predict Demand Variation via GARCH

28 Prediction Results Each tenant i has a random demand D i in each “10 minutes” D i is Gaussian, with mean μ i = E [ D i ] variance σ i 2 = var [ D i ] covariance matrix Σ = [σ ij ]

29 Dimension Reduction via PCA A channel’s demand = weighted sum of factors Find factors using Principal Component Analysis (PCA) Predict factors first, then each channel

30 Time periods (1 period = 10 minutes) Bandwidth (Mbps) 3 Biggest Channels of 452 Channels

31 Time periods (1 period = 10 minutes) Mbps The First 3 Principal Components

32 Number of principal components 98% 8 components Complexity Reduction: 452 channels 8 components Data Variance Explained

33 Pricing: Parameter Settings Utility of tenant i (conservative estimate) Linear revenue Reputation loss for demand not guaranteed Usage of tenant i: w.h.p.

34 CDF Convergence Iteration of the Last Tenant Mean = 6 rounds Mean = 158 rounds 100 tenants (channels), 81 time periods (81 x 10 Minutes)

35 Related Work Primal/Dual Decomposition [Chiang et al. 07] Contraction Mapping x := T(x) D. P. Bertsekas, J. Tsitsiklis, "Parallel and distributed computation: numerical methods" Game Theory [Kelly 97] Each user submits a price (bid), expects a payoff Equilibrium may or may not be social optimal Time Series Prediction HMM [Silva 12], PCA [Gürsun 11], ARIMA [Niu 11]

36 Conclusions A cloud bandwidth reservation model based on guaranteed portions Pricing for social welfare maximization Future work: new decomposition and iterative methods for very large-scale distributed optimization more general convergence conditions

37 Thank you Di Niu Department of Electrical and Computer Engineering University of Toronto

38

39 RMSE (Mbps) in Log Scale Channel Index Root mean squared errors (RMSEs) over 1.25 days

40 Optimal Pricing when each tenant requires w i ≡ 1 Correlation to the market, in [-1, 1] ExpectedDemand Demand Standard Deviation With multiplexing, Without multiplexing,

41 Histogram of Price Discounts due to Multiplexing Discounts of All Tenants in All Test Periods Counts mean discount 44% total cost saving 35% Risk neutralizers Majority

42 Aggregate bandwidth (Mbps) Video Channel: F190E Time periods (one period = 10 minutes)