Satisfaction Games in Graphical Multi-resource Allocation

Slides:

Advertisements

Similar presentations

Congestion Games with Player- Specific Payoff Functions Igal Milchtaich, Department of Mathematics, The Hebrew University of Jerusalem, 1993 Presentation.

Advertisements

Joint Strategy Fictitious Play Sherwin Doroudi. “Adapted” from J. R. Marden, G. Arslan, J. S. Shamma, “Joint strategy fictitious play with inertia for.

ECO290E: Game Theory Lecture 9 Subgame Perfect Equilibrium.

Strategic Network Formation and Group Formation Elliot Anshelevich Rensselaer Polytechnic Institute (RPI)

An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.

A Game Theoretic Approach to Provide Incentive and Service Differentiation in P2P Networks John C.S. Lui The Chinese University of Hong Kong Joint work.

Dynamic Spectrum Management: Optimization, game and equilibrium Tom Luo (Yinyu Ye) December 18, WINE 2008.

Beyond selfish routing: Network Formation Games. Network Formation Games NFGs model the various ways in which selfish agents might create/use networks.

Autonomous Target Assignment: A Game Theoretical Formulation Gurdal Arslan & Jeff Shamma Mechanical and Aerospace Engineering UCLA AFOSR / MURI.

Selfish Caching in Distributed Systems: A Game-Theoretic Analysis By Byung-Gon Chun et al. UC Berkeley PODC’04.

Nov 2003Group Meeting #2 Distributed Optimization of Power Allocation in Interference Channel Raul Etkin, Abhay Parekh, and David Tse Spectrum Sharing.

A Scalable Network Resource Allocation Mechanism With Bounded Efficiency Loss IEEE Journal on Selected Areas in Communications, 2006 Johari, R., Tsitsiklis,

Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:

Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:

1 Issues on the border of economics and computation נושאים בגבול כלכלה וחישוב Congestion Games, Potential Games and Price of Anarchy Liad Blumrosen ©

1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:

A Projection Framework for Near- Potential Polynomial Games Nikolai Matni Control and Dynamical Systems, California.

By: Gang Zhou Computer Science Department University of Virginia 1 A Game-Theoretic Framework for Congestion Control in General Topology Networks SYS793.

Efficiency Loss in a Network Resource Allocation Game Paper by: Ramesh Johari, John N. Tsitsiklis [ Informs] Presented by: Gayatree Ganu.

Chapter 9 Games with Imperfect Information Bayesian Games.

NOBEL WP Szept Stockholm Game Theory in Inter-domain Routing LÓJA Krisztina - SZIGETI János - CINKLER Tibor BME TMIT Budapest,

Beyond Routing Games: Network (Formation) Games. Network Games (NG) NG model the various ways in which selfish users (i.e., players) strategically interact.

Mohammadreza Ataei Instructor : Prof. J.Omidi. 2.

Transit price negotiation: repeated game approach Sogea 23 Mai 2007 Nancy, France D.Barth, J.Cohen, L.Echabbi and C.Hamlaoui

ECE559VV – Fall07 Course Project Presented by Guanfeng Liang Distributed Power Control and Spectrum Sharing in Wireless Networks.

Mazumdar Ne X tworking’03 June 23-25,2003, Chania, Crete, Greece The First COST-IST(EU)-NSF(USA) Workshop on EXCHANGES & TRENDS IN N ETWORKING 1 Non-convex.

Games with Imperfect Information Bayesian Games. Complete versus Incomplete Information So far we have assumed that players hold the correct belief about.

Dynamic Games & The Extensive Form

MAP: Multi-Auctioneer Progressive Auction in Dynamic Spectrum Access Lin Gao, Youyun Xu, Xinbing Wang Shanghai Jiaotong University.

1 The Price of Defense M. Mavronicolas , V. Papadopoulou , L. Michael ¥, A. Philippou , P. Spirakis § University of Cyprus, Cyprus  University of Patras.

A Study of Central Auction Based Wholesale Electricity Markets S. Ceppi and N. Gatti.

Extensive Games with Imperfect Information

Topic 3 Games in Extensive Form 1. A. Perfect Information Games in Extensive Form. 1 RaiseFold Raise (0,0) (-1,1) Raise (1,-1) (-1,1)(2,-2) 2.

Ásbjörn H Kristbjörnsson1 The complexity of Finding Nash Equilibria Ásbjörn H Kristbjörnsson Algorithms, Logic and Complexity.

Information Theory for Mobile Ad-Hoc Networks (ITMANET): The FLoWS Project Competitive Scheduling in Wireless Networks with Correlated Channel State Ozan.

Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish agents strategically interact in using a network They.

Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish users (i.e., players) strategically interact in using.

1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.

Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp ,

Hedonic Clustering Games Moran Feldman Joint work with: Seffi Naor and Liane Lewin-Eytan.

Multicast Recipient Maximization in IEEE j WiMAX Relay Networks Wen-Hsing Kuo † ( 郭文興 ) & Jeng-Farn Lee ‡ ( 李正帆 ) † Department of Electrical Engineering,

Chapter 6 Extensive Form Games With Perfect Information (Illustrations)

Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.

ECO290E: Game Theory Lecture 10 Examples of Dynamic Games.

Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.

Game theory basics A Game describes situations of strategic interaction, where the payoff for one agent depends on its own actions as well as on the actions.

Near-Optimal Spectrum Allocation for Cognitive Radios: A Frequency-Time Auction Perspective Xinyu Wang Department of Electronic Engineering Shanghai.

Algorithmic Game Theory and Internet Computing

Communication Complexity as a Lower Bound for Learning in Games

Econ 805 Advanced Micro Theory 1

Aspiration-based Learning

Network Creation Game A. Fabrikant, A. Luthra, E. Maneva,

Structured Models for Multi-Agent Interactions

Unit 4 SOCIAL INTERACTIONS.

Effective Social Network Quarantine with Minimal Isolation Costs

Game Theory in Wireless and Communication Networks: Theory, Models, and Applications Lecture 2 Bayesian Games Zhu Han, Dusit Niyato, Walid Saad, Tamer.

Game Theory Chapter 12.

Presented By Aaron Roth

Network Formation Games

Games with Imperfect Information Bayesian Games

Oliver Schulte Petra Berenbrink Simon Fraser University

Selfish Load Balancing

EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF INDUSTRIAL ENGINEERING IENG314 OPERATIONS RESEARCH II SAMIR SAMEER ABUYOUSSEF

Molly W. Dahl Georgetown University Econ 101 – Spring 2009

Computing a Nash Equilibrium of a Congestion Game: PLS-completeness

Network Formation Games

Markov Decision Processes

Markov Decision Processes

Lecture 8 Nash Equilibrium

Blockchain Mining Games

Presentation transcript:

Satisfaction Games in Graphical Multi-resource Allocation Sun Ruijia 5110309622

Outline Motivation System model Convergence of Nash Equilibria Simulation Conclusion

Outline Motivation System model Convergence of Nash Equilibria Simulation Conclusion

Motivation Communication system Resource allocation limited users In communication system, resource allocation is always the fundamental problem – the limited resources and the increasing demand. How to allocate the resources efficiently among so many users to maximize the total utility has always been on focus. Typically there are two different approaches. The first one is the centralized manner while the other depends on the autonomous users or entities, which is called the distributed manner. Centralized manner Distributed manner Maximize the total utility

Motivation Centralized manner Head operator entities resources If there is a leader, every thing will work much better. This is the advantages of centralized manner. In a centralized manner, there is a head operator controls the whole system. The operator collects almost all information about every entities within the system. It will analyze the current situation and comes out a best result. Then the operator will make plans and allocate the resources to each entities to reach his best result. This approach seems quite perfect, right? However in practice, centralized manner is often difficult to realize. Firstly, due to the heterogeneity of users, the operator needs gather massive amounts of information to perform the optimization. Secondly, finding the system-wide optimal solution is usually NP hard. Thus this approach is sometimes not suitable. Best results Massive amounts of information NP hard

Motivation Distributed manner users make decisions for itself Convergence Price of anarchy In a distributed manner, however, such problems can be avoided. In a distributed manner, each user makes the resource allocation decision locally to meet its own demand, while taking the network dynamics and other users’ actions into consideration. This approach is flexible and particularly suitable. But the distributed manner also has its own problem, that is, the convergence and the price of anarchy. To analyze these issue, models in game theory have been utilized, such as the congestion games, potential games. We have different game models for different situations. Game theory Congestion games, potential games…

Motivation QoS Satisfaction game Multi-resource allocation games Listen to music Read e-book Browse the webpages Watch the video … QoS Satisfaction game Multi-resource allocation games Object replication Nowadays everybody has a smartphone right? We may use it to listen to music, read e-books, browse the webpages or watch the videos. Our demand for QoS varies according to what we use it for, that is, the QoS for watching a video must be higher than the QoS for browsing a webpage. It is also obvious that we don’t need the QoS to be as high as possible. (For example, when listening to music, once the music can be played fluently, the listener will be satisfied and doesn’t need a higher QoS.) Thus, the model of satisfaction game is proposed in <Quality of Service Games for Spectrum Sharing>. In this paper, only single channel allocation is considered. But in practice, cases exist about multi-resource allocation problems. Such as object replication, where each node(entities) can has a storage capacity Ki which it uses to replicate objects locally. Thus we need to extend the original satisfaction games into multi-resource allocation model. Satisfaction multi-resource allocation games

Outline Motivation System model Convergence of Nash Equilibria Simulation Conclusion

System model N nodes (entities) want to compete for R resources. Nodes can be represented by an interference graph according to their locations and each node has a set of neighbors. The more nodes in neighbor set of node i choose the same resource, the less the QoS received by node i is. Each node has a demand QoS. Once the QoS is larger than or equal to the demand, the user is satisfied. Each node can collect more than one resource.

System model N nodes (entities) want to compete for R resources. Nodes can be represented by an interference graph according to their locations and each node has a set of neighbors. The more nodes in neighbor set of node i choose the same resource, the less the QoS received by node i is. Each node has a demand QoS. Once the QoS is larger than or equal to the demand, the user is satisfied. Each node can collect more than one resource.

System model (𝒩, ℛ, ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ , ( 𝐷 𝑛 ) 𝑛∈𝒩 , 𝒢, 𝐾, 𝒜) 𝒦={𝐾(𝑛)|𝑛∈𝒩} , 𝐾(𝑖) is the number of resources node 𝑛 needs to allocate 𝐷 𝑛 ≥0 The QoS demand of node n ℛ= 𝟎,1,2,…𝑅 the set of resources (𝒩, ℛ, ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ , ( 𝐷 𝑛 ) 𝑛∈𝒩 , 𝒢, 𝐾, 𝒜) 𝒢= 𝑁,𝐸 : interference graph QoS received by user n who has selected resource 𝑟 𝒩={1,2,3,…,𝑁} the set of nodes Strategy profile: 𝒜={ 𝐴 𝑛 |𝑛∈𝒩}, where 𝐴 𝑛 ={ 𝑎 𝑛 𝑟 |𝑟∈ℛ} Indicator 𝑎 𝑛 𝑟 = 1, if resource 𝑟 is allocated by node 𝑛 0, otherwise

System model (𝒩, ℛ, ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ , ( 𝐷 𝑛 ) 𝑛∈𝒩 , 𝒢, 𝐾,𝒜) 𝒢= 𝑁,𝐸 (𝒩, ℛ, ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ , ( 𝐷 𝑛 ) 𝑛∈𝒩 , 𝒢, 𝐾,𝒜) 𝒢= 𝑁,𝐸 𝑁 𝑛 : the neighbor of node 𝑛, who will interfere with node 𝑖 if they collect the same resource 2 1 3 4 Node 3 and 2 are the neighbors of node 1 while node 4 is not.

System model (𝒩, ℛ, ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ , ( 𝐷 𝑛 ) 𝑛∈𝒩 , 𝒢, 𝐾,𝒜) (𝒩, ℛ, ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ , ( 𝐷 𝑛 ) 𝑛∈𝒩 , 𝒢, 𝐾,𝒜) ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ Congestion level: the number of node in the set of n’s neighbors choosing the resource r QoS 𝐼 𝑛 𝑟

System model (𝒩, ℛ, ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ , ( 𝐷 𝑛 ) 𝑛∈𝒩 , 𝒢, 𝐾,𝒜) (𝒩, ℛ, ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ , ( 𝐷 𝑛 ) 𝑛∈𝒩 , 𝒢, 𝐾,𝒜) ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ Congestion level: the number of node in the set of n’s neighbors choosing the resource r QoS 𝐼 𝑛 𝑟 = 𝑛∈𝒩 𝑛 : 𝑎 𝑛 𝑟 =1 Entity n collects resource r ( 𝐷 𝑛 ) 𝑛∈𝒩 Threshold 𝑇 𝑛 𝑟 𝐼 𝑛 𝑟 ≤ 𝑇 𝑛 𝑟 : entity is satisfied 𝐼 𝑛 𝑟 > 𝑇 𝑛 𝑟 : entity is dissatisfied 𝑇 𝑛 𝑟 𝐼 𝑛 𝑟

System model (𝒩, ℛ, ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ , ( 𝐷 𝑛 ) 𝑛∈𝒩 , 𝒢, 𝐾,𝒜) (𝒩, ℛ, ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ , ( 𝐷 𝑛 ) 𝑛∈𝒩 , 𝒢, 𝐾,𝒜) ( 𝑄 𝑛 𝑟 ) 𝑛∈𝒩,𝑟∈ℛ 𝐼 𝑛 𝑟 = 𝑛∈𝒩 𝑛 : 𝑥 𝑛 =𝑟 Threshold 𝑇 𝑛 𝑟 QoS 𝑈 𝑛 𝑟 = 1, if 𝐼 𝑛 𝑟 ≥ 𝑇 𝑛 𝑟 0, if 𝐼 𝑛 𝑟 < 𝑇 𝑛 𝑟 𝑈 𝑛 = 𝑟 𝑈 𝑛 𝑟 ( 𝐷 𝑛 ) 𝑛∈𝒩 (𝒩, ℛ, 𝑇 𝑛 𝑟 , 𝒢, 𝐾) 𝑇 𝑛 𝑟 𝐼 𝑛 𝑟

System model The evicted set: 𝐸 𝑛 𝑡 ={𝑟| 𝑎 𝑛 𝑟 0 =1∧ 𝑎 𝑛 𝑟 𝑡 =0} The inserted set: 𝐼 𝑛 𝑡 ={𝑟| 𝑎 𝑛 𝑟 0 =0∧ 𝑎 𝑛 𝑟 𝑡 =1} 1,2,3 3,5,6 𝐸 𝑛 𝑡 = 1,2 𝐼 𝑛 𝑡 ={5,6} Since our model is a multi-resource model, every time an entity update his strategy, he will evict several resources and insert the same amount of resources within his timeslot. Thus we define the evicted set and the inserted set. Before that we need to define anr

Outline Motivation System model Convergence of Nash Equilibria Simulation Conclusion

Convergence of NE Key concept in game theory Definition 1 (Better Reply Update). The event where a player n changes its choice of strategy from 𝑥 𝑛 to r is a better reply update if and only if 𝑈 𝑛 𝑟, 𝒙 −𝑛 > 𝑈 𝑛 𝑥 𝑛 , 𝒙 −𝑛 , where we write the argument of the function as 𝒙=( 𝑥 𝑛 , 𝑥 −𝑛 ) with 𝒙 −𝑛 =( 𝑥 1 ,…, 𝑥 𝑛−1 , 𝑥 𝑛+1 ,…, 𝑥 𝑁 ) representing the strategy profile of all users except player n. As stated before, the general better reply step may introduce loop in our improvement path. We introduce a modified better reply. Thus, we can represent our multi-resource update process with several single-resource update process.

Convergence of NE Definition 2 (Pure Nash Equilibrium). A strategy profile 𝒙 is a pure NE if no users at 𝒙 can perform a better reply update, i.e., 𝑈 𝑛 𝑟, 𝒙 −𝑛 > 𝑈 𝑛 𝑥 𝑛 , 𝒙 −𝑛 for any 𝑟∈ℛ 𝑎𝑛𝑑 𝑛∈𝒩. Definition 3 (Finite Improvement Property). A game has the finite improvement property if any asynchronous better reply update process terminates at a pure NE within a finite number of updates. As stated before, the general better reply step may introduce loop in our improvement path. We introduce a modified better reply. Thus, we can represent our multi-resource update process with several single-resource update process.

Convergence of NE Definition 4 (modified better reply) An modified better reply for multi-resource allocation problem is that for ∀𝑟∈ 𝐸 𝑛 (𝑡), 𝑈 𝑛 𝑟 𝑡 =0; and for ∀𝑟∈ 𝐼 𝑛 (𝑡), 𝑈 𝑛 𝑟 𝑡 =1. 𝐸 𝑛 𝑡 = 𝐼 𝑛 𝑡 ≤𝐾(𝑛). Time slot 1 for node n1 Time slot 1 for node n Time slot 1 for node nn … Update resource 1 Update resource 2 Update resource K(n) As stated before, the general better reply step may introduce loop in our improvement path. We introduce a modified better reply. Thus, we can represent our multi-resource update process with several single-resource update process.

Convergence of NE Theorem 1. In satisfaction game for graphical multi-resource allocation, FIP can be guaranteed under modified better reply. Proof: Potential function: A function Φ: × 𝑛 ( 𝒜 𝑛 )→ℝ is a generalized ordinal potential function for the game if the change of Φ is strictly positive if an arbitrary player n increases his utility by changing his strategy from 𝐴 𝑛 to 𝐴 𝑛 ′ . Formally, 𝑈 𝑛 𝐴 𝑛 ′ , 𝐴 −𝑛 > 𝑈 𝑛 𝐴 𝑛 , 𝐴 −𝑛 ⟹Φ 𝐴 𝑛 ′ , 𝐴 −𝑛 >Φ 𝐴 𝑛 , 𝐴 −𝑛 When Φ 𝐴 𝑛 , 𝐴 −𝑛 is the maximum value, then NE is reached.

Convergence of NE Theorem 1. In satisfaction game for graphical multi-resource allocation, FIP can be guaranteed under modified better reply. Proof: Φ 𝑨 = 𝑛 𝑟 𝐹 𝑛 𝑟 𝐴 𝑛 , 𝐴 −𝑛 𝐹 𝑛 𝑟 = 2 𝑇 𝑛 𝑟 − 𝐼 𝑛 𝑟 𝑨 , if 𝑎 𝑛 𝑟 =1 0, if 𝑎 𝑛 𝑟 =0 𝑟 𝑟 ′ Since we can represent the multi-resource allocation with the single-resource allocation model, here we just prove for a single-resource better reply, this function holds. dissatisfied satisfied 𝑇 𝑛 𝑟 ≤ 𝐼 𝑛 𝑟 −1 𝑇 𝑛 𝑟 ′ ≥ 𝐼 𝑛 𝑟 ′ 𝐹 𝑛 𝑟 ′ − 𝐹 𝑛 𝑟 ≥ 𝐼 𝑛 𝑟 ′ − 𝐼 𝑛 𝑟 +2

Convergence of NE Theorem 1. In satisfaction game for graphical multi-resource allocation, FIP can be guaranteed under modified better reply. Proof: Φ 𝑨 = 𝑛 𝑟 𝐹 𝑛 𝑟 𝐴 𝑛 , 𝐴 −𝑛 𝐹 𝑛 𝑟 = 2 𝑇 𝑛 𝑟 − 𝐼 𝑛 𝑟 𝑨 , if 𝑎 𝑛 𝑟 =1 0, if 𝑎 𝑛 𝑟 =0 Since we can represent the multi-resource allocation with the single-resource allocation model, here we just prove for a single-resource better reply, this function holds. Node N(n)1, …., N(n)n-1, n, N(n)n+1… Node N(n)1, …., N(n)n-1, N(n)n+1… −𝑛 𝐹 𝑖 𝑟 ′ − −𝑛 𝐹 𝑖 𝑟 = 𝐼 𝑛 𝑟 − 𝐼 𝑛 𝑟 ′

Convergence of NE Theorem 1. In satisfaction game for graphical multi-resource allocation, FIP can be guaranteed under modified better reply. Proof: Φ 𝑨 = 𝑛 𝑟 𝐹 𝑛 𝑟 𝐴 𝑛 , 𝐴 −𝑛 𝐹 𝑛 𝑟 = 2 𝑇 𝑛 𝑟 − 𝐼 𝑛 𝑟 𝑨 , if 𝑎 𝑛 𝑟 =1 0, if 𝑎 𝑛 𝑟 =0 Since we can represent the multi-resource allocation with the single-resource allocation model, here we just prove for a single-resource better reply, this function holds. 𝑛 𝐹 𝑛 𝑟 ′ − 𝑛 𝐹 𝑛 𝑟 ≥2 For a modified better reply by node n, Φ 𝑨 will increase at least 2

Convergence of NE Theorem 1. In satisfaction game for graphical multi-resource allocation, FIP can be guaranteed under modified better reply. Proof: Φ 𝑨 = 𝑛 𝑟 𝐹 𝑛 𝑟 𝐴 𝑛 , 𝐴 −𝑛 𝐹 𝑛 𝑟 = 2 𝑇 𝑛 𝑟 − 𝐼 𝑛 𝑟 𝑨 , if 𝑎 𝑛 𝑟 =1 0, if 𝑎 𝑛 𝑟 =0 −𝑁≤− 𝐼 𝑛 𝑟 < 𝐹 𝑛 𝑟 ≤2 𝑇 𝑛 𝑟 <2𝑁+2 1 2 3𝑁+2 𝑁 𝑛 𝐾(𝑛) Φ 𝑨 < 3𝑁+2 𝑁 𝑛 𝐾(𝑛)

Outline Motivation System model Convergence of Nash Equilibria Simulation Conclusion

Simulation N=6 R=10 Set the threshold 𝑇 𝑛 𝑟 , interference graph and the initial stage randomly

Simulation N=10 R=15

Simulation R=[7,15,25,35,45,53,65,75,83,92]; N=[10,20,30,40,50,60,70,80,90,100];

Outline Motivation System model Convergence of Nash Equilibria Simulation Conclusion

Conclusion We use the distributed manner to solve the resource allocation problems Instead of single resource allocation, we propose that every node can collect more than one resource Utilize the satisfaction game model to make the allocation problem more practical Using a modified better reply strategy guarantees that NE can be reached.

Conclusion Future work Find a faster way than asynchronous update for converge To collect more resources (so that we can reduce the demand for each resource) to guarantee that the node is satisfied with every resource he collects.

Q&A

Thank you!