1 ENS, June 21, 2007 Jean-Yves Le Boudec, EPFL joint work with David McDonald, U. of Ottawa and Jochen Mundinger, EPFL …or an Art ? Is Mean Field a Technology…

Slides:



Advertisements
Similar presentations
Boyce/DiPrima 9th ed, Ch 2.8: The Existence and Uniqueness Theorem Elementary Differential Equations and Boundary Value Problems, 9th edition, by William.
Advertisements

Situation Calculus for Action Descriptions We talked about STRIPS representations for actions. Another common representation is called the Situation Calculus.
Hidden Markov Models (1)  Brief review of discrete time finite Markov Chain  Hidden Markov Model  Examples of HMM in Bioinformatics  Estimations Basic.
1 12. Principles of Parameter Estimation The purpose of this lecture is to illustrate the usefulness of the various concepts introduced and studied in.
Tracking Unknown Dynamics - Combined State and Parameter Estimation Tracking Unknown Dynamics - Combined State and Parameter Estimation Presenters: Hongwei.
Dynamic Bayesian Networks (DBNs)
TCOM 501: Networking Theory & Fundamentals
Entropy Rates of a Stochastic Process
MEAN FIELD FOR MARKOV DECISION PROCESSES: FROM DISCRETE TO CONTINUOUS OPTIMIZATION Jean-Yves Le Boudec, Nicolas Gast, Bruno Gaujal July 26,
Infinite Horizon Problems
Planning under Uncertainty
Goodness of Fit of a Joint Model for Event Time and Nonignorable Missing Longitudinal Quality of Life Data – A Study by Sneh Gulati* *with Jean-Francois.
Outline Formulation of Filtering Problem General Conditions for Filtering Equation Filtering Model for Reflecting Diffusions Wong-Zakai Approximation.
1 Introduction to Computability Theory Lecture11: Variants of Turing Machines Prof. Amos Israeli.
ODE and Discrete Simulation or Mean Field Methods for Computer and Communication Systems Jean-Yves Le Boudec EPFL MLQA, Aachen, September
Mean Field Methods for Computer and Communication Systems Jean-Yves Le Boudec EPFL July
Mean Field Methods for Computer and Communication Systems: A Tutorial Jean-Yves Le Boudec EPFL Performance 2010 Namur, November
1 A Generic Mean Field Convergence Result for Systems of Interacting Objects From Micro to Macro Jean-Yves Le Boudec, EPFL Joint work with David McDonald,
AWESOME: A General Multiagent Learning Algorithm that Converges in Self- Play and Learns a Best Response Against Stationary Opponents Vincent Conitzer.
1 A Class Of Mean Field Interaction Models for Computer and Communication Systems Jean-Yves Le Boudec EPFL – I&C – LCA Joint work with Michel Benaïm.
13. The Weak Law and the Strong Law of Large Numbers
Presenting: Assaf Tzabari
Parametric Inference.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley Asynchronous Distributed Algorithm Proof.
1 Mean Field Interaction Models for Computer and Communication Systems and the Decoupling Assumption Jean-Yves Le Boudec EPFL – I&C – LCA Joint work with.
1 A Generic Mean Field Convergence Result for Systems of Interacting Objects From Micro to Macro Jean-Yves Le Boudec, EPFL Joint work with David McDonald,
1 A Class Of Mean Field Interaction Models for Computer and Communication Systems Jean-Yves Le Boudec EPFL – I&C – LCA Joint work with Michel Benaïm.
1 Modeling and Simulating Networking Systems with Markov Processes Tools and Methods of Wide Applicability ? Jean-Yves Le Boudec
Maximum likelihood (ML)
Monte Carlo Methods in Partial Differential Equations.
1 10. Joint Moments and Joint Characteristic Functions Following section 6, in this section we shall introduce various parameters to compactly represent.
Mean Field Methods for Computer and Communication Systems Jean-Yves Le Boudec EPFL ACCESS Distinguished Lecture Series, Stockholm, May 28,
ECES 741: Stochastic Decision & Control Processes – Chapter 1: The DP Algorithm 1 Chapter 1: The DP Algorithm To do:  sequential decision-making  state.
Finite Buffer Fluid Networks with Overflows Yoni Nazarathy, Swinburne University of Technology, Melbourne. Stijn Fleuren and Erjen Lefeber, Eindhoven University.
Fluid Limits for Gossip Processes Vahideh Manshadi and Ramesh Johari DARPA ITMANET Meeting March 5-6, 2009 TexPoint fonts used in EMF. Read the TexPoint.
Random Sampling, Point Estimation and Maximum Likelihood.
General Principle of Monte Carlo Fall 2013 By Yaohang Li, Ph.D.
MATH 224 – Discrete Mathematics
Decision Making in Robots and Autonomous Agents Decision Making in Robots and Autonomous Agents The Markov Decision Process (MDP) model Subramanian Ramamoorthy.
MEIE811D Advanced Topics in Finance Optimum Consumption and Portfolio Rules in a Continuous-Time Model Yuna Rhee Seyong Park Robert C. Merton (1971) [Journal.
1 ECE-517 Reinforcement Learning in Artificial Intelligence Lecture 7: Finite Horizon MDPs, Dynamic Programming Dr. Itamar Arel College of Engineering.
Modern Navigation Thomas Herring
1 Sections 1.5 & 3.1 Methods of Proof / Proof Strategy.
Borgan and Henderson:. Event History Methodology
Queuing Networks Jean-Yves Le Boudec 1. Contents 1.The Class of Multi-Class Product Form Networks 2.The Elements of a Product-Form Network 3.The Product-Form.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.
PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology Principles of Parameter Estimation.
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley.
CS Statistical Machine learning Lecture 24
ECE-7000: Nonlinear Dynamical Systems Overfitting and model costs Overfitting  The more free parameters a model has, the better it can be adapted.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
CS433 Modeling and Simulation Lecture 07 – Part 01 Continuous Markov Chains Dr. Anis Koubâa 14 Dec 2008 Al-Imam.
Sampling and estimation Petter Mostad
Ch 10.6: Other Heat Conduction Problems
Asymptotic behaviour of blinking (stochastically switched) dynamical systems Vladimir Belykh Mathematics Department Volga State Academy Nizhny Novgorod.
Queuing Theory.  Queuing Theory deals with systems of the following type:  Typically we are interested in how much queuing occurs or in the delays at.
CS774. Markov Random Field : Theory and Application Lecture 15 Kyomin Jung KAIST Oct
1 On the Channel Capacity of Wireless Fading Channels C. D. Charalambous and S. Z. Denic School of Information Technology and Engineering, University of.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
Chebyshev’s Inequality Markov’s Inequality Proposition 2.1.
Kevin Stevenson AST 4762/5765. What is MCMC?  Random sampling algorithm  Estimates model parameters and their uncertainty  Only samples regions of.
Mean Field Methods for Computer and Communication Systems Jean-Yves Le Boudec EPFL Network Science Workshop Hong Kong July
MCMC Output & Metropolis-Hastings Algorithm Part I
Heuristic Optimization Methods
risk Sensitive control of the lifetime ruin problem
Hidden Markov Models Part 2: Algorithms
Jean-Yves Le Boudec EPFL – I&C – LCA Joint work with Michel Benaïm
Presentation transcript:

1 ENS, June 21, 2007 Jean-Yves Le Boudec, EPFL joint work with David McDonald, U. of Ottawa and Jochen Mundinger, EPFL …or an Art ? Is Mean Field a Technology…

2  Most of the material in this presentation is based on J.-Y. Le BoudecJ.-Y. Le Boudec, D. McDonald and J. Mundinger A Generic Mean Field Convergence Result for Systems of Interacting Objects 4th International Conference on the Quantitative Evaluation of SysTems (QEST) 2007D. McDonaldJ. Mundinger also available at  this slide show

3 Contents 1.Mean Field in communication and computer modeling 2.A Generic Model for a System of Interacting Objects 3.Convergence to the Mean Field 4.Fast Simulation 5.Full Scale Example: A Reputation System 6.Central Limit Theorem 7.Outlook

4 Mean Field: What is it ?  Introduced in Physics  Markov process, continuous time, represents states of particles  Mean field is a model where interaction between particles is via some quantity derived from distribution of states of all particle  Mean field is also an approximation for the state of one particle where we assume independence in the master equation ( equivalent to the forward equation, called Chapman-Kolmogorov in queuing theory textbooks) state of particle n rate of transition r effect of transition r on particle n distribution of states independence assumption

5 Mean Field: What is it ?  Mean field is also an approximation for the distribution of states where we assume determinism in the master equation  This is often called the “mean field” equation

6 Convergence to Mean Field  Proving these approximation is usually done in the sense of convergence when some parameter gets large (e.g. number of particles)  The techniques for that are: convergence of one Markov process towards another.  Sources of techniques are

7 A Few Examples Where Applied Never again ! E.L.

8 Generic Approach to show convergence to mean field  [Ethier and Kurtz] 1.Find a scaling variable N and a family of Markov processes M N indexed by N 2.Show that the family M N is relatively compact in the sense of topology of weak convergence 3.Find the limit of the martingale problem for M N and show that it has a unique solution

9 Contents 1.Mean Field in communication and computer modeling 2.A Generic Model for a System of Interacting Objects 3.Convergence to the Mean Field 4.Fast Simulation 5.Full Scale Example: A Reputation System 6.Central Limit Theorem 7.Outlook

10 Mean Field Interaction Model  A Generic Model, with generic results  Our first cut at making mean field a technology  Time is discrete  N objects  Every object has a state in.  Mean field approx reduces size of model from N £ S to S  Informally: object n evolves depending only on Its own state How many other objects are in each state

11  Model assumptions: X N n (t) : state of object n at time t M N i (t) = proportion of objects that are in state i M N is the “occupancy measure” R N (t) = “history” of occupancy measure Objects draws next state according to

12 Two Mild Hypotheses g() is continuous

13 Example 1: Robot Swarm  No memory: R N = M N  N objects = N robots  Robot has S = 2 possible states  Fits in framework: transitions for one robot depends on own state + how many are in search state  K N independent of N after proper re-scaling  Equation for mean field is  Mild hypotheses are true because K is continuous in M (in fact: linear)

14 Example 2: ECN/TCP Gateways  N objects = TCP connections  State = sending rate (discrete)  Memory R = buffer at ECN gateway  Mean field equation  q(r) = proba of negative feedback when R=r  Mild assumptions are true if q() is continuous ECN received no ECN received

15 Example 3: Heterogeneous (Multiclass Example)  Same as previous but introduce multiclass model  Aggressive connections, normal connection  State of an object = (c, i) c : class i : sending rate  Objects may change class or not  Also fits in our framework  Mean Field does not mean all objects are exchangeable !

16 Contents 1.Mean Field in communication and computer modeling 2.A Generic Model for a System of Interacting Objects 3.Convergence to the Mean Field 4.Fast Simulation 5.Full Scale Example: A Reputation System 6.Central Limit Theorem 7.Outlook

17  A slightly weaker form was proven in many of the references mentioned earlier, in particular  A close, continuous time cousin is in

18 Practical Application  This replaces the stochastic system by a deterministic, dynamical system  This justifies the mean field equation in the large N regime  This can be used as a “fluid approximation” N i (t) = number of objects in state i

19 Proof of Theorem  Based on The next theorem (fast simulation) A coupling argument An ad-hoc version of the strong law of large numbers The Glivenko Cantelli lemma

20 Contents 1.Mean Field in communication and computer modeling 2.A Generic Model for a System of Interacting Objects 3.Convergence to the Mean Field 4.Fast Simulation 5.Full Scale Example: A Reputation System 6.Central Limit Theorem 7.Outlook

21 Fast Simulation / Analysis of One Object  Assume we are interested in one object in particular E.g. distribution of time until a TCP connection reaches maximum rate  For large N, since mean field convergence holds, one may forget the details of the states of all other objects and replace them by the deterministic dynamical system  The next theorem says that, essentially, this is valid

22 Fast Simulation Algorithm Returns next state for one object When transition matrix is K State of one specific object This is the mean field independence approximation Replace true value by deterministic limit

23 Fast Simulation Result

24 Practical Application  This justifies the mean field approximation (based on the independence assumption) for the stochastic state of one object as a large N asymptotic  Gives a method for fast simulation or analysis The state space for Y 1 has S states, instead of N £ S

25 Contents 1.Mean Field in communication and computer modeling 2.A Generic Model for a System of Interacting Objects 3.Convergence to the Mean Field 4.Fast Simulation 5.Full Scale Example: A Reputation System 6.Central Limit Theorem 7.Outlook

26 A Reputation System  My original motivation for this work  Illustrates the complete set of steps  System N objects = N peers Peers observe one subject and rate it Rating is a number in (0,1) Direct observations and spreading of reputation Confirmation bias + forgetting

27 Operation of Reputation System: Forgetting  Z n (t) = reputation rating held by peer n  Subject is perceived as positive (with proba  ) or negative (with proba 1-  )  In case of direct positive observation  In case of direct negative observation  w is the forgetting factor, close to 1 (0.9 in next slides)

28 Confirmation Bias  Peer also read other peer ratings  If overheard rating is z:   is the threshold of the confirmation bias

29 Initially: peers have Z=0, 0.5 or 1  = 0.9 Every time step: direct obs p=0.01, meet liar proba 0.30, meet honest proba 0.69 Example of simulation: N=100 peers with maximal liars (always say Z=0)

30 3 particular peers, one of each type  = 0.9

31 The problem fits in our framework…  Assume discrete time  At every time step a peer Makes a direct observation Or overhears a liar Or overhears some honest peer Or does nothing  Object = honest peer  Transition depends on Own state Distribution of states of all other peers => Fits in our framework with memory R = occupancy measure M

32 We would like to apply the mean field convergence result to analyze very large N  But model is continuous state space  Discretize reputation ratings ! Quantize Z n on L bits; replace Z n by X n = 2 L Z N, constrained to be integer Random rounding to avoid side effects; replace by where RANDROUND(2.7) = 2 with proba 0.3 and 3 with proba 0.7 E(RANDROUND(x)) = x

33 Transition Matrix  Is deduced from the rules  The hypotheses apply (almost trivially; K is linear in M) Proba of meeting a honest peer other than self Converges uniformly in M to M N k (t) for large N

34 Limiting reputation ratings: 0.9 and 0.1

35 Different Parameters (few liars) Few liars Final ratings converge to true value Phase transition

36 Different Initial Conditions

37 The Model can serve different Liar Strategies  Strategy 1: liars always say Z= 0  Strategy 2: liar guesses your rating based on past experience Similar to strategy 1, except memory R = memory occupancy at step t-1  Strategy 3: liars know your rating and is as negative as you accept Similar to strategy 1, memory = occupancy measure M

38 Liar Strategy 2 (infer) Liar Strategy 3 (side information) Peers starting at time 512

39 Contents 1.Mean Field in communication and computer modeling 2.A Generic Model for a System of Interacting Objects 3.Convergence to the Mean Field 4.Fast Simulation 5.Full Scale Example: A Reputation System 6.Central Limit Theorem 7.Outlook

40 A Central Limit Theorem Similar to Van Kampen’s Method  Case without memory and assuming K is differentiable:

41 Contents 1.Mean Field in communication and computer modeling 2.A Generic Model for a System of Interacting Objects 3.Convergence to the Mean Field 4.Fast Simulation 5.Full Scale Example: A Reputation System 6.Central Limit Theorem 7.Outlook

42 For the cases that fit in this framework…  We are closer to …  than to …

43 Extend the modelling framework to: birth and death of objects transitions that affect several objects simultaneously enumerable but infinite set of states Further work E. L.