Download presentation
Presentation is loading. Please wait.
Published byJames Gilbert Modified over 9 years ago
1
Chen Cai, Benjamin Heydecker Presentation for the 4th CREST Open Workshop Operation Research for Software Engineering Methods, London, 2010 Approximate Dynamic Programming & Adaptive Traffic Control
2
Contents Dynamic Programming Curse of Dimensionality Approximate Dynamic Programming Adaptive Traffic Signal Control
3
1. Dynamic Programming
4
What it does? –Sequential decision-making for discrete systems –Iterative computing rather than enumeration –Global optimality t0t0 t1t1 t2t2 t3t3 t m-2 tmtm t m-1 Stage 0 Stage m-1 tt
5
1. Dynamic Programming How it does? –DP decomposes a complex problem to a group of sub-problems called stage; and by recursively finding optimal solution at each stage, its solution converges to global optimality. –It can be mathematically interpreted as: By recursively computing for all i t at stage t.
6
2. Curse of Dimensionality
7
State Space –i t = ( i t (1), i t (2), …, i t (K) ) is K-dimensional, each i t (n) takes one of M i possible values, the total number of states at each step t is M i K Decision Space –u t = ( u t (0), u t (1), …, u t (N) ) is N-dimensional, each u t (n) may take M u possible values, the total number of eligible decision is M u N. Information Space –w t = ( w t (1), w t (2), …, w t (L) ) is L-dimensional, each w t (n) takes one of M w possible values, the size of information space is M w L
8
2. Curse of Dimensionality Three curses of dimensionality Computational demand is In the case that K=10, L=5, and N=5, and M i K = M w L = M u N =10, the total computational demand is state information decision
9
3. Approximate Dynamic Programming
10
What it does? –Reduce computational demand How it does? –Model approximation Models describe system dynamics. Complex system is hard to model and may be partially observable –Policy approximation Parameterisation that captures the relationship between control policy and state variables –Function approximation Parameterisation of value function
11
3. Approximate Dynamic Programming Approximatio n Equation Parameterisation of value function
12
3. Approximate Dynamic Programming Progressive update of approximation function
13
4. ADP in Adaptive Traffic Signal Control
14
4. Adaptive Traffic Signals Adaptive traffic signal control is a complex problem Real-time dynamic decision-making reduces vehicle delays and stops substantially
15
4. Adaptive Traffic Signals Sensing Control Real world
16
4. Adaptive Traffic Signals Numerical example LinkL1L2L6L7 Flow rate (vehicles per hour) 350382440382 DownstreamL3 L5L8L4L8 Turning ratio100%25%75%100%25%75%
17
4. Adaptive Traffic Signals Signal sequences Link 7 Signal 5 Link 6 Signal 4 Link 8 Signal 6 4505 450 0 451 5 453 1 452 1 453 4 456 8 455 9 454 7 454 4 463 4 462 2 461 5 460 2 459 5 458 5 458 1 461 1 455 5 455 1 454 0 453 8 452 7 452 5 451 1 450 9 460 6 459 1 458 9 457 7 457 2 463 0 462 6 452 4 452 0 451 0 450 2 453 7 459 6 457 8 456 2 454 7 463 0 461 8 460 3 464 2 Link 1 Signa l 1 Link 2 Signa l 2 Link 3 Signa l 3 463 0 464 2 452 4 452 0 451 0 450 2 453 7 459 6 457 8 456 2 454 7 461 8 460 3 4505 450 0 451 5 453 1 452 1 453 4 456 8 455 9 454 7 454 4 463 4 462 2 461 5 460 2 459 5 458 5 458 1 453 3 451 6 452 8 451 4 455 1 460 7 459 2 458 1 455 8 463 4 461 4 463 8
18
4. Adaptive Traffic Signals Up to 60% reduction in vehicle delays in comparison with optimised fixed-time plans Fully adaptive and applicable to distributed network control Computation demand manageable by real-time systems
19
5. Conclusion Dynamic programming is the only exact solution to sequential decision-making for discrete systems DP is difficult for real-time control because of computational demand Approximation to DP can reduce dimensionality and therefore make problem-solving tractable ADP is a general framework in which various approximation architectures and machine learning techniques can be used Adaptive traffic signal controller using ADP demonstrated promising results in reducing vehicle delays
20
From imagination to impact
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.