Optimal Control and Reachability with Competing Inputs

Slides:



Advertisements
Similar presentations
1 1 Slide Chapter 1 & Lecture Slide Body of Knowledge n Management science Is an approach to decision making based on the scientific method Is.
Advertisements

Supervisory Control of Hybrid Systems Written by X. D. Koutsoukos et al. Presented by Wu, Jian 04/16/2002.
Model Checker In-The-Loop Flavio Lerda, Edmund M. Clarke Computer Science Department Jim Kapinski, Bruce H. Krogh Electrical & Computer Engineering MURI.
Models for Control and Verification Ian Mitchell Department of Computer Science The University of British Columbia research supported by National Science.
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
AA278A: Supplement to Lecture Notes 10. Controller Synthesis for Hybrid Systems Claire J. Tomlin Department of Aeronautics and Astronautics Department.
1 Nonlinear Control Design for LDIs via Convex Hull Quadratic Lyapunov Functions Tingshu Hu University of Massachusetts, Lowell.
Workshop on Stochastic Differential Equations and Statistical Inference for Markov Processes Day 1: January 19 th, Day 2: January 28 th Lahore University.
Planning under Uncertainty
Robust Hybrid and Embedded Systems Design Jerry Ding, Jeremy Gillula, Haomiao Huang, Michael Vitus, and Claire Tomlin MURI Review Meeting Frameworks and.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley Asynchronous Distributed Algorithm Proof.
1 Collision Avoidance Systems: Computing Controllers which Prevent Collisions By Adam Cataldo Advisor: Edward Lee Committee: Shankar Sastry, Pravin Varaiya,
Topologically Adaptive Stochastic Search I.E. Lagaris & C. Voglis Department of Computer Science University of Ioannina - GREECE IOANNINA ATHENS THESSALONIKI.
Chapter 13 Stochastic Optimal Control The state of the system is represented by a controlled stochastic process. Section 13.2 formulates a stochastic optimal.
Computational Stochastic Optimization: Bridging communities October 25, 2012 Warren Powell CASTLE Laboratory Princeton University
General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning Duke University Machine Learning Group Discussion Leader: Kai Ni June 17, 2005.
Markov Decision Processes1 Definitions; Stationary policies; Value improvement algorithm, Policy improvement algorithm, and linear programming for discounted.
1 ECE-517 Reinforcement Learning in Artificial Intelligence Lecture 7: Finite Horizon MDPs, Dynamic Programming Dr. Itamar Arel College of Engineering.
Lecture 6 : Level Set Method
Communication System A communication system can be represented as in Figure. A message W, drawn from the index set {1, 2,..., M}, results in the signal.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley.
Chapter 1 Introduction n Introduction: Problem Solving and Decision Making n Quantitative Analysis and Decision Making n Quantitative Analysis n Model.
ECE-7000: Nonlinear Dynamical Systems Overfitting and model costs Overfitting  The more free parameters a model has, the better it can be adapted.
1 Optimization Techniques Constrained Optimization by Linear Programming updated NTU SY-521-N SMU EMIS 5300/7300 Systems Analysis Methods Dr.
Planning Under Uncertainty. Sensing error Partial observability Unpredictable dynamics Other agents.
ECE-7000: Nonlinear Dynamical Systems 3. Phase Space Methods 3.1 Determinism: Uniqueness in phase space We Assume that the system is linear stochastic.
Use or disclosure of the information contained herein is subject to specific written CIRA approval 1 PURSUIT – EVASION GAMES GAME THEORY AND ANALYSIS OF.
Introducing Information into RM to Model Market Behavior INFORMS 6th RM and Pricing Conference, Columbia University, NY Darius Walczak June 5, 2006.
Hypothesis testing and statistical decision theory
Game Theory Just last week:
12. Principles of Parameter Estimation
State Space Representation
risk Sensitive control of the lifetime ruin problem
Theory of Capital Markets
水分子不時受撞,跳格子(c.p. 車行) 投骰子 (最穩定) 股票 (價格是不穏定,但報酬過程略穩定) 地震的次數 (不穩定)
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 3
CS b659: Intelligent Robotics
Making complex decisions
Recursive Identification of Switched ARX Hybrid Models: Exponential Convergence and Persistence of Excitation René Vidal National ICT Australia Brian D.O.Anderson.
Task: It is necessary to choose the most suitable variant from some set of objects by those or other criteria.
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 3
Communication Complexity as a Lower Bound for Learning in Games
Dynamical Statistical Shape Priors for Level Set Based Tracking
Scheduling Design and Verification for Open Soft Real-time Systems
Artificial Intelligence
Machine Learning Today: Reading: Maria Florina Balcan
Brownian Motion for Financial Engineers
Hidden Markov Models Part 2: Algorithms
Announcements Homework 3 due today (grace period through Friday)
Some Aspects of Continuous-time Sliding Mode Control
Quantum One.
Alternating tree Automata and Parity games
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Artificial Intelligence
State Space Analysis UNIT-V.
Stability Analysis of Linear Systems
Support Vector Machines
Maths for Signals and Systems Linear Algebra in Engineering Lectures 10-12, Tuesday 1st and Friday 4th November2016 DR TANIA STATHAKI READER (ASSOCIATE.
Biointelligence Laboratory, Seoul National University
Information Theoretical Analysis of Digital Watermarking
12. Principles of Parameter Estimation
Berlin Chen Department of Computer Science & Information Engineering
Berlin Chen Department of Computer Science & Information Engineering
SVMs for Document Ranking
Berlin Chen Department of Computer Science & Information Engineering
Markov Decision Processes
ECE 576 POWER SYSTEM DYNAMICS AND STABILITY
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 3
Presentation transcript:

Optimal Control and Reachability with Competing Inputs Ian Mitchell Department of Computer Science The University of British Columbia research supported by National Science and Engineering Research Council of Canada Good afternoon. I’ll be talking to you today about level set methods for analyzing, verifying and controlling continuous and hybrid systems. So obviously, I’ll have to explain what I mean by reachability, and what is a hybrid system. I’ll show how reachability can be used to analyze, verify and create controllers for various types of systems. This is joint work with Professor Claire Tomlin in Aero/Astro, Professor Ron Fedkiw in Computer Science, and with students in their labs at Stanford.

Ian Mitchell (UBC Computer Science) Competing Inputs What do we do when there are multiple parameters, some of which we can choose but some of which have unknown and uncontrolled value “Control” input denoted by u (or a) “Disturbance” input denoted by d (or b) Choose control input as before to optimize trajectory or achieve safety Due to disturbance input, system remains nondeterministic even if control signal is fixed First, let’s look at how I represent sets. The implicit surface representation has attracted lots of attention over the past few decades in computer science and engineering, because it handles general sets so gracefully. We just define a scalar function J such that J is negative inside the set and positive outside. Constructive geometry is easily accomplished on such sets; for example, the union of two sets is described implicitly by the taking minimum of their two implicit functions at each point in the state space. I give an example of an oddly shaped set and one possible implicit surface representation of that set at the bottom of the slide. Implicit surface functions like J are not unique, but if we place the additional restriction on J that its gradient have magnitude one, then we can deduce two additional bits of information from J at each point in the state space: the distance and direction of the nearest point on the boundary of the set. This information proves useful in the context of synthesizing safe control signals, such as the video game that I showed earlier. March 2008 Ian Mitchell (UBC Computer Science)

Two Treaments of Disturbance Stochastic perturbations d(t) ~ D Discrete state Poisson processes Markov decision processes Stochastic differential equations Bounded value inputs d(t) 2 D Robust reach sets Two player zero sum games First, let’s look at how I represent sets. The implicit surface representation has attracted lots of attention over the past few decades in computer science and engineering, because it handles general sets so gracefully. We just define a scalar function J such that J is negative inside the set and positive outside. Constructive geometry is easily accomplished on such sets; for example, the union of two sets is described implicitly by the taking minimum of their two implicit functions at each point in the state space. I give an example of an oddly shaped set and one possible implicit surface representation of that set at the bottom of the slide. Implicit surface functions like J are not unique, but if we place the additional restriction on J that its gradient have magnitude one, then we can deduce two additional bits of information from J at each point in the state space: the distance and direction of the nearest point on the boundary of the set. This information proves useful in the context of synthesizing safe control signals, such as the video game that I showed earlier. March 2008 Ian Mitchell (UBC Computer Science)

Markov Decision Process Discrete time, discrete state model with probabilistic transitions Typically specified by Alternatively specified by x(t+1) = d(t) where d(t) is drawn from the Bernoulli Scheme with For discrete time systems, many distributions are supported First, let’s look at how I represent sets. The implicit surface representation has attracted lots of attention over the past few decades in computer science and engineering, because it handles general sets so gracefully. We just define a scalar function J such that J is negative inside the set and positive outside. Constructive geometry is easily accomplished on such sets; for example, the union of two sets is described implicitly by the taking minimum of their two implicit functions at each point in the state space. I give an example of an oddly shaped set and one possible implicit surface representation of that set at the bottom of the slide. Implicit surface functions like J are not unique, but if we place the additional restriction on J that its gradient have magnitude one, then we can deduce two additional bits of information from J at each point in the state space: the distance and direction of the nearest point on the boundary of the set. This information proves useful in the context of synthesizing safe control signals, such as the video game that I showed earlier. March 2008 Ian Mitchell (UBC Computer Science)

Stochastic Differential Equations (SDEs) Two mathematical frameworks: Ito and Stratonovich Conversions exist between the two SDE is ODE with a Brownian motion (Wiener process) perturbation Restrictive class of distributions eg: cannot guarantee bound on stochastic term Equivalent of Hamilton-Jacobi equation for SDEs is the Fokker-Planck equation First, let’s look at how I represent sets. The implicit surface representation has attracted lots of attention over the past few decades in computer science and engineering, because it handles general sets so gracefully. We just define a scalar function J such that J is negative inside the set and positive outside. Constructive geometry is easily accomplished on such sets; for example, the union of two sets is described implicitly by the taking minimum of their two implicit functions at each point in the state space. I give an example of an oddly shaped set and one possible implicit surface representation of that set at the bottom of the slide. Implicit surface functions like J are not unique, but if we place the additional restriction on J that its gradient have magnitude one, then we can deduce two additional bits of information from J at each point in the state space: the distance and direction of the nearest point on the boundary of the set. This information proves useful in the context of synthesizing safe control signals, such as the video game that I showed earlier. March 2008 Ian Mitchell (UBC Computer Science)

Continuous Backward Reachable Tubes Set of all states from which trajectories can reach some given target state For example, what states can reach G(0)? Continuous System Dynamics Target Set G(0) Backward Reachable Set G(t) March 2008 Ian Mitchell (UBC Computer Science)

Reachable Tubes (controlled input) For most of our examples, target set is unsafe If we can control the input, choose it to avoid the target set Backward reachable set is unsafe no matter what we do “Minimal” backward reach tube Continuous System Dynamics March 2008 Ian Mitchell (UBC Computer Science)

Reachable Tubes (uncontrolled input) Sometimes we have no control over input signal noise, actions of other agents, unknown system parameters It is safest to assume the worst case “Maximal” backward reach tube Continuous System Dynamics March 2008 Ian Mitchell (UBC Computer Science)

Ian Mitchell (UBC Computer Science) Two Competing Inputs For some systems there are two classes of inputs  = (u,d) Controllable inputs u  U Uncontrollable (disturbance) inputs d  D Equivalent to a zero sum differential game formulation If there is an advantage to input ordering, give it to disturbances Continuous System Dynamics March 2008 Ian Mitchell (UBC Computer Science)

Ian Mitchell (UBC Computer Science) Objective Function Extends in obvious way to the additional input eg: discrete time discounted with fixed finite horizon tf eg: continuous time no discount with target set T First, let’s look at how I represent sets. The implicit surface representation has attracted lots of attention over the past few decades in computer science and engineering, because it handles general sets so gracefully. We just define a scalar function J such that J is negative inside the set and positive outside. Constructive geometry is easily accomplished on such sets; for example, the union of two sets is described implicitly by the taking minimum of their two implicit functions at each point in the state space. I give an example of an oddly shaped set and one possible implicit surface representation of that set at the bottom of the slide. Implicit surface functions like J are not unique, but if we place the additional restriction on J that its gradient have magnitude one, then we can deduce two additional bits of information from J at each point in the state space: the distance and direction of the nearest point on the boundary of the set. This information proves useful in the context of synthesizing safe control signals, such as the video game that I showed earlier. March 2008 Ian Mitchell (UBC Computer Science)

Ian Mitchell (UBC Computer Science) Who Goes First? One input is chosen to maximize and the other to minimize the objective But what knowledge is available when choosing an input? Current state? Other input? “Non-anticipative strategies” One player gets to know the other player’s input value (as well as current state) However, that player must declare their strategy (reaction to every input) in advance First, let’s look at how I represent sets. The implicit surface representation has attracted lots of attention over the past few decades in computer science and engineering, because it handles general sets so gracefully. We just define a scalar function J such that J is negative inside the set and positive outside. Constructive geometry is easily accomplished on such sets; for example, the union of two sets is described implicitly by the taking minimum of their two implicit functions at each point in the state space. I give an example of an oddly shaped set and one possible implicit surface representation of that set at the bottom of the slide. Implicit surface functions like J are not unique, but if we place the additional restriction on J that its gradient have magnitude one, then we can deduce two additional bits of information from J at each point in the state space: the distance and direction of the nearest point on the boundary of the set. This information proves useful in the context of synthesizing safe control signals, such as the video game that I showed earlier. March 2008 Ian Mitchell (UBC Computer Science)

Zero Sum Game Value Function Value function is then defined as optimization over appropriate strategy and input signal pair “Lower” value function, since disturbance (minimizer) has the advantage Parallel “upper” value function can be defined If inputs are independent, optimal strategy will ignore additional information about the other input Upper and lower value functions will be equal First, let’s look at how I represent sets. The implicit surface representation has attracted lots of attention over the past few decades in computer science and engineering, because it handles general sets so gracefully. We just define a scalar function J such that J is negative inside the set and positive outside. Constructive geometry is easily accomplished on such sets; for example, the union of two sets is described implicitly by the taking minimum of their two implicit functions at each point in the state space. I give an example of an oddly shaped set and one possible implicit surface representation of that set at the bottom of the slide. Implicit surface functions like J are not unique, but if we place the additional restriction on J that its gradient have magnitude one, then we can deduce two additional bits of information from J at each point in the state space: the distance and direction of the nearest point on the boundary of the set. This information proves useful in the context of synthesizing safe control signals, such as the video game that I showed earlier. March 2008 Ian Mitchell (UBC Computer Science)

Competing Inputs: Final Comments Feedback control is more realistic implementation If order of input decision is irrelevant (upper and lower value functions are equal), then nonanticipative strategy results will be equivalent to feedback results For robustness, give advantage (eg strategy) to the disturbance input if it matters (potentially pessimistic) Input signals still drawn from set of measureable functions Two player concepts have been extended to viability theory and set-valued analysis First, let’s look at how I represent sets. The implicit surface representation has attracted lots of attention over the past few decades in computer science and engineering, because it handles general sets so gracefully. We just define a scalar function J such that J is negative inside the set and positive outside. Constructive geometry is easily accomplished on such sets; for example, the union of two sets is described implicitly by the taking minimum of their two implicit functions at each point in the state space. I give an example of an oddly shaped set and one possible implicit surface representation of that set at the bottom of the slide. Implicit surface functions like J are not unique, but if we place the additional restriction on J that its gradient have magnitude one, then we can deduce two additional bits of information from J at each point in the state space: the distance and direction of the nearest point on the boundary of the set. This information proves useful in the context of synthesizing safe control signals, such as the video game that I showed earlier. March 2008 Ian Mitchell (UBC Computer Science)

Optimal Control and Reachability with Competing Inputs For more information contact Ian Mitchell Department of Computer Science The University of British Columbia mitchell@cs.ubc.ca http://www.cs.ubc.ca/~mitchell Good afternoon. I’ll be talking to you today about level set methods for analyzing, verifying and controlling continuous and hybrid systems. So obviously, I’ll have to explain what I mean by reachability, and what is a hybrid system. I’ll show how reachability can be used to analyze, verify and create controllers for various types of systems. This is joint work with Professor Claire Tomlin in Aero/Astro, Professor Ron Fedkiw in Computer Science, and with students in their labs at Stanford.