Dimitrios Hristu-Varsakelis Mechanical Engineering and Institute for Systems Research University of Maryland, College Park

Slides:



Advertisements
Similar presentations
Reactive and Potential Field Planners
Advertisements

Lecture 24 MAS 714 Hartmut Klauck
Lecture 8: Three-Level Architectures CS 344R: Robotics Benjamin Kuipers.
Probabilistic Roadmaps. The complexity of the robot’s free space is overwhelming.
Dynamic Bayesian Networks (DBNs)
COORDINATION and NETWORKING of GROUPS OF MOBILE AUTONOMOUS AGENTS.
Lab 2 Lab 3 Homework Labs 4-6 Final Project Late No Videos Write up
Bio-Inspired Optimization. Our Journey – For the remainder of the course A brief review of classical optimization methods The basics of several stochastic.
1 Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science Carnegie Mellon University joint work with Tzu-Kuo Huang, Le.
Centre for Autonomous Systems Petter ÖgrenCAS talk1 A Control Lyapunov Function Approach to Multi Agent Coordination P. Ögren, M. Egerstedt * and X. Hu.
1/14 Ad Hoc Networking, Eli M. Gafni and Dimitri P. Bertsekas Distributed Algorithm for Generating Loop-free Routes in Networks With Frequently.
CPSC 322, Lecture 37Slide 1 Finish Markov Decision Processes Last Class Computer Science cpsc322, Lecture 37 (Textbook Chpt 9.5) April, 8, 2009.
1 Introduction to Computability Theory Lecture12: Decidable Languages Prof. Amos Israeli.
Introduction to Computability Theory
1 Complexity of Network Synchronization Raeda Naamnieh.
1 Solving problems by searching Chapter 3. 2 Why Search? To achieve goals or to maximize our utility we need to predict what the result of our actions.
1 Lecture 8: Genetic Algorithms Contents : Miming nature The steps of the algorithm –Coosing parents –Reproduction –Mutation Deeper in GA –Stochastic Universal.
1 Motion Planning Algorithms : BUG-family. 2 To plan a path  find a continuous trajectory leading from initial position of the automaton (a mobile robot)
MAE 552 – Heuristic Optimization Lecture 6 February 6, 2002.
Minimum Classification Error Networks Based on book chapter 9, by Shigeru Katagiri Jaakko Peltonen, 28 th February, 2002.
P. Ögren (KTH) N. Leonard (Princeton University)
Dimitrios Hristu-Varsakelis Mechanical Engineering and Institute for Systems Research University of Maryland, College Park
1 Efficient Placement and Dispatch of Sensors in a Wireless Sensor Network Prof. Yu-Chee Tseng Department of Computer Science National Chiao-Tung University.
Copyright N. Friedman, M. Ninio. I. Pe’er, and T. Pupko. 2001RECOMB, April 2001 Structural EM for Phylogentic Inference Nir Friedman Computer Science &
Solving problems by searching
Decision Making Under Uncertainty Russell and Norvig: ch 16, 17 CMSC421 – Fall 2003 material from Jean-Claude Latombe, and Daphne Koller.
CS Reinforcement Learning1 Reinforcement Learning Variation on Supervised Learning Exact target outputs are not given Some variation of reward is.
June 12, 2001 Jeong-Su Han An Autonomous Vehicle for People with Motor Disabilities by G. Bourhis, O.Horn, O.Habert and A. Pruski Paper Review.
Collaborative Filtering Matrix Factorization Approach
MAKING COMPLEX DEClSlONS
online convex optimization (with partial information)
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
1 Solving problems by searching This Lecture Chapters 3.1 to 3.4 Next Lecture Chapter 3.5 to 3.7 (Please read lecture topic material before and after each.
TKK | Automation Technology Laboratory Partially Observable Markov Decision Process (Chapter 15 & 16) José Luis Peralta.
Maximum Network Lifetime in Wireless Sensor Networks with Adjustable Sensing Ranges Cardei, M.; Jie Wu; Mingming Lu; Pervaiz, M.O.; Wireless And Mobile.
Robust localization algorithms for an autonomous campus tour guide Richard Thrapp Christian Westbrook Devika Subramanian Rice University Presented at ICRA.
University of Amsterdam Search, Navigate, and Actuate - Qualitative Navigation Arnoud Visser 1 Search, Navigate, and Actuate Qualitative Navigation.
1 Distributed and Optimal Motion Planning for Multiple Mobile Robots Yi Guo and Lynne Parker Center for Engineering Science Advanced Research Computer.
Mobile Agent Migration Problem Yingyue Xu. Energy efficiency requirement of sensor networks Mobile agent computing paradigm Data fusion, distributed processing.
Value Function Approximation on Non-linear Manifolds for Robot Motor Control Masashi Sugiyama1)2) Hirotaka Hachiya1)2) Christopher Towell2) Sethu.
Graph Colouring L09: Oct 10. This Lecture Graph coloring is another important problem in graph theory. It also has many applications, including the famous.
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
Deciding Under Probabilistic Uncertainty Russell and Norvig: Sect ,Chap. 17 CS121 – Winter 2003.
Administration Feedback on assignment Late Policy
Covering Points of Interest with Mobile Sensors Milan Erdelj, Tahiry Razafindralambo and David Simplot-Ryl INRIA Lille - Nord Europe IEEE Transactions on.
AN INTELLIGENT ASSISTANT FOR NAVIGATION OF VISUALLY IMPAIRED PEOPLE N.G. Bourbakis*# and D. Kavraki # #AIIS Inc., Vestal, NY, *WSU,
Behavior-based Multirobot Architectures. Why Behavior Based Control for Multi-Robot Teams? Multi-Robot control naturally grew out of single robot control.
1 Energy-Efficient Mobile Robot Exploration Yongguo Mei, Yung-Hsiang Lu Purdue University ICRA06.
Navigation Strategies for Exploring Indoor Environments Hector H Gonzalez-Banos and Jean-Claude Latombe The International Journal of Robotics Research.
Problem Description: One line explanation of the problem to be solved Problem Description: One line explanation of the problem to be solved Proposed Solution:
Planning Under Uncertainty. Sensing error Partial observability Unpredictable dynamics Other agents.
1 Travel Times from Mobile Sensors Ram Rajagopal, Raffi Sevlian and Pravin Varaiya University of California, Berkeley Singapore Road Traffic Control TexPoint.
Linear Discriminant Functions Chapter 5 (Duda et al.) CS479/679 Pattern Recognition Dr. George Bebis.
Vision-Guided Humanoid Footstep Planning for Dynamic Environments
Prof. Yu-Chee Tseng Department of Computer Science
COGNITIVE APPROACH TO ROBOT SPATIAL MAPPING
CS b659: Intelligent Robotics
Lecture 9: Behavior Languages
Collaborative Filtering Matrix Factorization Approach
Distributed Sensing, Control, and Uncertainty
Market-based Dynamic Task Allocation in Mobile Surveillance Systems
Russell and Norvig: Chapter 3, Sections 3.1 – 3.3
Solving problems by searching
Chapter 4 . Trajectory planning and Inverse kinematics
Robotics meet Computer Science
Deciding Under Probabilistic Uncertainty
Planning.
Solving problems by searching
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 3
Presentation transcript:

Dimitrios Hristu-Varsakelis Mechanical Engineering and Institute for Systems Research University of Maryland, College Park Joint work with: M. Egerstedt, S. B. Andersson, C. Shao. P.R. Kumar, P. S. Krishnaprasad, Cooperative Optimization and Navigation Problems

Outline Ensembles of autonomous vehicles operating on “expansive” terrain. Bio-inspired trajectory optimization Language-based navigation Report on Progress – Event-driven communication

Examples from biology (bees, ants, fish etc.) Ensembles can accomplish tasks that are impossible for an individual. Coordination requires thinking about control/communication interactions. Ensembles of Autonomous Systems

V1V1 VnVn vehicle obstacle control station target start Trajectory optimization without a map A group of vehicles traveling between a fixed pair of locations Terrain is unknown - no “global” map. On-board sensing provides local information about vehicle’s immediate surroundings PROBLEM: Given an initial path between a pair of “start” and “target” locations, find the optimal path connecting that pair, using “local” interactions between vehicles. V n-1

V1V1 VnVn vehicle obstacle control station target start Trajectory optimization without a map A group of vehicles traveling between a fixed pair of locations Terrain is unknown - no “global” map. On-board sensing provides local information about vehicle’s immediate surroundings PROBLEM: Given an initial path between a pair of “start” and “target” locations, find the optimal path connecting that pair, using “local” interactions between vehicles. V n-1

Local pursuit: A biologically-inspired algorithm Theorem (on ): The iterated paths converge to a straight line as... K+2 k+1 k Start Target : Initial path : path followed by the k-th vehicle,... (on a smooth manifold M): If vehicle separation is sufficiently small, then the iterated paths converge to a geodesic. [Bruckstein, 92]

Experimental results: with Euclidean metric Initial path length ~7m Vehicle separation ~1.5m A collection of mobile robots with: Wireless communication between neighbors Sonar and odometry sensors TARGET START

: Minimum-length geodesic connecting to M : location of k-th vehicle Local Pursuit Idea: Find optimal trajectory to leader and follow it momentarily.

M : Minimum-length geodesic connecting a to b : location of k-th vehicle Pursuit decreases vehicle separation

The k-th vehicle moves as follows: Wait at until t=Δ(κ+1) At time t, “follow the optimal trajectory” from to Local pursuit for more general optimal control problems Let Given an initial trajectory with converge to a local min. for Assumptions: uniqueness, smoothness Find that minimizes s.t. As, iterated trajectories

Simulation: pursuit on 5m trajectory 0.7m separation

A sub-Riemannian example 5m trajectory 1.5m separation fixed

Pursuit in vehicles with drift (minimum time problem)

Summary and Work in Progress A biologically-inspired trajectory optimization algorithm - local pursuit forms a “string” of vehicles - each vehicle uses local information and communicates with its closest neighbors Target state and optimal trajectory are unknown Local convergence Experiments Escaping local minima Comparison with gradient descent methods

Control in a reasonably complex world The problem of specifying control tasks (e.g. “go to the refrigerator and get the milk”) Solving motion control problems of adequate complexity Many interesting systems evolve in environments that are not smooth, simply connected, etc. Using language primitives to navigate: Specify control policies Represent the environment (what parts do we ignore?)

Def: MDLe is the formal language defined by the context free grammar with production rules: Motion Description Languages Evaluate Evolve under until Concatenate, encapsulate atoms to form complex strings (plans), e.g. N: nonterminals T: terminals S: start symbol ε: empty string Fact: MDLe is context free but not regular Atom:

Symbolic Navigation Keep only “interesting” details about how to navigate the world Landmark: L = (M,x)M: map “patch”, x: coordinates Sensor signature: L = L i if s(t) = s i (t) for t in [t 0,T] Navigation –Local navigation: on a given landmark L i –Global navigation: between landmarks M x World

A directed graph representation of a map Represent only “interesting parts” of the world. G = {L,E} L i : landmarks E ij : {i,j,  ij } Γ ij : an MDLe program E ij  E ji Idea: Replace details locally by a feedback program

Experiment: indoor navigation Lab 1 Lab 2 Office Partial floor plan of 2 nd floor A.V. Williams

Experiment, cont’d Goal: Navigate between three landmarks Front of labRear of labOffice

Experiment: Example MDLe plans {Lab2toLab1Plan (bumper) (Atom (atIsection 0100) (goAvoid )) (Atom (atIsection 0010) (go )) (Atom (wait ) (align 7 9)) (Atom (atIsection 1000) (goAvoid )) (Atom (atIsection 0100) (go )) (Atom (wait ) (align 3 5)) (Atom (wait 7) (goAvoid )) (Atom (atIsection 1000) (goAvoid )) } {Lab1toOfficePlan (bumper) (Atom (atIsection 1001) (goAvoid )) (Atom (atIsection 0011) (go )) (Atom (wait ) (align 11 13)) (Atom (atIsection 0100) (goAvoid )) (Atom (wait 10) (rotate -90)) }

Experiment: A typical run

Controllers (and MDLe plans) are not always successful. Environmental factors (moving obstacles) System uncertainty (e.g. actuator noise) Associate a probability density function with an MDLe plan Enumerate the MDLe strings associated with an environment graph G = {L,E}, Define Prob. of arriving at by executing from Assumptions: G is a “good” description of the world Sensor model: Incorporating Uncertainty

A prototype navigation problem How do I get to a given landmark ? Prob. density at time k, given observations up to time k. Probability after evaluating plan and making a new observation: Maximize probability of arriving at a desired landmark in N “steps” Maximize probability of arriving at desired landmark in N “steps” Information at “time” k Maximize prob. of arrival at a desired landmark with minimum of “steps”

A navigation example

Example - data Example: L2 to L3 (syntax: (ξ,u)) with N(0,0.01) actuator noise

Example: steer to a landmark in N “steps” X 0 =L 1, X F =L 2, N=3 P 0|0 =[1/3, 1/3, 1/3] Evolution of probability density on G Desired success probability set to 95%

True and observed landmarks Example: steer to a landmark in N “steps”

Executed Plans Example: steer to a landmark in N “steps”

Summary and Ongoing Work Language-based Control The motion description language MDLe “Landmark+instruction”-based descriptions of the world Optimal navigation via dynamic-programming Obtaining “nominal” densities for navigation Software

References: S. Andersson and D. Hristu-Varsakelis, “Stochastic Language-based Motion Control”, to appear, CDC D. Hristu-Varsakelis, M. Egerstedt, P. S. Krishnaprasad, “On the Structural Complexity of the Motion Description Language MDLe”, to appear, CDC D. Hristu-Varsakelis and P. R. Kumar, “Interrupt-based feedback control over a shared communication medium”, IEEE CDC M. Egerstedt and D. Hristu-Varsakelis, “Observability and Policy Optimization for Mobile Robots”, CDC 2002

This page intentionally left blank

Event-Based Stabilization of Ensembles-Users of a Shared Network

Dynamical systems as users of a shared “network” Control of collections of systems with limited communication A prototype problem in “divided attention”. N=number of systems in the ensemble n=max. number of feedback loops that can be closed at any time How much communication time must be devoted to each system to guarantee that the collection remains stable? Can the ensemble be stabilized? G 1 (s ) … K1K1 G 2 (s) K2K2 G N (s) KNKN controller plant shared medium

A feedback communication policy We would like to avoid having to specify the communication policy in advance (thus the need for memory, clocks) How much information is needed to implement an event-driven policy? Let’s define a simple rule for deciding which system(s) should be allowed to use the “network”. Idea: Close loops corresponding to states that are “furthest” from the origin 0... Ex.: N=3, n=2

0... A feedback communication policy Ex.: N=3, n=2 (feedback loop closed) (feedback loop open) Let’s define a simple rule for deciding which system(s) should be allowed to use the “network”. Idea: Close loops corresponding to states that are “furthest” from the origin Definition: An ensemble is δ-captured if for all i after some time For each system, find a Lyapunov function V( ) such that:

Some possibilities for interrupt-based communication (special case n=1) Policy: (sampled CLS-  ): 2’. When, set. Policy: (CLS-  ): Let 1. At time, close the loop of the system, 2. When, set. 3. Policy: (open loop CLS): 1’’. At time, close the loop of the system, 2’’. When, set. feedback

A test run 

A “least conservative” feedback communication policy Policy: (MACLS  1. At time t, close the loop of the system where 2. When, repeat from step 1. Theorem: the ensemble is captured using CLS-  for  large enough, if where otherwise there exists a choice of dynamics with the same for which there is no stabilizing communication sequence. Policy: (Control Zone  Pick  1. At time t, close the loop of the system where 2. When, repeat from step 1. An alternative communication policy:

A simulation example (N=3, n=1,  =1,  =0.2)

Experiment: stabilizing a pair of pendulums Lengths: 20cm, 45cm Communication: 115Kbps ρ=0.8

Experiment: stabilizing a pair of pendulums

Event-based feedback control - Summary and Work in Progress A class of feedback communication policies - sampled Lyapunov functions - continuously monitored Lyapunov functions - continuously monitored state norms Sufficient condition for stability Stochasticity Performance analysis Effects of delays in the feedback loop