David Shim Omid Shakernia

Slides:

Advertisements

Similar presentations

GRASP University of Pennsylvania NRL logo? Autonomous Network of Aerial and Ground Vehicles Vijay Kumar GRASP Laboratory University of Pennsylvania Ron.

Advertisements

A Survey on Tracking Methods for a Wireless Sensor Network Taylor Flagg, Beau Hollis & Francisco J. Garcia-Ascanio.

Simbeeotic: A Simulator and Testbed for Micro-Aerial Vehicle Swarm Experiments Bryan Kate, Jason Waterman, Karthik Dantu and Matt Welsh Presented By: Mostafa.

1 Stochastic Event Capture Using Mobile Sensors Subject to a Quality Metric Nabhendra Bisnik, Alhussein A. Abouzeid, and Volkan Isler Rensselaer Polytechnic.

NUS CS5247 A Visibility-Based Pursuit-Evasion Problem Leonidas J.Guibas, Jean-Claude Latombe, Steven M. LaValle, David Lin, Rajeev Motwani. Computer Science.

Uncertain Multiagent Systems: Games and Learning H. Jin Kim, Songhwai Oh and Shankar Sastry University of California, Berkeley July 17, 2002 Decision-Making.

A Robotic Wheelchair for Crowded Public Environments Choi Jung-Yi EE887 Special Topics in Robotics Paper Review E. Prassler, J. Scholz, and.

Control and Decision Making in Uncertain Multiagent Hierarchical Systems June 10 th, 2002 H. Jin Kim and Shankar Sastry University of California, Berkeley.

Development of NEST Challenge Application: Distributed Pursuit Evasion Games (DPEGs) Bruno Sinopoli, Luca Schenato, Shawn Shaffert and Shankar Sastry With.

Chess Review May 11, 2005 Berkeley, CA Tracking Multiple Objects using Sensor Networks and Camera Networks Songhwai Oh EECS, UC Berkeley

P. Ögren (KTH) N. Leonard (Princeton University)

Pursuit and Evasion CS326A: Motion Planning Spring 2003 Final Project Eric Ng Huy Nguyen.

PEG Breakout Mike, Sarah, Thomas, Rob S., Joe, Paul, Luca, Bruno, Alec.

Continuum Crowds Adrien Treuille, Siggraph 王上文.

Pursuit Evasion Games (PEGs) Using a Sensor Network Luca Schenato, Bruno Sinopoli Robotics and Intelligent Machines Laboratory UC Berkeley

Dr. Shankar Sastry, Chair Electrical Engineering & Computer Sciences University of California, Berkeley.

An experiment on squad navigation of human and robots IARP/EURON Workshop on Robotics for Risky Interventions and Environmental Surveillance January 7th-8th,

Finding an Unpredictable Target in a Workspace with Obstacles LaValle, Lin, Guibas, Latombe, and Motwani, 1997 CS326 Presentation by David Black-Schaffer.

POLI di MI tecnicolano VISION-AUGMENTED INERTIAL NAVIGATION BY SENSOR FUSION FOR AN AUTONOMOUS ROTORCRAFT VEHICLE C.L. Bottasso, D. Leonello Politecnico.

Multi-vehicle Cooperative Control Raffaello D’Andrea Mechanical & Aerospace Engineering Cornell University u Progress on RoboFlag Test-bed u MLD approach.

Jason Li Jeremy Fowers Ground Target Following for Unmanned Aerial Vehicles.

June 12, 2001 Jeong-Su Han An Autonomous Vehicle for People with Motor Disabilities by G. Bourhis, O.Horn, O.Habert and A. Pruski Paper Review.

1 DARPA TMR Program Collaborative Mobile Robots for High-Risk Urban Missions Second Quarterly IPR Meeting January 13, 1999 P. I.s: Leonidas J. Guibas and.

Fuzzy control of a mobile robot Implementation using a MATLAB-based rapid prototyping system.

Multiple Autonomous Ground/Air Robot Coordination Exploration of AI techniques for implementing incremental learning. Development of a robot controller.

Vision-based Landing of an Unmanned Air Vehicle

Mapping and Localization with RFID Technology Matthai Philipose, Kenneth P Fishkin, Dieter Fox, Dirk Hahnel, Wolfram Burgard Presenter: Aniket Shah.

Forward-Scan Sonar Tomographic Reconstruction PHD Filter Multiple Target Tracking Bayesian Multiple Target Tracking in Forward Scan Sonar.

1 S ystems Analysis Laboratory Helsinki University of Technology Kai Virtanen, Tuomas Raivio and Raimo P. Hämäläinen Systems Analysis Laboratory Helsinki.

Game-Theoretic Analysis of Mobile Network Coverage David K.Y. Yau.

1 Distributed and Optimal Motion Planning for Multiple Mobile Robots Yi Guo and Lynne Parker Center for Engineering Science Advanced Research Computer.

1 S ystems Analysis Laboratory Helsinki University of Technology Kai Virtanen, Janne Karelahti, Tuomas Raivio, and Raimo P. Hämäläinen Systems Analysis.

Intelligence Surveillance and Reconnaissance System for California Wildfire Detection Presented by- Shashank Tamaskar Purdue University

Control and Decision Making in Uncertain Multi-agent Hierarchical Systems A Case Study in Learning and Approximate Dynamic Programming PI Meeting August.

Probabilistic Smart Terrain Dr. John R. Sullins Youngstown State University.

Multi-Player Pursuit Evasion Games, Learning, and Sensor Webs Shankar Sastry University of California, Berkeley ATO Novel Approaches to Information Assurance.

Path Planning Based on Ant Colony Algorithm and Distributed Local Navigation for Multi-Robot Systems International Conference on Mechatronics and Automation.

Planning Tracking Motions for an Intelligent Virtual Camera Tsai-Yen Li & Tzong-Hann Yu Presented by Chris Varma May 22, 2002.

Heterogeneous Teams of Modular Robots for Mapping and Exploration by Grabowski et. al.

University of Pennsylvania 1 GRASP Control of Multiple Autonomous Robot Systems Vijay Kumar Camillo Taylor Aveek Das Guilherme Pereira John Spletzer GRASP.

4/22/20031/28. 4/22/20031/28 Presentation Outline  Multiple Agents – An Introduction  How to build an ant robot  Self-Organization of Multiple Agents.

Understanding Complex Systems May 15, 2007 Javier Alcazar, Ph.D.

VEMANA INSTITUTE OF TECHNOLOGY,BANGALORE

Optimal Acceleration and Braking Sequences for Vehicles in the Presence of Moving Obstacles Jeff Johnson, Kris Hauser School of Informatics and Computing.

COGNITIVE APPROACH TO ROBOT SPATIAL MAPPING

CS b659: Intelligent Robotics

A Vision System for Landing an Unmanned Aerial Vehicle

Pursuit Evasion Games and Multiple View Geometry

Wireless Sensor Network Architectures

Berkeley UAV / UGV Testbed

Pursuit-Evasion Games with UGVs and UAVs

Segmentation of Dynamic Scenes

Vision Based Motion Estimation for UAV Landing

Probabilistic Pursuit-Evasion Games with UGVs and UAVs

Pursuit Evasion Games and Multiple View Geometry

Omnidirectional Vision-Based Formation Control

Formation Control of Nonholonomic Mobile Robots with Omnidirectional Visual Servoing and Motion Segmentation René Vidal Omid Shakernia Shankar.

Path Curvature Sensing Methods for a Car-like Robot

Timothy Boger and Mike Korostelev

Networks of Autonomous Unmanned Vehicles

Vision based automated steering

Distributed Control Applications Within Sensor Networks

In the land of the blind, the one eyed man is king

Robot Intelligence Kevin Warwick.

Jose-Luis Blanco, Javier González, Juan-Antonio Fernández-Madrigal

Market-based Dynamic Task Allocation in Mobile Surveillance Systems

Lecture 3: Environs and Algorithms

Configuration Space of an Articulated Robot

Area Coverage Problem Optimization by (local) Search

Presentation transcript:

David Shim Omid Shakernia A Hierarchical Approach to Probabilistic Pursuit-Evasion Games with Unmanned Ground and Aerial Vehicles Jin Kim René Vidal David Shim Omid Shakernia Shankar Sastry UC Berkeley

Outline Pursuit Evasion Game Scenario Previous Work Hierarchical Control Architecture Implementation on Ground/Air Vehicles Experiment/Simulation Platform Evaluation of Game Strategies Speed, Sensing, Intelligence Experimental & Simulation Results Conclusions and Current Research I will begin the talk by describing the pursuit-evasion scenario, and the previous work that has led up to our current contribution. Next I’ll describe a hierarchical control architecture that we proposed to implement the PEG on real UAV/UGVs. I’ll briefly described our fleet of robots, and the control architecture on these robots. Also, I’ll explain a novel experiment/simulation platform that allows us to perform PEG experiments with real robots, simulations with pure software as well as hardware in the loop simulations of PEG Our simulation/experiment platform enabled us to evaluate the performance of different pursuit policies, and how the performance of high level strategies vary with the speed, intelligence and sensing capabilies of the players in the game. Finaly present conclusions and directions for research ---------------------------- When we implemented the original pursuit strategies on real robots that have nontrivial sensing (vision) and complex dynamics (helicopter, nonholonomic robot, etc), we found many theoritical issues that were not considered with the original theoretical formulation. That formulation had a simplified discrete jump model of robots, and simplified sensing model.

Scenario Evade! This is scenario: There is an open field, out doors, with an unknown terrain and unknown number of trees and other obstacles. There are a group of unmanned ground vehicles, as well as a group of unmanned aerial vehicles. These ground and air vehicles can communicate with each other and form a TEAM of PURSUERS. The mission of the pursuers is to build a map of this environment and to capture another team of EVADERS The evaders can be actively trying to avoid being captured, (for example, by hiding behind obstacles, trees, etc. This is way we solve the problem: Divide the game arena into disjoint number of cells or an OCCUPANCY GRID Compute a probabilistic map of the arena, such that each cell has an associated probability of containing an evader or obstacle Given this probabilistic map, compute a PURSUIT STRATEGY which guides the pursers to be locations in the arena to maximize the probability of capturing an evader Evade!

Probabilistic Map Building Measurements Step sensor model Prediction step: evader motion model Hespanha, et. al. [CDC ’99, CDC ‘00] Optimal pursuit policies computationally infeasible Greedy Pursuit / random evader A few more details on probabilistic map building on the occupancy grid It is a recursive Bayesian approach, which builds up probabilities of an evader or obstacle occupying each cell in the grid, based on possibly noisy measurements of the pursuers. Given a probabilistic map, a the PREVIOUS time instant Use the measurements and sensor model to compute probabilities of evaders occupying cells at the current time instant Next, based on a evader motion model, predict the locations of evaders at the next time instant In Hespanha [cdc99]: First to proposed the approach of combining probabilistic map building and pursuit-policies This was an abstract theoretical work, which considered discrete world, with discrete-jump dynamics of robots, and highly simplified sensing model It was shown that the optimal pursuit policy, which minimizes the expected capture time of the evaders is in general infeasible to compute in real time. Proposed a greedy pursuit policy, where each pursuer maximized probability of capturing evader at next time instant Very nice theoretical results for this greedy policy Probability of having a finite capture time is 1 Expected value of capture time is finite Hespanha [cdc00] One step nash equilibium game theoretic approach

PEG on UAVs and UGVs Vidal et. al. [ICRA ‘01] Hierarchical architecture Implement regulation layer control Kim et. al. [CDC ’01] Implement high level strategies Global-max pursuit Intelligent evader Evaluate Pursuit Policy All this theory is nice, but is at a very abstract level, Many issues remain when trying to implement on real mobile robots Motion of evader is discrete Dynamics of agents not included Oversimplifies sensor model (false positives/negatives) When you implement in real systems, new theoretical issues appear: Modeling different cameras Continuous dynamics of agents, dynamics Communication issues Sensing model in the original theory was greatly simplified So, for the last year and a half our research thrust has been to implement PEGs on real UAVs and UGVS In Vidal et. Al [icra01] We began to implement the PEG on mobile robots, etc Implementation of test-bed on mobile robots: UAVs and UGVs Proposed a Hierarchical approach Implemented Low level regulation on UAVs and UGVs Implemented sensing elements, INS, GPS, integration, Computer vision, etc In Kim et. Al [CDC01] Implement high level strategy planner Perform full probabilistic PEG on real UAVs and UGVs Evaluate performance of pursuit policies in terms of: Dynamics of mobile robots (speed, maneuverability, etc) Sensing capabilities of pursuers (type of vision sensor, range, field of view, etc) In particular, we got some interesting results on the performance of pursuit policies with respect to different types vision systems, that agree with the vision systems we see in predators and pray in the animal kingdom Predators tend to have narrow field of view, forward looking eyes Prey have wide field of view omni-directional vision

Hierarchical Architecture map builder terrain evader control signals [4n] evaders detected obstacles pursuers positions Desired pursuers state of helicopter & height over terrain obstacles detected tactical planner & regulation actuator lin. accel. & ang. vel. [6n] inertial [3n] height over terrain [n] evaders detected vehicle-level sensor fusion communications network tactical planner trajectory regulation position of evader(s) position of obstacles strategy planner position of pursuers agent dynamics encoders INS GPS ultrasonic altimeter vision Exogenous disturbances The proposed hierarchical architecture for PEG comes from the theory of hierarchical hybrid systems. This architecture has been successfully applied to controlling platoons of cars in automated highway systems, as well as for conflict resolution in air traffic management systems, and flight vehicle management systems for controlling UAVs. The idea is to partition is large and complex control problem into various layers of abstraction. Strategic planner / Map Building: pursuit policy computation Communication layer tactical planner, and sensor fusion path planning, obstacle avoidance, position estimation of evader, Regulation layer real-time control, GPS, vision system

Experiment/Simulation Platform UAV MATLAB/ Simulink Vision, Communication, Path Planning Real-time Control Tactical Planner Navigation Computer Strategic Planner Map Builder UGV Tactical Planner Robot Controller Now I’ll described the unified Experiment/Simulation platform which we have built to perform the pursuit-evasion game REAL-TIME CONTROL At the lowest level of the hierarchy, is the navigation and real-time control for our fleet of UAVs and UGVs. The real-time control of the UAV was part of the PhD work of David Shim.l The UAV regulation layer provides services such as hover, lateral motion, pirouette, etc Also manages the Inertial navigation system and GPS to know exactly the position of UAV The UGV is a Pioneer2AT robot, which comes with software for simple motion control. We have also integrated a GPS system and compass similar to the UAV TACTICAL and SENSOR FUSION The UAV and UGV have integrated Vision systems which they use for sensing the position of obstacles and evaders Currently the vision-based evader position estimation is based on COLOR TRACKING Simple sonar-based obstacle avoidance behaviors for UGVs STRATEGY and MAP BUILDING This is implemented in MATLAB and SIMULINK, and they communicate with the UAVs and UGVs through TCP sockets Strategy is sheltered from details of dynamics of robots: we can apply a GREEDY policy that was designed in that abstract theory directly to any number of UAVs and UGVs

Experiment/Simulation Platform UAV model MATLAB/ Simulink System ID model, Camera model, INS/GPS model UAV Simulator Strategic Planner Map Builder UGV model Robot model, Camera model, Dead reckoning Pioneer Simulator A further benefit is that we can use the same STRATEGY PLANNER and MAP BUILDER in a SIMULATION by just replacing the actual robots with software simulation models of the robots We have a UAV model that was obtained by system ID (same model one used build the real-time controller) The pioneers come with a simulation model that performs dead reckoning We made a simple camera model, where we compute each pursuers region of visibility based on its position and field of view of its camera. In the simulation, if an evader or obstacle is within the region of visibility, then we a probability of false positives and false negatives of detection. Further, this is very flexible platform where we can purform full hardware experiments, full software simulations, and any mixture of hardware in the loop simulations.

PEG Experiment PEG with four UGVs Global-Max pursuit policy Simulated camera view (radius 7.5m with 50o FOV) Pursuer=0.3m/s Evader=0.1m/s

Pursuit Policy: Sensing, Intelligence, Speed Greedy Global-max Visibility Region Forward View Omni-directional View Evasion Policy Random Global-min Evader speed Evaluated policies against different vision capabilities Trapezoidal (narrow FOV) vs. OMNI-directional (wide FOV) Both vision systems covered same number of cells Narrow field of view Can see farther into distance Can sweep a larger area simply by rotation Omni-directional (wide angle) Can see in all angles Rotation does not help see more

Pursuit Policy vs. Vision System Why global max outperforms greedy: Before we did real implementation, both policies were performing about the same Now, with all the dynamics and sensing, global-max 3 times better Reason Greedy policy: more often gives change of direction, pursuers have to spend more time rotating, which effectively reduces the translational speed (in practice you cannot rotate and move at full speed at same time) Global max: changes less frequently: pursuers don’t spend as much time changing directions, effectively faster Why trapezoidal better than omni-directional? Omni camera and trapezoidal camera cover same number of cells By rotating in place pursuers can effectively see a larger area Pursuers see further into distance Agrees with predator/prey situations we find in nature

Evader Speed vs. Policy Next we took the best performing pursuit policy, global-max, and the best vision system (forward view), and studying how changing the speed of the evader, and the intelligence of the evader affected capture time. The the case of an intelligent evader, the evader also builds a probabilistic map of pursuers and evaders, and follows a GLOBAL-MIN policy, where it tries to go to the location in the map with minim probability of being captured. Also, we kept the pursuers speed constant at 0.3m/s and did experiments where evaders were slow (0.1m/s) or faster (0.5m/s). It is intuitive to see that a FAST INTELLIGENT evader takes longer to be captured than a SLOW INTELLIGENT evader What is interesting to notice is a FAST RANDOM takes LESS time to capture than a SLOW RANDOM evader This was actually predicted in the original work of Hespanha, where in the extreme case that the evader is staying in place, the probability of capturing in finite time is less that 1 One important point is that the total capture time is some combination of exploration time, where the pursuers are building a map of the environment, as well as pursuit time ------------------------------------ * There must be some U shaped curve with the optimal speed for each pursuer

PEG: 4 UGVs and 1 UAV

Conclusions Conclusions Current Research Hierarchical architecture applied to control multiple agents for pursuit evasion scenario Evaluated strategies vs. speed, sensing and intelligence Global-max outperforms greedy in a real scenario Forward view outperforms Omni-view Vision Agrees with biological predator/prey vision systems Current Research Multi-Body Structure from Motion for Pursuit-Evasion Games [submitted IFAC ’02] Collision Avoidance and UAV Path Planning Monte Carlo based learning of Pursuit Policies In practice, color tracking is not feasible, Need vision system that is able to identify and track each evader using features different than color Whole body of computer vision literature for one moving object: currently generalizing that theory for multiple moving objects

THE END