GaTAC: A Scalable and Realistic Testbed for Multiagent Decision Making Ekhlas Sonu, Prashant Doshi Dept. of Computer Science University of Georgia Athens,

Slides:



Advertisements
Similar presentations
QoS-based Management of Multiple Shared Resources in Dynamic Real-Time Systems Klaus Ecker, Frank Drews School of EECS, Ohio University, Athens, OH {ecker,
Advertisements

Formal Modelling of Reactive Agents as an aggregation of Simple Behaviours P.Kefalas Dept. of Computer Science 13 Tsimiski Str Thessaloniki Greece.
Intelligent Agents Russell and Norvig: 2
Design of Attitude and Path Tracking Controllers for Quad-Rotor Robots using Reinforcement Learning Sérgio Ronaldo Barros dos Santos Cairo Lúcio Nascimento.
Technical Advisor : Mr. Roni Stern Academic Advisor : Dr. Meir Kalech Team members :  Amit Ofer  Liron Katav Project Homepage :
Zach Ramaekers Computer Science University of Nebraska at Omaha Advisor: Dr. Raj Dasgupta 1.
Multiple Marine Vehile Deconflicted Path Planning with Currents and Communication Constraints A. Häusler 1, R. Ghabcheloo 2, A. Pascoal 1, A. Aguiar 1.
1 Autonomous Controller Design for Unmanned Aerial Vehicles using Multi-objective Genetic Programming Choong K. Oh and Gregory J. Barlow U.S. Naval Research.
Automatic Control & Systems Engineering Autonomous Systems Research Mini-UAV for Urban Environments Autonomous Control of Multi-UAV Platforms Future uninhabited.
Integrating POMDP and RL for a Two Layer Simulated Robot Architecture Presented by Alp Sardağ.
Design of Autonomous Navigation Controllers for Unmanned Aerial Vehicles using Multi-objective Genetic Programming Gregory J. Barlow March 19, 2004.
8/22/20061 Maintaining a Linked Network Chain Utilizing Decentralized Mobility Control AIAA GNC Conference & Exhibit Aug. 21, 2006 Cory Dixon and Eric.
INTEGRATED PROGRAMME IN AERONAUTICAL ENGINEERING Coordinated Control, Integrated Control and Condition Monitoring in Uninhabited Air-Vehicles Ian Postlethwaite,
Annual SERC Research Review - Student Presentation, October 5-6, Extending Model Based System Engineering to Utilize 3D Virtual Environments Peter.
DCT 1123 PROBLEM SOLVING & ALGORITHMS INTRODUCTION TO PROGRAMMING.
Sérgio Ronaldo Barros dos Santos, Cairo Lúcio Nascimento Júnior,
Špindlerův Mlýn, Czech Republic, SOFSEM Semantically-aided Data-aware Service Workflow Composition Ondrej Habala, Marek Paralič,
Multi-Agent Model to Multi-Process Transformation A Housing Market Case Study Gerhard Zimmermann Informatik University of Kaiserslautern.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
I. Pribela, M. Ivanović Neum, Content Automated assessment Testovid system Test generator Module generators Conclusion.
Controlling and Configuring Large UAV Teams Paul Scerri, Yang Xu, Jumpol Polvichai, Katia Sycara and Mike Lewis Carnegie Mellon University and University.
Sérgio Ronaldo Barros dos Santos (ITA-Brazil) Sidney Nascimento Givigi Júnior (RMC-Canada) Cairo Lúcio Nascimento Júnior (ITA-Brazil) Autonomous Construction.
Adapted for STLP Camp ‘09 Mobile Robots Why do robots need to move?
COMP 410 & Sky.NET May 2 nd, What is COMP 410? Forming an independent company The customer The planning Learning teamwork.
Zhiyong Wang In cooperation with Sisi Zlatanova
Approach Participants controlled a UAV, flying over the Bagram Air Force Base, and were tasked with the goal of reaching a designated target. Success was.
Leslie Luyt Supervisor: Dr. Karen Bradshaw 2 November 2009.
Generalized and Bounded Policy Iteration for Finitely Nested Interactive POMDPs: Scaling Up Ekhlas Sonu, Prashant Doshi Dept. of Computer Science University.
Introduction To System Analysis and Design
Energy-Aware Scheduling with Quality of Surveillance Guarantee in Wireless Sensor Networks Jaehoon Jeong, Sarah Sharafkandi and David H.C. Du Dept. of.
Object Management Group (OMG) Specifies open standards for every aspect of distributed computing Multiplatform Model Driven Architecture (MDA)
Symbiotic Simulation of Unmanned Aircraft Systems (UAS)
COBXXXX EXPERIMENTAL FRAMEWORK FOR EVALUATION OF GUIDANCE AND CONTROL ALGORITHMS FOR UAVS Sérgio Ronaldo Barros dos Santos,
MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.
Haley: A Hierarchical Framework for Logical Composition of Web Services Haibo Zhao, Prashant Doshi LSDIS Lab, Dept. of Computer Science, University of.
Using Reinforcement Learning to Model True Team Behavior in Uncertain Multiagent Settings in Interactive DIDs Muthukumaran Chandrasekaran THINC Lab, CS.
Modelling and Open Loop Simulation of Reentry Trajectory for RLV Missions Ashok Joshi and K. Sivan Department of Aerospace Engineering Indian Institute.
University of Windsor School of Computer Science Topics in Artificial Intelligence Fall 2008 Sept 11, 2008.
Service Service metadata what Service is who responsible for service constraints service creation service maintenance service deployment rules rules processing.
Network UAV C3 Stage 1 Final Briefing Timothy X Brown University of Colorado at Boulder Interdisciplinary Telecommunications Program Electrical and Computer.
Institute for Computer Science VI Autonomous Intelligent Systems
End-to-End Efficiency (E 3 ) Integrating Project of the EC 7 th Framework Programme General View of the E3 Prototyping Environment for Cognitive and Self-x.
Chapter 3 System Performance and Models Introduction A system is the part of the real world under study. Composed of a set of entities interacting.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
The set of SE models include s those that are BE. It further includes models that include identical distributions over the subject agent’s action observation.
A Roadmap towards Machine Intelligence
1 Multiagent Teamwork: Analyzing the Optimality and Complexity of Key Theories and Models David V. Pynadath and Milind Tambe Information Sciences Institute.
Learning Team Behavior Using Individual Decision Making in Multiagent Settings Using Interactive DIDs Muthukumaran Chandrasekaran THINC Lab, CS Department.
Thrust IIB: Dynamic Task Allocation in Remote Multi-robot HRI Jon How (lead) Nick Roy MURI 8 Kickoff Meeting 2007.
Learning for Physically Diverse Robot Teams Robot Teams - Chapter 7 CS8803 Autonomous Multi-Robot Systems 10/3/02.
Generalized Point Based Value Iteration for Interactive POMDPs Prashant Doshi Dept. of Computer Science and AI Institute University of Georgia
Distributed cooperation and coordination using the Max-Sum algorithm
Presented by Yu-Shun Wang Advisor: Frank, Yeong-Sung Lin Near Optimal Defense Strategies to Minimize Attackers’ Success Probabilities for networks of Honeypots.
INFSO-RI JRA2 Test Management Tools Eva Takacs (4D SOFT) ETICS 2 Final Review Brussels - 11 May 2010.
On the Difficulty of Achieving Equilibrium in Interactive POMDPs Prashant Doshi Dept. of Computer Science University of Georgia Athens, GA Twenty.
Learning Procedural Knowledge through Observation -Michael van Lent, John E. Laird – 인터넷 기술 전공 022ITI02 성유진.
KNU RTLAB A Real-Time Linux System For Autonomous Navigation And Flight Attitude Control Of An Uninhabited Aerial Vehicle Charles E. Hall, Jr. Mechanical.
Students: Yossi Turgeman Avi Deri Self-Stabilizing and Efficient Robust Uncertainty Management Instructor: Prof Michel Segal.
Marilyn Wolf1 With contributions from:
Enhancements for Voltaire’s InfiniBand simulator
Kai Li, Allen D. Malony, Sameer Shende, Robert Bell
Chapter 1: Introduction to Systems Analysis and Design
John Backes, Rockwell Collins Dan DaCosta, Rockwell Collins
Sérgio Ronaldo Barros dos Santos Cairo Lúcio Nascimento Júnior
COMP60621 Fundamentals of Parallel and Distributed Systems
Market-based Dynamic Task Allocation in Mobile Surveillance Systems
John Backes, Rockwell Collins Dan DaCosta, Rockwell Collins
COMP60611 Fundamentals of Parallel and Distributed Systems
Chapter 1: Introduction to Systems Analysis and Design
Presentation transcript:

GaTAC: A Scalable and Realistic Testbed for Multiagent Decision Making Ekhlas Sonu, Prashant Doshi Dept. of Computer Science University of Georgia Athens, GA, 30602, USA Demo for AAMAS-2012

Objective To design and implement a realistic testbed to evaluate the performance of decision making algorithms in a problem domain that is: Relevant in cooperative, competitive and mixed settings i.e. across different frameworks such as Dec-POMDP, I- POMDP, etc. Scalable in problem size No. of Physical States Flexible in agent capabilities Number and type of actions and observations Extensible in number of agents and adaptable to agent types

Motivation Recently, there have been substantial development in multi-agent decision making algorithms that has driven researchers to go beyond the traditional toy problem domains such as the Tiger Problem, Machine Maintenance Problem, Grid meeting, etc. Some larger problem domains include Cooperative Box-Pushing, Mars Rover, etc.: Applied in cooperative settings

A Desirable Problem Domain A desirable problem domain for multi-agent decision making must be: Scalable in physical states Flexible in agent capabilities actions & observations Extensible in number of agents Relevant to cooperative, competitive and mixed settings Able to produce solutions rich in structure Realistic with a popular appeal

Proposed Scenario: Autonomous Unmanned Aerial Vehicles Application: Law enforcement [Murphy, Cycon; 1998] Fighting forest fires [Casbeer, et.al.; 2005] Border surveillance [Haddal, Gertler; 2010] Wartime reconnaissance Uncertainty in AUAVs due to: Uncertainty about physical state Noisy actuators and sensors Added Complexity: Presence of other agents May be cooperative or competitive Related Research Focuses on formulating flight trajectories [R. Bernard, et.al.,2002, S.M. Li, et.al 2002]

An example decision making scenario with AUAVs  We propose a problem domain involving a Autonomous Uninhabited Aerial Vehicles  The operating theatre may be divided into various sectors (as is a common practice) and may be represented as a grid of a predetermined size

An example decision making scenario with AUAVs  An example UAV recon problem may involve a UAV (I) (or a team of UAVs) trying to apprehend a target (T) (or a team of moving targets) while another team of UAVs (J) tries to help the target(s) escape to a safe house  Of course the exact problem description is flexible S.H.

GaTAC: Overview Georgia Testbed for Autonomous Control of vehicles (GaTAC): computer simulation framework for evaluating solution to a UAV reconnaissance problem. It provides: Hyperrealistic 3D rendering of AUAV acting in real world scenario Scalability in problem size and number of agents Flexibility in designing actions and observations of each agent Input: Agent control function (policies) for all agents generated by any (multi-agent) decision making algorithm Output: Simulation of policies on a flight simulator Results of simulations may be compared for policies generated by different algorithms using metrics such as number of captures, cumulative reward, etc.

GaTAC: Workflow We begin with a formal description for any UAV decision making problem Formulate problem as.dpomdp/.ipomdp file Configure GaTAC for simulation (i.e. setup environment) GaTAC

GaTAC: Workflow.dpomdp/.ipomdp GaTAC Solve using algo. of choice Obtain policies Policies for each agent are fed to GaTAC to be simulated and evaluated

GaTAC: Workflow.dpomdp/.ipomdp GaTAC Solve Simulate policies and evaluate results using metrics such as number of success, cumulative rewards, etc.

GaTAC Components Each instance of GaTAC has three components: Flight Simulator Off-shelf open source flight simulator on which policies are simulated One instance of flight simulator for each agent Autonomous Control Module Control each aircraft and make it behave according to the policy on the flight simulator Communication Module Send aircraft behavior from ACM to flight simulator Communicate with other agents (if required) GaTAC instances may run on different machines Connected using communication module

Architecture Flight Simulator Communication Module Autonomous Control Module Communication between agents

Flight Simulator FlightGear: Open-source (written in C++) Multi-platform Hyperrealistic 3D graphics 3D virtual map Flexible with choices of Multiple models of aircrafts Locations to act as operating environment Weather condition, time of day, etc. 6 DOF flight dynamics model Simulates effects of airflow on different parts of aircraft

FlightGear in Operating Scenario FG utilizes realistic 3D scenery available from TerraGear Provides multiple view of the flying aircraft Cockpit view, tail view, etc. Multiple instances of FG may be linked together through external servers- ideal for multi-agent settings

Autonomous Control Module Low Level Actions Control Rudder, Throttle, Aileron, Roll, Pitch, etc. High Level Actions Takeoff, Fly-Straight, Turn, Change Altitude Agent Actions on Grid Used to algorithmically control the aircraft and make it behave according to policy: 3 levels of hierarchy Perform low level actions to control aircraft by adjusting parameters along the 6DoF Perform simple tasks that represent simple aircraft behaviors Actions constructed using high level actions to represent actions of agents in the problem at hand

Communication Module Establish a communication channel between: Autonomous Control Module and FlightGear Between each agent (if required e.g. in team settings) Communication channels use UDP, httpd and XML Communicate low-level flight control data from an instance of autonomous control module to respective instance of FlightGear Communicate aircraft position to all other instances of GaTAC in real time (used to formulate observations)

Communication Module Functions Send control data from ACM to FG May adjust flight parameters such as controlling thrust, rudder, aileron, altitude, etc. Receive the aircraft’s flight dynamics in real time from FG and send to ACM for path correction Position, aircraft orientation on 6 DoF, flight speed, altitude, etc. May be used to pass messages between GaTAC instances (when communication between agents is required)

GaTAC Control Algorithm Read policy from file Start FlightGear Fly according to policy Get Observations/ Next Action Mission Accomplished Observation =Successful? Yes No Obtain action to perform from the policy Agent action systematically broken down into high-level and then low-level actions to control the aircraft algorithmically Repeat until termination condition reached Next action may be obtained from policy using the observation

Conclusions GaTAC: Can act as an open-source testbed for decision theoretic agents May be used to compare different algorithms irrespective of decision making framework (Dec-POMDP, I-POMDP, MTDP, etc.) Is extensible: no upper bound on size of problem No. of physical states, no. of agents, no. and types of actions & observations Facilitates deployment of decision theoretic agents in hyper-realistic real world settings (cooperative, competitive, or mixed) Easily configurable for simulating any UAV problem Provides for communication between agents May be extended to include choice of locations and aircrafts

Demo