Georgia Tech / Mobile Intelligence 1 Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems DARPA MARS Review.

Slides:

Advertisements

Similar presentations

IMA 2.5: Software Architecture and Development Environment Roberto Olivares M.S. Electrical Engineering Vanderbilt University, Spring 2003.

Advertisements

Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California

ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS

Autonomic Systems Justin Moles, Winter 2006 Enabling autonomic behavior in systems software with hot swapping Paper by: J. Appavoo, et al. Presentation.

A Component Based Programming Framework for Autonomic Applications Hua Liu, Manish Parashar, and Salim Hariri ICAC ‘04 John Otto Wi06 CS 395/495 Autonomic.

AuRA: Principles and Practice in Review

ProActive Task Manager Component for SEGL Parameter Sweeping Natalia Currle-Linde and Wasseim Alzouabi High Performance Computing Center Stuttgart (HLRS),

Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems Georgia Tech College of Computing Georgia Tech Research.

SSP Re-hosting System Development: CLBM Overview and Module Recognition SSP Team Department of ECE Stevens Institute of Technology Presented by Hongbing.

Introduction to HCC and HCM. Human Centered Computing Philosophical-humanistic position regarding the ethics and aesthetics of a workplace Any system.

Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.

1 Quality Objects: Advanced Middleware for Wide Area Distributed Applications Rick Schantz Quality Objects: Advanced Middleware for Large Scale Wide Area.

REES: Reasoning Engines Evaluation Shell version 3.0 Automated Reasoning Lab University of California, Irvine.

Integrating POMDP and RL for a Two Layer Simulated Robot Architecture Presented by Alp Sardağ.

Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 7: Expert Systems and Artificial Intelligence Decision Support.

CASE Tools CIS 376 Bruce R. Maxim UM-Dearborn. Prerequisites to Software Tool Use Collection of useful tools that help in every step of building a product.

Multi-Agent System for Online Auctions Jose Manuel Valladares Pernas CIS Master Project - Fall 2004 Advisor: Dr. Haiping Xu.

System Engineering Instructor: Dr. Jerry Gao. System Engineering Jerry Gao, Ph.D. Jan System Engineering Hierarchy - System Modeling - Information.

1212 Management and Communication of Distributed Conceptual Design Knowledge in the Building and Construction Industry Dr.ir. Jos van Leeuwen Eindhoven.

8/9/20151 DARPA-MARS Kickoff Adaptive Intelligent Mobile Robots Leslie Pack Kaelbling Artificial Intelligence Laboratory MIT.

Annual SERC Research Review - Student Presentation, October 5-6, Extending Model Based System Engineering to Utilize 3D Virtual Environments Peter.

Georgia Tech / Mobile Intelligence 1 Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems DARPA MARS Review.

The 2nd International Conference of e-Learning and Distance Education, 21 to 23 February 2011, Riyadh, Saudi Arabia Prof. Dr. Torky Sultan Faculty of Computers.

©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 18 Slide 1 Software Reuse.

Software Engineering Muhammad Fahad Khan

©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.

Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.

Controlling and Configuring Large UAV Teams Paul Scerri, Yang Xu, Jumpol Polvichai, Katia Sycara and Mike Lewis Carnegie Mellon University and University.

Software Agents: An Overview by Hyacinth S. Nwana and Designing Behaviors for Information Agents by Keith Decker, Anandeep Pannu, Katia Sycara and Mike.

COP 4620 / 5625 Programming Language Translation / Compiler Writing Fall 2003 Lecture 10, 10/30/2003 Prof. Roy Levow.

DBSQL 14-1 Copyright © Genetic Computer School 2009 Chapter 14 Microsoft SQL Server.

Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 20 Object-Oriented.

Providing Policy Control Over Object Operations in a Mach Based System By Abhilash Chouksey

Odyssey A Reuse Environment based on Domain Models Prepared By: Mahmud Gabareen Eliad Cohen.

Session 2a, 10th June 2008 ICT-MobileSummit 2008 Copyright E3 project, BUPT Autonomic Joint Session Admission Control using Reinforcement Learning.

Model-Driven Analysis Frameworks for Embedded Systems George Edwards USC Center for Systems and Software Engineering

Object-oriented Analysis and Design Stages in a Software Project Requirements Writing Analysis Design Implementation System Integration and Testing Maintenance.

Max Ong University of Sheffield, UK. AHM 2004 Session 2.3: Workflow Composition, Wednesday 1 st September 2004, 4pm. Workflow Advisor in DAME Abstract.

Boundary Assertion in Behavior-Based Robotics Stephen Cohorn - Dept. of Math, Physics & Engineering, Tarleton State University Mentor: Dr. Mircea Agapie.

MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.

EIS'2007 (Salamanca, Spain, March 22-24, 2007) 1 Towards an Extended Model of User Interface Adaptation: the ISATINE framework 1 Víctor M. López Jaquero,

Model-Driven Engineering of Behaviors in User Interfaces Efrem Mbaki & Jean Vanderdonckt Université catholique de Louvain (UCL) Louvain School of Management.

Spatio-Temporal Case-Based Reasoning for Behavioral Selection Maxim Likhachev and Ronald Arkin Mobile Robot Laboratory Georgia Tech.

CEN5011, Fall CEN5011 Software Engineering Dr. Yi Deng ECS359, (305)

Evaluation of Agent Building Tools and Implementation of a Prototype for Information Gathering Leif M. Koch University of Waterloo August 2001.

DARPA ITO/MARS Project Update Vanderbilt University A Software Architecture and Tools for Autonomous Robots that Learn on Mission K. Kawamura, M. Wilkes,

Georgia Tech / Mobile Intelligence 1 Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems DARPA MARS Kickoff.

Memory Hierarchy Adaptivity An Architectural Perspective Alex Veidenbaum AMRM Project sponsored by DARPA/ITO.

MODEL-BASED SOFTWARE ARCHITECTURES.  Models of software are used in an increasing number of projects to handle the complexity of application domains.

August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.

Review of Parnas’ Criteria for Decomposing Systems into Modules Zheng Wang, Yuan Zhang Michigan State University 04/19/2002.

Selection of Behavioral Parameters: Integration of Case-Based Reasoning with Learning Momentum Brian Lee, Maxim Likhachev, and Ronald C. Arkin Mobile Robot.

Learning Momentum: Integration and Experimentation Brian Lee and Ronald C. Arkin Mobile Robot Laboratory Georgia Tech Atlanta, GA.

Unclassified//For Official Use Only 1 RAPID: Representation and Analysis of Probabilistic Intelligence Data Carnegie Mellon University PI : Prof. Jaime.

From Use Cases to Implementation 1. Structural and Behavioral Aspects of Collaborations  Two aspects of Collaborations Structural – specifies the static.

T EST T OOLS U NIT VI This unit contains the overview of the test tools. Also prerequisites for applying these tools, tools selection and implementation.

Learning Behavioral Parameterization Using Spatio-Temporal Case-Based Reasoning Maxim Likhachev, Michael Kaess, and Ronald C. Arkin Mobile Robot Laboratory.

Control-Theoretic Approaches for Dynamic Information Assurance George Vachtsevanos Georgia Tech Working Meeting U. C. Berkeley February 5, 2003.

Software Architecture Architecture represents different things from use cases –Use cases deal primarily with functional properties –Architecture deals.

From Use Cases to Implementation 1. Mapping Requirements Directly to Design and Code  For many, if not most, of our requirements it is relatively easy.

CLASSIFICATION OF DESIGN PATTERNS Hladchuk Maksym.

Genetic Algorithms. Solution Search in Problem Space.

University of Pennsylvania 1 GRASP Control of Multiple Autonomous Robot Systems Vijay Kumar Camillo Taylor Aveek Das Guilherme Pereira John Spletzer GRASP.

1 CASE Computer Aided Software Engineering. 2 What is CASE ? A good workshop for any craftsperson has three primary characteristics 1.A collection of.

Introduction to Machine Learning, its potential usage in network area,

Introduction to Design Patterns

Model-Driven Analysis Frameworks for Embedded Systems

SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION

Presentation transcript:

Georgia Tech / Mobile Intelligence 1 Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems DARPA MARS Review Meeting - January 2000 Approved for public release: distribution unlimited

Georgia Tech / Mobile Intelligence 2 Personnel n Georgia Tech –College of Computing < Prof. Ron Arkin < Prof. Chris Atkeson < Prof. Sven Koenig –Georgia Tech Research Institute < Dr. Tom Collins n Mobile Intelligence Inc. < Dr. Doug MacKenzie n Students –Amin Atrash –Bhaskar Dutt –Brian Ellenberger –Mel Eriksen –Max Likachev –Brian Lee –Sapan Mehta

Georgia Tech / Mobile Intelligence 3 Adaptation and Learning Methods n Case-based Reasoning for: –deliberative guidance (“wizardry”) –reactive situational- dependent behavioral configuration n Reinforcement learning for: –run-time behavioral adjustment –behavioral assemblage selection n Probabilistic behavioral transitions –gentler context switching –experience-based planning guidance Available Robots and MissionLab Console

Georgia Tech / Mobile Intelligence 4 1. Learning Momentum n Reactive learning via dynamic gain alteration (parametric adjustment) n Continuous adaptation based on recent experience n Situational analyses required n In a nutshell: If it works, keep doing it a bit harder; if it doesn’t, try something different

Georgia Tech / Mobile Intelligence 5 Learning Momentum - Design n Integrated into MissionLab in CNL Library n Works with MOVE_TO_GOAL, COOP, and AVOID_OBSTACLES n Has not yet been extended to all behaviors

Georgia Tech / Mobile Intelligence 6 Simple Example

Georgia Tech / Mobile Intelligence 7 Learning Momentum - Future Work n Extension to additional CNL behaviors n Make thresholds for state determination rules accessible from cfgedit n Integrate with CBR and RL

Georgia Tech / Mobile Intelligence 8 2. CBR for Behavioral Selection n Another form of reactive learning n Previous systems include: ACBARR and SINS n Discontinuous behavioral switching

Georgia Tech / Mobile Intelligence 9 Case-Based Reasoning for Behavioral Selection - Current Design n The CBR Module is designed as a stand-alone module n A hard-coded library of eight cases for MoveToGoal tasks n Case - a set of parameters for each primitive behavior in the current assemblage and index into the library

Georgia Tech / Mobile Intelligence 10 Case-Based Reasoning for Behavioral Selection - Current Results n On the Left - MoveToGoal without CBR Module n On the Right - MoveToGoal with CBR Module

Georgia Tech / Mobile Intelligence 11 Case-Based Reasoning for Behavioral Selection - Future Plans n Two levels of operation: choosing and adapting parameters for selected behavior assemblages as well as choosing and adapting the whole new behavior assemblages n Automatic learning and modification of cases through experience n Improvement of case/index/feature selection and adaptation n Integration with Q-learning and Momentum Learning n Identification of relevant task domain case libraries

Georgia Tech / Mobile Intelligence Reinforcement learning for Behavioral Assemblage Selection n Reinforcement learning at coarse granularity (behavioral assemblage selection) n State space tractable n Operates at level above learning momentum (selection as opposed to adjustment) n Have added the ability to dynamically choose which behavioral assemblage to execute n Ability to learn which assemblage to choose using wide variety of Reinforcement Learning methods: Q-learning, Value Iteration, (Policy Iteration in near future)

Georgia Tech / Mobile Intelligence 13 Selecting Behavioral Assemblages - Specifics Replace the FSA with an interface allowing user to specify the environmental and behavioral states Agent learns transitions between behavior states Learning algorithm is implemented as an abstract module and different learning algorithms can be swapped in and out as desired. CNL function interfaces robot executable and learning algorithm

Georgia Tech / Mobile Intelligence 14 Integrated System

Georgia Tech / Mobile Intelligence 15 Architecture Learning Algorithm (Qlearning) Cfgedit CNL function Behavioral States Environmental States CDL code MissionLab

Georgia Tech / Mobile Intelligence 16 RL - Next Steps Change implementation of Behavioral Assemblages in Missionlab from simply being statically compiled into the CDL code to a more dynamic representation. Create relevant scenarios and test Missionlab’s ability to learn good solutions Look at new learning algorithms to exploit the advantages of Behavioral Assemblages selection Conduct extensive simulation studies then implement on robot platforms

Georgia Tech / Mobile Intelligence CBR “Wizardry” n Experience-driven assistance in mission specification n At deliberative level above existing plan representation (FSA) n Provides mission planning support in context

Georgia Tech / Mobile Intelligence 18 CBR Wizardry / Usability Improvements n Current Methods: Using GUI to construct FSA - may be difficult for inexperienced users. n Goal: Automate plan creation as much as possible while providing unobtrusive support to user.

Georgia Tech / Mobile Intelligence 19 Tentative Insertion of FSA Elements: A user support mechanism currently being worked on n Some FSA elements very often occur together. n Statistical data on this can be gathered. n When user places a state, a trigger and state that follow this state often enough can be tentatively inserted into the FSA. n Comparable to URL completion features in web browsers. State A State C Trigger B Statistical Data Tentative Additions User places State A

Georgia Tech / Mobile Intelligence 20 Recording Plan Creation Process n Pinpointing where user has trouble during plan creation is important prerequisite to improving software usability. n There was no way to record plan creation process in MissionLab. n Module now created that records user’s actions as (s)he creates the plan. This recording can later be played back and points where the user stumbled can thus be identified. The Creation of a Plan

Georgia Tech / Mobile Intelligence 21 Wizardry - Future Work n Use of plan creation recordings during usability studies to identify stumbling blocks in process. n Creation of plan templates (frameworks of some commonly used plan types e.g. reconnaissance missions) n Collection of library of plans which can be placed at different points in “plan creation tree”. This can then be used in a plan creation wizard. Plan 1 Plan 2Plan 3Plan 4 Plan 5Plan 6Plan 7Plan 8 Plan Creation Tree

Georgia Tech / Mobile Intelligence Probabilistic Planning and Execution n “Softer, kinder” method for matching situations and their perceptual triggers n Expectations generated based on situational probabilities regarding behavioral performance (e.g., obstacle densities and traversability), using them at planning stages for behavioral selection n Markov Decision Process, Dempster-Shafer, and Bayesian methods to be investigated

Georgia Tech / Mobile Intelligence 23 Probabilistic Planning and Execution - Concept n Find the optimal plan despite sensor uncertainty about the current environment Mission Editor POMDP Solver POMDP Specification MissionLab.cdl FSA

Georgia Tech / Mobile Intelligence 24 Probabilistic Methods: Current Status mine no mine clear mine move scan move POMDP FSA MissionLab (current work) clear mine P(detect mine|mine) = 0.8 P(detect mine|no mine) = 0

Georgia Tech / Mobile Intelligence 25 Varying Costs Different Plans mine no mine clear mine move scan move POMDP MissionLab (current work) clear mine P(detect mine|mine) = 0.8 P(detect mine|no mine) = 0 FSA

Georgia Tech / Mobile Intelligence 26 MIC’s Role n Develop conceptual plan for integrating learning algorithms into MissionLab n Guide students performing integration n Assist in designing usability studies to evaluate integrated system n Guide performance and evaluation of usability studies n Identify key technologies in MissionLab which could be commercialized n Support technology transfer to a designated company for commercialization

Georgia Tech / Mobile Intelligence 27 Schedule