Jaroslaw Kutylowski 1 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität An Overview of Robot Behavior Control with insight into.

Slides:



Advertisements
Similar presentations
Friedhelm Meyer auf der Heide 1 HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Algorithmic Aspects of Dynamic Intelligent Systems.
Advertisements

Friedhelm Meyer auf der Heide 1 HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Energy-Efficient Local Strategies for Robotic.
Reactive and Potential Field Planners
ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS
Using Parallel Genetic Algorithm in a Predictive Job Scheduling
RL for Large State Spaces: Value Function Approximation
Lecture 24 Coping with NPC and Unsolvable problems. When a problem is unsolvable, that's generally very bad news: it means there is no general algorithm.
Jaroslaw Kutylowski 1 HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Reliable Broadcasting without Collision Detection… … in.
Algorithms + L. Grewe.
The AGILO Autonomous Robot Soccer Team: Computational Principles, Experiences, and Perspectives Michael Beetz, Sebastian Buck, Robert Hanek, Thorsten Schmitt,
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
A Fairy Tale of Greedy Algorithms Yuli Ye Joint work with Allan Borodin, University of Toronto.
Train DEPOT PROBLEM USING PERMUTATION GRAPHS
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
Planning under Uncertainty
Brent Dingle Marco A. Morales Texas A&M University, Spring 2002
Tirgul 8 Universal Hashing Remarks on Programming Exercise 1 Solution to question 2 in theoretical homework 2.
2-Layer Crossing Minimisation Johan van Rooij. Overview Problem definitions NP-Hardness proof Heuristics & Performance Practical Computation One layer:
NP-complete and NP-hard problems. Decision problems vs. optimization problems The problems we are trying to solve are basically of two kinds. In decision.
Approximation Algorithms Motivation and Definitions TSP Vertex Cover Scheduling.
Experts and Boosting Algorithms. Experts: Motivation Given a set of experts –No prior information –No consistent behavior –Goal: Predict as the best expert.
Universität Dortmund  P. Marwedel, Univ. Dortmund, Informatik 12, 2003 Hardware/software partitioning  Functionality to be implemented in software.
Adapting Environment-Mediated Self-Organizing Emergent Systems by Exception Rules Holger Kasinger, Bernhard Bauer, Jörg Denzinger and Tom Holvoet.
Teaching Deliberative Navigation Using the LEGO RCX and Standard LEGO Components Gary R. Mayer, Dr. Jerry Weinberg, Dr. Xudong Yu
High level & Low level language High level programming languages are more structured, are closer to spoken language and are more intuitive than low level.
DARPA Mobile Autonomous Robot SoftwareLeslie Pack Kaelbling; March Adaptive Intelligent Mobile Robotics Leslie Pack Kaelbling Artificial Intelligence.
Robotica Lecture 3. 2 Robot Control Robot control is the mean by which the sensing and action of a robot are coordinated The infinitely many possible.
Stochastic Algorithms Some of the fastest known algorithms for certain tasks rely on chance Stochastic/Randomized Algorithms Two common variations – Monte.
Swarm Computing Applications in Software Engineering By Chaitanya.
Advanced Algorithm Design and Analysis (Lecture 13) SW5 fall 2004 Simonas Šaltenis E1-215b
K. J. O’Hara AMRS: Behavior Recognition and Opponent Modeling Oct Behavior Recognition and Opponent Modeling in Autonomous Multi-Robot Systems.
Graph Algorithms. Definitions and Representation An undirected graph G is a pair (V,E), where V is a finite set of points called vertices and E is a finite.
1 Online algorithms Typically when we solve problems and design algorithms we assume that we know all the data a priori. However in many practical situations.
1 ECE-517 Reinforcement Learning in Artificial Intelligence Lecture 7: Finite Horizon MDPs, Dynamic Programming Dr. Itamar Arel College of Engineering.
AI in games Roger Crawfis/Eric Fosler-Lussier CSE 786 Game Design.
Robotica Lecture 3. 2 Robot Control Robot control is the mean by which the sensing and action of a robot are coordinated The infinitely many possible.
Scheduling policies for real- time embedded systems.
Introduction Complex and large SW. SW crises Expensive HW. Custom SW. Batch execution Structured programming Product SW.
Analysis of algorithms Analysis of algorithms is the branch of computer science that studies the performance of algorithms, especially their run time.
Online Algorithms By: Sean Keith. An online algorithm is an algorithm that receives its input over time, where knowledge of the entire input is not available.
1 Online algorithms Typically when we solve problems and design algorithms we assume that we know all the data a priori. However in many practical situations.
HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Online Routing in Faulty Meshes with Sub-linear Comparative Time and Traffic.
Honors Track: Competitive Programming & Problem Solving Optimization Problems Kevin Verbeek.
University of Windsor School of Computer Science Topics in Artificial Intelligence Fall 2008 Sept 11, 2008.
Behaviour-Based Control in Mobile Robotics
CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.
Institute for Computer Science VI Autonomous Intelligent Systems
Approximate Inference: Decomposition Methods with Applications to Computer Vision Kyomin Jung ( KAIST ) Joint work with Pushmeet Kohli (Microsoft Research)
Algorithmic, Game-theoretic and Logical Foundations
Introduction to Artificial Intelligence CS 438 Spring 2008 Today –AIMA, Ch. 25 –Robotics Thursday –Robotics continued Home Work due next Tuesday –Ch. 13:
Distributed Models for Decision Support Jose Cuena & Sascha Ossowski Pesented by: Gal Moshitch & Rica Gonen.
CS 484 Load Balancing. Goal: All processors working all the time Efficiency of 1 Distribute the load (work) to meet the goal Two types of load balancing.
Course Overview  What is AI?  What are the Major Challenges?  What are the Main Techniques?  Where are we failing, and why?  Step back and look at.
CSE 589 Part V One of the symptoms of an approaching nervous breakdown is the belief that one’s work is terribly important. Bertrand Russell.
CS6045: Advanced Algorithms NP Completeness. NP-Completeness Some problems are intractable: as they grow large, we are unable to solve them in reasonable.
Jaroslaw Kutylowski 1 HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Maintaining Communication Between an Explorer and a Base.
Path Planning Based on Ant Colony Algorithm and Distributed Local Navigation for Multi-Robot Systems International Conference on Mechatronics and Automation.
Learning for Physically Diverse Robot Teams Robot Teams - Chapter 7 CS8803 Autonomous Multi-Robot Systems 10/3/02.
Onlinedeeneislam.blogspot.com1 Design and Analysis of Algorithms Slide # 1 Download From
Approximation Algorithms based on linear programming.
Victoria Ibarra Mat:  Generally, Computer hardware is divided into four main functional areas. These are:  Input devices Input devices  Output.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
COGNITIVE APPROACH TO ROBOT SPATIAL MAPPING
CS b659: Intelligent Robotics
Hidden Markov Models Part 2: Algorithms
CIS16 Application Development – Programming with Visual Basic
CIS 488/588 Bruce R. Maxim UM-Dearborn
Market-based Dynamic Task Allocation in Mobile Surveillance Systems
CS 416 Artificial Intelligence
Presentation transcript:

Jaroslaw Kutylowski 1 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität An Overview of Robot Behavior Control with insight into AI-based and algorithm-based approaches

Jaroslaw Kutylowski 2 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Agenda What is this talk going to cover? What is behavior? Behavior control -Basic control strategies -Advantages and disadvantages of these strategies -Hybrid strategies Behavior-based control Deliberation-based control Hybrid strategies Final remarks

Jaroslaw Kutylowski 3 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität What is behavior? every robot has a goal how to accomplish this goal? good readings from sensors and good control of movement do not suffice  we need proper decision-making

Jaroslaw Kutylowski 4 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior control High-level behavioral algorithms Low-level basic algorithms Physics What is behavior? movement control sensor control what to do on sensor input how to coordinate with teammates navigation exploration etc.

Jaroslaw Kutylowski 5 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität What is behavior? Behavior control Input: sensory data history of behavior information from teammates information about opponent Output: what to do next? – where to go – where to look – what to send to teammates Behavior control High-level behavioral algorithms Low-level basic algorithms Physics  We will look at different methods for decision making and following these decisions

Jaroslaw Kutylowski 6 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität What is behavior? High-level behavioral algorithms Most prominent problems navigation to a point, with obstacles exploring unknown terrain task allocation Behavior control High-level behavioral algorithms Low-level basic algorithms Physics  Many research done in this area  We will review some of the results

Jaroslaw Kutylowski 7 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität What is behavior? Low-level basic algorithms Behavior control High-level behavioral algorithms Low-level basic algorithms Physics Typical problems how to walk how to read sensor input how to evaluate visual sensory input  These are problems which we won’t discuss

Jaroslaw Kutylowski 8 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior control basic strategies Two main approaches to behavior control: Behavior-based control (reactive) –“world is the world’s best model” –simple actions as reactions to environment –complex behaviors emerge from simple ones –stateless –no communication between teammates, only observation –inspired biologically –emerging from the AI community Deliberation-based control –careful planning of actions –maintaining state and synchronizing it with the environment –complex behaviors planned in advance –communication with teammates –prediction of opponent’s behavior –emerging from the algorithmic community

Jaroslaw Kutylowski 9 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior control discussion on behavior-based control Advantages: Simple controller, suitable for architectures with low performance Easy implementation leading to rapid development Easy to test and debug Should adapt well to changing environmental conditions Fast reaction time, well suited for dynamically changing situations e.g. (e.g. robot-soccer) Provable low-level properties (collision-avoidance etc.) Disadvantages: Emergent behavior is impossible to predict No provable properties about emergent behavior Not suitable very well to less dynamic situations where goals are achieved in a long term (e.g. UGV navigation)

Jaroslaw Kutylowski 10 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior control discussion on deliberation-based control Advantages: Possibility to plan in advance for long term behavior Complex behaviors are precisely defined and provable Can take advantage of communication with mates Possibility of learning and thus predicting the moves of the opponent Disadvantages: High hardware requirements (computationally intensive algorithms) Possibility of loss of synchronization between internal state and environment Problems hard to solve and implement Can react too slow in very dynamic situations (e.g. robot-soccer)

Jaroslaw Kutylowski 11 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior control hybrid strategies The two basic strategies can be combined to hybrid ones Basic behavior controlled by behavior-based strategies (low-level) Deliberation-based methods define a high-level strategy Advantages of both strategies can be combined

Jaroslaw Kutylowski 12 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control agenda Typical methods: –Simple state machines and how to define them –Potential fields method –Formation control

Jaroslaw Kutylowski 13 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control simple state-machines Most popular method of behavior control in dynamical systems Used by GermanTeam 2002 and later Sample definition: Goalie Goalie-before -kickoff Goalie -playing Return-to goal Position-inside -goal stand go-to-point kick go-to-ball

Jaroslaw Kutylowski 14 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control XABSL for defining behavior rules Instead of defining behavioral aspects of software in plain code, usage of meta-languages Software engineering defines UML, Petri-nets, high-level scripting etc. for modeling of behavior XABSL (extensible agent behavior specification language) is defined by the German team Syntax based on XML Defines a state-automaton Language constructs typical for a structural language (if, conditions) Constructs for easy operation on the state-automaton (transitions) Basic behaviors like “go-to-ball” defined in low-level language

Jaroslaw Kutylowski 15 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control XABSL for defining behavior rules XABSL is transformed into Intermediate Code, which is executed on the AIBO by a low-level virtual machine AIBO behaves according to the definitions given in XABSL, acting as a state-automaton

Jaroslaw Kutylowski 16 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control decisions in state machines Sometimes decisions between certain behavior options must be made These are based on evaluating utility functions for possible options These utility functions can be influenced by non-determinism

Jaroslaw Kutylowski 17 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control Potential fields method Objects either attract or repulse the robot These forces constitute the potential field Forces in the field are summed according to physical rules, so that one obtains the resultant force The resultant force indicates the movement direction of the robot, optionally the force strength determines the movement speed Advantages: Smooth movement Elegant solution, very easy to describe

Jaroslaw Kutylowski 18 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control Potential fields method Ball Robot Opponent Repulsive force induced by opponent robot Attractive induced by ball Calculated resultant force, direction of movement

Jaroslaw Kutylowski 19 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control Potential fields method – problems Local minima Ball Robot Ball Robot

Jaroslaw Kutylowski 20 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control Potential fields method – problems No passage between close objects Ball Robot

Jaroslaw Kutylowski 21 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control Potential fields method – problems Oscillations Ball Robot Ball Robot

Jaroslaw Kutylowski 22 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control formation control Formation control is important for terrain traversal, soccer … Four robots travel in a predefined formation ColumnLineDiamond Robots compute position and positions of others Own formation position is calculated basing on leader position neighbor position unit-center position

Jaroslaw Kutylowski 23 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Behavior-based control formation control Robot tries to maintain formation, by staying inside of the dead zone Inside of dead zone  no additional formation maintaining performed Inside of controlled zone  speed vector into dead zone linearly dependent on distance from dead zone If obstacles occur, the avoidance gains priority  As soon obstacle is surrounded, the robot tries to get into formation  Can be realized using potential field, with dead zone attracting and obstacles repelling Dead zone Controlled zone

Jaroslaw Kutylowski 24 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based control agenda Typical methods: –Case based reasoning –Hidden Markov Models Algorithmic approaches –task allocation –navigation to a point, with obstacles

Jaroslaw Kutylowski 25 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based control case based reasoning During soccer play similar situations can occur quite often Case based reasoning allows a player to store the behavior of opponents and use it when a similar situation occurs once again Sample: Goal Robot with ball Opponent

Jaroslaw Kutylowski 26 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based control case based reasoning Advantages: Opponent behavior can be analyzed and player can adapt to its strategies Disadvantages: If opponent uses similar techniques, than the two CBR instances fight against each other, returning improper forecasts No provable results Highly memory and computational intensive Learning process is needed  May be advantageous against simple opponents, but has no provable properties and fails against “intelligent” opponents

Jaroslaw Kutylowski 27 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based control Hidden Markov Model method As in CBR, the goal is to predict the behavior of the opponent The HMM method: assume that the opponent has a state machine and uses a set of common behaviors, like go-to-ball, intercept-ball … for each behavior we define a model, which is a state machine with probabilities for transition from state to state for every possible observation the model contains a probability that it occurs in a certain state we cannot directly observe the state of the opponent  so we instantiate HMM behavior models and look whether their execution matches the observations thus we obtain probabilities that the opponent is in a certain state of a certain behavior

Jaroslaw Kutylowski 28 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based control Hidden Markov Model method Observations are Distance of robot to ball Robot ball manipulation Distance of robot to goal … The most interesting question is about the value of  Knowing the probability, we can derive some information about future behavior of opponent

Jaroslaw Kutylowski 29 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based methods task allocation Task allocation is important when coordination of robots is needed With robot soccer task allocation is mostly reduced to role assignment (first forward, supporting forward, defender) Lot of research on multiprocessor task scheduling and similar assignments, which can be often translated to multi-robot scenarios Models utilized for robot task scheduling: Robots are heterogeneous Tasks require specific skills Tasks appear online Communication is expensive and thus must be minimized Computation power is sparse

Jaroslaw Kutylowski 30 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based methods task allocation We look for efficient, online and distributed approximations for task- allocation Taxonomy of task allocation problems: ST-SR – single-task robots, single-robot tasks ST-MR – single-task robots, multi-robot tasks MT-SR – multi-task robots, single-robot tasks MT-MR – multi-task robots, multi-robot tasks uncommon

Jaroslaw Kutylowski 31 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based methods task allocation – ST-SR setting Model Set M of workers, s. t. |M| = m Set N of jobs, s. t. |N| = n jobs, with a weight w j for each job skill rating, which defines the fitness of a worker for a job: We want to find such an assignment of workers to jobs, s. t. a sum of the combination of utility function and job weight is maximized  Centralized ILP solvable by Hungarian Method gives runtime of O(mn 2 ), but needs about n 2 messages to be exchanged  Distributed auction mechanisms achieve the same task with only O(n) messages

Jaroslaw Kutylowski 32 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based methods task allocation – online ST-SR setting The previous model assumed an offline-setting In reality the online version is much more likely to occur BLE algorithm: If any robot is unassigned, find the robot-task pair with highest utility and weight Assign this robot to this task Go on  This greedy strategy is 2-competitive to the optimal offline algorithm

Jaroslaw Kutylowski 33 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based methods task allocation – ST-MR setting Also known as coalition formation Now each job might require a specific skill which is possessed only by some robots Transforming the coalition formation problem to SPP: Let E be a set of all tasks and robots Let F be a family of all robot-task pairs u(f), where f is a set from F, is the utility for robot-task pair SPP Finite set E Family F of subsets of E Utility function u: F→R +  Find a maximum-utility family X of elements in F, s.t. X is a partition of E

Jaroslaw Kutylowski 34 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based methods task allocation – ST-MR setting SPP is NP-complete But there are heuristics and approximations which give good practical results Unfortunately these methods do not have a guaranteed approximation ratio, they only report how far the constructed solution is from the optimum for a particular problem instance

Jaroslaw Kutylowski 35 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based methods navigation to a point Model: The robot should get from a source position to a target position traveling the smallest possible distance There are obstacles with unknown position and size Different assumptions about the abilities of sensors may be made Visual sensors Touch sensors Important measures: Ratio of distance obtained by algorithm and the optimum Distance taking into account the sizes of obstacles

Jaroslaw Kutylowski 36 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based methods navigation – D* algorithm Model Finite undirected graph G(V,E), most often a grid Edge blocking The edge blocking is unknown to the algorithm The blocked edges cannot be traverse Blocked edges can be detected only at adjacent vertices D* algorithm Assume that all the unknown terrain contains no blocked edges Find shortest path Try to go on this path On blocked edges  update terrain map, calculate new path EB 

Jaroslaw Kutylowski 37 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based methods navigation – D* algorithm Sample edge-blocked graph S E

Jaroslaw Kutylowski 38 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Deliberation-based methods navigation – D* algorithm Performance of D* Lower bound on competitive ratio Upper bound on competitive ratio Lower bound construction

Jaroslaw Kutylowski 39 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Hybrid strategies Two layers of execution The lower runs with reactive behavior-based methods The upper runs with deliberative methods The lower layer assures fast reactions, obstacle avoidance etc. and can basically function without the help of the upper layer The upper layer provides additional support to the lower layer, by analyzing the situation (e.g. case based reasoning) and giving “hints” to the lower layer The hints are only supportive for the working of the behavior-based methods, i.e. they can be (partially) ignored The hint can be modeled as a slight influence on a utility function of executing an option

Jaroslaw Kutylowski 40 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Final remarks What you should remember Two basic strategies for behavior control No clear indication which one is best Many research in both areas, with deliberative having more strict proofs and behavior-based having more practical realization  Today’s results aren’t great  Practical realizations more often use simpler methods – there is a gap between the theoretical results and their implementation

Jaroslaw Kutylowski 41 HEINZ NIXDORF INSTITUT Universität Paderborn Algorithmen und Komplexität Jaroslaw Kutylowski Heinz Nixdorf Institut & Institut für Informatik Universität Paderborn Fürstenallee Paderborn Tel.: / Fax: / Thank you for your attention!