R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin 03 1 Emerging behavior of autonomous agents Ralf Der, Michael.

Slides:

Advertisements

Similar presentations

Lecture 2 Operational Amplifiers

Advertisements

Multi-Layer Perceptron (MLP)

Institut für Regelungs- und Automatisierungstechnik 1 Michael Hofbaur Modular Machines Modular Machines - reconfigurable mobile robots for research and.

Computational Intelligence Winter Term 2009/10 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.

Software Testing Technique. Introduction Software Testing is the process of executing a program or system with the intent of finding errors. It involves.

Mobile Robot ApplicationsMobile Robot Applications Textbook: –T. Bräunl Embedded Robotics, Springer 2003 Recommended Reading: 1. J. Jones, A. Flynn: Mobile.

Leadership Science 1 Parsing the Influential Increment in the Language of Complexity Academy of Management Annual Meeting 2007 Jim Hazy Adelphi University.

Cognitive Systems, ICANN panel, Q1 What is machine intelligence, as beyond pattern matching, classification and prediction. What is machine intelligence,

5.9 + = 10 a)3.6 b)4.1 c)5.3 Question 1: Good Answer!! Well Done!! = 10 Question 1:

12 th International Fall Workshop VISION, MODELING, AND VISUALIZATION 2007 November 7-9, 2007 Saarbrücken, Germany Estimating Natural Activity by Fitting.

Computational Intelligence Winter Term 2011/12 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.

Computational Intelligence Winter Term 2014/15 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.

ICS-171:Notes 8: 1 Notes 8: Uncertainty, Probability and Optimal Decision-Making ICS 171, Winter 2001.

Behavioral Theories of Motor Control

Higher Coordination with Less Control – A Result of Information Maximization in the Sensorimotor Loop Keyan Zahedi, Nihat Ay, Ralf Der (Published on: May.

NW Computational Intelligence Laboratory Experience-Based Surface-Discernment by a Quadruped Robot by Lars Holmstrom, Drew Toland, and George Lendaris.

Wait, sound sensor >70, Port 2 Flowchart – Heartbeat 1 Start Motor A, Move Backward, 1/3 Rotation, Power 20 Wait, 1 Second Sound Sensor (Port 2) Less than.

Robot Programming. Programming Behaviors Behaviors describe the actions and decisions of your robot.

Pedro Nunes1 Sensor Fusion Applied to the RoboCup Simulation League.

Jennifer Goodall, Nick Webb, Katy DeCorah

An Introduction to Machine Learning In the area of AI (earlier) machine learning took a back seat to Expert Systems Expert system development usually consists.

Organic Computing CS PhD Seminar Mar 3, 2003 Christoph von der Malsburg Computer Science and Biology Departments University of Southern California and.

1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.

Behavior- Based Approaches Behavior- Based Approaches.

Computational aspects of motor control and motor learning Michael I. Jordan* Mark J. Buller (mbuller) 21 February 2007 *In H. Heuer & S. Keele, (Eds.),

RoboCup: The Robot World Cup Initiative Based on Wikipedia and presentations by Mariya Miteva, Kevin Lam, Paul Marlow.

Distributed Constraint Optimization * some slides courtesy of P. Modi

D D L ynamic aboratory esign 5-Nov-04Group Meeting Accelerometer Based Handwheel State Estimation For Force Feedback in Steer-By-Wire Vehicles Joshua P.

Optimality in Motor Control By : Shahab Vahdat Seminar of Human Motor Control Spring 2007.

Institute of Perception, Action and Behaviour (IPAB) Director: Prof. Sethu Vijayakumar.

Welcome to a before and after coherent noise removal series. There is a lot to explain about what is going on here, so I am using this otherwise wasted.

Brian Renzenbrink Jeff Robble Object Tracking Using the Extended Kalman Particle Filter.

 The most intelligent device - “Human Brain”.  The machine that revolutionized the whole world – “computer”.  Inefficiencies of the computer has lead.

Professor : Chi-Jo Wang Student’s name : Nguyen Van Binh Student ID: MA02B203 Two Wheels Self Balancing Robot 1 Southern Taiwan University Department of.

Study on Genetic Network Programming (GNP) with Learning and Evolution Hirasawa laboratory, Artificial Intelligence section Information architecture field.

K. J. O’Hara AMRS: Behavior Recognition and Opponent Modeling Oct Behavior Recognition and Opponent Modeling in Autonomous Multi-Robot Systems.

The aim of GRASP is the design of a cognitive system capable of performing grasping and manipulation tasks in open-ended environments, dealing with novelty,

Probabilistic Reasoning for Robust Plan Execution Steve Schaffer, Brad Clement, Steve Chien Artificial Intelligence.

2 3  A machine  Built to help us  Autonomous (not remote control)  If we want robots to do things for us, we have.

FRE 2672 TFG Self-Organization - 01/07/2004 Engineering Self-Organization in MAS Complex adaptive systems using situated MAS Salima Hassas LIRIS-CNRS Lyon.

ACM 97 Semiconductors Carver Mead Gordon and Betty Moore Professor of Engineering and Applied Science, California Institute of Technology.

Progress in identification of damping: Energy-based method with incomplete and noisy data Marco Prandina University of Liverpool.

Motor Control. Beyond babbling Three problems with motor babbling: –Random exploration is slow –Error-based learning algorithms are faster but error signals.

Abstract This presentation questions the need for reinforcement learning and related paradigms from machine-learning, when trying to optimise the behavior.

Purposive Behavior Acquisition for a real robot by vision based Reinforcement Learning Minuru Asada,Shoichi Noda, Sukoya Tawarasudia, Koh Hosoda Presented.

IVR 30/10/2009 Herrmann1 IVR: Control Theory Overview: PID control Steady-state error and the integral method Overshoot and ringing in system with time.

Lecture 25: Implementation Complicating factors Control design without a model Implementation of control algorithms ME 431, Lecture 25.

Robot Programming. Programming Behaviors Behaviors describe the actions and decisions of your robot.

Behavior-based Multirobot Architectures. Why Behavior Based Control for Multi-Robot Teams? Multi-Robot control naturally grew out of single robot control.

Lecture 5 Neural Control

Forward Modeling Image Analysis The forward modeling analysis approach (sometimes called “analysis by synthesis”) involves optimizing the parameters of.

ACM 97 Semiconductors Carver Mead Gordon and Betty Moore Professor of Engineering and Applied Science, California Institute of Technology.

G. Casalino, E. Zereik, E. Simetti, A. Turetta, S. Torelli and A. Sperindè EUCASS 2011 – 4-8 July, St. Petersburg, Russia.

Robust Localization Kalman Filter & LADAR Scans

Lecture 22 The Spherical Bicycle 1. 2 Some relative dimensions with the wheel radius and mass as unity sphere radius 2, mass 50 fork length 4, radius.

Robot Intelligence Technology Lab. Evolution of simple navigation Chapter 4 of Evolutionary Robotics Jan. 12, 2007 YongDuk Kim.

RoboCup: The Robot World Cup Initiative

Lecture 4: Numerical Stability

IPAB Research Areas and Strengths

Done Done Course Overview What is AI? What are the Major Challenges?

PID Controllers Jordan smallwood.

Oliviero Giannini, Ute Gauger, Michael Hanss

Web-Mining Agents Cooperating Agents for Information Retrieval

Joelle Pineau: General info

Overview of Machine Learning

Capabilities of Threshold Neurons

MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING

MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING

The Organization and Planning of Movement Ch

Computer Vision Lecture 19: Object Recognition III

Presentation transcript:

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin 03 1 Emerging behavior of autonomous agents Ralf Der, Michael Herrmann * Institut für Informatik – Universität Leipzig Abteilung Intelligente Systeme AG Neuroinformatik und Robotik * Michael Herrmann now: U Göttingen, Institute of Nonlinear DynamicsGöttingennstitute of Nonlinear Dynamics

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin 03 2 Aims Creation of machines with emerging capabilities by exploiting the general principles of self-organization as known from natural sciences. In the RoboCup domain: Exploration of cluttered environments and playing ball as emerging phenomena.

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin 03 3 Emergence of new spatio-temporal modes of behavior by spontaneous symmetry breaking The nature of self organization Our aim: Behavior of artificial beings as spatio-temporal modes of a self-organizing system Physics: The conflict between the global constraints of a system + self-amplification of constraint breaking fluctuations Physics: The conflict between the global constraints of a system + self-amplification of constraint breaking fluctuations

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin 03 4 Self-organization by way of autonomous learning Autonomous learning: Supervised learning of the controller with learning signals generated by principles completely internal to the agent. Our approach: Minimize the error between model and true behavior. Behavior model is an adaptive model of the dynamics of the sensorimotor loop.

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin 03 5 The sensorimotor loop | | | time t motor activities sensor values x_new x_old Controller World

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin 03 6 Modeling forward in time x_new world x_old model controller x_new = x_model + noise time x_model E = (x_new - x_model) 2 Learning signal for both model and controller Most favorite behavior: Do nothing

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin 03 7 x_new The time inverted sensor-motor loop motor activities controller model x_old ( reconstructed )

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin 03 8 Modeling backward in time – The time loop x_old x_new world model controller x_new = x_model + noise time x_ reconstructed E = (x_ old - x_ reconstr.) 2 The evaluation instance is set back in time. It sees the actual instant of physical time as a future event. The model is used in order to propagate the future sensor values back to the conscious now.

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin 03 9 The time loop effekt drives self-organization The time loop effect. In the linear approximation the full dynamics (left branch of the time loop) can be viewed as the superpositon of noise events which are propagated from the time t n of the event to the current time t by the model dynamics. Then the result is propagated back from t to the reference time t by the model dynamics again. Since the model dynamics forward and backward in time cancel the net effect is a propagation backward in time from t n to t (dashed arrow).

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin Behavior from minimizing the time loop error Backward in time the noise is damped if behavior is instable so that the learning dynamics generates activity in the sensorimotor loop by self-amplification of fluctuations. On the other hand the noise is the larger the more irregular the behavior. The balance of the two effects fosters the emergence of domain related activities of the agent. model(x) + noise ( E = ( model -1 (noise)) 2 Learning signal for both model and controller

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin Sensorimotor loop in the robot domain camera preprocessing motor activity wheel velocity robot 1 neuron still done by hand Maze 3Maze 2 Maze 1

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin The time loop effect enforces sensorimotor integration motor activity wheel velocity motor model 1 neuron vision model learning signal new sensor values

R. Der, R. Liebscher, M. Herrmann - Institut für Informatik - Universität Leipzig St. Augustin Further steps Self-organized sensor fusion (second phase of the project). Straightforward since the time loop effect enforces sensorimotor integration. Combination with the distal learning (Jordan and Rumelhart) paradigm. Combination with reinforcement learning and evolution strategies. These approaches based on the fact that the present behavior explores the sensorimotor capabilities of the agent so that a wealth of behavioral primitives related to specific domain related situations are emerging. By cataloguing these behavior primitves behavior space is built on which more complicated intentional (conscious) behaviors may be grown.