Curious Reconfigurable Robots Kathryn Merrick ITEE Seminar – Friday 29 th August Collaborators: Dayne Schmidt, Elanor Huntington.

Slides:



Advertisements
Similar presentations
IMA 2.5: Software Architecture and Development Environment Roberto Olivares M.S. Electrical Engineering Vanderbilt University, Spring 2003.
Advertisements

Sazalinsyah Razali (syah)‏ Dr. Q. Meng (Supervisor)‏
Modelling CGFs for tactical air-to-air combat training
CESG, Fall 2011, 5 th November 2011 Stuart Fowell, SciSys Device Virtualisation and Electronic Data Sheets.
Ontologies: Dynamic Networks of Formally Represented Meaning Dieter Fensel: Ontologies: Dynamic Networks of Formally Represented Meaning, 2001 SW Portal.
Artificial Curiosity Tyler Streeter
Dealing with Complexity Robert Love, Venkat Jayaraman July 24, 2008 SSTP Seminar – Lecture 10.
Yiannis Demiris and Anthony Dearden By James Gilbert.
Constructing the Future with Intelligent Agents Raju Pathmeswaran Dr Vian Ahmed Prof Ghassan Aouad.
Zach Ramaekers Computer Science University of Nebraska at Omaha Advisor: Dr. Raj Dasgupta 1.
© Franz Kurfess Project Topics 1 Topics for Master’s Projects and Theses -- Winter Franz J. Kurfess Computer Science Department Cal Poly.
Brent Dingle Marco A. Morales Texas A&M University, Spring 2002
1 ITC242 – Introduction to Data Communications Week 12 Topic 18 Chapter 19 Network Management.
Integrating POMDP and RL for a Two Layer Simulated Robot Architecture Presented by Alp Sardağ.
Mental Development and Representation Building through Motivated Learning Janusz A. Starzyk, Ohio University, USA, Pawel Raif, Silesian University of Technology,
IJCNN, International Joint Conference on Neural Networks, San Jose 2011 Pawel Raif Silesian University of Technology, Poland, Janusz A. Starzyk Ohio University,
Holistic Scheduling and Analysis of Mixed Time/Event-Triggered Distributed Embedded System Traian Pop, Petru Eles, Zebo Peng EE249 Discussion Paper Review.
Behavior- Based Approaches Behavior- Based Approaches.
Modelling Motivation for Experience-Based Attention Focus in Reinforcement Learning Candidate Kathryn Merrick School of Information Technologies University.
Intelligent Agents: an Overview. 2 Definitions Rational behavior: to achieve a goal minimizing the cost and maximizing the satisfaction. Rational agent:
Chuang-Hue Moh Spring Embodied Intelligence: Final Project.
Curious Characters in Multiuser Games: A Study in Motivated Reinforcement Learning for Creative Behavior Policies * Mary Lou Maher University of Sydney.
Biointelligence Laboratory School of Computer Science and Engineering Seoul National University Cognitive Robots © 2014, SNU CSE Biointelligence Lab.,
Leroy Garcia 1.  Artificial Intelligence is the branch of computer science that is concerned with the automation of intelligent behavior (Luger, 2008).
“Curious Places” October, 2007 Key Centre of Design Computing and Cognition, University of Sydney A Room that Adapts using Curiosity and Supervised Learning.
August 19 th, 2006 Computational Neuroscience Group, LCE Helsinki University of Technology Computational neuroscience group Laboratory of computational.
1 From individuals to social and vice-versa André Campos Utrecht University, The Netherlands Universidade Federal do Rio Grande do Norte, Brazil Frank.
Chapter 8 Prediction Algorithms for Smart Environments
What is AIBO ? AIBO is produced by SONY Marketed as an entertainment robot Has sensors for input Has LEDs and sound for output as well as motors for movement.
An Architecture for Empathic Agents. Abstract Architecture Planning + Coping Deliberated Actions Agent in the World Body Speech Facial expressions Effectors.
Approaches to Artificial Evolution Verena Hamburger.
DESC9180 Designing Virtual Worlds Week 12 Future of Virtual Worlds 6pm – 9pm Tuesday, October 16 th, 2007 Kathryn Merrick and Owen Macindoe DESC9180 Designing.
Generic Distributed Algorithms for Self-Reconfiguring Robots Keith Kotay and Daniela Rus MIT Computer Science and Artificial Intelligence Laboratory.
Distributed Computation in MANets Robot swarm developed by James Rice University.
Hybrid AI & Machine Learning Systems Using Ne ural Networks and Subsumption Architecture By Logan Kearsley.
Towards Cognitive Robotics Biointelligence Laboratory School of Computer Science and Engineering Seoul National University Christian.
Beyond Gazing, Pointing, and Reaching A Survey of Developmental Robotics Authors: Max Lungarella, Giorgio Metta.
Page 1 Reconfigurable Communications Processor Principal Investigator: Chris Papachristou Task Number: NAG Electrical Engineering & Computer Science.
Parallel and Distributed Simulation Introduction and Motivation.
Chapter 2.2 Game Design. CS Overview This introduction covers: –Terms –Concepts –Approach All from a workaday viewpoint.
Introduction to Reinforcement Learning Dr Kathryn Merrick 2008 Spring School on Optimisation, Learning and Complexity Friday 7 th.
Designing a multimedia interactive to support shared learning experiences Steve DiPaola & Caitlin Akai Simon Fraser University Steve DiPaola & Caitlin.
Overview of Machine Learning RPI Robotics Lab Spring 2011 Kane Hadley.
Artificial Intelligence 2005/06 Partially Ordered Plans - or: "How Do You Put Your Shoes On?"
MAPLD 2005/254C. Papachristou 1 Reconfigurable and Evolvable Hardware Fabric Chris Papachristou, Frank Wolff Robert Ewing Electrical Engineering & Computer.
Behaviour-Based Control in Mobile Robotics
Curiosity-Driven Exploration with Planning Trajectories Tyler Streeter PhD Student, Human Computer Interaction Iowa State University
Subsumption Architecture and Nouvelle AI Arpit Maheshwari Nihit Gupta Saransh Gupta Swapnil Srivastava.
Abstract A Structured Approach for Modular Design: A Plug and Play Middleware for Sensory Modules, Actuation Platforms, Task Descriptions and Implementations.
Chapter 1. Cognitive Systems Introduction in Cognitive Systems, Christensen et al. Course: Robots Learning from Humans Park, Sae-Rom Lee, Woo-Jin Statistical.
Generating Software Documentation in Use Case Maps from Filtered Execution Traces Edna Braun, Daniel Amyot, Timothy Lethbridge University of Ottawa, Canada.
Tangible User Interfaces and Reinforcement Learning (Smart Toys) An honours thesis presentation by… Trent Apted Supervised by A/Prof Bob Kummerfeld Smart.
Using IR For Maze Navigation Kyle W. Lawton and Liz Shrecengost.
Towards the autonomous navigation of intelligent robots for risky interventions Janusz Bedkowski, Grzegorz Kowalski, Zbigniew Borkowicz, Andrzej Masłowski.
ROB SAUNDERS, JOHN S. GERO
Chapter 15. Cognitive Adequacy in Brain- Like Intelligence in Brain-Like Intelligence, Sendhoff et al. Course: Robots Learning from Humans Cinarel, Ceyda.
MODULAR SELF -RECONFIGURABLE MULTI-FUNCTIONAL ROBOTIC SYSTEM.
NTT-MIT Collaboration Meeting, 2001Leslie Pack Kaelbling 1 Learning in Worlds with Objects Leslie Pack Kaelbling MIT Artificial Intelligence Laboratory.
Done by Fazlun Satya Saradhi. INTRODUCTION The main concept is to use different types of agent models which would help create a better dynamic and adaptive.
Introduction to Machine Learning, its potential usage in network area,
INTRODUCTION TO WIRELESS SENSOR NETWORKS
User Interference Effect on Routing of Cognitive Radio Ad-Hoc Networks
Build Intelligence from the bottom up!
Build Intelligence from the bottom up!
Reinforcement Learning for Intelligent Control Part 2 Presented at Chinese Youth Automation Conference National Academy of Science, Beijing, 8/22/05.
© James D. Skrentny from notes by C. Dyer, et. al.
Anne Pratoomtong ECE734, Spring2002
Announcements Homework 3 due today (grace period through Friday)
Build Intelligence from the bottom up!
Purpose of the research:
Presentation transcript:

Curious Reconfigurable Robots Kathryn Merrick ITEE Seminar – Friday 29 th August Collaborators: Dayne Schmidt, Elanor Huntington

Overview Why build curious robots? Background Curious robots for creative play Strengths and limitations Current and future work

Why Build Curious Robots? Explore the scalability of existing curiosity algorithms Build adaptive robots: –Fault tolerant –Able to use tools –Reconfigurable –Creative

Adaptable Robots Big Dog (Boston Dynamics, 2008) (Bongard et al, 2006)

Self-Reconfigurable Robots Research focuses on: –Hardware –Communication –Navigation (University of Pennsylvania, 2008)

Curious, Developmental Robots Curious Aibo (Sony CSL, ) Research focus is on behaviour Static robot configuration

Curious, Reconfigurable Robots Assume a changing robot configuration Focus on behavioural algorithms

Perceiving the World S   | ε  ( : )  A unique identifier  Real number A   | ε  A unique identifier World is represented as a variable length string from a grammar, rather than a fixed length vector World comprises states and actions

Modelling Curiosity Curiosity is modelled as a function of novelty and interest

Curious Reinforcement Learning Agent learns a mapping of states to actions and utility values Agent selects best action most of the time and a random action some of the time A1A1 A2A2 A3A3 S1S S2S S3S Reward = Curiosity value

Architecture for a Curious, Reconfigurable Robot Device Manager Resource Manager Curious Agent PerceptionCuriosityLearningActivation Abstract Sensor Abstract Actuator Memory Agent Layer Device Layer

Curious Robots for Creative Play Explores relationship between structure and behaviour Creative thinking spiral (Resnick, 2007) Imagination, creativity, play, sharing and reflection

Strengths Robot is reconfigurable – sensors and effectors can be added or removed New behaviour emerges for new structure Relationship between structure and behaviour revealed

Limitations Not exhibiting the same cyclic behaviour seen in simulated agents Cyclic behaviour harder to measure Learning takes a long time S1S1 S2S2 S3S3 (a) A1 A1 A2A2 A3A3 (b) T1 T1 T2T2 T3T3 (c)

Curious Social Force Models for Reconfigurable Robots CDF Project: Dayne Schmidt

Measuring the Performance of Curious Robots Characterising attention focus using bifurcation diagrams With Elanor Huntington 1A(move fwd port 4) 2A(move fwd port 6) 3A(move bkd port 4) 4A(move bkd port 6) 5A(stop port 4) 6A(stop port 6)

Measuring a Curious, Reconfigurable Robot Focus of attention shifts as robot is reconfigured 1A(move fwd port 4) 2A(move bkd port 4) 3A(stop port 4) 4A(move fwd port 6) 5A(move bkd port 6) 6A(stop port 6)

Learning Approaches for Curious Robots Representing the state-action table as a neural network reduces memory requirements Attention focus limited to simple tasks A1A1 A2A2 A3A3 s1s1 s2s2 s3s3 s4s4 s5s5 w 11 w 21 w 31 w 41 w 51

Alternatives to Curiosity? Modelling behaviour cycles as intrinsic reward for learning In natural systems behaviour cycles occur at biological, cognitive and social levels –Sleep-wake cycle, seasonal, tidal or lunar cycles –Learning cycle, habituation and recovery –Fashion cycles, sociological cycles (Ahlgren and Halberg, 1990)

Behaviour Cycles as Intrinsic Reward Advantages in natural systems include: –Anticipation, efficiency, competition, navigation –Creativity through exploration –Social self-advancement Also potential advantages for artificial systems

Conclusions We have created a curious, reconfigurable robot: –Sensors and effectors can be added or removed –New behaviour emerges for new structure Ongoing work for: –Understanding curiosity in complex environments –Learning speed and representation –Measurement of emergent behaviour –Alternatives to curiosity as motivation for reconfigurable robots