Purpose of the research:

Slides:

Advertisements

Similar presentations

Machine Learning in Computer Games Learning in Computer Games By: Marc Ponsen.

Advertisements

November 5, 2009Introduction to Cognitive Science Lecture 16: Symbolic vs. Connectionist AI 1 Symbolism vs. Connectionism There is another major division.

P. Ögren (KTH) N. Leonard (Princeton University)

Integrating POMDP and RL for a Two Layer Simulated Robot Architecture Presented by Alp Sardağ.

LEARNING THEORY OBSERVATIONAL LEARNING. Observational learning is learning through observation. Observational learning is learning through observation.

Curious Reconfigurable Robots Kathryn Merrick ITEE Seminar – Friday 29 th August Collaborators: Dayne Schmidt, Elanor Huntington.

Neural Networks Primer Dr Bernie Domanski The City University of New York / CSI 2800 Victory Blvd 1N-215 Staten Island, New York 10314

Lead Black Slide. © 2001 Business & Information Systems 2/e2 Chapter 11 Management Decision Making.

Fourth International Symposium on Neural Networks (ISNN) June 3-7, 2007, Nanjing, China Online Dynamic Value System for Machine Learning Haibo He, Stevens.

Cmpt-225 Simulation. Application: Simulation Simulation  A technique for modeling the behavior of both natural and human-made systems  Goal Generate.

Rohit Ray ESE 251. What are Artificial Neural Networks? ANN are inspired by models of the biological nervous systems such as the brain Novel structure.

IMPLEMENTATION ISSUES REGARDING A 3D ROBOT – BASED LASER SCANNING SYSTEM Theodor Borangiu, Anamaria Dogar, Alexandru Dumitrache University Politehnica.

Artificial Intelligence By Ryan Shoultes & Jeremy Creighton.

Behavior Based Robotics: A Wall Following Behavior Arun Mahendra - Dept. of Math, Physics & Engineering, Tarleton State University Mentor: Dr. Mircea Agapie.

Nuttapon Boonpinon Advisor Dr. Attawith Sudsang Department of Computer Engineering,Chulalongkorn University Pattern Formation for Heterogeneous.

COLLABORATIVE SPECTRUM MANAGEMENT FOR RELIABILITY AND SCALABILITY Heather Zheng Dept. of Computer Science University of California, Santa Barbara.

Introduction GAM 376 Robin Burke Winter Outline Introductions Syllabus.

Hybrid AI & Machine Learning Systems Using Ne ural Network and Subsumption Architecture Libraries By Logan Kearsley.

 The most intelligent device - “Human Brain”.  The machine that revolutionized the whole world – “computer”.  Inefficiencies of the computer has lead.

Neural Networks AI – Week 23 Sub-symbolic AI Multi-Layer Neural Networks Lee McCluskey, room 3/10

Chapter 1. Introduction in Creating Brain-Like Intelligence, Sendhoff et al. Course: Robots Learning from Humans Jo, HwiYeol Biointelligence Laboratory.

Study on Genetic Network Programming (GNP) with Learning and Evolution Hirasawa laboratory, Artificial Intelligence section Information architecture field.

Chapter 16. Basal Ganglia Models for Autonomous Behavior Learning in Creating Brain-Like Intelligence, Sendhoff et al. Course: Robots Learning from Humans.

Bayesian goal inference in action observation by co-operating agents EU-IST-FP6 Proj. nr Raymond H. Cuijpers Project: Joint-Action Science and.

Intelligent Systems in the Gambling Industry Kieran O’Neill 25/03/10.

CSC Intro. to Computing Lecture 22: Artificial Intelligence.

Prepared By :. CONTENTS 1~ INTRODUCTION 2~ WHAT IS BLUE BRAIN 3~ WHAT IS VIRTUAL BRAIN 4~ FUNCTION OF NATURAL BRAIN 5~ BRAIN SIMULATION 6~ CURRENT RESEARCH.

Neural Network Basics Anns are analytical systems that address problems whose solutions have not been explicitly formulated Structure in which multiple.

DARPA Mobile Autonomous Robot SoftwareLeslie Pack Kaelbling; January Adaptive Intelligent Mobile Robotics Leslie Pack Kaelbling Artificial Intelligence.

Course Overview  What is AI?  What are the Major Challenges?  What are the Main Techniques?  Where are we failing, and why?  Step back and look at.

A Roadmap towards Machine Intelligence

Cognitive Modular Neural Architecture

Hot Topics in Artificial Intelligence Corin Anderson Tessa Lau Steve Wolfman

Adopting and adapting teaching and learning styles Neil Denby.

MIT Artificial Intelligence Laboratory — Research Directions Intelligent Agents that Learn Leslie Pack Kaelbling.

Robot Intelligence Technology Lab. 10. Complex Hardware Morphologies: Walking Machines Presented by In-Won Park

Artificial Intelligence Skepticism by Josh Pippin.

Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.

Artificial Neural Networks By: Steve Kidos. Outline Artificial Neural Networks: An Introduction Frank Rosenblatt’s Perceptron Multi-layer Perceptron Dot.

Overview of Artificial Intelligence (1) Artificial intelligence (AI) Computers with the ability to mimic or duplicate the functions of the human brain.

Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation Dr.G.M.Nasira R. Vidya R. P. Jaia Priyankka.

Break-Away Strategy Game A turn based strategy game used to simulate Break-Away Brought to you by Team 33.

Brief Intro to Machine Learning CS539

General-Purpose Learning Machine

All about squash By Dean All about squash by Dean.

A BRIEF INTRO TO: COBOTS, LSM AND A.I. APIs

Done Done Course Overview What is AI? What are the Major Challenges?

Table 1. Advantages and Disadvantages of Traditional DM/ML Methods

ReinforcementLearning: A package for replicating human behavior in R

Technologies for Learning Chapter Two Lonnie Redning

A Network Virtual Machine for Real-Time Coordination Services

Intelligent Adaptive Mobile Robots

Joost N. Kok Universiteit Leiden

CIS 488/588 Bruce R. Maxim UM-Dearborn

FUNDAMENTAL CONCEPT OF ARTIFICIAL NETWORKS

Smeagol and Gollum.

Management Oversight of the Primate Training Program

Treatments for substance misuse

Orphaned Children Morrison and Ellwood (2000):

National 5 Physical Education

Emre O. Neftci iScience Volume 5, Pages (July 2018) DOI: /j.isci

Educational Data Mining Success Stories

Arduino Joy Stick and Game

Contributed by Louise Matthews

Learning to Communicate

HIGH - SPEED TRAIN TEAM

EXPLICIT RULES: INPUT-OUTPUT FORMULAS

Testing Slides adopted from John Jannotti, Brown University

Presentation transcript:

Policy learning using online reinforcement learning for a adaptive Liquid State Machine Purpose of the research: - Use human based network and learning to study how humans acquire cognitive functions. (for instance cooperation!) Goal for the summer school: - Implement an artificial spiking neuron network for policy learning. - Collaborate with the EFAA group to teach iCub© the game of Pong© *J. Baraglia et al., “Action Understanding using an Adaptive Liquid State Machine based on Environmental Ambiguity”, ICDLEpiRob 2013

Model for the learning iCub/iCubSim InD aLSM* Next hand position. Control module (Coordinator + YARP) Ball’s end position predictor Output selection　 Ball position Reward Next action iCub/iCubSim *J. Baraglia et al., “Action Understanding using an Adaptive Liquid State Machine based on Environmental Ambiguity”, ICDLEpiRob 2013

Policy learning using online reinforcement learning for a adaptive Liquid State Machine How pong can be learn? Using reinforcement learning!! State: Missed ball!!! Reward: Negative. Learning speed: MAX! State: Got ball!!! Reward: Positive. Learning speed: MAX! State: Missed ball!!! Reward: Positive. Learning speed: MIN! If the system get positive reward, the same reaction for similar inputs is more likely to occur again. (Thorn-like effect) Else, if the system get negative reward, the same reaction is less likely to be reproduced.

Implemented but not really tested Example In Simulation In Simulation On iCub simulator On iCub simulator On iCub On iCub Implemented but not really tested

Conclusion Positive: - The learning function is able to learn pong© without explicit rules. - The learning is very fast and robust! - Worked on simulation and relatively good on iCubSIM! Negative: - We couldn’t teach the real robot yet. - The learning complexity is n2  hardly scalable to big problems. Future work: - The network should be improved to lower the complexity. - Teach the real robot to play the game!

Thank you!! Especially to: VVV13’s organizers. The EFAA team. The IIT staff that took care of the iCub.