Presentation is loading. Please wait.

Presentation is loading. Please wait.

Purpose of the research:

Similar presentations


Presentation on theme: "Purpose of the research:"— Presentation transcript:

1 Policy learning using online reinforcement learning for a adaptive Liquid State Machine
Purpose of the research: - Use human based network and learning to study how humans acquire cognitive functions. (for instance cooperation!) Goal for the summer school: - Implement an artificial spiking neuron network for policy learning. - Collaborate with the EFAA group to teach iCub© the game of Pong© *J. Baraglia et al., “Action Understanding using an Adaptive Liquid State Machine based on Environmental Ambiguity”, ICDLEpiRob 2013

2 Model for the learning iCub/iCubSim InD aLSM*
Next hand position. Control module (Coordinator + YARP) Ball’s end position predictor Output selection  Ball position Reward Next action iCub/iCubSim *J. Baraglia et al., “Action Understanding using an Adaptive Liquid State Machine based on Environmental Ambiguity”, ICDLEpiRob 2013

3 Policy learning using online reinforcement learning for a adaptive Liquid State Machine
How pong can be learn? Using reinforcement learning!! State: Missed ball!!! Reward: Negative. Learning speed: MAX! State: Got ball!!! Reward: Positive. Learning speed: MAX! State: Missed ball!!! Reward: Positive. Learning speed: MIN! If the system get positive reward, the same reaction for similar inputs is more likely to occur again. (Thorn-like effect) Else, if the system get negative reward, the same reaction is less likely to be reproduced.

4 Implemented but not really tested
Example In Simulation In Simulation On iCub simulator On iCub simulator On iCub On iCub Implemented but not really tested

5 Conclusion Positive: - The learning function is able to learn pong© without explicit rules. - The learning is very fast and robust! - Worked on simulation and relatively good on iCubSIM! Negative: - We couldn’t teach the real robot yet. - The learning complexity is n2  hardly scalable to big problems. Future work: - The network should be improved to lower the complexity. - Teach the real robot to play the game!

6 Thank you!! Especially to: VVV13’s organizers. The EFAA team.
The IIT staff that took care of the iCub.


Download ppt "Purpose of the research:"

Similar presentations


Ads by Google