Presentation is loading. Please wait.

Presentation is loading. Please wait.

Transfer and Multi-Task Learning in Reinforcement Learning Alessandro LAZARIC “Machine Learning with Interdependent and Non-identically Distributed Data”

Similar presentations


Presentation on theme: "Transfer and Multi-Task Learning in Reinforcement Learning Alessandro LAZARIC “Machine Learning with Interdependent and Non-identically Distributed Data”"— Presentation transcript:

1 Transfer and Multi-Task Learning in Reinforcement Learning Alessandro LAZARIC “Machine Learning with Interdependent and Non-identically Distributed Data” Seminar @Dagstuhl SequeL Inria Lille – Nord Europe April 7-10, 2015

2 Reinforcement Learning April 7-10, 2015 A. LAZARIC – Transfer in RL- 2 agent environment critic delay <position, speed><handlebar, pedals><new position, new speed>, advancement Value Function Control Policy

3 Transfer in Reinforcement Learning April 7-10, 2015 A. LAZARIC – Transfer in RL- 3 agent environment critic delay transfer of knowledge

4 Transfer in RL is not trivial April 7-10, 2015 A. LAZARIC – Transfer in RL- 4 Techniques developed in supervised learning cannot be always re-used in RL: Many different “objects” that can be transferred (eg, policies, value functions, samples) Tasks may be similar in many different ways Samples are often non-iid “Unsupervised” samples are not well defined Different objectives (eg, exploration-exploitation)

5 My research (present and future): transfer for exploration-exploitation April 7-10, 2015 A. LAZARIC – Transfer in RL- 5 Motivating problems Intelligent tutoring systems Recommendation systems Computer games Attempted (successful) approaches in multi-armed bandit Identification of finite set of models Transfer of samples Open questions Estimation of the bias for selective transfer Appropriate measure of similarity Exploration vs exploitation vs transfer

6 Thanks!! Inria Lille – Nord Europe www.inria.fr


Download ppt "Transfer and Multi-Task Learning in Reinforcement Learning Alessandro LAZARIC “Machine Learning with Interdependent and Non-identically Distributed Data”"

Similar presentations


Ads by Google