Towards robotic assistants in nursing homes: challenges and results Joelle Pineau Michael Montemerlo Martha Pollack * Nicholas Roy Sebastian Thrun Carnegie.

Slides:

Advertisements

Similar presentations

A Decision-Theoretic Model of Assistance - Evaluation, Extension and Open Problems Sriraam Natarajan, Kshitij Judah, Prasad Tadepalli and Alan Fern School.

Advertisements

Technology to Support Individuals with Cognitive Impairment Martha E. Pollack Computer Science & Engineering University of Michigan.

11 Changing Demographics (US Census Dept, 2005). 22.

Dialogue Policy Optimisation

Manuela Veloso, Anthony Stentz, Alexander Rudnicky Brett Browning, M. Bernardine Dias Faculty Thomas Harris, Brenna Argall, Gil Jones Satanjeev Banerjee.

SA-1 Probabilistic Robotics Planning and Control: Partially Observable Markov Decision Processes.

CSE-573 Artificial Intelligence Partially-Observable MDPS (POMDPs)

Technology to Support Individuals with Cognitive Impairment Martha E. Pollack Computer Science & Engineering University of Michigan.

Fast approximate POMDP planning: Overcoming the curse of history! Joelle Pineau, Geoff Gordon and Sebastian Thrun, CMU Point-based value iteration: an.

Meeting 3 POMDP (Partial Observability MDP) 資工四阮鶴鳴李運寰 Advisor: 李琳山教授.

Sebastian Thrun Carnegie Mellon & Stanford Wolfram Burgard University of Freiburg and Dieter Fox University of Washington Probabilistic Algorithms for.

MDP Presentation CS594 Automated Optimal Decision Making Sohail M Yousof Advanced Artificial Intelligence.

Markovito’s Team (INAOE, Puebla, Mexico). Team members.

High-level robot behavior control using POMDPs Joelle Pineau and Sebastian Thrun Carnegie Mellon University.

What Are Partially Observable Markov Decision Processes and Why Might You Care? Bob Wall CS 536.

Bayes Filters Pieter Abbeel UC Berkeley EECS Many slides adapted from Thrun, Burgard and Fox, Probabilistic Robotics TexPoint fonts used in EMF. Read the.

1 Reasoning Under Uncertainty Over Time CS 486/686: Introduction to Artificial Intelligence Fall 2013.

1 The INRIA Robotics Teams Propose a Large-Scale Initiative Action “Personally Assisted Living” March 18, 2009.

Probabilistic Control of Human Robot Interaction: Experiments with a Robotic Assistant for Nursing Homes Joelle Pineau Michael Montemerlo Martha Pollack.

Sebastian Thrun Carnegie Mellon & Stanford Wolfram Burgard University of Freiburg and Dieter Fox University of Washington Probabilistic Algorithms for.

Sebastian Thrun Carnegie Mellon University Statistical Learning in Robotics State-of-the-Art, Challenges and Opportunities.

Approximate Solutions for Partially Observable Stochastic Games with Common Payoffs Rosemary Emery-Montemerlo joint work with Geoff Gordon, Jeff Schneider.

Sebastian Thrun Carnegie Mellon University University of Pittsburgh Particle Filters In Robotics or: How the World Became To Be One Big Bayes Network.

© sebastian thrun, CMU, C Statistical Techniques In Robotics Sebastian Thrun and Geoffrey Gordon Carnegie Mellon University

Probabilistic Robotics

A Probabilistic Approach to Collaborative Multi-robot Localization Dieter Fox, Wolfram Burgard, Hannes Kruppa, Sebastin Thrun Presented by Rajkumar Parthasarathy.

© sebastian thrun, CMU, CS226 Statistical Techniques In Robotics Sebastian Thrun (Instructor) and Josh Bao (TA)

Markov Decision Processes

Hierarchical POMDP Planning and Execution Joelle Pineau Machine Learning Lunch November 20, 2000.

Department of Computer Science Undergraduate Events More

CS 188: Artificial Intelligence Fall 2009 Lecture 19: Hidden Markov Models 11/3/2009 Dan Klein – UC Berkeley.

The SmartWheeler platform Collaboration between McGill, U.Montreal, Ecole Polytechnique Montreal + 2 clinical rehab centers. Standard commercial power.

Decision-Making on Robots Using POMDPs and Answer Set Programming Introduction Robots are an integral part of many sectors such as medicine, disaster rescue.

Planning and Verification for Stochastic Processes with Asynchronous Events Håkan L. S. Younes Carnegie Mellon University.

DARPA Mobile Autonomous Robot SoftwareLeslie Pack Kaelbling; March Adaptive Intelligent Mobile Robotics Leslie Pack Kaelbling Artificial Intelligence.

Planning and Execution with Phase Transitions Håkan L. S. Younes Carnegie Mellon University Follow-up paper to Younes & Simmons’ “Solving Generalized Semi-Markov.

Probabilistic Robotics: Monte Carlo Localization

Efficient Interaction Strategies for Adaptive Reminding Julie S. Weber & Martha E. Pollack Adaptive Reminder Generation SignalingIntended Approach Learning.

K. J. O’Hara AMRS: Behavior Recognition and Opponent Modeling Oct Behavior Recognition and Opponent Modeling in Autonomous Multi-Robot Systems.

CSE-573 Reinforcement Learning POMDPs. Planning What action next? PerceptsActions Environment Static vs. Dynamic Fully vs. Partially Observable Perfect.

TKK | Automation Technology Laboratory Partially Observable Markov Decision Process (Chapter 15 & 16) José Luis Peralta.

Artificial Intelligence 2005/06 Partially Ordered Plans - or: "How Do You Put Your Shoes On?"

Solving POMDPs through Macro Decomposition

A Tutorial on the Partially Observable Markov Decision Process and Its Applications Lawrence Carin June 7,2006.

Tractable Planning for Real-World Robotics: The promises and challenges of dealing with uncertainty Joelle Pineau Robotics Institute Carnegie Mellon University.

Confidence Based Autonomy: Policy Learning by Demonstration Manuela M. Veloso Thanks to Sonia Chernova Computer Science Department Carnegie Mellon University.

Transfer in Variable - Reward Hierarchical Reinforcement Learning Hui Li March 31, 2006.

Probabilistic approaches to reasoning and control: Towards autonomous interactive mobile robots Joelle Pineau Carnegie Mellon University TAMALE Seminar.

Transfer Learning in Sequential Decision Problems: A Hierarchical Bayesian Approach Aaron Wilson, Alan Fern, Prasad Tadepalli School of EECS Oregon State.

Thrust IIB: Dynamic Task Allocation in Remote Multi-robot HRI Jon How (lead) Nick Roy MURI 8 Kickoff Meeting 2007.

Reinforcement Learning Guest Lecturer: Chengxiang Zhai Machine Learning December 6, 2001.

10-1 Probabilistic Robotics: FastSLAM Slide credits: Wolfram Burgard, Dieter Fox, Cyrill Stachniss, Giorgio Grisetti, Maren Bennewitz, Christian Plagemann,

Partial Observability “Planning and acting in partially observable stochastic domains” Leslie Pack Kaelbling, Michael L. Littman, Anthony R. Cassandra;

CS 541: Artificial Intelligence Lecture X: Markov Decision Process Slides Credit: Peter Norvig and Sebastian Thrun.

Partially Observable Markov Decision Process and RL

Engineering Societies in the Agents World Workshop 2003

CS b659: Intelligent Robotics

Thrust IC: Action Selection in Joint-Human-Robot Teams

Joelle Pineau Robotics Institute Carnegie Mellon University

Probabilistic Robotics: Historgam Localization

Markov Decision Processes

Joelle Pineau: General info

Markov Decision Processes

Course Logistics CS533: Intelligent Agents and Decision Making

Hierarchical POMDP Solutions

CIS 488/588 Bruce R. Maxim UM-Dearborn

High-level robot behavior control using POMDPs

Approximate POMDP planning: Overcoming the curse of history!

CS 416 Artificial Intelligence

Reinforcement Nisheeth 18th January 2019.

Presentation transcript:

Towards robotic assistants in nursing homes: challenges and results Joelle Pineau Michael Montemerlo Martha Pollack * Nicholas Roy Sebastian Thrun Carnegie Mellon University * University of Michigan

Joelle Pineau The Nursebot Project Introducing Pearl – A mobile robotic assistant for elderly people and nurses cameras sonars handle bars mobile base carrying tray LCD mouth touchscreen microphone & speakers laser ROLE: Moving things around Moving things around Management support of ADLs Management support of ADLs Providing physical assistance Providing physical assistance Remote health services Remote health services Supporting communication Supporting communication Calling for help in emergencies Calling for help in emergencies Monitoring Rx adherence & safety Monitoring Rx adherence & safety Providing info (TV, weather) Providing info (TV, weather) Reminding to eat, drink, take meds Reminding to eat, drink, take meds Linking caregiver and resources Linking caregiver and resources

Joelle Pineau The Nursebot Project The Nursebot project in its early days

Joelle Pineau The Nursebot Project Architecture Cognitive supportNavigationCommunication High-level controller

Joelle Pineau The Nursebot Project Localization and map building (Burgard et al., 1999) People detection and tracking (Montemerlo et al., 2002) Architecture Cognitive supportNavigationCommunication High-level controller

Joelle Pineau The Nursebot Project Autominder system (Pollack et al., 2002) Architecture Cognitive supportNavigationCommunication High-level controller

Joelle Pineau The Nursebot Project Speech recognition: Sphinx system (Ravishankar, 1996) Speech synthesis: Festival system (Black et al., 1999) Architecture Cognitive supportNavigationCommunication High-level controller

Joelle Pineau The Nursebot Project The role of the top-level controller Établir les priorités parmi les objectifs des différents modules Négocier entre plusieurs objectifs ayant des coûts/gains variés Négocier entre l’acquisition d’information et la rencontre des objectifs Passer d’une tâche à l’autre en partageant l’information sensorielle Planifier malgré la présence d’incertitude Cognitive supportNavigationCommunication ACTION SELECTION - based on the trade-off between: - goals from different modules; - goals with varying costs / rewards; - reducing uncertainty versus accomplishing goals. High-level controller

Joelle Pineau The Nursebot Project Speech recognition with Sphinx

Joelle Pineau The Nursebot Project Robot control under uncertainty Belief State P(s t =weather-today)=0.5 P(s t =appointment-today )=0.5 USER Action={say-weather, update-appointment, clarify-query} Speech=“today” State weather-today

Joelle Pineau The Nursebot Project Partially Observable Markov Decision Processes Robot control using Partially Observable Markov Decision Processes (POMDPs) Belief state USER + ENVIRONMENT + WORLD Actions Observations Costs / Rewards State Problem: Which action allows the robot to maximize its reward? P(s 1 ) P(s 2 )

Joelle Pineau The Nursebot Project Methods to solve POMDPs Objective: Find a policy,  (b), which maximizes reward. Complexity Performance QMDP MDP FIB UMDP AMDP O(S 2 A) O(S 2 A T )O(S 2 A O ) O(S 2 AB) T POMDP New methods?

Joelle Pineau The Nursebot Project New approach: A hierarchy of POMDPs Idea: Exploit domain knowledge to divide one POMDP into many smaller ones. Motivation: Complexity of POMDP solving grows exponentially with # of actions. Assumption: We are given POMDP M = {S,A, ,b,T,O,R} and hierarchy H Act ExamineHealth Navigate Move VerifyPulse ClarifyGoal NorthSouthEastWest VerifyMeds subtask abstract action primitive action

Joelle Pineau The Nursebot Project PolCA: Planning with a hierarchy of POMDPs Step 1: Select the action set Navigate Move ClarifyGoal SouthEast West North A Move = {N,S,E,W} ACTIONS North South East West ClarifyGoal VerifyPulse VerifyMeds ACTIONS North South East West ClarifyGoal VerifyPulse VerifyMeds

Joelle Pineau The Nursebot Project PolCA: Planning with a hierarchy of POMDPs Step 1: Select the action set Step 2: Minimize the state set STATE FEATURES X-position Y-position X-goal Y-goal HealthStatus STATE FEATURES X-position Y-position X-goal Y-goal HealthStatus Navigate Move ClarifyGoal SouthEast West North A Move = {N,S,E,W} S Move = {X,Y} ACTIONS North South East West ClarifyGoal VerifyPulse VerifyMeds ACTIONS North South East West ClarifyGoal VerifyPulse VerifyMeds

Joelle Pineau The Nursebot Project PolCA: Planning with a hierarchy of POMDPs Step 1: Select the action set Step 2: Minimize the state set Step 3: Choose parameters STATE FEATURES X-position Y-position X-goal Y-goal HealthStatus STATE FEATURES X-position Y-position X-goal Y-goal HealthStatus Navigate Move ClarifyGoal SouthEast West North A Move = {N,S,E,W} S Move = {X,Y} ACTIONS North South East West ClarifyGoal VerifyPulse VerifyMeds ACTIONS North South East West ClarifyGoal VerifyPulse VerifyMeds PARAMETERS {b h,T h,O h,R h } PARAMETERS {b h,T h,O h,R h }

Joelle Pineau The Nursebot Project PolCA: Planning with a hierarchy of POMDPs Step 1: Select the action set Step 2: Minimize the state set Step 3: Choose parameters Step 4: Plan task h STATE FEATURES X-position Y-position X-goal Y-goal HealthStatus STATE FEATURES X-position Y-position X-goal Y-goal HealthStatus Navigate Move ClarifyGoal SouthEast West North A Move = {N,S,E,W} S Move = {X,Y} ACTIONS North South East West ClarifyGoal VerifyPulse VerifyMeds ACTIONS North South East West ClarifyGoal VerifyPulse VerifyMeds PLAN  h PLAN  h PARAMETERS {b h,T h,O h,R h } PARAMETERS {b h,T h,O h,R h }

Joelle Pineau The Nursebot Project First study in simulation 20-questions domain: Agent: “Is it an animal?” User: “No.” Agent: “Is it a vegetable?” User: “Yes.” Agent: “Is it green?” User: “No.” Agent: “...?” Agent: “Is it an animal?” User: “No.” Agent: “Is it a vegetable?” User: “Yes.” Agent: “Is it green?” User: “No.” Agent: “...?” Actions Objective : Plan a sequence of questions allowing the agent to identify the chosen object. Small complication…. the user can change objects at any time ( Pr = 0.1 ), without telling the agent. Observations

Joelle Pineau The Nursebot Project The hierarchy Animals Begin Vegetable? VegetablesMinerals Mineral?Animal? Mammal?RABBITHerbivore?TURTLE…? ……

Joelle Pineau The Nursebot Project Methods to solve POMDPs Complexity Performance POMDP QMDP MDP FIB UMDP AMDP PolCA

Joelle Pineau The Nursebot Project Results Domain: |S|=1 2, |A|=20, |O|=3

Joelle Pineau The Nursebot Project A new decomposition Animals Begin Vegetable? VegetablesMinerals Mineral?Animal? …… …

Joelle Pineau The Nursebot Project A new decomposition FruitsPlants …… Animals Begin Vegetable? VegetablesMinerals Mineral?Animal? … Fruit? …

Joelle Pineau The Nursebot Project Results Domain: |S|=1 2, |A|=20, |O|=3

Joelle Pineau The Nursebot Project Methods to solve POMDPs Complexity Performance POMDP QMDP MDP FIB UMDP AMDP PolCA

Joelle Pineau The Nursebot Project PolCA in the Nursebot domain Goal: A robot is deployed in a nursing home, where it provides reminders to elderly users and accompanies them to appointments. Domain : |S|=512, |A|=20, |O|=19 Hierarchy:

Joelle Pineau The Nursebot Project Sample scenario

Joelle Pineau The Nursebot Project Results for dialogue system POMDP policy MDP policy

Joelle Pineau The Nursebot Project Summary We have developed a first prototype robot able to serve as a mobile nursing assistant for elderly people. The top-level controller uses a hierarchical variant of POMDPs to select actions. This allows it to acquire necessary information and successfully complete assigned tasks. Probabilistic techniques have been found to be very useful to flexibly model and track individuals.

Joelle Pineau The Nursebot Project For more details: The Nursebot team CMU - Robotics: Greg Armstrong Michael Montemerlo Joelle Pineau Nicholas Roy Jamie Schulte Sebastian Thrun CMU - HCI/Design: Francine Gemperle Jennifer Goetz Sarah Kiesler Aaron Powers U. of Pittsburgh - Nursing: Jacqueline Dunbar-Jacobs Sandra Engberg Judith Matthews U. of Pittsburgh - CS: Don Chiarulli Colleen McCarthy U. of Freiburg - CS: Maren Bennewitz Wolfram Burgard Dirk Schulz U. of Michigan - CS: Laura Brown Dirk Colbry Cheryl Orosz Bart Peintner Martha Pollack Sailesh Ramakrishnan Standard Robotics: Greg Baltus

Joelle Pineau The Nursebot Project Our vision of robotic healthcare Moving things around Moving things around Enabling use of remote health services Enabling use of remote health services Supporting inter-personal communication Supporting inter-personal communication Calling for help in emergencies Calling for help in emergencies Monitoring Rx adherence & safety Monitoring Rx adherence & safety Providing information (TV, weather) Providing information (TV, weather) Management support of ADLs Management support of ADLs Reminding to eat, drink, & take meds Reminding to eat, drink, & take meds Providing physical assistance Providing physical assistance Linking the caregiver to resources Linking the caregiver to resources

Joelle Pineau The Nursebot Project Profile of an aging population  450,000 nurses to recruit before 2008 (USA)

Joelle Pineau The Nursebot Project Localization and map building

Joelle Pineau The Nursebot Project People tracking

Joelle Pineau The Nursebot Project Analysis of movements

Joelle Pineau The Nursebot Project Autominder System

Joelle Pineau The Nursebot Project The family of Markov models Markov Chain Hidden Markov Model (HMM) Markov Decision Process (MDP) Partially Observable Markov Decision Process (POMDP) State ambiguity? noyes Choice of action? yes no

Joelle Pineau The Nursebot Project The POMDP is a septuple { S, A, , b, T, O, R } Problem: Which action allows the robot to maximize its reward? b t-1 btbt a t-1 otot s t-1 stst... ?? o t-1... r t-1 rtrt The POMDP model Belief: State: Action:

Joelle Pineau The Nursebot Project Execution with a hierarchy of POMDPs At each time step, traverse the hierarchy from top to bottom. For each subtask, consult the policy a t   h (b t ) If a t is an internal node > move to that subtask. Act ExamineHealth Navigate Move VerifyPulse ClarifyGoal NorthSouthEastWest VerifyMeds

Joelle Pineau The Nursebot Project How to choose parameters? Consider for example T Navigate (s,a,s’) Case #1: a is a primitive action. T Navigate (s,ClarifyGoal,s’)  T(s,ClarifyGoal,s’) Navigate Move ClarifyGoal SouthEast West North S S’ South S’ North S’ ClarifyGoal

Joelle Pineau The Nursebot Project How to choose parameters? Consider for example T Navigate (s,a,s’) Case #1: a is a primitive action. T Navigate (s,ClarifyGoal,s’)  T(s,ClarifyGoal,s’) Case #2: a is an abstract action. T Navigate (s,Move,s’)  T(s,  Move (s),s’) Navigate Move ClarifyGoal SouthEast West North S S’ South S’ North S’ ClarifyGoal

Joelle Pineau The Nursebot Project States:1 per object (e.g. tomato, cucumber, rabbit, turtle, tulip, ….) Observations:“yes”, “no”, “  ” Actions:1 guess per objet Rewards:question= -1 + many questionscorrect guess = +5 incorrect guess= -20 The details Animals Begin Vegetable? VegetablesMinerals Mineral?Animal? Mammal?RABBITHerbivore?TURTLE…? ……