National University of Singapore

Slides:

Advertisements

Similar presentations

University of Karlsruhe September 30th, 2004 Masayuki Fujita

Advertisements

Bayesian Belief Propagation

Visual Servo Control Tutorial Part 1: Basic Approaches Chayatat Ratanasawanya December 2, 2009 Ref: Article by Francois Chaumette & Seth Hutchinson.

Delft University of TechnologyDelft Centre for Mechatronics and Microsystems Introduction Factory robots use trajectory control; the desired angles of.

Benjamin Stephens Carnegie Mellon University 9 th IEEE-RAS International Conference on Humanoid Robots December 8, 2009 Modeling and Control of Periodic.

Automotive Research Center Robotics and Mechatronics A Nonlinear Tracking Controller for a Haptic Interface Steer-by-Wire Systems A Nonlinear Tracking.

INTRODUCTION TO DYNAMICS ANALYSIS OF ROBOTS (Part 6)

Study of the periodic time-varying nonlinear iterative learning control ECE 6330: Nonlinear and Adaptive Control FISP Hyo-Sung Ahn Dept of Electrical and.

Introduction to Control: How Its Done In Robotics R. Lindeke, Ph. D. ME 4135.

Quantifying Generalization from Trial-by-Trial Behavior in Reaching Movement Dan Liu Natural Computation Group Cognitive Science Department, UCSD March,

Model Predictive Control for Humanoid Balance and Locomotion Benjamin Stephens Robotics Institute.

CH24 in Robotics Handbook Presented by Wen Li Ph.D. student Texas A&M University.

Motion Tracking. Image Processing and Computer Vision: 82 Introduction Finding how objects have moved in an image sequence Movement in space Movement.

A De-coupled Sliding Mode Controller and Observer for Satellite Attitude Control Ronald Fenton.

The City College of New York 1 Jizhong Xiao Department of Electrical Engineering City College of New York Manipulator Control Introduction.

Adaptive Rao-Blackwellized Particle Filter and It’s Evaluation for Tracking in Surveillance Xinyu Xu and Baoxin Li, Senior Member, IEEE.

Introduction to Boosting Aristotelis Tsirigos SCLT seminar - NYU Computer Science.

CSE 425: Industrial Process Control 1. About the course Lect.TuLabTotal Semester work 80Final 125Total Grading Scheme Course webpage:

Muhammad Moeen YaqoobPage 1 Moment-Matching Trackers for Difficult Targets Muhammad Moeen Yaqoob Supervisor: Professor Richard Vinter.

COMPLEXITY SCIENCE WORKSHOP 18, 19 June 2015 Systems & Control Research Centre School of Mathematics, Computer Science and Engineering CITY UNIVERSITY.

Optimization-Based Full Body Control for the DARPA Robotics Challenge Siyuan Feng Mar

Ch. 6 Single Variable Control

A Framework for Distributed Model Predictive Control

20/10/2009 IVR Herrmann IVR: Introduction to Control OVERVIEW Control systems Transformations Simple control algorithms.

Towards Cognitive Robotics Biointelligence Laboratory School of Computer Science and Engineering Seoul National University Christian.

1 Final Conference, 19th – 23rd January 2015 Geneva, Switzerland RP 15 Force estimation based on proprioceptive sensors for teleoperation in radioactive.

Department of Electrical Engineering, Southern Taiwan University Robotic Interaction Learning Lab 1 The optimization of the application of fuzzy ant colony.

T. Bajd, M. Mihelj, J. Lenarčič, A. Stanovnik, M. Munih, Robotics, Springer, 2010 ROBOT CONTROL T. Bajd and M. Mihelj.

Time-Varying Angular Rate Sensing for a MEMS Z-Axis Gyroscope Mohammad Salah †, Michael McIntyre †, Darren Dawson †, and John Wagner ‡ Mohammad Salah †,

To clarify the statements, we present the following simple, closed-loop system where x(t) is a tracking error signal, is an unknown nonlinear function,

M. Zareinejad 1.  fundamentally, instability has the potential to occur because real-world interactions are only approximated in the virtual world 

20/10/2009 IVR Herrmann IVR:Control Theory OVERVIEW Control problems Kinematics Examples of control in a physical system A simple approach to kinematic.

Introduction to ROBOTICS

Visual SLAM Visual SLAM SPL Seminar (Fri) Young Ki Baik Computer Vision Lab.

Haptic Interfaces and Force-Control Robotic Application in Medical and Industrial Contexts Applicants Prof. Doo Yong Lee, KAIST Prof. Rolf Johansson,

1 Distributed and Optimal Motion Planning for Multiple Mobile Robots Yi Guo and Lynne Parker Center for Engineering Science Advanced Research Computer.

Low Level Control. Control System Components The main components of a control system are The plant, or the process that is being controlled The controller,

Control of Robot Manipulators

Benjamin Stephens Carnegie Mellon University Monday June 29, 2009 The Linear Biped Model and Application to Humanoid Estimation and Control.

Real-Time Simultaneous Localization and Mapping with a Single Camera (Mono SLAM) Young Ki Baik Computer Vision Lab. Seoul National University.

Speed-Sensorless Estimation for Induction motors using Extended Kalman Filters 教授：龔應時學生：楊政達 Murat Barut; Seta Bogosyan; Metin Gokasan; Industrial.

Robot Velocity Based Path Planning Along Bezier Curve Path Gil Jin Yang, Byoung Wook Choi * Dept. of Electrical and Information Engineering Seoul National.

Disturbance rejection control method

Smart Sleeping Policies for Wireless Sensor Networks Venu Veeravalli ECE Department & Coordinated Science Lab University of Illinois at Urbana-Champaign.

Path Planning Based on Ant Colony Algorithm and Distributed Local Navigation for Multi-Robot Systems International Conference on Mechatronics and Automation.

1 Lu LIU and Jie HUANG Department of Mechanics & Automation Engineering The Chinese University of Hong Kong 9 December, Systems Workshop on Autonomous.

Project Goals: The Hardware: The Problem: Results to Date:

Chapter 1: Overview of Control

Ryszard Gessing Silesian Technical University Gliwice, Poland

OVERVIEW Impact of Modelling and simulation in Mechatronics system

Automation as the Subject of Mechanical Engineer’s interest

CS b659: Intelligent Robotics

Direct Manipulator Kinematics

PETRA 2014 An Interactive Learning and Adaptation Framework for Socially Assistive Robotics: An Interactive Reinforcement Learning Approach Konstantinos.

GESTURE CONTROLLED ROBOTIC ARM

Course: Autonomous Machine Learning

Linear Control Systems

EHPV Technology Auto-Calibration and Control Applied to Electro-Hydraulic Valves by Patrick Opdenbosch GOALS Development of a general formulation for control.

Ning Wang¹, Chenguang Yang², Michael R. Lyu¹, and Zhijun Li³

Root-Locus Analysis (1)

An Adaptive Middleware for Supporting Time-Critical Event Response

Overview of Control System

LINEAR CONTROL SYSTEMS

Market-based Dynamic Task Allocation in Mobile Surveillance Systems

CONTROL SYSTEM AN INTRODUCTION.

NONLINEAR AND ADAPTIVE SIGNAL ESTIMATION

Chapter 1. Introduction to Control System

NONLINEAR AND ADAPTIVE SIGNAL ESTIMATION

Chapter 4 . Trajectory planning and Inverse kinematics

Chapter 7 Inverse Dynamics Control

Presentation transcript:

National University of Singapore IEEE International Conference on Robotics and Automation (ICRA 2017) Singapore, May 29 - June 3, 2017 Learning and Control of Robots in Interacting with Unknown Environments Yanan Li PhD BSc, Imperial College London Shuzhi Sam Ge FIEEE, FIFAC, FIET, FSAE, PhD, DIC, BSc National University of Singapore http://robotics.nus.edu.sg

Vision of Social Robots Social robots: Robots that are able to interact and communicate among themselves, with humans, and with the environment, within the social and cultural structure attached to their roles. -International Journal of Social Robotics More robots out of the laboratory into schools, homes and clinic places… Interacting with people in their own space Changing everyday lifestyle

International Journal of Social Robotics Editor in Chief: Shuzhi Sam Ge, NUS Co-Editor in Chief: Oussama Khatib, Stanford University, USA Aims and scope: Provide a platform for presenting findings and latest developments in social robotics, covering relevant advances in engineering, computing, arts and social sciences. Provide an overview of the current state of the social robotics scene. 1 Volume(-s) with 5 issue(-s) per annual subscription IF: 1.407

Outline Introduction Impedance Learning Intention Estimation Zero Force Regulation Conclusion and Future Work

Control: Industry vs Social Accuracy Fast response Fixed environment Interaction Safety Compliant behavior Unknown environment

Intelligent Control Human adjusts their limb impedance and trajectory when catching a ball. It is possible to apply impedance and trajectory learning skills to robot control. http://cnx.org/content/m42183/latest/?collection=col11406/latest http://www.rocketsticks.com/science Human Limb Robot Arm

Intelligent Control Robot will act as a load to the human partner if it plays a follower role. Robot’s trajectory is adapted according to human’s intention. Human-Robot Collaboration Mass-Damping-Stiffness System Central Nervous System: Motion Intention

Intelligent Control Hybrid Position/Force Control: controlling force and position in a non-conflicting way Impedance Control [N. Hogan, 1985]: developing a relationship between force and position Passive Robot Environment Stable [J. E. Colgate, 1988] Passive systems include arbitrary combinations of masses, springs and dampers, linear or nonlinear. require the decomposition of two directions

Problems and Solutions How to impose a target impedance model on a robot arm, in the presence of uncertainties and unknown robot dynamics? Impedance control design (available in the literature) How to find a target impedance model? How to determine the impedance parameters, subject to unknown environment dynamics? Impedance learning Impedance adaptation How to determine the rest position/desired trajectory in the impedance model? Intention Estimation Zero Force Regulation

Outline Introduction Impedance Learning Intention Estimation Zero Force Regulation Conclusion and Future Work

Related Works Damping control Known environment dynamics Passivity assumption Modest performance [S. P. Buerger] Known environment dynamics Stability Optimal performance [R. Johansson; M. Matinfar] Unknown environment dynamics Environment identification and modeling Impedance learning and adaptation Earlier works: simple tasks State-of-the-art: reinforcement learning – complex computation [J. Buchli] Still an open problem

Human Learning Skill A new door/ball is an unknown environment. Human beings learn to adjust their limb impedance iteratively when opening a door/catching a ball. It is possible to apply impedance learning skill to robot control. Source: http://www.chumpysclipart.com/illustration/3943/picture_of_an_angry_man_trying_to_open_a_locked_door

Problem Formulation The dynamics of a robot arm follow an impedance model: Consider that the environment is unknown and dynamically changing, and it is described by Problem: How to determine so that a desired interaction performance is guaranteed? Define a cost function to measure the interaction performance:

Impedance Learning 1. According to gradient-following scheme: 2. The target impedance model: 3. How to obtain ?

Impedance Learning Consider the environment dynamics: Choose states The environment is described as Define The environment dynamics become which is a time-varying system.

Betterment Scheme Theorem [S. Arimoto, 1984]: Consider the linear time-varying system described by The control input is iteratively updated in the following manner The betterment process is convergent in the sense that as Convergence guarantee:

Impedance Learning According to the betterment scheme, we have By employing gradient-following and betterment schemes, impedance parameters are updated as:

Experiment Study ATI 6 axis force/torque sensor Scenario: In each iteration with a period of 18s, the interaction starts at t = 5s and ends at t = 16s. Unknown environment: human hand Initial impedance parameters: Learning rate:

The First Case Cost function: Cost Function Stiffness

The First Case Cost function: Tracking Error Interaction Force

The Second Case Cost function: Cost Function Stiffness

The Second Case Cost function: Tracking Error Interaction Force

Discussion By choosing different cost functions, it is determined that the control objective can be trajectory tracking, integral force tracking or the combination/compromise of these two. The proposed impedance learning guarantees that the defined cost function becomes smaller iteratively, subject to unknown dynamic environments, and the expected interaction performance has been achieved.

Discussion The advantage of the proposed impedance learning over impedance control with fixed impedance parameters lies in: a modest performance can be obtained if a good set of fixed impedance parameters is predefined (when k = 0), and a better performance can be obtained only with variant impedance parameters because the environments are dynamically changing. While impedance parameters have been obtained through iterative learning, how to determine the rest position/desired trajectory in a target impedance model?

Outline Introduction Impedance Learning Intention Estimation Zero Force Regulation Conclusion and Future Work

Scenario: Human-Robot Collaboration Most tasks that are either too complex to automate or too heavy to manipulate manually are impractical and even impossible to be solely taken by either fully automated robots or human beings. Robots and human beings may share the same workspace and they have complementary advantages.

Scenario: Human-Robot Collaboration The robot will act as a load to the human partner if x0 is far away from x. x0 is supposed to be adapted according to human partner’s motion intention. Trajectory adaptation is another human beings’ learning skill which can be applied to robot control.

Related Works Intention estimation Force regulation Motion characteristics of the human limb: minimum jerk model [B. Corteville, 2007] Motion intention state: hidden Markov model [Z. Wang, 2009] Intentional walking direction: Kalman filter [K. Wakita, 2011] Robot and human partner: separately analyzed Force regulation Force regulation under impedance control [K. Lee, 2008] Existing works: constant rest position and only stiffness in the environment (human limb)

Problem Formulation Human limb model [M. M. Rahman 2002]: motion intention planned in the central nervous system Assumption: In a typical collaborative task, the motion intention of the human partner (in each direction) is determined by the interaction force, position and velocity at the interaction point (in the corresponding direction) of the human limb and robot arm. intention estimation

Intention Estimation NN is employed to estimate human partner’s motion intention, as below: Define a cost function: Updating law:

Intention Estimation

Experiment Study Scenario: The left wrist of Nancy is moved by human being’s hand. Note: The actual trajectory of the robot arm cannot be compared with the motion intention directly. Impedance parameters: NN setting:

Point-to-Point Movement Joint angle

Point-to-Point Movement External torque

Time-Varying Trajectory Joint angle

Time-Varying Trajectory External torque

Discussion The motion intention of the human partner has been observed by employing the human limb model. With the proposed trajectory adaptation, the robot arm is able to “actively” follow its human partner. Human partner and robot are considered to be two separated subsystems, and the performance of the whole coupled interaction system is yet to be rigorously analyzed.

Outline Introduction Impedance Learning Intention Estimation Zero Force Regulation Conclusion and Future Work

Problem Formulation Human limb model [M. M. Rahman 2002]: Control objective: design x0 in so that . Three cases: Point-to-point movement Periodic trajectory Arbitrary continuous trajectory

Zero Force Regulation Trajectory adaptation: Point-to-point movement: Periodic trajectory: Arbitrary continuous trajectory: Similar to the intention estimation method!

Main Result Theorem: With the above trajectory adaptations, the following performance of the closed-loop interaction system can be guaranteed: The robot is able to “actively” follow its human partner. Lemma: The above result will not be affected by the transient performance of the inner position control loop, if it is asymptotically stable.

Experiment Study Scenario: The left wrist of Nancy is moved by human being’s hand. Note: The actual trajectory of the robot arm cannot be compared with the motion intention directly. Impedance parameters:

Point-to-Point Movement Joint angle

Point-to-Point Movement External torque

Periodic Trajectory Joint angle

Periodic Trajectory External torque

Arbitrary Continuous Trajectory Joint angle

Arbitrary Continuous Trajectory External torque

Discussion Zero force regulation has been investigated for human-robot collaboration, so that the robot is able to “actively” follow its human partner. Adaptive control has been proposed to deal with the point-to-point movement, and learning control and NN control have been developed to generate periodic and non-periodic trajectories, respectively.

Outline Introduction Impedance Learning Intention Estimation Zero Force Regulation Conclusion and Future Work

Conclusion Impedance learning is developed to obtain desired impedance parameters when robots interact with unknown and dynamically changing environments. Trajectory adaptation is investigated for human-robot collaboration, so that robot is able to “actively” follow its human partner.

Future Work Simultaneous learning/adaptation of impedance and trajectory will be studied. Interaction control with other sensory information (e.g., image) will be investigated. The applications of the proposed methods in specific areas (e.g., rehabilitation, tele-operation, and human-robot collaboration) will be considered. Critical issues in practical implementations, such as time-delay and human factor, will be investigated.

References C. Wang, Y. Li, S. S. Ge and T. H. Lee, “Reference Adaptation for Robots in Physical Interactions with Unknown Environments,” IEEE Transactions on Cybernetics, 2016 Y. Li and S. S. Ge, “Human-Robot Collaboration Based on Motion Intention　Estimation,” IEEE Transactions on Mechatronics, 2014 Y. Li and S. S. Ge, “Impedance Learning for Robot Interacting with Unknown Environments,” IEEE Transactions on Control Systems Technology,　2013 S. S. Ge and Y. Li, “Force Tracking Control for Motion Synchronization in Human-Robot Collaboration,” Robotica, 2013 S. S. Ge, Y. Li and C. Wang, “Impedance Adaptation for Optimal Robot-　Environment Interaction,” International Journal of Control, 2013

Published Books Published Books Intelligent Control:

Published Books Switched Dynamical Systems: Robotic Systems:

International Journal of Social Robotics Editor in Chief: Shuzhi Sam Ge, NUS Co-Editor in Chief: Oussama Khatib, Stanford University, USA Aims and scope: Provide a platform for presenting findings and latest developments in social robotics, covering relevant advances in engineering, computing, arts and social sciences. Provide an overview of the current state of the social robotics scene. 1 Volume(-s) with 5 issue(-s) per annual subscription IF: 1.407

Welcome 2017 International Conference on Social Robotics (ICSR 2017) will be held in Tsukuba City, November 22-24, 2017—first time in Japan http://www.icsr2017.org/committees.html

Acknowledgement Machine Technology Intelligent Control Robot Sensing Machine Learning Cooperation Mechanisms Affective Sciences Behavioral Studies Psychological Impact Ethical Implications Interface Design