L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE A Relational Representation for Procedural.

Slides:

Advertisements

Similar presentations

Ch:8 Design Concepts S.W Design should have following quality attribute: Functionality Usability Reliability Performance Supportability (extensibility,

Advertisements

CSCTR Session 11 Dana Retová.  Start bottom-up  Create cognition based on sensori-motor interaction ◦ Cohen et al. (1996) – Building a baby ◦ Cohen.

Bayesian Network and Influence Diagram A Guide to Construction And Analysis.

Introduction University of Bridgeport 1 Introduction to ROBOTICS.

Dynamic Bayesian Networks (DBNs)

Jenkins — Modular Perception and Control Brown Computer — ROUGH DRAFT ( ) 1 Workshop Introduction: Modular Perception.

Sponsored by the U.S. Department of Defense © 2005 by Carnegie Mellon University 1 Pittsburgh, PA Dennis Smith, David Carney and Ed Morris DEAS.

OOP - Object Oriented Programming Object Oriented Programming is an approach to programming that was developed to make large programs easier to manage.

Randomized Kinodynamics Motion Planning with Moving Obstacles David Hsu, Robert Kindel, Jean-Claude Latombe, Stephen Rock.

Mining for High Complexity Regions Using Entropy and Box Counting Dimension Quad-Trees Rosanne Vetro, Wei Ding, Dan A. Simovici Computer Science Department.

Probabilistic reasoning over time So far, we’ve mostly dealt with episodic environments –One exception: games with multiple moves In particular, the Bayesian.

Laboratory for Perceptual Robotics – Department of Computer Science Hierarchical Mechanisms for Robot Programming Shiraj Sen Stephen Hart Rod Grupen Laboratory.

Autonomous Robot Navigation Panos Trahanias ΗΥ475 Fall 2007.

Challenges Bayesian Estimation for Autonomous Object Manipulation Based on Tactile Perception Anna Petrovskaya, Oussama Khatib, Sebastian Thrun, Andrew.

Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Generalized Grasping and Manipulation Laboratory.

Laboratory for Perceptual Robotics Department of Computer Science University of Massachusetts Amherst Natural Task Decomposition with Intrinsic Potential.

U NIVERSITY OF M ASSACHUSETTS, A MHERST D EPARTMENT OF C OMPUTER S CIENCE Advanced Compilers CMPSCI 710 Spring 2003 Computing SSA Emery Berger University.

A Formal Model of Computation for Sensory-Based Robotics

Evaluating Hypotheses

Part 2 of 3: Bayesian Network and Dynamic Bayesian Network.

Knowledge Representation and Organization

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Intent Recognition as a Basis for Imitation.

U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Optimal Fixed-Size Controllers for Decentralized POMDPs Christopher Amato Daniel.

Neural Networks (NN) Ahmad Rawashdieh Sa’ad Haddad.

October 7, 2010Neural Networks Lecture 10: Setting Backpropagation Parameters 1 Creating Data Representations On the other hand, sets of orthogonal vectors.

Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.

Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.

Part I: Classification and Bayesian Learning

Quantitative Methods. Introduction Experimental Data Non-Experimental Data & Inference Probabilistic versus Deterministic Models Political Methodology.

CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.

Chapter 1 Introduction to Simulation

U NIVERSITY OF M ASSACHUSETTS, A MHERST D EPARTMENT OF C OMPUTER S CIENCE Emery Berger University of Massachusetts, Amherst Advanced Compilers CMPSCI 710.

Towards Cognitive Robotics Biointelligence Laboratory School of Computer Science and Engineering Seoul National University Christian.

Department of Electrical Engineering, Southern Taiwan University Robotic Interaction Learning Lab 1 The optimization of the application of fuzzy ant colony.

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Learning Prospective Robot Behavior Shichao.

MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.

Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.

Ensembles. Ensemble Methods l Construct a set of classifiers from training data l Predict class label of previously unseen records by aggregating predictions.

Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.

Natural Tasking of Robots Based on Human Interaction Cues Brian Scassellati, Bryan Adams, Aaron Edsinger, Matthew Marjanovic MIT Artificial Intelligence.

A Model for Learning the Semantics of Pictures V. Lavrenko, R. Manmatha, J. Jeon Center for Intelligent Information Retrieval Computer Science Department,

Maximum a posteriori sequence estimation using Monte Carlo particle filters S. J. Godsill, A. Doucet, and M. West Annals of the Institute of Statistical.

Presented by Jian-Shiun Tzeng 5/7/2009 Conditional Random Fields: An Introduction Hanna M. Wallach University of Pennsylvania CIS Technical Report MS-CIS

Consistency An estimator is a consistent estimator of θ, if , i.e., if

Learning TFC Meeting, SRI March 2005 On the Collective Classification of “Speech Acts” Vitor R. Carvalho & William W. Cohen Carnegie Mellon University.

Chapter 7. Learning through Imitation and Exploration: Towards Humanoid Robots that Learn from Humans in Creating Brain-like Intelligence. Course: Robots.

ROBOT VISION LABORATORY 김 형 석 Robot Applications-B

Chapter 1. Cognitive Systems Introduction in Cognitive Systems, Christensen et al. Course: Robots Learning from Humans Park, Sae-Rom Lee, Woo-Jin Statistical.

OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.

Classification Ensemble Methods 1

Enriching Assessment of the Core Albert Oosterhof, Faranak Rohani, & Penny J. Gilmer Florida State University Center for Advancement of Learning and Assessment.

OBJECT TRACKING USING PARTICLE FILTERS. Table of Contents Tracking Tracking Tracking as a probabilistic inference problem Tracking as a probabilistic.

Chapter 4 Motor Control Theories Concept: Theories about how we control coordinated movement differ in terms of the roles of central and environmental.

Reinforcement Learning for Mapping Instructions to Actions S.R.K. Branavan, Harr Chen, Luke S. Zettlemoyer, Regina Barzilay Computer Science and Artificial.

Probabilistic Reasoning Inference and Relational Bayesian Networks.

U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Achieving Goals in Decentralized POMDPs Christopher Amato Shlomo Zilberstein UMass.

Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.

Functionality of objects through observation and Interaction Ruzena Bajcsy based on Luca Bogoni’s Ph.D thesis April 2016.

Statistical environment representation to support navigation of mobile robots in unstructured environments Sumare workshop Stefan Rolfes Maria.

Statistica /Statistics Statistics is a discipline that has as its goal the study of quantity and quality of a particular phenomenon in conditions of.

San Diego May 22, 2013 Giovanni Saponaro Giampiero Salvi

Modeling the development of mirror neurons Problem Solution

Classroom Assessment Validity And Bias in Assessment.

Artificial Intelligence

PSY 614 Instructor: Emily Bullock Yowell, Ph.D.

The Basic of Measurement

CIS 488/588 Bruce R. Maxim UM-Dearborn

Statistical environment representation to support navigation of mobile robots in unstructured environments Stefan Rolfes Maria Joao Rendas

Algebraic Specification Software Specification Lecture 34

Presentation transcript:

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE A Relational Representation for Procedural Task Knowledge Stephen Hart Roderic Grupen David Jensen Laboratory for Perceptual Robotics University of Massachusetts Amherst New England Manipulation Symposium May 25, 2005

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Introduction and Motivation Robots performing tasks in real-world environments require methods to: Produce fault-tolerant behavior Focus on most salient and relevant information Handle multi-modal, continuous data Leverage past experience (i.e. adapt and reuse) Can we learn probability estimates regarding the effects of sensorimotor variables on task success? –e.g. If I take these actions, how likely am I to succeed at my task?

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Generalized Task Expertise Declarative knowledge –Captures abstract knowledge about the task –e.g. find an object, reach to it, pick it up... Procedural knowledge –Captures knowledge about how to instantiate the abstract policy in a particular environmental context –e.g. turn my head to the left, use my left hand to reach, use an enveloping grasp...

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Schema Theory Arbib (1995) describes control programs composed of: –Perceptual schema - a Ball might be characterized by “size,” “color,” “velocity,” etc. –Motor schema - actions characterized by a “degree of readiness” and “activity level.” Are such distinctions misleading? –Gibsonian Affordances: a perceptual feature is only meaningful if it facilitates action –Mirror Neurons: the same neurons will activate when performing an action or when observing someone else perform that action Claim: All perceptual information can come from appropriately designed controllers

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE How do we learn procedural structure? We would like the robot to differentiate its actions based on environmental context –e.g. Pick and Place Which available sensorimotor features are correlated –structure learning How these features relate, probabilistically, to each other –parameter learning

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Relational Data Data with complex dependencies between instances or varying structure (not i.i.d.) Applicable to robotics domain because: –Different training episodes may exhibit varying structure Data designated as Objects and Attributes –Objects are related through the structure of the data –Attributes are related through learned statistical dependencies Relational Dependency Networks –approximate the full joint distribution of a set of variables with a set of conditional probability distributions –Perform Gibbs sampling to do joint inference

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE locale bounding box dimensions orientation convergence state lift-able fingers LocalizeReachGrasp convergence state Some Controller Objects

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE What is Relational About this Data? Reach Controller Grasp Controller Reach Controller Simple Assembly 1: Grasp Controller Assemble Controller

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE What is Relational About this Data? Reach Controller Grasp Controller Reach Controller Simple Assembly 2: Grasp Controller Assemble Controller Remanipulate Controller

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Gathering the Dataset Observe an autonomous program or a teleoperator performing a task a variety of ways Each trial may follow a different trajectory Data is collected after each trial Model is learned with Proximity

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Experiments PickUp with Dexter TM 2 objects (3 orientations) tall box, coffee can 2 grasps: 2 VF, 3 VF 2 reaches: top approach side approach 8 locales uniformly distributed

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE locale bounding box dimensions orientation convergence state lift-able fingers LocalizeReachGrasp convergence state The Learned Model Graph

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Attribute Trees The RDN algorithm estimates a CPD for each attribute –Learns a locally consistent Relational Probability Tree (RPT) for that attribute Each tree focuses attention on the most salient predictors of the corresponding attribute –Manages complexity –Allows for easy and intuitive interpretation –Each attribute (sensorimotor feature) has an affordance in terms of the current task

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE RPT for “Lift-able”

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Using the RDN to construct policy How do we use the learned schema to perform the task again? –At each action point: perform joint inference on task success variables and find most likely resource assignment Use this assignment and see how likely success is Perform next action with resource binding, possibly uncovering new information through interaction

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Yeah, but... how does it perform? Pick up the can with 2 or 3 fingers from the top Pick up the box with 2 fingers –From the side or the top standing up –From the top laying down Predicts little probability of success if object is outside reachable workspace

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Where to Next? How do we learn the declarative structure? –Previous work by Huber, Platt, etc. Capture dynamic response of controllers during execution –Learn dependencies through direct interaction with the environment Can we sample a set attributes from uncountable possible set –Resample if poor policies are learned

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE The End

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE RDNs in Robotics What do we know? –a collection of controllers are necessary for a task, usually organized as a sequence of sub-goals –controllers have state, attached resources, and can reveal perceptual information through execution –controllers can execute sequentially or in conjunction What don’t we know? –Which sensorimotor features of each controller are important and how they correlate

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE Localize Reach Grasp Localize Reach Grasp Localize Reach Grasp Localize Reach Grasp Four Training Structures

L ABORATORY FOR P ERCEPTUAL R OBOTICS U NIVERSITY OF M ASSACHUSETTS A MHERST D EPARTMENT OF C OMPUTER S CIENCE What is Relational About this Data? Reach Controller Grasp Controller Localize Controller Reach Controller Obstacle Avoidance Controller Obstacle Avoidance Controller Kinematic Conditioning Controller Pick and Transport: Not independently distributed!!! sequential relations conjunctive relations