Robotic Swarms for Assembly

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

Lecture 18: Temporal-Difference Learning
Intel Research Internet Coordinate Systems - 03/03/2004 Internet Coordinate Systems Marcelo Pias Intel Research Cambridge
Robotics Research Laboratory 1 Chapter 6 Design Using State-Space Methods.
Kick-off Meeting, July 28, 2008 ONR MURI: NexGeNetSci Distributed Coordination, Consensus, and Coverage in Networked Dynamic Systems Ali Jadbabaie Electrical.
Coverage by Directional Sensors Jing Ai and Alhussein A. Abouzeid Dept. of Electrical, Computer and Systems Engineering Rensselaer Polytechnic Institute.
GRASP University of Pennsylvania NRL logo? Autonomous Network of Aerial and Ground Vehicles Vijay Kumar GRASP Laboratory University of Pennsylvania Ron.
ICCV 2007 tutorial Part III Message-passing algorithms for energy minimization Vladimir Kolmogorov University College London.
Multiscale Stochastic Simulation Algorithm with Stochastic Partial Equilibrium Assumption for Chemically Reacting Systems Linda Petzold and Yang Cao University.
CHAPTER 8 A NNEALING- T YPE A LGORITHMS Organization of chapter in ISSO –Introduction to simulated annealing –Simulated annealing algorithm Basic algorithm.
Dynamics of Learning VQ and Neural Gas Aree Witoelar, Michael Biehl Mathematics and Computing Science University of Groningen, Netherlands in collaboration.
Observers and Kalman Filters
Aspects of Conditional Simulation and estimation of hydraulic conductivity in coastal aquifers" Luit Jan Slooten.
Centre for Autonomous Systems Petter ÖgrenCAS talk1 A Control Lyapunov Function Approach to Multi Agent Coordination P. Ögren, M. Egerstedt * and X. Hu.
CAS 1999 Dynamic Financial Analysis Seminar Chicago, Illinois July 19, 1999 Calibrating Stochastic Models for DFA John M. Mulvey - Princeton University.
Sérgio Pequito Phd Student
458 Lumped population dynamics models Fish 458; Lecture 2.
Maximizing Path Durations in Mobile Ad- Hoc Networks Yijie Han and Richard J. La Department of ECE & ISR University of Maryland, College Park CISS, Princeton.
Yashar Ganjali High Performance Networking Group Stanford University September 17, 2003 Minimum-delay Routing.
Mapping and Localization with RFID Technology Matthai Philipose, Kenneth P Fishkin, Dieter Fox, Dirk Hahnel, Wolfram Burgard Presenter: Aniket Shah.
Stochastic Linear Programming by Series of Monte-Carlo Estimators Leonidas SAKALAUSKAS Institute of Mathematics&Informatics Vilnius, Lithuania
Smart Monte Carlo: Various Tricks Using Malliavin Calculus Quantitative Finance, NY, Nov 2002 Eric Benhamou Goldman Sachs International.
Aemen Lodhi (Georgia Tech) Amogh Dhamdhere (CAIDA)
Princeton University Department of Chemical Engineering and PACM Equation-Free Uncertainty Quantification on Stochastic Chemical Reactions SIAM Annual.
Copyright  2003 by Dr. Gallimore, Wright State University Department of Biomedical, Industrial Engineering & Human Factors Engineering Human Factors Research.
1 Distributed and Optimal Motion Planning for Multiple Mobile Robots Yi Guo and Lynne Parker Center for Engineering Science Advanced Research Computer.
Suppressing Random Walks in Markov Chain Monte Carlo Using Ordered Overrelaxation Radford M. Neal 발표자 : 장 정 호.
Reinforcement Learning Control with Robust Stability Chuck Anderson, Matt Kretchmar, Department of Computer Science, Peter Young, Department of Electrical.
Automated Intruder Tracking using Particle Filtering and a Network of Binary Motion Sensors Jeremy Schiff EECS Department University of California, Berkeley.
Mathematical Modeling of Signal Transduction Pathways Biplab Bose IIT Guwahati.
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
MURI Telecon, Update 7/26/2012 Summary, Part I:  Completed: proving and validating numerically optimality conditions for Distributed Optimal Control (DOC)
Primbs1 Receding Horizon Control for Constrained Portfolio Optimization James A. Primbs Management Science and Engineering Stanford University (with Chang.
Monte-Carlo based Expertise A powerful Tool for System Evaluation & Optimization  Introduction  Features  System Performance.
Decision Making Under Uncertainty CMSC 471 – Spring 2041 Class #25– Tuesday, April 29 R&N, material from Lise Getoor, Jean-Claude Latombe, and.
Playing Tic-Tac-Toe with Neural Networks
Colorado Center for Astrodynamics Research The University of Colorado 1 STATISTICAL ORBIT DETERMINATION Kalman Filter with Process Noise Gauss- Markov.
On Mobile Sink Node for Target Tracking in Wireless Sensor Networks Thanh Hai Trinh and Hee Yong Youn Pervasive Computing and Communications Workshops(PerComW'07)
Mean Field Methods for Computer and Communication Systems Jean-Yves Le Boudec EPFL Network Science Workshop Hong Kong July
University of Pennsylvania 1 GRASP Control of Multiple Autonomous Robot Systems Vijay Kumar Camillo Taylor Aveek Das Guilherme Pereira John Spletzer GRASP.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
Supplementary Chapter B Optimization Models with Uncertainty
TEMPLE UNIVERSITY Deadline-Sensitive Mobile Data Offloading via Opportunistic Communications Guoju Gaoa, Mingjun Xiao∗a, Jie Wub, Kai Hana, Liusheng Huanga.
Monte Carlo methods 10/20/11.
Control of Swarm Robotic Systems for Coverage and Manipulation Tasks
Chapter 6: Temporal Difference Learning
System Control based Renewable Energy Resources in Smart Grid Consumer
You can check broken videos in this slide here :
Distributed Energy Efficient Clustering (DEEC) Routing Protocol
Queue Dynamics with Window Flow Control
Clustering (3) Center-based algorithms Fuzzy k-means
Autonomous Cyber-Physical Systems: Dynamical Systems
Unfolding Problem: A Machine Learning Approach
Igor V. Cadez, Padhraic Smyth, Geoff J. Mclachlan, Christine and E
Authors: Ing-Ray Chen; Yating Wang Present by: Kaiqun Fu
Lecture 2 – Monte Carlo method in finance
2016 International Conference on Grey Systems and Uncertainty Analysis
An Adaptive Middleware for Supporting Time-Critical Event Response
Mean-Independent Noise Control of Cell Fates via Intermediate States
October 6, 2011 Dr. Itamar Arel College of Engineering
Chapter 6: Temporal Difference Learning
Boltzmann Machine (BM) (§6.4)
Lecture 4 - Monte Carlo improvements via variance reduction techniques: antithetic sampling Antithetic variates: for any one path obtained by a gaussian.
Presented By: Darlene Banta
NONLINEAR AND ADAPTIVE SIGNAL ESTIMATION
SENSOR BASED CONTROL OF AUTONOMOUS ROBOTS
NONLINEAR AND ADAPTIVE SIGNAL ESTIMATION
Lecture 16. Classification (II): Practical Considerations
Distributed Reinforcement Learning for Multi-Robot Decentralized Collective Construction Gyu-Young Hwang
Stabilization of Multiple Robots on Stable Orbits via Local Sensing Mong-ying A. Hsieh, Savvas Loizou and Vijay Kumar GRASP Laboratory, University of.
Presentation transcript:

Robotic Swarms for Assembly Spring Berman and Vijay Kumar GRASP Laboratory, University of Pennsylvania Loïc Matthey University College London, formerly EPFL February 22, 2010

Motivation Design a reconfigurable manufacturing system that quickly assembles target amounts of products from a supply of heterogeneous parts

Design Criteria (1) Strategy should be scalable in the number of parts Decentralized approach: - Parts scattered randomly inside an arena Randomly moving autonomous robots assemble products Local sensing, local communication (2) Minimal adjustments when product demand changes - Probabilities of assembly and disassembly are robot control policies - Can be updated via a broadcast (3) System can be optimized for fast production Spatial homogeneity  Chemical Reaction Network model

Related Work Collision-based self-assembly modeled as Chemical Reaction Networks Hosokawa et al., Artificial Life, 1994 - Predict yield of complete assemblies from passive modules Baskaran et al., Micro Electro Mech. Syst., 2008 - Model micro scale batch assembly Klavins et al., Proc. Robotics Sci. Syst., 2007 - Optimize assembly detachment probabilities to maximize yield of a particular assembly type - Optimization requires enumeration of all reachable states

Outline Abstraction: continuous model of robots executing an assembly task Synthesis: top-down design of decentralized robot controllers Optimization: controller parameters are independent of the number of robots and parts Implementation: provable guarantees on performance

System Abstraction Ordinary differential equations M states: continuous populations of parts Reduced continuous model Robots find parts quickly, Ordinary differential equations States: continuous populations of robots and free/carried parts Complete continuous model Large Spatial homogeneity [D. Gillespie, Annu. Rev. Phys. Chem., 2007] 3D physics simulation N robots, Pi parts; i = 1,…,M types Physical system

Reduced continuous model Problem Statement ODE’s are functions of probabilities of assembly and disassembly: Optimize for fast assembly of target amounts of products Reduced continuous model Robots start assemblies and perform disassemblies according to optimized probabilities Physical system

2 types of final assemblies Example Implemented in the robot simulator Webots (www.cyberbotics.com) - Uses Open Dynamics Engine to simulate physics Predefined assembly plan: 2 types of final assemblies 4 types of basic parts

Magnets that bond to other parts Example Magnets that bond to other parts Khepera III + bar (www.k-team.com) Rotational servo Magnet Bonds to bar Magnets can be turned on or off Servo rotates bonded part to orientation for assembly Infra-red distance sensors for collision avoidance Emitter/receiver on each robot and basic part for local communication, computing relative bearing

Complete Continuous Model pe = prob. that a robot encounters a part or another robot ≈ [Correll and Martinoli, Coll. Beh. Workshop, ICRA 2007] A = arena area

Complete Continuous Model prob. of two robots successfully completing assembly process j (measured from simulations)

Complete Continuous Model Tunable: prob. of two robots starting assembly process j prob. per unit time of a robot performing disassembly process j

(Illustrated later for reduced model) Complete Continuous Model ODE Model (Illustrated later for reduced model)

Validation of Complete Continuous Model 15 robots, 15 basic parts measured Error bars show standard deviations Part populations averaged over 100 Webots simulations F2 F2 F1 F1 Time (sec)

Validation of Complete Continuous Model Continuous model is fairly accurate Discrepancies are due to: Relatively low populations; ODE most accurate for large ones Assembly disruption in simulation (not modeled) Final product populations Webots, average of 100 simulations F2 Complete cont. model (numerically integrated) F1 Time (sec)

Reduced Continuous Model Lower-dimensional model (abstract away robots): Vector of complexes: Conservation constraints:

Reduced Continuous Model The system has a unique, positive, globally asymptotically stable equilibrium. Proof: Reaction network is deficiency zero and weakly reversible, does not admit equilibria with some xi = 0  We can design K such that the system always converges to a target equilibrium, xd > 0

Design of Optimal Recall that K is a function of p, the vector of Select xd that satisfies conservation constraints Compute p that minimizes the system convergence time to xd subject to constraints:

Optimization Problems I. Linear Program Objective: Maximize the average inverse relaxation time τj τj = time for system mode to return to equilibrium after perturbation Estimated by linearizing the ODE model around xd [Heinrich and Schuster, The Regulation of Cellular Systems, 1996] For reaction : II. Monte Carlo Method Objective: Minimize time for system to reach

Mapping onto Physical System = simulation timestep (32 ms) = random number uniformly distributed over [0,1] Robot computes R at each Δt, disassembles the part if Robot computes R, executes assembly if

Optimization Improves Convergence Rate 15 robots, 15 basic parts Simulations averaged over 30 runs Simulation Model Target α = 0.5 (1– α) = 0.5 Randomly selected from from Linear Program Monte Carlo Time (sec)

Linearization is most effective for α ≈ 0.2 – 0.5 For all α, linear program only changes rates of disassembling F1, F2 Monte Carlo yield fastest convergence but take ~10 hrs to compute, vs. <1 s using the linear program (2 GHz laptop) 300 basic parts Time for reduced model to reach from: Random selection (Average of 100 values) Linear Program Monte Carlo Method

Linearization is most effective for α ≈ 0.2 – 0.5 15 robots, 15 basic parts Simulations averaged over 30 runs Simulation Model Target α = 0.9 (1– α) = 0.1 Randomly selected from from Linear Program Monte Carlo Time (sec)

Conclusions and Future Work Abstracted multi-robot assembly system to an accurate continuous model Optimization of stochastic control policies results in faster production of target quantities Linearization of model for optimization is most effective for a certain range of target product distributions Extensions: Optimize assembly plans Use inter-robot communication to improve the yield rate Ex. Recruitment of robots carrying needed parts