Stanley Bak, Fardin Abdi Taghi Abad, Zhenqi Huang, Marco Caccamo

Slides:

Advertisements

Similar presentations

Teaser - Introduction to Distributed Computing

Advertisements

PROTOCOL VERIFICATION & PROTOCOL VALIDATION. Protocol Verification Communication Protocols should be checked for correctness, robustness and performance,

Model Checker In-The-Loop Flavio Lerda, Edmund M. Clarke Computer Science Department Jim Kapinski, Bruce H. Krogh Electrical & Computer Engineering MURI.

Towards Autonomic Adaptive Scaling of General Purpose Virtual Worlds Deploying a large-scale OpenSim grid using OpenStack cloud infrastructure and Chef.

CSLI 5350G - Pervasive and Mobile Computing Week 3 - Paper Presentation “RPB-MD: Providing robust message dissemination for vehicular ad hoc networks”

Improving TCP Performance over Mobile Ad Hoc Networks by Exploiting Cross- Layer Information Awareness Xin Yu Department Of Computer Science New York University,

Song Han, Xiuming Zhu, Al Mok University of Texas at Austin

Fardin Abdi, Brett Robins, Marco Caccamo University of Illinois at Urbana-Champaign Urbana-Champaign, USA {abditag2, robbins3, 1UIUC.

Cache Coherent Distributed Shared Memory. Motivations Small processor count –SMP machines –Single shared memory with multiple processors interconnected.

More routing protocols Alec Woo June 18 th, 2002.

© nCode 2000 Title of Presentation goes here - go to Master Slide to edit - Slide 1 Reliable Communication for Highly Mobile Agents ECE 7995: Term Paper.

Building Low-Diameter P2P Networks Eli Upfal Department of Computer Science Brown University Joint work with Gopal Pandurangan and Prabhakar Raghavan.

Architecture and Real Time Systems Lab University of Massachusetts, Amherst An Application Driven Reliability Measures and Evaluation Tool for Fault Tolerant.

Handout # 4: Scaling Controllers in SDN - HyperFlow

On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.

SMUCSE 8344 Constraint-Based Routing in MPLS. SMUCSE 8344 Constraint Based Routing (CBR) What is CBR –Each link a collection of attributes (performance,

COMPUTER NETWORKS LAB 3: HUBS AND SWITCHES

Presenter: Chi-Hung Lu 1. Problems Distributed applications are hard to validate Distribution of application state across many distinct execution environments.

What is it? A mobile robotics system controls a manned or partially manned vehicle-car, submarine, space vehicle | Website for Students.

Network Topologies.

Hashing it Out in Public Common Failure Modes of DHT-based Anonymity Schemes Andrew Tran, Nicholas Hopper, Yongdae Kim Presenter: Josh Colvin, Fall 2011.

On-Chip Control Flow Integrity Check for Real Time Embedded Systems Fardin Abdi Taghi Abad, Joel Van Der Woude, Yi Lu, Stanley Bak, Marco Caccamo, Lui.

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2007 (TPDS 2007)

Distributed Control of FACTS Devices Using a Transportation Model Bruce McMillin Computer Science Mariesa Crow Electrical and Computer Engineering University.

Robot Autonomous Perception Model For Internet-Based Intelligent Robotic System By Sriram Sunnam.

Simulation of the OLSRv2 Protocol First Report Presentation.

1 Distributed and Optimal Motion Planning for Multiple Mobile Robots Yi Guo and Lynne Parker Center for Engineering Science Advanced Research Computer.

Lecture 4 TTH 03:30AM-04:45PM Dr. Jianjun Hu CSCE569 Parallel Computing University of South Carolina Department of.

Efficient AOI-Cast for Peer-to-Peer Networked Virtual Environments.

University of Pennsylvania 7/15/98 Asymmetric Bandwidth Channel (ABC) Architecture Insup Lee University of Pennsylvania July 25, 1998.

A Data Stream Publish/Subscribe Architecture with Self-adapting Queries Alasdair J G Gray and Werner Nutt School of Mathematical and Computer Sciences,

Hwajung Lee. The State-transition model The set of global states = s 0 x s 1 x … x s m {s k is the set of local states of process k} S0  S1  S2  Each.

“Controlled Straight Mobility and Energy-Aware Routing in Robotic Wireless Sensor Networks ” Rafael Falcon, Hai Liu, Amiya Nayak and Ivan Stojmenovic

A N I N - MEMORY F RAMEWORK FOR E XTENDED M AP R EDUCE 2011 Third IEEE International Conference on Coud Computing Technology and Science.

Global Clock Synchronization in Sensor Networks Qun Li, Member, IEEE, and Daniela Rus, Member, IEEE IEEE Transactions on Computers 2006 Chien-Ku Lai.

COMP8330/7330/7336 Advanced Parallel and Distributed Computing Communication Costs in Parallel Machines Dr. Xiao Qin Auburn University

Cloud-based Parallel Implementation of SLAM for Mobile Robots Supun Kamburugamuve, Hengjing He, Geoffrey Fox, David Crandall School of Informatics & Computing.

Outline Introduction. Changes made to the Tycho design from last time (June 2005). Example Tycho setup. Tycho benchmark motivations and methodology. Some.

2010 IEEE Global Telecommunications Conference (GLOBECOM 2010)

Optimal Acceleration and Braking Sequences for Vehicles in the Presence of Moving Obstacles Jeff Johnson, Kris Hauser School of Informatics and Computing.

Authors: Jiang Xie, Ian F. Akyildiz

John Backes, Rockwell Collins Dan DaCosta, Rockwell Collins

Jack Dongarra University of Tennessee

Introduction to Wireless Sensor Networks

Datacenter Interconnection Network Design

The Echo Algorithm The echo algorithm can be used to collect and disperse information in a distributed system It was originally designed for learning network.

Vineet Mittal Should more be added here Committee Members:

Software Design and Architecture

Distribution and components

RSVP: A New Resource ReSerVation Protocol

Fardin Abdi, Renato Mancuso, Stanley Bak, Or Dantsker, Marco Caccamo

Surviving Holes and Barriers in Geographic Data Reporting for

Privacy and Fault-Tolerance in Distributed Optimization Nitin Vaidya University of Illinois at Urbana-Champaign.

Architecture of Parallel Computers CSC / ECE 506 Summer 2006 Scalable Programming Models Lecture 11 6/19/2006 Dr Steve Hunter.

Towards Next Generation Panel at SAINT 2002

Fault Tolerance Distributed Web-based Systems

ITEC452 Distributed Computing Lecture 5 Program Correctness

On-time Network On-chip

PRESENTATION COMPUTER NETWORKS

GATES: A Grid-Based Middleware for Processing Distributed Data Streams

Market-based Dynamic Task Allocation in Mobile Surveillance Systems

Automated Analysis and Code Generation for Domain-Specific Models

John Backes, Rockwell Collins Dan DaCosta, Rockwell Collins

Parallel Programming in C with MPI and OpenMP

Distributed Reinforcement Learning for Multi-Robot Decentralized Collective Construction Gyu-Young Hwang

Multiprocessors and Multi-computers

AN INEXPENSIVE ROBOTIC KIT FOR CHILDREN EDUCATION

Distributed Systems and Algorithms

Presentation transcript:

Using Run-Time Checking to Provide Safety and Progress for Distributed Cyber-Physical Systems Stanley Bak, Fardin Abdi Taghi Abad, Zhenqi Huang, Marco Caccamo Presentor: Renato Mancuso

Distributed Coordination 1 Interconnected systems that physically affect each other State of each node is a function of control inputs of other nodes based on system connection graph 2 Water Distribution system Electrical Grids Traffic Control system

Communication; An Essential Component Distributed systems rely on communication for: 1. Reaching the desired state Functionality 2. Maintain invariants: stability Stability/Safety …so what happens when communication is unreliable? Communication Faults Violation of Safety

Limits of Distributed Coordination Approach 1: Massive distributed sensor-fusion and unsafe state avoidance Not exhaustive Approach 2: Use middleware that provides guarantees of communication and latency Scalability Middleware Water Distribution system Electrical Grids Traffic Control system Image: “A Swarm of Nano Quadrotors”, UPENN, http://www.youtube.com/watch?v=YQIMGV5vtd4

Paper Goals Examine fundamental requirements for safety in distributed systems with unreliable communication Safety: global invariant (for example, collisions are avoided) Goal 1 Provide a mechanism for safe progress, if the communication works adequately well Progress: all distributed agents follow the same goal Water Distribution system Electrical Grids Traffic Control system Goal 2 Image: “A Swarm of Nano Quadrotors”, UPENN, http://www.youtube.com/watch?v=YQIMGV5vtd4

Safety Theorem Intuition: Goal 1: Safety Safety Theorem A coordinating distributed system is safe under unreliable communication if and only if both: Condition 1: The system is safe if no communication takes place Condition 2: For each message m that is received by any node, the system remains safe if no other messages are ever received after m Water Distribution system Electrical Grids Traffic Control system Intuition:

Runtime Checking …but progress? Goal 1: Safety Runtime Checking Note that: Condition 2 is difficult to check ahead of time, since it’s quantified for every message Proposed Solution To build a usable system with this result, we check this condition at runtime, and drop messages which violate it Water Distribution system Electrical Grids Traffic Control system …but progress?

Proposed Architecture Goal 1: Safety Proposed Architecture Perform a safety test on each command (check condition 2) Safe commands pass Command Filter Safeguard Unsafe commands are filtered

Safe Progress Compatible actions: Goal 2: Progress Safe Progress Compatible actions: actions which all agents can take that are globally safe. So: Build a chain of compatible actions for global progress The rate of progress depends on the quality of the comm. channel. Set-point 𝑖−1 Water Distribution system Electrical Grids Traffic Control system Set-point 𝑖 Set-point 𝑖+1 -ball Trajectory

Example System A flock of vehicles moves along a path The user can input “detour points”, to redirect the flock Collisions must be avoided Detour points should be reached, communication permitting

Non-Compatible Actions A new waypoint for the flock is entered Collision may occur due to a communication fault

Compatible Actions Iteratively Approach Goal

Compatible Actions Iteratively Approach Goal

Compatible Actions Iteratively Approach Goal

Compatible Actions Iteratively Approach Goal

Compatible Actions Iteratively Approach Goal

Compatible Actions Iteratively Approach Goal

Compatible Actions Iteratively Approach Goal

Compatible Actions Robustness to Communication Failures Tractor 1 did not receive the new path but safety is maintained. Paths sent to followers! Tractor 1 did not receive the path Desired final path for the flock Paths generated for all the followers New detour point entered by operator 1 2 3

Vehicle Flocking Application Flocking system with StarL StarL code can be run on a Roomba flock, or in a built-in simulator Communication effects can be simulated and evaluated Video: https://youtu.be/dIGU8OTfCh8

Evaluation We measured the effect of packet-loss and vehicle count on convergence time and number of messages sent With increasing loss ratio: convergence time grows quadratically With increasing vehicles: convergence time is constant bandwidth grows linearly

Future Extensions Replace runtime reachability checks with ahead-of-time computation Progress framework where commands do not originate from a centralized coordinator Implementation on a large swarm of robots

Thanks.