1 of 14 1/14 Design Optimization of Time- and Cost-Constrained Fault-Tolerant Distributed Embedded Systems Viaceslav Izosimov, Paul Pop, Petru Eles, Zebo.

Slides:



Advertisements
Similar presentations
Simulation of Feedback Scheduling Dan Henriksson, Anton Cervin and Karl-Erik Årzén Department of Automatic Control.
Advertisements

Courseware Scheduling of Distributed Real-Time Systems Jan Madsen Informatics and Mathematical Modelling Technical University of Denmark Richard Petersens.
1 of 14 1 /23 Flexibility Driven Scheduling and Mapping for Distributed Real-Time Systems Paul Pop, Petru Eles, Zebo Peng Department of Computer and Information.
Carnegie Mellon R-BATCH: Task Partitioning for Fault-tolerant Multiprocessor Real-Time Systems Junsung Kim, Karthik Lakshmanan and Raj Rajkumar Electrical.
1 of 14 1 Analysis of Mixed Time-/Event-Triggered Distributed Embedded Systems Paul Pop, Traian Pop, Petru Eles, Zebo Peng Embedded Systems Laboratory.
Making Services Fault Tolerant
1 of 16 April 25, 2006 Minimizing System Modification in an Incremental Design Approach Paul Pop, Petru Eles, Traian Pop, Zebo Peng Department of Computer.
Communication Scheduling for Time-Triggered SystemsSlide 1 Communication Scheduling for Time-Triggered Systems Paul Pop, Petru Eles and Zebo Peng Dept.
Bogdan Tanasa, Unmesh D. Bordoloi, Petru Eles, Zebo Peng Department of Computer and Information Science, Linkoping University, Sweden December 3, 2010.
1 of 30 June 14, 2000 Scheduling and Communication Synthesis for Distributed Real-Time Systems Paul Pop Department of Computer and Information Science.
1 of 10 April 25, 2001 Minimizing System Modification in an Incremental Design Approach Paul Pop, Petru Eles, Traian Pop, Zebo Peng Department of Computer.
Process Scheduling for Performance Estimation and Synthesis of Hardware/Software Systems Slide 1 Process Scheduling for Performance Estimation and Synthesis.
1 of 14 1/15 Schedulability Analysis and Optimization for the Synthesis of Multi-Cluster Distributed Embedded Systems Paul Pop, Petru Eles, Zebo Peng Embedded.
Reliability-Aware Frame Packing for the Static Segment of FlexRay Bogdan Tanasa, Unmesh Bordoloi, Petru Eles, Zebo Peng Linkoping University, Sweden 1.
1 of 14 1 Fault-Tolerant Embedded Systems: Scheduling and Optimization Viacheslav Izosimov, Petru Eles, Zebo Peng Embedded Systems Lab (ESLAB) Linköping.
Scheduling with Optimized Communication for Time-Triggered Embedded Systems Slide 1 Scheduling with Optimized Communication for Time-Triggered Embedded.
1 of 16 March 30, 2000 Bus Access Optimization for Distributed Embedded Systems Based on Schedulability Analysis Paul Pop, Petru Eles, Zebo Peng Department.
1 of 12 May 3, 2000 Performance Estimation for Embedded Systems with Data and Control Dependencies Paul Pop, Petru Eles, Zebo Peng Department of Computer.
1 Oct 2, 2003 Design Optimization of Mixed Time/Event-Triggered Distributed Embedded Systems Traian Pop, Petru Eles, Zebo Peng Embedded Systems Laboratory.
1 of 14 1 Analysis and Synthesis of Communication-Intensive Heterogeneous Real-Time Systems Paul Pop Computer and Information Science Dept. Linköpings.
1 of 14 1/15 Design Optimization of Multi-Cluster Embedded Systems for Real-Time Applications Paul Pop, Petru Eles, Zebo Peng, Viaceslav Izosimov Embedded.
Holistic Scheduling and Analysis of Mixed Time/Event-Triggered Distributed Embedded System Traian Pop, Petru Eles, Zebo Peng EE249 Discussion Paper Review.
1 of 14 1 Scheduling and Optimization of Fault- Tolerant Embedded Systems Viacheslav Izosimov Embedded Systems Lab (ESLAB) Linköping University, Sweden.
Thermal-Aware SoC Test Scheduling with Test Set Partitioning and Interleaving Zhiyuan He 1, Zebo Peng 1, Petru Eles 1 Paul Rosinger 2, Bashir M. Al-Hashimi.
1 of 16 June 21, 2000 Schedulability Analysis for Systems with Data and Control Dependencies Paul Pop, Petru Eles, Zebo Peng Department of Computer and.
Misconceptions About Real-time Computing : A Serious Problem for Next-generation Systems J. A. Stankovic, Misconceptions about Real-Time Computing: A Serious.
The Rare Glitch Project: Verifying Bus Protocols for Embedded Systems Edmund Clarke, Daniel Kroening Carnegie Mellon University.
1 of 14 1 / 18 An Approach to Incremental Design of Distributed Embedded Systems Paul Pop, Petru Eles, Traian Pop, Zebo Peng Department of Computer and.
A Tool for Describing and Evaluating Hierarchical Real-Time Bus Scheduling Policies Author: Trevor Meyerowitz, Claudio Pinello, Alberto DAC2003, June 24,2003.
DATE Optimizations of an Application- Level Protocol for Enhanced Dependability in FlexRay Wenchao Li 1, Marco Di Natale 2, Wei Zheng 1, Paolo Giusto.
Tradeoff Analysis for Dependable Real-Time Embedded Systems during the Early Design Phases Junhe Gan.
1 Memory and Time-Efficient Schedulability Analysis of Task Sets with Stochastic Execution Times Sorin Manolache, Petru Eles, Zebo Peng Department of Computer.
An efficient active replication scheme that tolerate failures in distributed embedded real-time systems Alain Girault, Hamoudi Kalla and Yves Sorel Pop.
© Oxford University Press 2011 DISTRIBUTED COMPUTING Sunita Mahajan Sunita Mahajan, Principal, Institute of Computer Science, MET League of Colleges, Mumbai.
1 of 14 1/15 Synthesis-driven Derivation of Process Graphs from Functional Blocks for Time-Triggered Embedded Systems Master thesis Student: Ghennadii.
Dependable communication synthesis for distributed embedded systems Nagarajan Kandasamy, John P. Hayes, Brian T. Murray Presented by John David Eriksen.
Scheduling policies for real- time embedded systems.
Integrated Scheduling and Synthesis of Control Applications on Distributed Embedded Systems Soheil Samii 1, Anton Cervin 2, Petru Eles 1, Zebo Peng 1 1.
Energy/Reliability Trade-offs in Fault-Tolerant Event-Triggered Distributed Embedded Systems Junhe Gan, Flavius Gruian, Paul Pop, Jan Madsen.
In-Vehicle Communication SAN Group RTS Regular Meeting Presentation December 2008.
Immune Genetic Algorithms for Optimization of Task Priorities and FlexRay Frame Identifiers Soheil Samii 1, Yanfei Yin 1,2, Zebo Peng 1, Petru Eles 1,
Prepare by : Ihab shahtout.  Overview  To give an overview of fixed priority schedule  Scheduling and Fixed Priority Scheduling.
5 May CmpE 516 Fault Tolerant Scheduling in Multiprocessor Systems Betül Demiröz.
Dynamic Scheduling and Control-Quality Optimization of Self-Triggered Control Applications Soheil Samii, Petru Eles, Zebo Peng Department of Computer and.
1 Scheduling Processes with Release Times, Deadlines, Precedence and Exclusion Relations J. Xu and D. L. Parnas IEEE Transactions on Software Engineering,
A flexible simulator for control- dominated distributed real-time systems Johannes Petersson IDA/SaS/ESLAB Johannes Petersson IDA/SaS/ESLAB Master’s Thesis.
Mixed Criticality Systems: Beyond Transient Faults Abhilash Thekkilakattil, Alan Burns, Radu Dobrin and Sasikumar Punnekkat.
1 of 14 1 Integrating Embedded System Design Tools in the Generic Modeling Environment Peter Tureby, Fredrik Wester Final Project at the Embedded Systems.
Tolerating Communication and Processor Failures in Distributed Real-Time Systems Hamoudi Kalla, Alain Girault and Yves Sorel Grenoble, November 13, 2003.
1 of 14 1/34 Embedded Systems Design: Optimization Challenges Paul Pop Embedded Systems Lab (ESLAB) Linköping University, Sweden.
A Fault-Tolerant Scheduling Algorithm for Real-Time Periodic Tasks with Possible Software Faults Ching-Chih Han, Kang G. Shin, and Jian Wu.
1 of 14 1/15 Schedulability-Driven Frame Packing for Multi-Cluster Distributed Embedded Systems Paul Pop, Petru Eles, Zebo Peng Embedded Systems Lab (ESLAB)
1 Developing Aerospace Applications with a Reliable Web Services Paradigm Pat. P. W. Chan and Michael R. Lyu Department of Computer Science and Engineering.
Task Mapping and Partition Allocation for Mixed-Criticality Real-Time Systems Domițian Tămaș-Selicean and Paul Pop Technical University of Denmark.
An Algorithm for Automatically Obtaining Distributed and Fault Tolerant Static Schedules Alain Girault - Hamoudi Kalla - Yves Sorel - Mihaela Sighireanu.
Fault-Tolerant Rate- Monotonic Scheduling Sunondo Ghosh, Rami Melhem, Daniel Mosse and Joydeep Sen Sarma.
1 Schedulability Analysis of Multiprocessor Real-Time Applications with Stochastic Task Execution Times Sorin Manolache, Petru Eles, Zebo Peng {sorma,
Undergraduate course on Real-time Systems Linköping TDDD07 Real-time Systems Lecture 5: Real-time Communication Simin Nadjm-Tehrani Real-time Systems Laboratory.
Optimization of Time-Partitions for Mixed-Criticality Real-Time Distributed Embedded Systems Domițian Tămaș-Selicean and Paul Pop Technical University.
Reliable energy management System reliability is affected by use of energy management The use of DVS increases the probability of faults, thus damaging.
Prabhat Kumar Saraswat Paul Pop Jan Madsen
Chapter – 5.2 Static Process Scheduling
Paul Pop, Petru Eles, Zebo Peng
Fault and Energy Aware Communication Mapping with Guaranteed Latency for Applications Implemented on NoC Sorin Manolache, Petru Eles, Zebo Peng {sorma,
Design Optimization of Multi-Cluster Embedded Systems for Real-Time Applications Paul Pop, Petru Eles, Zebo Peng, Viaceslav Izosimov Embedded Systems.
Buffer Space Optimisation with Communication Mapping and Traffic Shaping for NoCs Sorin Manolache, Petru Eles, Zebo Peng Linköping University, Sweden.
Schedulability-Driven Partitioning and Mapping for Multi-Cluster Real-Time Systems Paul Pop, Petru Eles, Zebo Peng, Viaceslav Izosimov Embedded Systems.
Timing Analysis of Mixed-Criticality Hard Real-Time Applications Implemented on Distributed Partitioned Architectures Sorin Ovidiu Marinescu, Domițian.
Sorin Manolache, Petru Eles, Zebo Peng {sorma, petel,
Anand Bhat*, Soheil Samii†, Raj Rajkumar* *Carnegie Mellon University
Presentation transcript:

1 of 14 1/14 Design Optimization of Time- and Cost-Constrained Fault-Tolerant Distributed Embedded Systems Viaceslav Izosimov, Paul Pop, Petru Eles, Zebo Peng Embedded Systems Lab (ESLAB) Linköping University, Sweden

2 of 14 2/14 Motivation  Hard real-time applications  Timing constraints  Cost constraints  Online preemptive  Flexible  Off-line non-preemptive  Predictable vs.  Faults  Predictable  Transient  Intermittent  Hardware solutions  MARS, TTA, X-by-Wire  Permanent faults  Costly for transient faults  Software solutions  Re-execution/rollback recovery  Checkpointing/rollback recovery  Replication, primary-backup… vs.  Software solutions  Re-execution/rollback recovery  Checkpointing/rollback recovery  Replication, primary-backup…

3 of 14 3/14 Outline  Motivation  System architecture and fault-model  Fault-tolerance techniques  Problem formulation  Motivational examples  Tabu-search optimization strategy  Experimental results  Contributions and Message

4 of 14 4/14 Processes: Static cyclic scheduling Fault-Tolerant Time-Triggered Systems... Transient faults Processes: Re-execution and replication S1S1 S3S3 S2S2 S4S4 S1S1 S3S3 S2S2 S4S4 TDMA Round Cycle of two rounds Slot Time Triggered Protocol (TTP)  Bus access scheme: time-division multiple-access (TDMA)  Schedule table located in each TTP controller: message descriptor list (MEDL) Messages: Static schedule table Messages: Fault-tolerant protocol

5 of 14 5/14 Fault-Tolerant Techniques P1P1 P1P1 P1P1 Re-execution N1N1 P1P1 P1P1 P1P1 Replication N1N1 N2N2 N3N3 P1P1 P1P1 N1N1 N2N2 P1P1 Re-executed replicas 2

6 of 14 6/14 Problem Formulation  Given  Fault model  Number of transient faults in the system period  System architecture  Application  WCETs, message sizes, periods, deadlines Application: set of process graphsArchitecture: time-triggered system Fault-model: transient faults  Determine  Schedulable and fault-tolerant design implementation  Fault-tolerance policy assignment  Mapping of processes and messages  Schedule tables for processes and messages

7 of 14 7/14 Static Scheduling [Kandasamy et al. 03] P1P1 N 1 : S 1 N 2 : S 11 N 3 : S 14 P2P2 P4P4 P5P5 P3P3 m1m1 m2m2 2 P2P2 P4P4 P3P3 P5P5 P1P1 m1m1 m2m2 N1N1 N2N2 N3N3 S 11 S 12 S 13 P1P1 P1P1 N2N2 S1S1 S2S2 S3S3 P2P2 P2P2 S4S4 P3P3 S5S5 S6S6 S7S7 S8S8 S 10 S9S9 P4P4 P3P3 P4P4 P3P3 P4P4 P4P4 N1N1 S 14 S 15 S 18 P5P5 P5P5 N3N3 Root schedules P1P1 N 1 : S 2 N 2 : S 12 N 3 : S 14 P2P2 P4P4 P5P5 P3P3 m1m1 m2m2 P1P1 Contingency schedules S1S1 S 11 S2S2 S 12 P2P2 Contingency schedules Transparent re-execution Recovery slack

8 of 14 8/14 Re-execution vs. Replication N1N1 N2N2 P1P1 P3P3 P2P2 m1m1 1 P1P1 P2P2 P3P3 N1N1 N2N N1N1 N2N2 TTP P1P1 P2P2 S1S1 S2S2 P3P3 Met A1A1 N1N1 N2N2 TTP P1P1 P2P2 P3P3 S1S1 S2S2 Missed P1P1 N1N1 N2N2 TTP P1P1 P2P2 P2P2 P3P3 P3P3 S1S1 S2S2 m1m1 m1m1 m2m2 m2m2 Deadline Met P1P1 P3P3 P2P2 m1m1 m2m2 A2A2 Replication is better P1P1 S1S1 N1N1 N2N2 TTP P1P1 S2S2 P2P2 P2P2 P3P3 P3P3 Deadline Missed m1m1 m1m1 Re-execution is better

9 of 14 9/14 P1P1 N1N1 N2N2 TTP P2P2 P3P3 S1S1 S2S2 P4P4 m2m2 Missed Fault-Tolerant Policy Assignment P1P1 P2P2 P3P3 N1N1 N2N P4P N1N1 N2N2 P1P1 P4P4 P2P2 P3P3 m1m1 m2m2 m3m3 P1P1 N1N1 N2N2 TTP P2P2 P3P3 S1S1 S2S2 m2m2 P4P4 N1N1 N2N2 P1P1 P3P3 S1S1 S2S2 P4P4 P2P2 P1P1 m1m1 m1m1 m2m2 m2m2 P2P2 m3m3 m3m3 P3P3 P4P4 Missed P1P1 N1N1 N2N2 TTP P2P2 P3P3 S1S1 S2S2 m2m2 P4P4 No fault-tolerance: application crashes N1N1 N2N2 P1P1 P3P3 S1S1 S2S2 P4P4 P2P2 P1P1 m2m2 m1m1 TTP Met Optimization of fault-tolerance policy assignment Deadline

10 of 14 10/14 Mapping and Fault-Tolerance P1P1 P4P4 P2P2 P3P3 m1m1 m2m2 m3m3 m4m4 P1P1 P2P2 P3P3 P4P4 N1N1 N2N2 40X X 1 N1N1 N2N2 P1P1 N1N1 N2N2 TTP P2P2 P3P3 S1S1 S2S2 m2m2 P4P4 m4m4 Best mapping without considering fault-tolerance Deadline Missed P1P1 N1N1 N2N2 TTP P2P2 P3P3 S1S1 S2S2 P4P4 m4m4 m2m2 P1P1 N1N1 N2N2 P2P2 P3P3 S1S1 S2S2 m4m4 m2m2 P4P4 Deadline Met Simultaneous mapping and fault-tolerance

11 of 14 11/14 Optimization Strategy  Design optimization:  Fault-tolerance policy assignment  Mapping of processes and messages  Root schedules  Three tabu-search optimization algorithms: 1.Mapping and Fault-Tolerance Policy assignment (MRX)  Re-execution, replication or both 2.Mapping and only Re-Execution (MX) 3.Mapping and only Replication (MR) Tabu-search List scheduling

12 of 14 12/14 MRX Tabu-Search Example P1P1 P2P2 P3P3 N1N1 N2N P4P N1N1 N2N2 P1P1 P4P4 P2P2 P3P3 m1m1 m2m2 m3m3 Design transformations

13 of 14 13/ Experimental Results Mapping and replication (MR) 20 Mapping and re-execution (MX) Mapping and policy assignment (MRX) Number of processes Avgerage % deviation from MRX Schedulability improvement under resource constraints  Case study  Vehicle cruise controller  MRX: schedulable fault-tolerant application with 65% overhead

14 of 14 14/14 Contributions and Message  Contributions  Combined re-execution and replication  Optimization algorithms for fault-tolerance policy assignment  Efficient contingency schedule generation Optimization of fault-tolerance policy assignment needed for cost-effective fault tolerance