Tradeoff Analysis for Dependable Real-Time Embedded Systems during the Early Design Phases Junhe Gan.

Slides:



Advertisements
Similar presentations
Courseware Scheduling of Distributed Real-Time Systems Jan Madsen Informatics and Mathematical Modelling Technical University of Denmark Richard Petersens.
Advertisements

1 of 14 1 /23 Flexibility Driven Scheduling and Mapping for Distributed Real-Time Systems Paul Pop, Petru Eles, Zebo Peng Department of Computer and Information.
Introduction and Background  Power: A Critical Dimension for Embedded Systems  Dynamic power dominates; static /leakage power increases faster  Common.
*time Optimization Heiko, Diego, Thomas, Kevin, Andreas, Jens.
Presented by: Thabet Kacem Spring Outline Contributions Introduction Proposed Approach Related Work Reconception of ADLs XTEAM Tool Chain Discussion.
Meeting Service Level Objectives of Pig Programs Zhuoyao Zhang, Ludmila Cherkasova, Abhishek Verma, Boon Thau Loo University of Pennsylvania Hewlett-Packard.
1 of 14 1/14 Design Optimization of Time- and Cost-Constrained Fault-Tolerant Distributed Embedded Systems Viaceslav Izosimov, Paul Pop, Petru Eles, Zebo.
1 HW/SW Partitioning Embedded Systems Design. 2 Hardware/Software Codesign “Exploration of the system design space formed by combinations of hardware.
CS244-Introduction to Embedded Systems and Ubiquitous Computing Instructor: Eli Bozorgzadeh Computer Science Department UC Irvine Winter 2010.
Bogdan Tanasa, Unmesh D. Bordoloi, Petru Eles, Zebo Peng Department of Computer and Information Science, Linkoping University, Sweden December 3, 2010.
1 of 30 June 14, 2000 Scheduling and Communication Synthesis for Distributed Real-Time Systems Paul Pop Department of Computer and Information Science.
Performance and Energy Bounds for Multimedia Applications on Dual-processor Power-aware SoC Platforms Weng-Fai WONG 黄荣辉 Dept. of Computer Science National.
1 of 14 1/15 Schedulability Analysis and Optimization for the Synthesis of Multi-Cluster Distributed Embedded Systems Paul Pop, Petru Eles, Zebo Peng Embedded.
1 of 14 1 Fault-Tolerant Embedded Systems: Scheduling and Optimization Viacheslav Izosimov, Petru Eles, Zebo Peng Embedded Systems Lab (ESLAB) Linköping.
Scheduling with Optimized Communication for Time-Triggered Embedded Systems Slide 1 Scheduling with Optimized Communication for Time-Triggered Embedded.
1 of 16 March 30, 2000 Bus Access Optimization for Distributed Embedded Systems Based on Schedulability Analysis Paul Pop, Petru Eles, Zebo Peng Department.
1 of 14 1 Analysis and Synthesis of Communication-Intensive Heterogeneous Real-Time Systems Paul Pop Computer and Information Science Dept. Linköpings.
1 of 14 1 Scheduling and Optimization of Fault- Tolerant Embedded Systems Viacheslav Izosimov Embedded Systems Lab (ESLAB) Linköping University, Sweden.
1 of 16 June 21, 2000 Schedulability Analysis for Systems with Data and Control Dependencies Paul Pop, Petru Eles, Zebo Peng Department of Computer and.
HW/SW Co-Synthesis of Dynamically Reconfigurable Embedded Systems HW/SW Partitioning and Scheduling Algorithms.
On Fairness, Optimizing Replica Selection in Data Grids Husni Hamad E. AL-Mistarihi and Chan Huah Yong IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS,
1 of 14 1 / 18 An Approach to Incremental Design of Distributed Embedded Systems Paul Pop, Petru Eles, Traian Pop, Zebo Peng Department of Computer and.
1 Embedded Computer System Laboratory RTOS Modeling in Electronic System Level Design.
Exploring the Tradeoffs of Configurability and Heterogeneity in Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable.
VOLTAGE SCHEDULING HEURISTIC for REAL-TIME TASK GRAPHS D. Roychowdhury, I. Koren, C. M. Krishna University of Massachusetts, Amherst Y.-H. Lee Arizona.
1 Telematics/Networkengineering Confidential Transmission of Lossless Visual Data: Experimental Modelling and Optimization.
Baoxian Zhao Hakan Aydin Dakai Zhu Computer Science Department Computer Science Department George Mason University University of Texas at San Antonio DAC.
1 of 14 1/15 Synthesis-driven Derivation of Process Graphs from Functional Blocks for Time-Triggered Embedded Systems Master thesis Student: Ghennadii.
Scheduling policies for real- time embedded systems.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Energy/Reliability Trade-offs in Fault-Tolerant Event-Triggered Distributed Embedded Systems Junhe Gan, Flavius Gruian, Paul Pop, Jan Madsen.
Robin McDougall Scott Nokleby Mechatronic and Robotic Systems Laboratory 1.
A Node and Load Allocation Algorithm for Resilient CPSs under Energy-Exhaustion Attack Tam Chantem and Ryan M. Gerdes Electrical and Computer Engineering.
Analysis and Optimization of Mixed-Criticality Applications on Partitioned Distributed Architectures Domițian Tămaș-Selicean, Sorin Ovidiu Marinescu and.
CS244-Introduction to Embedded Systems and Ubiquitous Computing Instructor: Eli Bozorgzadeh Computer Science Department UC Irvine Winter 2010.
Resource Mapping and Scheduling for Heterogeneous Network Processor Systems Liang Yang, Tushar Gohad, Pavel Ghosh, Devesh Sinha, Arunabha Sen and Andrea.
Hard Real-Time Scheduling for Low- Energy Using Stochastic Data and DVS Processors Flavius Gruian Department of Computer Science, Lund University Box 118.
5 May CmpE 516 Fault Tolerant Scheduling in Multiprocessor Systems Betül Demiröz.
Lecture 4 TTH 03:30AM-04:45PM Dr. Jianjun Hu CSCE569 Parallel Computing University of South Carolina Department of.
An Energy-efficient Task Scheduler for Multi-core Platforms with per-core DVFS Based on Task Characteristics Ching-Chi Lin Institute of Information Science,
CUHK Learning-Based Power Management for Multi-Core Processors YE Rong Nov 15, 2011.
DTM and Reliability High temperature greatly degrades reliability
A flexible simulator for control- dominated distributed real-time systems Johannes Petersson IDA/SaS/ESLAB Johannes Petersson IDA/SaS/ESLAB Master’s Thesis.
Energy-Aware Resource Adaptation in Tessellation OS 3. Space-time Partitioning and Two-level Scheduling David Chou, Gage Eads Par Lab, CS Division, UC.
Mixed Criticality Systems: Beyond Transient Faults Abhilash Thekkilakattil, Alan Burns, Radu Dobrin and Sasikumar Punnekkat.
1 of 14 1/34 Embedded Systems Design: Optimization Challenges Paul Pop Embedded Systems Lab (ESLAB) Linköping University, Sweden.
Technical Reading Report Virtual Power: Coordinated Power Management in Virtualized Enterprise Environment Paper by: Ripal Nathuji & Karsten Schwan from.
Synthesis of Communication Schedules for TTEthernet-based Mixed-Criticality Systems Domițian Tămaș-Selicean 1, Paul Pop 1 and Wilfried Steiner 2 1 Technical.
Winter-Spring 2001Codesign of Embedded Systems1 Co-Synthesis Algorithms: Distributed System Co- Synthesis Part of HW/SW Codesign of Embedded Systems Course.
Workload Clustering for Increasing Energy Savings on Embedded MPSoCs S. H. K. Narayanan, O. Ozturk, M. Kandemir, M. Karakoy.
1 Hardware-Software Co-Synthesis of Low Power Real-Time Distributed Embedded Systems with Dynamically Reconfigurable FPGAs Li Shang and Niraj K.Jha Proceedings.
Task Mapping and Partition Allocation for Mixed-Criticality Real-Time Systems Domițian Tămaș-Selicean and Paul Pop Technical University of Denmark.
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
Determining Optimal Processor Speeds for Periodic Real-Time Tasks with Different Power Characteristics H. Aydın, R. Melhem, D. Mossé, P.M. Alvarez University.
1 Comparative Study of two Genetic Algorithms Based Task Allocation Models in Distributed Computing System Oğuzhan TAŞ 2005.
Multi-cellular paradigm The molecular level can support self- replication (and self- repair). But we also need cells that can be designed to fit the specific.
Optimization of Time-Partitions for Mixed-Criticality Real-Time Distributed Embedded Systems Domițian Tămaș-Selicean and Paul Pop Technical University.
Reliable energy management System reliability is affected by use of energy management The use of DVS increases the probability of faults, thus damaging.
Reducing the Number of Preemptions in Real-Time Systems Scheduling by CPU Frequency Scaling Abhilash Thekkilakattil, Anju S Pillai, Radu Dobrin, Sasikumar.
Prabhat Kumar Saraswat Paul Pop Jan Madsen
Parallel Programming By J. H. Wang May 2, 2017.
Paul Pop, Petru Eles, Zebo Peng
Architecture Synthesis for Cost Constrained Fault Tolerant Biochips
Babak Sorkhpour, Prof. Roman Obermaisser, Ayman Murshed
Gabor Madl Ph.D. Candidate, UC Irvine Advisor: Nikil Dutt
Improved schedulability on the ρVEX polymorphic VLIW processor
Fault-Tolerant Architecture Design for Flow-Based Biochips
Department of Electrical Engineering Joint work with Jiong Luo
Parallel Programming in C with MPI and OpenMP
Anand Bhat*, Soheil Samii†, Raj Rajkumar* *Carnegie Mellon University
Presentation transcript:

Tradeoff Analysis for Dependable Real-Time Embedded Systems during the Early Design Phases Junhe Gan

2 Embedded Systems Introduction: embedded systems General Purpose Computer Systems

3 Introduction: design metrics  Unit cost: the monetary cost of manufacturing each copy of the system, excluding NRE cost  NRE cost (Non-Recurring Engineering cost): The one-time monetary cost of designing the system  Performance: the execution time or throughput of the system  Predictability: the key property of any real-time system that the timing requirements must be met.  Power: the amount of power consumed by the system  Robustness: the ability of a system to resist change without altering its implementation  Flexibility: the ability to change the functionality of the system without incurring heavy NRE cost  Dependability (reliability, safety, security, maintainability, and availability)  Challenge: simultaneously optimize competing design metrics

4 Introduction: early design stages  Early design decisions have a high impact  More effort should be spent during the early design phases  Challenge: uncertainties

5 Introduction: system-level design DSE

6 Introduction: system models Application Architecture WCETs  Scheduling  Tasks are scheduled by fixed-priority preemptive scheduling, while messages are transmitted using a fixed-priority nonpreemptive policy.  We use response time analysis to calculate the worst-case response time r i for each task, which is compared to its deadline D i.  We use the degree of schedulability r S to measure which design alternative is “more schedulable”.

7 Introduction: system-level design tasks Function-to-task allocation  Deciding how to decompose functional blocks into tasks Mapping  Deciding in which PE to place a task Scheduling  Deciding the execution order of the mapped tasks on the PE Architecture selection  Determining the number, the type of conponents of the system platform Voltage scaling  Assigning the operating mode to execute the task

8 Outline  Introduction  Design for Robustness and Flexibility of Real-Time Distributed Applications during the Early Design Phases  Reliability-Aware Dynamic Energy Management for Fault-Tolerant Distributed Embedded Systems  Criticality-Aware Functionality Allocation for Distributed Real-Time Embedded Systems  Summary and contributions

9 WCET modelling: uncertainties ii  The uncertainty in the worst- case execution time (WCET) c i is due to lack of information “Percentile method” 50 th percentile: 30 ms 90 th percentile: 60 ms  Details of the PE not fully known  Full implementation not yet available PE Knowing the 50 th and 90 th percentiles, we can determine the cumulative distribution function of the WCET c i Hard real-time applications Jakob Axelsson. A method for evaluating uncertainties in the early development phases of embedded real-time systems. In Embedded and Real-Time Computing Systems and Applications, 2005.

10 Functionality modeling: uncertainties  Changes in requirements  New version of a product  Evolution of a product line We capture the functionality as a set of tasks, baseline S 0 S 1 is a functionality update, replaces  1 with  5 S 2 adds  6 to increase performance S 3 adds a new application, with tasks  7 and  8 S 3 adds a new application, with tasks  7 and  8 S 4 is a combination of S 1 and S 2 S 4 is a combination of S 1 and S 2 We capture the changes in functionality as scenarios I. Bate and P. Emberson. Incorporating scenarios and heuristics to improve flexibility in real-time embedded systems. In Real-Time and Embedded Technology and Applications Symposium, 2006.

11 Problem formulation  Given  Architecture model  Baseline functionality S 0  Set of future scenarios S i  Determine  Mapping M 0 of the tasks in S 0 such that the robustness and flexibility of M 0 are maximized  Robustness the tasks in S 0 have a high chance to be schedulable  FlexibilityM 0 has a high chance to successfully accommodate S i  Notes  This problem is especially relevant for system integrators  Changing the mapping is costly, especially in areas such as safety-critical

12 Motivational example: robustness Robustness: the probability of all tasks being schedulable Without capturing uncertainty in WCETs: M’ is preferred. Capturing uncertainty in WCETs: M has much higher chances (93%) to be schedulable, compared to M’ (67%).

13 Motivational example: flexibility Baseline functionality (×) Pareto-optimal solutions Exhaustive search (+)Straightforward Mapping, SFM ignores uncertainty in WCETs ignores future scenarios Future scenarios  Flexibility: the probability of all future scenarios being schedulable

14 Multiobjective optimization  Cost function: multiple objectives  Two alternatives:  Merge all design metrics into a single cost function by using a weighted sum then use meta-heuristics such as Tabu Search  Perform a multi-objective optimization approach such as Genetic Algorithm, which determine a Pareto-front of solutions  Pareto-front (trade-off curve)  Solutions are not dominated by each other

15 Mapping for Robustness and Flexibility (MRF)  Determining an optimal mapping is NP-hard (Non-deterministic-Polynomial-time hard)  Genetic algorithm-based approach: MRF (Mapping for Robustness and Flexibility optimization)  Non-dominated Sorting Genetic Algorithm-II  For each candidate solution M k  We calculate the robustness  We calculate the flexibility based on the robustness of each M i  We have to know the mapping M i of each future scenarios S i on top of M k  We use a Greedy mapping approach to determine the M k  Greedy mapping M i of each future scenarios S i on top of M k  Only the tasks which are not in S 0 have to be mapped  Greedy: tasks are sorted on utilization; mapped on the lowest utilized PE

16 Experimental setup  Baseline scenario  We have varied the size of the system from 22 tasks (S 0 ) and 3 PEs to 84 tasks (S 0 ) and 10 PEs  4 real-life case studies from Embedded System Synthesis Benchmark Suite  8 eight synthetic benchmarks generated using Task Graphs For Free  Future scenarios  For each benchmark, we have four future scenarios  Implementation  Matlab 2010 and run on an Intel Core i7 CPU 920 (2.67 GHz)  NSGA-II parameters are tuned such that no improvements were seen after a very long runtime

17 Experimental results: real-life case studies Conclusion: It is very important to model the uncertainties and to take them into account during design space exploration.

18 Outline  Introduction  Design for Robustness and Flexibility of Real-Time Distributed Applications during the Early Design Phases  Reliability-Aware Dynamic Energy Management for Fault- Tolerant Distributed Embedded Systems  Criticality-Aware Functionality Allocation for Distributed Real- Time Embedded Systems  Summary and contributions

19 Architecture model  A set of heterogeneous processing elements interconnected by a communication channel  Each processor element have a set of discrete operating modes  For each operating mode we know

20 Application model  A set of periodic tasks  Transient faults are tolerated using task replication  Number of replicas k i (critical task: k i > 0, non-critical task: k i = 0)  Reliability goal R g  The system should have a reliability greater than Rg, otherwise it is not fault-tolerant (more replicas would be needed).

21 Energy/reliability trade-off model  The fault rate increases exponentially when normalized voltage V and the normalized frequency F decreases The equation is adapted from: D. Zhu and H. Aydin, “Reliability-Aware Energy Management for Periodic Real-Time Tasks”, IEEE Transactions on Computers, 58(10), pp , 2009.

22 Problem formulation  Given:  Application and architecture models  Reliability goal and corresponding number of replicas for each task  Determine offline:  the mapping of each task to processing element  the operating mode for executing each task  Such that:  all tasks meet their timing requirements  the application reliability meets the given reliability goal  the energy consumption of the system is minimized

23 Motivational example  Application and architecture  Initial solution: no voltage and frequency scaling  Runs all the tasks in the maximum speed operating mode and maps the tasks on the low power PEs.  The given reliability goal: which means that we accept at most a 10 times decrease in reliability.

24 Motivational example: offline Energy minimization without concern for reliability Energy/reliability trade-off optimization

25 Optimization strategy: offline synthesis  Optimization Problem  NP-hard (Non-deterministic-Polynomial-time hard)  Minimize the cost function:  Use a Tabu search-based algorithm to explore the design space  Iteratively explores neighborhood solutions by performing  mapping moves  operating mode moves Energy ReliabilitySchedulability

26 Experimental Results: offline synthesis Conclusion: we are able to reduce the negative impact of energy minimization on reliability with very little decrease in energy savings

27 Outline  Introduction  Design for Robustness and Flexibility of Real-Time Distributed Applications during the Early Design Phases  Reliability-Aware Dynamic Energy Management for Fault- Tolerant Distributed Embedded Systems  Criticality-Aware Functionality Allocation for Distributed Real- Time Embedded Systems  Summary and contributions

28 Function-to-task allocation  Design-level: applications are modeled as functional blocks.  Implementation-level: applications are modeled as a set of interacting tasks.  Safety-Integrity Levels (SILs): are assigned to functional blocks/tasks to capture the required level of risk reduction, from SIL 4 (most critical) to SIL 0 (non critical).  Development and certification costs increase dramatically with SILs.  Trade off between cost and schedulability.  SIL decomposition based on the coresponding certification standards. ISO 26262

29 Problem formulation  Given:  Application and architecture models  The library of function-to-task decomposition  The library of architecture implementations for the PEs  Determine an implementation:  the function-to-task decomposition  the mapping of tasks to PEs  the types of PEs in the architecture  Such that:  total costs are minimized  the schedulability of the applications maximized  the requirements of safety and integrity are satisfied

30 Motivational example: decomposition library

31 Motivational example: hardware component library The unit cost increases with the increased reliability: use lowest cost PEs which provide required reliability

32 Motivational example: SFS and optimized results  Straightforward Solution (SFS):  Do not decompose the functional blocks into tasks with lower SILs  Cluster all tasks based on SILs for the mapping  Criticality-Aware Mapping Optimization (CMO):  Optimize the mapping  Criticality-Aware Functional Decomposition and Mapping Optimization (CDMO):  Optimize the functional decomposition  Optimize the mapping

33 Optimization strategy  Optimization problem is NP-hard (Non-deterministic-Polynomial-time hard)  Genetic algorithm-based approach, called CDMO (Criticality-aware functional Decomposition and task Mapping Optimization)  Non-dominated Sorting Genetic Algorithm-II  For each candidate implementation S i  We calculate the degree of schedulability  We calculate the total cost that includes the unit cost of the PEs and the development and certification costs of software tasks.

34 Experimental results: real-life case study Conclusion: By taking into account SIL decomposition, we are able to find schedulable solutions at a reduced cost.

35 Outline  Introduction  Design for Robustness and Flexibility of Real-Time Distributed Applications during the Early Design Phases  Reliability-Aware Dynamic Energy Management for Fault- Tolerant Distributed Embedded Systems  Criticality-Aware Functionality Allocation for Distributed Real- Time Embedded Systems  Summary and contributions

36 Summary  I addressed the architecture selection and the mapping of hard real-time applications on distributed heterogeneous architectures.  I modeled the uncertainties in WCETs, functionality requirements, and hardware component costs, during the early design phases.  I addressed the mapping, voltage and frequency scaling for fault- tolerant hard real-time applications mapped on distributed embedded systems.  I proposed both offline and online approaches that can take reliability into account when performing voltage and frequency scaling.  I addressed the function-to-task allocation and task mapping of mixed-criticality applications on distributed architectures.  I took into account safety and integrity requirements while performing functional decomposition and architecture selection.

37 Contributions  I have addressed competing design metrics, and to support the designer making early design decisions.  I have proposed methods to perform automatic design space exploration for design of embedded systems.  The implemented design space exploration tools are able to determine good quality solutions in a reasonable time.

Thank You