Analysis and Optimization of Mixed-Criticality Applications on Partitioned Distributed Architectures Domițian Tămaș-Selicean, Sorin Ovidiu Marinescu and.

Slides:



Advertisements
Similar presentations
© 2004 Wayne Wolf Topics Task-level partitioning. Hardware/software partitioning.  Bus-based systems.
Advertisements

Domițian Tămaș-Selicean
System-level Trade-off of Networks-on-Chip Architecture Choices Network-on-Chip System-on-Chip Group, CSE-IMM, DTU.
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
A Centralized Scheduling Algorithm based on Multi-path Routing in WiMax Mesh Network Yang Cao, Zhimin Liu and Yi Yang International Conference on Wireless.
Courseware Scheduling of Distributed Real-Time Systems Jan Madsen Informatics and Mathematical Modelling Technical University of Denmark Richard Petersens.
REAL-TIME COMMUNICATION ANALYSIS FOR NOCS WITH WORMHOLE SWITCHING Presented by Sina Gholamian, 1 09/11/2011.
1 of 14 1 /23 Flexibility Driven Scheduling and Mapping for Distributed Real-Time Systems Paul Pop, Petru Eles, Zebo Peng Department of Computer and Information.
7. Physical Memory 7.1 Preparing a Program for Execution
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 15 –QoS Admission, QoS Negotiation, and Establishment of AV Connections Klara Nahrstedt.
Droplet-Aware Module-Based Synthesis for Fault-Tolerant Digital Microfluidic Biochips Elena Maftei, Paul Pop, and Jan Madsen Technical University of Denmark.
Distributed Algorithms for Secure Multipath Routing
Towards Feasibility Region Calculus: An End-to-end Schedulability Analysis of Real- Time Multistage Execution William Hawkins and Tarek Abdelzaher Presented.
1 of 30 June 14, 2000 Scheduling and Communication Synthesis for Distributed Real-Time Systems Paul Pop Department of Computer and Information Science.
Service Disciplines for Guaranteed Performance Service Hui Zhang, “Service Disciplines for Guaranteed Performance Service in Packet-Switching Networks,”
Multiprocessing Memory Management
Process Scheduling for Performance Estimation and Synthesis of Hardware/Software Systems Slide 1 Process Scheduling for Performance Estimation and Synthesis.
1 of 14 1/15 Schedulability Analysis and Optimization for the Synthesis of Multi-Cluster Distributed Embedded Systems Paul Pop, Petru Eles, Zebo Peng Embedded.
Scheduling with Optimized Communication for Time-Triggered Embedded Systems Slide 1 Scheduling with Optimized Communication for Time-Triggered Embedded.
1 of 16 March 30, 2000 Bus Access Optimization for Distributed Embedded Systems Based on Schedulability Analysis Paul Pop, Petru Eles, Zebo Peng Department.
1 Oct 2, 2003 Design Optimization of Mixed Time/Event-Triggered Distributed Embedded Systems Traian Pop, Petru Eles, Zebo Peng Embedded Systems Laboratory.
1 of 14 1 Analysis and Synthesis of Communication-Intensive Heterogeneous Real-Time Systems Paul Pop Computer and Information Science Dept. Linköpings.
1 of 14 1/15 Design Optimization of Multi-Cluster Embedded Systems for Real-Time Applications Paul Pop, Petru Eles, Zebo Peng, Viaceslav Izosimov Embedded.
Thermal-Aware SoC Test Scheduling with Test Set Partitioning and Interleaving Zhiyuan He 1, Zebo Peng 1, Petru Eles 1 Paul Rosinger 2, Bashir M. Al-Hashimi.
HW/SW Co-Synthesis of Dynamically Reconfigurable Embedded Systems HW/SW Partitioning and Scheduling Algorithms.
1 of 14 1 / 18 An Approach to Incremental Design of Distributed Embedded Systems Paul Pop, Petru Eles, Traian Pop, Zebo Peng Department of Computer and.
A Tool for Describing and Evaluating Hierarchical Real-Time Bus Scheduling Policies Author: Trevor Meyerowitz, Claudio Pinello, Alberto DAC2003, June 24,2003.
Memory Management ◦ Operating Systems ◦ CS550. Paging and Segmentation  Non-contiguous memory allocation  Fragmentation is a serious problem with contiguous.
TitleEfficient Timing Channel Protection for On-Chip Networks Yao Wang and G. Edward Suh Cornell University.
Tradeoff Analysis for Dependable Real-Time Embedded Systems during the Early Design Phases Junhe Gan.
A Generalized Processor Sharing Approach to Flow Control in Integrated Services Networks: The Single-Node Case Abhay K. Parekh, Member, IEEE, and Robert.
Course Outline DayContents Day 1 Introduction Motivation, definitions, properties of embedded systems, outline of the current course How to specify embedded.
Low Contention Mapping of RT Tasks onto a TilePro 64 Core Processor 1 Background Introduction = why 2 Goal 3 What 4 How 5 Experimental Result 6 Advantage.
Analysis Methods for Mixed-Criticality Applications on TTEthernet-based Distributed Architectures Sorin Ovidiu Marinescu Technical University of Denmark.
ROBUST RESOURCE ALLOCATION OF DAGS IN A HETEROGENEOUS MULTI-CORE SYSTEM Luis Diego Briceño, Jay Smith, H. J. Siegel, Anthony A. Maciejewski, Paul Maxwell,
1 of 14 1/15 Synthesis-driven Derivation of Process Graphs from Functional Blocks for Time-Triggered Embedded Systems Master thesis Student: Ghennadii.
Energy/Reliability Trade-offs in Fault-Tolerant Event-Triggered Distributed Embedded Systems Junhe Gan, Flavius Gruian, Paul Pop, Jan Madsen.
Resource Mapping and Scheduling for Heterogeneous Network Processor Systems Liang Yang, Tushar Gohad, Pavel Ghosh, Devesh Sinha, Arunabha Sen and Andrea.
1 SYNTHESIS of PIPELINED SYSTEMS for the CONTEMPORANEOUS EXECUTION of PERIODIC and APERIODIC TASKS with HARD REAL-TIME CONSTRAINTS Paolo Palazzari Luca.
Virtual Memory The memory space of a process is normally divided into blocks that are either pages or segments. Virtual memory management takes.
Dzmitry Kliazovich University of Luxembourg, Luxembourg
CSCI1600: Embedded and Real Time Software Lecture 23: Real Time Scheduling I Steven Reiss, Fall 2015.
Message routing in multi-segment FTT networks: the isochronous approach Paulo Pedreiras, Luís Almeida Workshop on Parallel and.
Wajid Minhass, Paul Pop, Jan Madsen Technical University of Denmark
Synthesis of Communication Schedules for TTEthernet-based Mixed-Criticality Systems Domițian Tămaș-Selicean 1, Paul Pop 1 and Wilfried Steiner 2 1 Technical.
1 of 14 1/15 Schedulability-Driven Frame Packing for Multi-Cluster Distributed Embedded Systems Paul Pop, Petru Eles, Zebo Peng Embedded Systems Lab (ESLAB)
Timing Analysis of Rate Constrained Traffic for the TTEthernet Communication Protocol Domițian Tămaș-Selicean 1, Paul Pop 1 and Wilfried Steiner 2 1 Technical.
PERFORMANCE EVALUATION OF LARGE RECONFIGURABLE INTERCONNECTS FOR MULTIPROCESSOR SYSTEMS Wim Heirman, Iñigo Artundo, Joni Dambre, Christof Debaes, Pham.
1 Hardware-Software Co-Synthesis of Low Power Real-Time Distributed Embedded Systems with Dynamically Reconfigurable FPGAs Li Shang and Niraj K.Jha Proceedings.
SERENA: SchEduling RoutEr Nodes Activity in wireless ad hoc and sensor networks Pascale Minet and Saoucene Mahfoudh INRIA, Rocquencourt Le Chesnay.
Task Mapping and Partition Allocation for Mixed-Criticality Real-Time Systems Domițian Tămaș-Selicean and Paul Pop Technical University of Denmark.
Determining Optimal Processor Speeds for Periodic Real-Time Tasks with Different Power Characteristics H. Aydın, R. Melhem, D. Mossé, P.M. Alvarez University.
MEMORY MANAGEMENT. memory management  In a multiprogramming system, in order to share the processor, a number of processes must be kept in memory. 
Optimization of Time-Partitions for Mixed-Criticality Real-Time Distributed Embedded Systems Domițian Tămaș-Selicean and Paul Pop Technical University.
A Strategy to Compute the InfiniBand Arbitration Tables
Module 11: File Structure
Providing Security for Embedded Devices Through Virtualization
Paul Pop, Petru Eles, Zebo Peng
Architecture Synthesis for Cost Constrained Fault Tolerant Biochips
Fault and Energy Aware Communication Mapping with Guaranteed Latency for Applications Implemented on NoC Sorin Manolache, Petru Eles, Zebo Peng {sorma,
Module 5: CPU Scheduling
Schedulability-Driven Partitioning and Mapping for Multi-Cluster Real-Time Systems Paul Pop, Petru Eles, Zebo Peng, Viaceslav Izosimov Embedded Systems.
Timing Analysis of Mixed-Criticality Hard Real-Time Applications Implemented on Distributed Partitioned Architectures Sorin Ovidiu Marinescu, Domițian.
Modeling and Simulation of TTEthernet
Networked Real-Time Systems: Routing and Scheduling
Linköping University, IDA, ESLAB
Operating System , Fall 2000 EA101 W 9:00-10:00 F 9:00-11:00
Module 5: CPU Scheduling
Guaranteeing Message Latencies on Controller Area Network (CAN)
Module 5: CPU Scheduling
Presentation transcript:

Analysis and Optimization of Mixed-Criticality Applications on Partitioned Distributed Architectures Domițian Tămaș-Selicean, Sorin Ovidiu Marinescu and Paul Pop Technical University of Denmark

2 Outline  Motivation  Separation of mixed-criticality applications  At processing element level  At communication level  Problem formulation and example  Optimization strategy  Experimental results  Conclusions

3 Motivation  Safety is the property of a system that will not endanger human life or the environment  A safety-related system needs to be certified  A Safety Integrity Level (SIL) is assigned to each safety related function, depending on the required level of risk reduction  There are 4 SILs:  SIL4 (most critical)  SIL1 (least critical)  SIL0 (non-critical) – not covered by standards  SILs dictate the development process and certification procedures

4 Federated Architecture Motivation  Real time applications implemented using distributed systems PE Application A 1 Application A 2 Application A 3  Mixed-criticality applications share the same architecture SIL3 SIL4 SIL1 SIL2 SIL1 Solution: partitioned architecture Integrated Architecture

5 Separation at PE-level  Partition = virtual dedicated machine  Partitioned architecture  Spatial partitioning  protects one application’s memory and access to resources from another application  Temporal partitioning  partitions the CPU time among applications

6 Separation at PE-level  Temporal partitioning  Static partition table  Repeated with a period MF  Partition switch overhead  Each partition can have its own scheduling policy  A partition has a certain SIL Partition slice Major Frame PE 1 PE 2 PE 3 PE 1 PE 2 PE 3 Problem: optimize task mapping and allocation of partitions

7 Separation at Network-level ES 1 ES 2 NS 1 NS 2 ES 3 ES 4  Full-Duplex Ethernet-based data network for safety-critical applications  Compliant with ARINC 664p7 “Aircraft Data Network” End System Network Switch

8 Separation at Network-level NS 1 NS 2 vl 2 vl 1 ES 1 τ1τ1 ES 2 τ4τ4 ES 3 τ2τ2 τ5τ5 ES 4 τ3τ3  Highly critical application A 1 : τ 1, τ 2 and τ 3  τ 1 sends message m 1 to τ 2 and τ 3  Non-critical application A 2 : τ 4 and τ 5  τ 4 sends message m 2 to τ 5 virtual link

9 Separation at Network-level NS 1 NS 2 dp 1 vl 1 dp 2 l1l1 l2l2 l3l3 l4l4 ES 1 τ1τ1 ES 2 τ4τ4 ES 3 τ2τ2 τ5τ5 ES 4 τ3τ3 dataflow path  Highly critical application A 1 : τ 1, τ 2 and τ 3  τ 1 sends message m 1 to τ 2 and τ 3  Non-critical application A 2 : τ 4 and τ 5  τ 4 sends message m 2 to τ 5 dataflow link

10 TTEthernet  Traffic classes  Time Triggered (TT)  based on static schedule tables  Rate Constrained (RC)  deterministic unsynchronized communication  ARINC 664p7 traffic  Best Effort (BE)  no timing guarantees provided

11 Application Model  SCS apps transmit TT messages  FPS apps transmit RC messages

12 Problem formulation  Given  A set of applications  The criticality level (or SIL) of each task  A set of N processing elements (PEs) and topology of the network  The set of TT and RC frames  The set of virtual links  The size of the Major Frame and of the Application Cycle  Determine  The mapping of tasks to PEs  The sequence and length of partition slices on each processor  The assignment of tasks to partitions  The schedule for all the tasks and TT frames in the system  Such that  All applications meet their deadline  The response times of the FPS tasks and RC frames is minimized

13 Motivational Example 1  Mapping and partitioning optimization

14 Motivational Example 1

15 Motivational Example 1

16 Motivational Example 2 ES 1 ES 2 NS 1 ES 3 vl 3 vl 1 vl 2 period (us)deadline (us)C i (us) M f1 ∈ F RC vl 1 f2 ∈ F TT vl 2 f3 ∈ F TT vl 3  Optimization of TT message schedules

17 Motivational Example 2 ES 1 ES 2 NS 1 ES 3 vl 3 vl 1 vl 2 period (us) deadline (us) C i (us) M f1 ∈ F RC vl 1 f2 ∈ F TT vl 2 f3 ∈ F TT vl 3  Initial TT schedule

18 Motivational Example 2 ES 1 ES 2 NS 1 ES 3 vl 3 vl 1 vl 2 period (us) deadline (us) C i (us) M f1 ∈ F RC vl 1 f2 ∈ F TT vl 2 f3 ∈ F TT vl 3  Optimized TT schedule

19 Optimization Strategy  Tabu Search meta-heuristic  Task mapping and partition slice optimization (TO)  Considering TT frame schedules fixed  TT frame schedules optimization (TM)  Considering the task mapping and partition slices fixed  Tabu Search  Minimizes the cost function  Explores the solution space using design transformations

20 Optimization Strategy  Degree of schedulability  Captures the difference between the worst-case response time and the deadline  Cost Function

21 Optimization Strategy: Design Transformations  Partition slice moves  resize partition slice  swap two partition slices  join two partition slices  split partition slice into two  Task moves  re-assign task to another partition

22 Optimization Strategy: Design Transformations

23 Optimization Strategy: Design Transformations

24 Optimization Strategy: Design Transformations

25 Optimization Strategy: Design Transformations

26 Optimization Strategy: Design Transformations

27 Optimization Strategy: Design Transformation  Task re-assignment move  To another partition of the same application  To a partition of another application  To a newly created partition  Empty partitions are deleted

28 Optimization Strategy: Design Transformations  TT frame moves  advance frame transmission time  advance frame predecessors transmission time  postpone frame transmission time  postpone frame successors transmission time  RC frame moves  reserve space for RC frame  resize reserved space for RC frame  remove reserved space for RC frame

29 Frame Representation for Moves ES 1 ES 2 NS 1 NS 2 ES 3 ES 4 vl 1 f 1,1 [ES 1, NS 1 ] f 1,1 [NS 1, NS 2 ] f 1,1 [NS 1, NS 2 ] f 1,1 [NS 1, NS 2 ]

30 Design transformations: Postpone move

31 Design transformations: Advance move

32 Design transformations: Reserve space for RC

33 Design transformations: Resize RC reserved space

34 RC Frame End-to-End Analysis  On a dataflow link, a RC frame can be delayed by:  scheduled TT frames  queued RC frames  technical latency  policy specific:  timely block  pre-emption

35 RC Frame End-to-End Analysis ES 1 NS 2 NS 1 ES 4 vl 3 vl 2 vl 1 NS 3 NS 2 → NS 1 f 3,j f 4,1 NS 3 → NS 1 NS 1 → ES 4 f 2,1 ES 1 → NS 1 f 1,i f 2,1 f 4,1 f 1,i f 3,j C [NS 1, ES 4 ] f1f1 Q TT [NS 1, ES 4 ] Q RC [NS 1, ES 4 ] Q TL NS 1 R f1f1 vl 4

36 RC Frame End-to-End Analysis  Approaches for analysis of ARINC 644p7 network traffic:  Network Calculus, (Boyer, 2008)  Finite State Machine, (Saha, 2007)  Timed Automata, (Adnan, 2010)  Trajectory Approach, (Bauer, 2009)  We use the method proposed in (Steiner, 2011)  it takes into account also the TT traffic  it is pessimistic:  does not ignore frames that already delayed a RC frame on a previous link  assumes uniformly distributed intervals of equal length reserved for RC traffic

37 Experimental Results: TO  Benchmarks  5 synthetic  2 real life test cases from E3S  TO compared to:  Straightforward Solution for Tasks (SST)  Simple partitioning scheme, each application A i is allocated a total time proportional to the utilization of tasks of A i on the processor they are mapped to

38 Experimental Results: TO

39 Experimental Results: TO

40 Experimental Results: TM  Benchmarks  7 synthetic  1 real life test case based on the SAE Automotive benchmark  TM compared to:  Straightforward Solution for Messages (SSM)  Builds TT schedules with the goal to optimize the end-to-end response time of the TT frames without considering the RC traffic

41 Experimental Results: TM

42 Experimental Results: TM

43 Experimental Results: TM

44 Conclusions  Applications of different criticality levels can be integrated onto the same architecture only if there is enough separation:  Separation at PE-level achieved with IMA.  Separation at network-level using TTEthernet.  We proposed a Tabu Search based optimization of task mapping and allocation to partitions, and of time partitions.  Only by optimizing the implementation of the applications, taking into account the particularities of IMA and TTEthernet, are we able to support the designer in obtaining schedulable implementations.

45