FAST: Frequency-Aware Static Timing Analysis

Slides:



Advertisements
Similar presentations
Feedback EDF Scheduling Exploiting Dynamic Voltage Scaling Yifan Zhu and Frank Mueller Department of Computer Science Center for Embedded Systems Research.
Advertisements

Simulation of Feedback Scheduling Dan Henriksson, Anton Cervin and Karl-Erik Årzén Department of Automatic Control.
Xianfeng Li Tulika Mitra Abhik Roychoudhury
1 EE5900 Advanced Embedded System For Smart Infrastructure Energy Efficient Scheduling.
1 “Scheduling with Dynamic Voltage/Speed Adjustment Using Slack Reclamation In Multi-processor Real-Time Systems” Dakai Zhu, Rami Melhem, and Bruce Childers.
International Symposium on Low Power Electronics and Design Energy-Efficient Non-Minimal Path On-chip Interconnection Network for Heterogeneous Systems.
Combining Statistical and Symbolic Simulation Mark Oskin Fred Chong and Matthew Farrens Dept. of Computer Science University of California at Davis.
Real- time Dynamic Voltage Scaling for Low- Power Embedded Operating Systems Written by P. Pillai and K.G. Shin Presented by Gaurav Saxena CSE 666 – Real.
EECC551 - Shaaban #1 Fall 2003 lec# Pipelining and Exploiting Instruction-Level Parallelism (ILP) Pipelining increases performance by overlapping.
1 A Self-Tuning Cache Architecture for Embedded Systems Chuanjun Zhang*, Frank Vahid**, and Roman Lysecky *Dept. of Electrical Engineering Dept. of Computer.
Harini Ramaprasad, Frank Mueller North Carolina State University Center for Embedded Systems Research Tightening the Bounds on Feasible Preemption Points.
Sim-alpha: A Validated, Execution-Driven Alpha Simulator Rajagopalan Desikan, Doug Burger, Stephen Keckler, Todd Austin.
Microarchitectural Approaches to Exceeding the Complexity Barrier © Eric Rotenberg 1 Microarchitectural Approaches to Exceeding the Complexity Barrier.
1 Lecture 17: Basic Pipelining Today’s topics:  5-stage pipeline  Hazards and instruction scheduling Mid-term exam stats:  Highest: 90, Mean: 58.
1 Center for Embedded Systems Research (CESR) Department of Computer Science North Carolina State University Frank Mueller Timing Analysis: In Search of.
NC STATE UNIVERSITY Anantaraman © 2004RTSS–25 Enforcing Safety of Real-Time Schedules on Contemporary Processors using a Virtual Simple Architecture (VISA)
Computer Architecture 2010 – Out-Of-Order Execution 1 Computer Architecture Out-Of-Order Execution Lihu Rappoport and Adi Yoaz.
Processor Pipelines and Their Properties for Static WCET Analysis Jakob Engblom and Bengt Jonsson Presented by Kiran Seth.
Exploring the Tradeoffs of Configurability and Heterogeneity in Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable.
VOLTAGE SCHEDULING HEURISTIC for REAL-TIME TASK GRAPHS D. Roychowdhury, I. Koren, C. M. Krishna University of Massachusetts, Amherst Y.-H. Lee Arizona.
Pipelines for Future Architectures in Time Critical Embedded Systems By: R.Wilhelm, D. Grund, J. Reineke, M. Schlickling, M. Pister, and C.Ferdinand EEL.
ParaScale : Exploiting Parametric Timing Analysis for Real-Time Schedulers and Dynamic Voltage Scaling Sibin Mohan 1 Frank Mueller 1,William Hawkins 2,
WCET Analysis for a Java Processor Martin Schoeberl TU Vienna, Austria Rasmus Pedersen CBS, Denmark.
Computer Science Department University of Pittsburgh 1 Evaluating a DVS Scheme for Real-Time Embedded Systems Ruibin Xu, Daniel Mossé and Rami Melhem.
1 Single-ISA Heterogeneous Multi-Core Architectures: The Potential for Processor Power Reduction Rakesh Kumar, Keith I. Farkas, Norman P. Jouppi, Parthasarathy.
1 EE5900 Advanced Embedded System For Smart Infrastructure Energy Efficient Scheduling.
Critical Power Slope Understanding the Runtime Effects of Frequency Scaling Akihiko Miyoshi, Charles Lefurgy, Eric Van Hensbergen Ram Rajamony Raj Rajkumar.
Scheduling policies for real- time embedded systems.
1 Tuning Garbage Collection in an Embedded Java Environment G. Chen, R. Shetty, M. Kandemir, N. Vijaykrishnan, M. J. Irwin Microsystems Design Lab The.
1 Estimating the Worst-Case Energy Consumption of Embedded Software Ramkumar Jayaseelan Tulika Mitra Xianfeng Li School of Computing National University.
Zheng Wu. Background Motivation Analysis Framework Intra-Core Cache Analysis Cache Conflict Analysis Optimization Techniques WCRT Analysis Experiment.
NC STATE UNIVERSITY Center for Embedded Systems Research (CESR) Electrical & Computer Engineering North Carolina State University Ali El-Haj-Mahmoud and.
Hard Real-Time Scheduling for Low- Energy Using Stochastic Data and DVS Processors Flavius Gruian Department of Computer Science, Lund University Box 118.
NC STATE UNIVERSITY 1 Feedback EDF Scheduling w/ Async. DVS Switching on the IBM Embedded PowerPC 405 LP Frank Mueller North Carolina State University,
Safely Exploiting Multithreaded Processors to Tolerate Memory Latency
F A S T Frequency-Aware Static Timing Analysis
Static WCET Analysis vs. Measurement: What is the Right Way to Access Real-Time Task Timing? David Fleeman { Center.
NC STATE UNIVERSITY Center for Embedded Systems Research (CESR) Department of Electrical & Computer Eng’g North Carolina State University Ali El-Haj-Mahmoud,
Lecture 08: Memory Hierarchy Cache Performance Kai Bu
Critical Power Slope: Understanding the Runtime Effects of Frequency Scaling Akihiko Miyoshi †,Charles Lefurgy ‡, Eric Van Hensbergen ‡, Ram Rajamony ‡,
Static WCET Analysis vs. Measurement: What is the Right Way to Assess Real-Time Task Timing? Worst Case Execution Time Prediction by Static Program Analysis.
Yifan Zhu, Frank Mueller North Carolina State University Center for Efficient, Secure and Reliable Computing DVSleak: Combining Leakage Reduction and Voltage.
Harini Ramaprasad, Frank Mueller North Carolina State University Center for Embedded Systems Research Bounding Preemption Delay within Data Cache Reference.
Combining Software and Hardware Monitoring for Improved Power and Performance Tuning Eric Chi, A. Michael Salem, and R. Iris Bahar Brown University Division.
CprE 458/558: Real-Time Systems (G. Manimaran)1 Energy Aware Real Time Systems - Scheduling algorithms Acknowledgement: G. Sudha Anil Kumar Real Time Computing.
Static Timing Analysis
Equalizer: Dynamically Tuning GPU Resources for Efficient Execution Ankit Sethia* Scott Mahlke University of Michigan.
Determining Optimal Processor Speeds for Periodic Real-Time Tasks with Different Power Characteristics H. Aydın, R. Melhem, D. Mossé, P.M. Alvarez University.
Timing Anomalies in Dynamically Scheduled Microprocessors Thomas Lundqvist, Per Stenstrom (RTSS ‘99) Presented by: Kaustubh S. Patil.
RT-OPEX: Flexible Scheduling for Cloud-RAN Processing
CHaRy Software Synthesis for Hard Real-Time Systems
Reducing the Number of Preemptions in Real-Time Systems Scheduling by CPU Frequency Scaling Abhilash Thekkilakattil, Anju S Pillai, Radu Dobrin, Sasikumar.
Design-Space Exploration
Wayne Wolf Dept. of EE Princeton University
EEE 6494 Embedded Systems Design
Decoupled Access-Execute Pioneering Compilation for Energy Efficiency
Flavius Gruian < >
Dr. Javier Navaridas Pipelining Dr. Javier Navaridas COMP25212 System Architecture.
Instruction Scheduling for Instruction-Level Parallelism
Improved schedulability on the ρVEX polymorphic VLIW processor
Lecture 08: Memory Hierarchy Cache Performance
Address-Value Delta (AVD) Prediction
Aravindh Anantaraman*, Kiran Seth†, Eric Rotenberg*, Frank Mueller‡
Hardik Shah, Kai Huang and Alois Knoll
Human Computer Interaction Lecture 14 HCI in Software Process
The University of Adelaide, School of Computer Science
M. Kezunovic (P.I.) S. S. Luo D. Ristanovic Texas A&M University
ECE 721, Spring 2019 Prof. Eric Rotenberg.
Research Topics Embedded, Real-time, Sensor Systems Frank Mueller moss
Presentation transcript:

FAST: Frequency-Aware Static Timing Analysis Center for Embedded Systems Research North Carolina State University Advisors: Dr. Frank Mueller & Dr. Eric Rotenberg FAST: Frequency-Aware Static Timing Analysis Kiran Seth & Aravindh Anantaraman Motivation Solution Results Frequency Model Static timing analysis tool Frequency-Aware Static Timing Analysis (FAST) tool. Scheduling in Real-Time systems requires Worst-case Execution time (WCET) and Worst-case Execution Cycles (WCEC) Dynamic Voltage Scaling (DVS) saves energy Good fit for Real-Time Systems. Calculate WCEC accounting for effects of memory accesses. new parametric frequency model. Parametric Frequency Model WCEC = i + mN i: Ideal # of worst-case cycles (for non-memory operations). m: Total # of worst-case memory accesses. N: # of cycles for memory access. Depends on frequency and memory latency. Features of the frequency model can be used with any simple pipeline. can easily be incorporated into scheduling schemes. use of caches with the frequency model is optional. Static timing analysis Traditional static timing analysis tool Problem 6 C-lab benchmarks used (fft, adpcm, lms, cnt, mm and srt). DVS schemes assume that with frequency modulation WCET scales linearly. WCEC remains constant. Current DVS schemes ignore the effects of frequency scaling on WCET. Assumption valid for most operations. BUT not valid for memory operations  constant memory latency systems. DVS schemes overestimate WCET at lower frequencies. Cannot fully utilize available slack. Power savings potential largely wasted. Extent of overestimation of WCET depends on cache analysis. Not true ? Using the Parametric Frequency Model Consider the following sequence of instructions simulated through a simple pipe. A: add R2, R1, R3 B: load R4, [M1] C: add R2, R1, R4 D: add R2, R1, R5 The simple pipeline has 6 stages, a data and instruction cache for N = 10. Overestimation by FAST tool is less than 1 %. Frequency model improves DVS schemes  accurate WCET. Implemented frequency model for 3 DVS algorithms (by [Pillai and Shin]). Improved energy savings. The 3 FAST DVS schemes are compared with the original algorithms. 3 tasksets composed of C-lab benchmarks. utilizations = 0.5 and 0.9 WCET and WCEC for 6-stage pipeline with caches for C-lab benchmark fft. Example 1: Capturing effect of I-cache miss (WCEC = 9 + 1N ) Original DVS schemes use : FAST DVS schemes use : Assumed WCEC is always the WCEC at the highest frequency. Example 2: Capturing effect of D-cache miss (WCEC = 9 + 1N ) Static DVS scheme: most energy savings FAST equations: lower power in cycle-conserving look-ahead Example 3: Capturing effect of I-cache and D-cache miss (WCEC = 9 + 2N ) Assumed WCET is overestimated as compared to actual WCET. As N changes with frequency, the frequency model accurately measures WCEC. Benefits for FAST are observed in all cases.