Andrea Acquaviva, Luca Benini, Bruno Riccò

Slides:



Advertisements
Similar presentations
Mohammadsadegh Sadri, Christian Weis, Norbert When and Luca Benini
Advertisements

Real- time Dynamic Voltage Scaling for Low- Power Embedded Operating Systems Written by P. Pillai and K.G. Shin Presented by Gaurav Saxena CSE 666 – Real.
Sang-Chun Han Hwangjun Song Jun Heo International Conference on Intelligent Hiding and Multimedia Signal Processing (IIH-MSP), Feb, /05 Feb 2009.
Investigating the Effect of Voltage- Switching on Low-Energy Task Scheduling in Hard Real-Time Systems Paper review Presented by Chung-Fu Kao.
Low Power Design for Wireless Sensor Networks Aki Happonen.
Source Code Optimization and Profiling of Energy Consumption in Embedded System Simunic, T.; Benini, L.; De Micheli, G.; Hans, M.; Proceedings on The 13th.
Performance and Energy Bounds for Multimedia Applications on Dual-processor Power-aware SoC Platforms Weng-Fai WONG 黄荣辉 Dept. of Computer Science National.
A Survey of Energy efficient Network Protocols for Wireless Networks Presentation by – Sanjay Acharya Course – CS 898T Instructor – Dr. Chin-Chih Chang.
Improving the Efficiency of Memory Partitioning by Address Clustering Alberto MaciiEnrico MaciiMassimo Poncino Proceedings of the Design,Automation and.
Define Embedded Systems Small (?) Application Specific Computer Systems.
Chapter 1 and 2 Computer System and Operating System Overview
System-Wide Energy Minimization for Real-Time Tasks: Lower Bound and Approximation Xiliang Zhong and Cheng-Zhong Xu Dept. of Electrical & Computer Engg.
Processor Frequency Setting for Energy Minimization of Streaming Multimedia Application by A. Acquaviva, L. Benini, and B. Riccò, in Proc. 9th Internation.
Adaptive Video Coding to Reduce Energy on General Purpose Processors Daniel Grobe Sachs, Sarita Adve, Douglas L. Jones University of Illinois at Urbana-Champaign.
MSS & AMS Name and configure MIDI controllers, devices and sound modules. Control their routing to Pro Tools MSS - Configurations can be imported or exported.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
CS 423 – Operating Systems Design Lecture 22 – Power Management Klara Nahrstedt and Raoul Rivas Spring 2013 CS Spring 2013.
1 Design and Implementation of an Efficient MPEG-4 Interactive Terminal on Embedded Devices Yi-Chin Huang, Tu-Chun Yin, Kou-Shin Yang, Yan-Jun Chang, Meng-Jyi.
Reconfigurable Hardware in Wearable Computing Nodes Christian Plessl 1 Rolf Enzler 2 Herbert Walder 1 Jan Beutel 1 Marco Platzner 1 Lothar Thiele 1 1 Computer.
Graphics on Key by Eyal Sarfati and Eran Gilat Supervised by Prof. Shmuel Wimer, Amnon Stanislavsky and Mike Sumszyk 1.
Low-Power Wireless Sensor Networks
Integrating Fine-Grained Application Adaptation with Global Adaptation for Saving Energy Vibhore Vardhan, Daniel G. Sachs, Wanghong Yuan, Albert F. Harris,
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
High Performance Embedded Computing © 2007 Elsevier Lecture 3: Design Methodologies Embedded Computing Systems Mikko Lipasti, adapted from M. Schulte Based.
Quality of Service Karrie Karahalios Spring 2007.
High Performance Embedded Computing © 2007 Elsevier Chapter 1, part 2: Embedded Computing High Performance Embedded Computing Wayne Wolf.
A Graph Based Algorithm for Data Path Optimization in Custom Processors J. Trajkovic, M. Reshadi, B. Gorjiara, D. Gajski Center for Embedded Computer Systems.
Hardware-software Interface Xiaofeng Fan
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Advanced Technology Laboratories Practical Considerations for Smoothing Multimedia Traffic over Packet- Switched Networks Christos Tryfonas
Memory Hierarchy Adaptivity An Architectural Perspective Alex Veidenbaum AMRM Project sponsored by DARPA/ITO.
CS 351/ IT 351 Modeling and Simulation Technologies HPC Architectures Dr. Jim Holten.
Multimedia Computing and Networking Jan Reduced Energy Decoding of MPEG Streams Malena Mesarina, HP Labs/UCLA CS Dept Yoshio Turner, HP Labs.
Workload Clustering for Increasing Energy Savings on Embedded MPSoCs S. H. K. Narayanan, O. Ozturk, M. Kandemir, M. Karakoy.
1 Power-Aware System on a Chip A. Laffely, J. Liang, R. Tessier, C. A. Moritz, W. Burleson University of Massachusetts Amherst Boston Area Architecture.
1 Device Controller I/O units typically consist of A mechanical component: the device itself An electronic component: the device controller or adapter.
Determining Optimal Processor Speeds for Periodic Real-Time Tasks with Different Power Characteristics H. Aydın, R. Melhem, D. Mossé, P.M. Alvarez University.
PRESENTED BY: MOHAMAD HAMMAM ALSAFRJALANI UFL ECE Dept. 3/31/2010 UFL ECE Dept 1 CACHE OPTIMIZATION FOR AN EMBEDDED MPEG-4 VIDEO DECODER.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
Optimizing Interconnection Complexity for Realizing Fixed Permutation in Data and Signal Processing Algorithms Ren Chen, Viktor K. Prasanna Ming Hsieh.
Chapter 3 Data Representation
Basic Computer Fundamentals
GCSE Computing - The CPU
Overview Motivation (Kevin) Thermal issues (Kevin)
REAL-TIME OPERATING SYSTEMS
PROGRAMMABLE LOGIC CONTROLLERS SINGLE CHIP COMPUTER
Problem and Motivation
Temperature and Power Management
Microprocessor and Microcontroller Fundamentals
TrueTime.
Combinatorial Optimization for Embedded System Design
Hiba Tariq School of Engineering
Green cloud computing 2 Cs 595 Lecture 15.
6. Structure of Computers
Stateless Combinational Logic and State Circuits
Wayne Wolf Dept. of EE Princeton University
System On Chip.
Embedded Systems Design
Decoupled Access-Execute Pioneering Compilation for Energy Efficiency
EE 107 Fall 2017 Lecture 7 Serial Buses – I2C Direct Memory Access
Ming Liu, Wolfgang Kuehn, Zhonghai Lu, Axel Jantsch
Introducing Embedded Systems and the Microcontrollers
Chapter 1: Introduction
Babak Sorkhpour, Prof. Roman Obermaisser, Ayman Murshed
Improving java performance using Dynamic Method Migration on FPGAs
Final Review CS144 Review Session 9 June 4, 2008 Derrick Isaacson
חוברת שקפים להרצאות של ד"ר יאיר ויסמן מבוססת על אתר האינטרנט:
Digital Processing Platform
Final Project presentation
GCSE Computing - The CPU
Presentation transcript:

Software-Controlled Processor Speed Setting for Low-Power Streaming Multimedia Andrea Acquaviva, Luca Benini, Bruno Riccò D.E.I.S. - Università di Bologna

Motivation and Basic Idea Energy consumption in wearable devices affects: battery size, weight, and time system costs and reliability In μ-processor based architectures the CPU is the greatest contributor to energy Such architecture allows software-driven energy optimization Application-driven CPU speed-setting improves energy efficiency through just-in-time computation even with fixed voltage

Outline Background Contribution of the work Speed-Setting & Energy Optimization Clock frequency & Performance Experimental Results Conclusions

Background System Level Power optimization Application – side (workload – adaptive algorithms) Workload information Fast adaptation Operating system level (task scheduling) System information Slow adaptation Both problems have been investigated in various works included in the bibliography of the paper

Background (cont’d) Traditional Power optimization techniques Variable voltage based Workload-dependent voltage scheduling External hardware Discrete frequency range, Vdd Shutdown based Binary version worse adaptation Time and energy during transitions In contrast with previous work, we state that energy can be saved even with fixed voltage memory latency I/O synchronization

Contribution of the work Effectiveness of clock speed setting in multimedia streaming algorithm Automatic run – time processor frequency setting for energy minimization of MP3 decoding Streaming – multimedia workload characterization for speed-setting policies

Variable Frequency Energy as a function of frequency Energy consumption: T is given by: Hence the energy equation can be written as:

Variable Frequency (cont’d) Real time constraint: Nuseful and TMAX fixed, Nidle = Nidle(f) f > fmax The relation between f and the frame rate is not linear

Variable frequency (cont’d) Reduces costs of memory latency Reduces costs of I/O synchronization Discrete frequency range Adaptation mismatch

Multimedia Systems Hardware Software Wireless network, wired link from a host Wearable system: General purpose P (e.g. SoC) I/O HW units (DMA, IC, buffers…) Some external chips (ex. audio CODEC) Software Data processing algorithm MPEG decoding an audio stream P I/O EXT

The MPEG3 decoder An MPEG stream is composed by frames The decoder produces audio samples by processing block of frames. SW and HW buffering allows synchronization among input rate, output rate and elaboration time Each block must be elaborated in a fixed time, during this time the CPU does not access input or output buffers Output data are sent to the audio CODEC by the DMA

System characterization The effect of speed-setting on performance depends on: Hardware characteristics Workload system characterization: FRAME RATE vs FREQUENCY

Decision Algorithm: off – line phase Characteristics determination: FRA, FRB, FRW Overall normalized characteristics determination: FRAo, FRBo, FRWo. NFR(frame/s) FRB FRmax(br, sr) FRA Bit rate 1 FRW 0.9 0.8 0.7 Sample rate 0.6 0.5 0.4 f 100 200 300

Decision Algorithm: on-line phase audio stream br, sr Look-up FR FRmax f

Decision Algorithm: on – line phase (cont’d) FRMAX sr FRREQ fMIN fAVG fMAX fAVG: worst case (large jitter) fMAX: always guaranteed fMIN : best case AVG energy MAX energy MIN energy

Energy Penalty Memory system and interface does not speed up like the processor with increasing clock frequency Increasing f increases Energy Penalty

Experimental Results E(mJ) Energy penalty E(mJ) f Energy per frame f 11 Energy penalty 10 E(mJ) 9 11 8 f 100 200 300 10 Energy per frame 9 8 f 100 200 300 fMIN

Conclusions and future work Approach ro automatic run-time setting of optimum processor frequency for energy minimization for streaming MP3 System characterization for speed-setting policies Future: other embedded applications (ex. MPEG Video) Closed loop policies