Chapter 3 Embedded Computing in the Emerging Smart Grid Arindam Mukherjee, ValentinaCecchi, Rohith Tenneti, and Aravind Kailas Electrical and Computer.

Slides:



Advertisements
Similar presentations
Dynamic Power Redistribution in Failure-Prone CMPs Paula Petrica, Jonathan A. Winter * and David H. Albonesi Cornell University *Google, Inc.
Advertisements

TRIPS Primary Memory System Simha Sethumadhavan 1.
Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
Lecture 6: Multicore Systems
Thread Criticality Predictors for Dynamic Performance, Power, and Resource Management in Chip Multiprocessors Abhishek Bhattacharjee Margaret Martonosi.
Combining Statistical and Symbolic Simulation Mark Oskin Fred Chong and Matthew Farrens Dept. of Computer Science University of California at Davis.
Scalable Multi-Cache Simulation Using GPUs Michael Moeng Sangyeun Cho Rami Melhem University of Pittsburgh.
Computer Abstractions and Technology
Jared Casper, Ronny Krashinsky, Christopher Batten, Krste Asanović MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, USA A Parameterizable.
Sim-alpha: A Validated, Execution-Driven Alpha Simulator Rajagopalan Desikan, Doug Burger, Stephen Keckler, Todd Austin.
Is SC + ILP = RC? Presented by Vamshi Kadaru Chris Gniady, Babak Falsafi, and T. N. VijayKumar - Purdue University Spring 2005: CS 7968 Parallel Computer.
CSC457 Seminar YongKang Zhu December 6 th, 2001 About Network Processor.
1 Design and Implementation of Turbo Decoders for Software Defined Radio Yuan Lin 1, Scott Mahlke 1, Trevor Mudge 1, Chaitali.
1 Instant replay  The semester was split into roughly four parts. —The 1st quarter covered instruction set architectures—the connection between software.
Joram Benham April 2,  Introduction  Motivation  Multicore Processors  Overview, CELL  Advantages of CMPs  Throughput, Latency  Challenges.
Chapter 18 Multicore Computers
Secure Embedded Processing through Hardware-assisted Run-time Monitoring Zubin Kumar.
Embedded Systems Design ICT Embedded System What is an embedded System??? Any IDEA???
Intel Architecture. Changes in architecture Software architecture: –Front end (Feature changes such as adding more graphics, changing the background colors,
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
University of Michigan Electrical Engineering and Computer Science 1 Extending Multicore Architectures to Exploit Hybrid Parallelism in Single-Thread Applications.
Multi-core architectures. Single-core computer Single-core CPU chip.
Multi-Core Architectures
1 Multi-core processors 12/1/09. 2 Multiprocessors inside a single chip It is now possible to implement multiple processors (cores) inside a single chip.
Multicore Systems CET306 Harry R. Erwin University of Sunderland.
1 Computer Architecture Research Overview Rajeev Balasubramonian School of Computing, University of Utah
History of Microprocessor MPIntroductionData BusAddress Bus
Multiprocessing. Going Multi-core Helps Energy Efficiency William Holt, HOT Chips 2005 Adapted from UC Berkeley "The Beauty and Joy of Computing"
Predictive Design Space Exploration Using Genetically Programmed Response Surfaces Henry Cook Department of Electrical Engineering and Computer Science.
Hyper Threading (HT) and  OPs (Micro-Operations) Department of Computer Science Southern Illinois University Edwardsville Summer, 2015 Dr. Hiroshi Fujinoki.
Chapter 8 CPU and Memory: Design, Implementation, and Enhancement The Architecture of Computer Hardware and Systems Software: An Information Technology.
Processor Architecture
HyperThreading ● Improves processor performance under certain workloads by providing useful work for execution units that would otherwise be idle ● Duplicates.
Hybrid Multi-Core Architecture for Boosting Single-Threaded Performance Presented by: Peyman Nov 2007.
Intel Multimedia Extensions and Hyper-Threading Michele Co CS451.
Advanced Computer Architecture pg 1 Embedded Computer Architecture 5SAI0 Chip Multi-Processors (ch 8) Henk Corporaal
Computer Structure 2015 – Intel ® Core TM μArch 1 Computer Structure Multi-Threading Lihu Rappoport and Adi Yoaz.
Page 1 2P13 Week 1. Page 2 Page 3 Page 4 Page 5.
Processor Level Parallelism 2. How We Got Here Developments in PC CPUs.
Fall 2012 Parallel Computer Architecture Lecture 4: Multi-Core Processors Prof. Onur Mutlu Carnegie Mellon University 9/14/2012.
Pentium 4 Deeply pipelined processor supporting multiple issue with speculation and multi-threading 2004 version: 31 clock cycles from fetch to retire,
Lynn Choi School of Electrical Engineering
Microarchitecture.
Visit for more Learning Resources
Lynn Choi School of Electrical Engineering
Multi-core processors
Computer Structure Multi-Threading
Embedded Systems Design
Intel Atom Architecture – Next Generation Computing
Embedded Computer Architecture 5SAI0 Chip Multi-Processors (ch 8)
Hyperthreading Technology
Microprocessors Chapter 4.
Levels of Parallelism within a Single Processor
Coe818 Advanced Computer Architecture
Tapestry: Reducing Interference on Manycore Processors for IaaS Clouds
Intel Core I7 Pipeline Wei-Tse Sun.
Die Stacking (3D) Microarchitecture -- from Intel Corporation
Computer Evolution and Performance
Embedded Computer Architecture 5SAI0 Chip Multi-Processors (ch 8)
Levels of Parallelism within a Single Processor
Chapter 4 Multiprocessors
CS 286 Computer Organization and Architecture
Realizing Closed-loop, Online Tuning and Control for Configurable-Cache Embedded Systems: Progress and Challenges Islam S. Badreldin*, Ann Gordon-Ross*,
Chip&Core Architecture
Application-Specific Customization of Soft Processor Microarchitecture
What Are Performance Counters?
Emulating Massively Parallel (PetaFLOPS) Machines
Presentation transcript:

Chapter 3 Embedded Computing in the Emerging Smart Grid Arindam Mukherjee, ValentinaCecchi, Rohith Tenneti, and Aravind Kailas Electrical and Computer Engineering Department, University of North Carolina, Charlotte

Information technology back bone of smart grid Compute and Control Communication Data

Smart meters in advanced metering infrastructure sensor/control information Renewable and less predictable power sources status/control information Data collected by remote terminal units from field units

Communication Secure transfer encryption/decryption State of the art transfer protocol for communication via power lines/wireless/dedicated wirelines Two way communication at all levels

Compute and Control Computations for signal processing Computations for cyber security Power flow calculations for control Intelligent control for optimal power usage

Computations in smart grid? Analysis and Control Sensing and Measurement infrastructure Communication and security Computations Commun ication and Security Analysis and Control Sensing and Measurem ent infrastrutc ure

Intel Atom (state of the art) Advantages In-order execution Low power Lesser die space Disadvantages Memory access has long latencies for floating point and SIMD instructions In-order execution (Long latency) Intel Atom Pine trail

ARM Cortex A8 (state of the art) Advantage In-order cores Low power Lesser area Disadvantages Deep pipelines, introduce latency Instruction level parallelism cannot be exploited ARM Cortex A8 neon integer pipeline

New processor? Microarchitecture is currently not optimized for the smart grid applications Customize architecture for specific computations for better efficiency Efficiency Latency and throughput requirements for real time applications Consume lesser power compared to state of the art processors One embedded processor to handle varied applications

Design space exploration

Design space exploration - Basic Steps Identify the application and profile the benchmarks Identify the processor simulators Optimize the architecture

Applications Power flow studies – Aid in control Fast fourier transform – Signal processing Blowfish encryption – Security The benchmarks are optimized based on the architecture.

Processor simulators Casper - A Sparc V9 based Cycle accurate chip-multithreaded Architecture Simulator for Performance, Energy and aRea analysis. Based on open sourced Suns Ultra Sparc-T1 architecture. MPTLSim - A cycle-accurate, full-system simulator for x86-64 multicore architectures with coherent caches MV5 - An Event-driven, Cycle-accurate Simulator for Heterogeneous Manycore Architectures

Casper Customize Cores on chip Threads per core L1, L2 cache size/associativity/banks Size of load miss queue, missed instruction list, data fill queue, branch address buffer, store buffer, cache fill buffer Set instruction/data cache latency Measure Power Area CPI Throughput Latency Pipeline stalls Wait time for threads Instruction/data cache misses

MPTLSim All parameters listed in Casper are configurable in MPTLSim Additional features Full system capability Implements branch prediction Out of order execution (no In-order execution) Configure RTL models of the pipeline units

MV5 Implements all features of the earlier simulators (no support for full system emulation) Additional features X-86, Alpha, Sparc all architectures are supported Heterogeneous configuration in terms of the microarchitecture features like SIMD/Out of order/In order are supported Different On chip networks can be explored Run different benchmarks on different cores Example configuration

Optimal configuration Few methods Linear/Non-linear regression Genetic algorithms Artificial neural networks Strength Pareto evolution Fuzzy logic Algorithms for best initial training set Random sampling Placket-Burman design of experiments Latin hypercube Configurable parameter set Simulate the architecture and find the value to be optimized Optimizer predicts next set of parameters

Open research Benchmark suite development for the smart grid applications Operating system for the smart grid applications