Using Custom Accelerators in Wireless Systems Alex Papakonstantinou, Deming Chen Illinois Center for Wireless Systems Wireless SoC Design Trends and Challenges.

Slides:



Advertisements
Similar presentations
VHDL Design of Multifunctional RISC Processor on FPGA
Advertisements

Automatic Data Movement and Computation Mapping for Multi-level Parallel Architectures with Explicitly Managed Memories Muthu Baskaran 1 Uday Bondhugula.
Philips Research ICS 252 class, February 3, The Trimedia CPU64 VLIW Media Processor Kees Vissers Philips Research Visiting Industrial Fellow
ARM Cortex A8 Pipeline EE126 Wei Wang. Cortex A8 is a processor core designed by ARM Holdings. Application: Apple A4, Samsung Exynos What’s the.
CPU Review and Programming Models CT101 – Computing Systems.
1 ITCS 3181 Logic and Computer Systems B. Wilkinson Slides9.ppt Modification date: March 30, 2015 Processor Design.
Introducing the Si2170: Highest Performance, Hybrid TV Tuner June 2009.
Chapter1 Fundamental of Computer Design Dr. Bernard Chen Ph.D. University of Central Arkansas.
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
1-1 Microprocessor Engineering Microprocessor Systems Microcontrollers Infineon 16-bit Processor Family  specifically 167CS microcontroller C Programming.
1 HW/SW Partitioning Embedded Systems Design. 2 Hardware/Software Codesign “Exploration of the system design space formed by combinations of hardware.
11/14/05ELEC Fall Multi-processor SoCs Yijing Chen.
ECE 232 L2 Basics.1 Adapted from Patterson 97 ©UCBCopyright 1998 Morgan Kaufmann Publishers ECE 232 Hardware Organization and Design Lecture 2 Computer.
1 Chapter 13 Cores and Intellectual Property. 2 Overview FPGA intellectual property (IP) can be defined as a reusable design block (Hard, Firm or soft)
Pipelining By Toan Nguyen.
COM181 Computer Hardware Ian McCrumRoom 5B18,
A Company Selling Technology and not just a Product.
GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.
(1) Introduction © Sudhakar Yalamanchili, Georgia Institute of Technology, 2006.
Introduction to Symmetric Multiprocessors Süha TUNA Bilişim Enstitüsü UHeM Yaz Çalıştayı
1 Presenter: Ming-Shiun Yang Sah, A., Balakrishnan, M., Panda, P.R. Design, Automation & Test in Europe Conference & Exhibition, DATE ‘09. A Generic.
An Energy-Efficient Reconfigurable Multiprocessor IC for DSP Applications Multiple programmable VLIW processors arranged in a ring topology –Balances its.
L29:Lower Power Embedded Architecture Design 성균관대학교 조 준 동 교수,
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
Chap. 1 Overview of Digital Design with Verilog. 2 Overview of Digital Design with Verilog HDL Evolution of computer aided digital circuit design Emergence.
1 3-General Purpose Processors: Altera Nios II 2 Altera Nios II processor A 32-bit soft core processor from Altera Comes in three cores: Fast, Standard,
Infrastructure for Better Quality Internet Access & Web Publishing without Increasing Bandwidth Prof. Chi Chi Hung School of Computing, National University.
Architectures for mobile and wireless systems Ese 566 Report 1 Hui Zhang Preethi Karthik.
SYSTEM-ON-CHIP (SoC) AND USE OF VLSI CIRCUIT DESIGN TECHNOLOGY.
Automated Design of Custom Architecture Tulika Mitra
May 17, USB Semiconductor IP How to Integrate USB into Your Design Eric Huang inSilicon Corporation.
Section 10: Advanced Topics 1 M. Balakrishnan Dept. of Comp. Sci. & Engg. I.I.T. Delhi.
C OMPUTER O RGANIZATION AND D ESIGN The Hardware/Software Interface 5 th Edition Chapter 1 Computer Abstractions and Technology Sections 1.5 – 1.11.
Data and Computer Communications Circuit Switching and Packet Switching.
COE 405 Design and Modeling of Digital Systems
1 Towards Optimal Custom Instruction Processors Wayne Luk Kubilay Atasu, Rob Dimond and Oskar Mencer Department of Computing Imperial College London HOT.
A Methodology for Architecture Exploration of heterogeneous Signal Processing Systems Paul Lieverse, Pieter van der Wolf, Ed Deprettere, Kees Vissers.
F. Gharsalli, S. Meftali, F. Rousseau, A.A. Jerraya TIMA laboratory 46 avenue Felix Viallet Grenoble Cedex - France Embedded Memory Wrapper Generation.
A Graph Based Algorithm for Data Path Optimization in Custom Processors J. Trajkovic, M. Reshadi, B. Gorjiara, D. Gajski Center for Embedded Computer Systems.
Computer Organization and Design Computer Abstractions and Technology
Hardware-software Interface Xiaofeng Fan
Embedded Systems Design: A Unified Hardware/Software Introduction 1 Chapter 3 General-Purpose Processors: Software.
General Concepts of Computer Organization Overview of Microcomputer.
Adaptive Multi-Threading for Dynamic Workloads in Embedded Multiprocessors 林鼎原 Department of Electrical Engineering National Cheng Kung University Tainan,
University of Michigan Electrical Engineering and Computer Science University of Michigan Electrical Engineering and Computer Science 1 Bundled Execution.
System-level power analysis and estimation September 20, 2006 Chong-Min Kyung.
-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.
VLSI Design System-on-Chip Design
Moore’s Law and Its Future Mark Clements. 15/02/2007EADS 2 This Week – Moore’s Law History of Transistors and circuits The Integrated circuit manufacturing.
Von Neumann Computers Article Authors: Rudolf Eigenman & David Lilja
Hy-C A Compiler Retargetable for Single-Chip Heterogeneous Multiprocessors Philip Sweany 8/27/2010.
Basic Elements of Processor ALU Registers Internal data pahs External data paths Control Unit.
NISC set computer no-instruction
University of Michigan Electrical Engineering and Computer Science Automatic Synthesis of Customized Local Memories for Multicluster Application Accelerators.
بسم الله الرحمن الرحيم MEMORY AND I/O.
Parallel Computing Presented by Justin Reschke
Winter-Spring 2001Codesign of Embedded Systems1 Essential Issues in Codesign: Architectures Part of HW/SW Codesign of Embedded Systems Course (CE )
Software Design and Development Computer Architecture Computing Science.
1 The user’s view  A user is a person employing the computer to do useful work  Examples of useful work include spreadsheets word processing developing.
Controller Implementation
Andreas Hoffmann Andreas Ropers Tim Kogel Stefan Pees Prof
ECE354 Embedded Systems Introduction C Andras Moritz.
Embedded Systems Design
Advanced Topic: Alternative Architectures Chapter 9 Objectives
System Interconnect Fabric
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
Chapter 1 Introduction.
Srinivas Aluri Jaimin Mehta
ARM ORGANISATION.
Presentation transcript:

Using Custom Accelerators in Wireless Systems Alex Papakonstantinou, Deming Chen Illinois Center for Wireless Systems Wireless SoC Design Trends and Challenges Shrinking transistor technologies have transformed die into a host of extraordinary size and complexity systems –All the analog and digital components that were implemented in 3-4 different ICs in past technologies, can now fit in a single chip Designer Productivity does not rise at the same rate as transistor capacity –Design reuse and use of Commercial Off-The-Self (COTS) Intellectual Property (IP) help meet Time-To-Market (TTM) constraints but have other downsides Design space exploration is becoming a daunting task and conflicts with the shrinking TTM requirements System customization suffers in terms of functionality/ performance/power/area from “one system fits all” tactic Design focus is shifting from single thread speed optimization to execution parallelization through multi-processor systems Typical Design Practice & Design Paradigm Shift COTS IP modules are integrated to meet the required system functionality –Usually a generic microprocessor/micro- controller is used for the control part and a separate DSP processor for the signal processing part –Fixed-functionality IP modules are integrated for the various data processing IP-use speeds up the design phase but: –imposes coarse granularity on optimization decisions regarding functionality, performance and power dissipation –does not eliminate design time entirely, as interfacing between different IP modules can take up considerable engineering resources Design Paradigm needs a shift to higher abstraction level –Design systems efficiently with higher flexibility and on-demand customization Instruction-less custom processor / accelerator: –Microcode memory stores microcode words which control Functional-Units (FU) and data transfers each cycle –Program Counter (PC) holds next microcode memory address –Microcode words do not require any decoding –FUs customized according to application domain –Application-custom forwarding paths between FUs can eliminate unnecessary Register File (RF) reads/writes EPOS ( Explicitly Parallel Operations System ) Instruction-Level Parallelism (ILP) extraction: –The front-end of the IMPACT compiler is used to optimize the HLL description using: Traditional compiler techniques Superblock and Hyperblock creation The EPOS accelerators generated can substitute the generic COTS IP by: –Offering high customization according to the system requirements –Providing better performance and power efficiency than a generic DSP- core/microprocessor EPOS – based Wireless SoC Solution Each module is mapped directly onto a customized EPOS accelerator The interfaces between the EPOS accelerators, as well as, between other IP and EPOS modules are defined in the HLL program and automatically synthesized along with the EPOS datapaths Exploration of alternative system implementations becomes efficient and extremely fast Each EPOS processor can be re-programmed within the system to execute optimized/modified versions of its original functionality EPOS Performance Results EPOS Configuration used: –4xALU –1xMUL –1xST-Port –1xLD-Port FU Latencies: –ALU: 1 –MUL: 3 –LD: 4 –ST: 1 Application NISC (cycles) EPOS (cycles) startup dijkstra bubble