ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.

Slides:



Advertisements
Similar presentations
© 2003 Xilinx, Inc. All Rights Reserved Course Wrap Up DSP Design Flow.
Advertisements

Real-Time DSP Multiprocessor Implementation for Future Wireless Base-Station Receivers Bryan Jones, Sridhar Rajagopal, and Dr. Joseph Cavallaro.
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
Extensible Processors. 2 ASIP Gain performance by:  Specialized hardware for the whole application (ASIC). −  Almost no flexibility. −High cost.  Use.
Lecture 26: Reconfigurable Computing May 11, 2004 ECE 669 Parallel Computer Architecture Reconfigurable Computing.
Behavioral Design Outline –Design Specification –Behavioral Design –Behavioral Specification –Hardware Description Languages –Behavioral Simulation –Behavioral.
Processor Architectures and Program Mapping 5kk10 TU/e 2006 Henk Corporaal Jef van Meerbergen Bart Mesman.
Tejas Bhatt and Dennis McCain Hardware Prototype Group, NRC/Dallas Matlab as a Development Environment for FPGA Design Tejas Bhatt June 16, 2005.
The Effect of Data-Reuse Transformations on Multimedia Applications for Different Processing Platforms N. Vassiliadis, A. Chormoviti, N. Kavvadias, S.
Enhancing Embedded Processors with Specific Instruction Set Extensions for Network Applications A. Chormoviti, N. Vassiliadis, G. Theodoridis, S. Nikolaidis.
UCB November 8, 2001 Krishna V Palem Proceler Inc. Customization Using Variable Instruction Sets Krishna V Palem CTO Proceler Inc.
SSS 4/9/99CMU Reconfigurable Computing1 The CMU Reconfigurable Computing Project April 9, 1999 Mihai Budiu
GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.
FPGA Based Fuzzy Logic Controller for Semi- Active Suspensions Aws Abu-Khudhair.
L29:Lower Power Embedded Architecture Design 성균관대학교 조 준 동 교수,
Development in hardware – Why? Option: array of custom processing nodes Step 1: analyze the application and extract the component tasks Step 2: design.
DSPs in Wireless Communication Systems Vishwas Sundaramurthy Electrical and Computer Engineering Department, Rice University, Houston,TX.
Trigger design engineering tools. Data flow analysis Data flow analysis through the entire Trigger Processor allow us to refine the optimal architecture.
RICE UNIVERSITY Implementing the Viterbi algorithm on programmable processors Sridhar Rajagopal Elec 696
1 3-General Purpose Processors: Altera Nios II 2 Altera Nios II processor A 32-bit soft core processor from Altera Comes in three cores: Fast, Standard,
A Reconfigurable Processor Architecture and Software Development Environment for Embedded Systems Andrea Cappelli F. Campi, R.Guerrieri, A.Lodi, M.Toma,
Architectures for mobile and wireless systems Ese 566 Report 1 Hui Zhang Preethi Karthik.
Paper Review: XiSystem - A Reconfigurable Processor and System
CASTNESS11, Rome Italy © 2011 Target Compiler Technologies L 1 Ideas for the design of an ASIP for LQCD Target Compiler Technologies CASTNESS’11, Rome,
Automated Design of Custom Architecture Tulika Mitra
A bit-streaming, pipelined multiuser detector for wireless communications Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Section 10: Advanced Topics 1 M. Balakrishnan Dept. of Comp. Sci. & Engg. I.I.T. Delhi.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,
Software Defined Radio 長庚電機通訊組 碩一 張晉銓 指導教授 : 黃文傑博士.
J. Christiansen, CERN - EP/MIC
1 Towards Optimal Custom Instruction Processors Wayne Luk Kubilay Atasu, Rob Dimond and Oskar Mencer Department of Computing Imperial College London HOT.
Page 1 Reconfigurable Communications Processor Principal Investigator: Chris Papachristou Task Number: NAG Electrical Engineering & Computer Science.
RICE UNIVERSITY DSPs for 4G wireless systems Sridhar Rajagopal, Scott Rixner, Joseph R. Cavallaro and Behnaam Aazhang This work has been supported by Nokia,
Reminder Lab 0 Xilinx ISE tutorial Research Send me an if interested Looking for those interested in RC with skills in compilers/languages/synthesis,
TI DSPS FEST 1999 Implementation of Channel Estimation and Multiuser Detection Algorithms for W-CDMA on Digital Signal Processors Sridhar Rajagopal Gang.
R2D2 team R2D2 team Reconfigurable and Retargetable Digital Devices  Application domains Mobile telecommunications  WCDMA/UMTS (Wideband Code Division.
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
RICE UNIVERSITY DSP architectures for wireless communications Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
RICE UNIVERSITY “Joint” architecture & algorithm designs for baseband signal processing Sridhar Rajagopal and Joseph R. Cavallaro Rice Center for Multimedia.
MAPLD 2005/254C. Papachristou 1 Reconfigurable and Evolvable Hardware Fabric Chris Papachristou, Frank Wolff Robert Ewing Electrical Engineering & Computer.
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
Evaluating and Improving an OpenMP-based Circuit Design Tool Tim Beatty, Dr. Ken Kent, Dr. Eric Aubanel Faculty of Computer Science University of New Brunswick.
Development of Programmable Architecture for Base-Band Processing S. Leung, A. Postula, Univ. of Queensland, Australia A. Hemani, Royal Institute of Tech.,
RICE UNIVERSITY DSPs for future wireless systems Sridhar Rajagopal.
DSP Architectural Considerations for Optimal Baseband Processing Sridhar Rajagopal Scott Rixner Joseph R. Cavallaro Behnaam Aazhang Rice University, Houston,
Implementing algorithms for advanced communication systems -- My bag of tricks Sridhar Rajagopal Electrical and Computer Engineering This work is supported.
Implementing Multiuser Channel Estimation and Detection for W-CDMA Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro and Behnaam Aazhang Rice.
Survey of multicore architectures Marko Bertogna Scuola Superiore S.Anna, ReTiS Lab, Pisa, Italy.
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
NISC set computer no-instruction
SR: 599 report Channel Estimation for W-CDMA on DSPs Sridhar Rajagopal ECE Dept., Rice University Elec 599.
A 1.2V 26mW Configurable Multiuser Mobile MIMO-OFDM/-OFDMA Baseband Processor Motivations –Most are single user, SISO, downlink OFDM solutions –Training.
Real-Time System-On-A-Chip Emulation.  Introduction  Describing SOC Designs  System-Level Design Flow  SOC Implemantation Paths-Emulation and.
RICE UNIVERSITY Handset architectures Sridhar Rajagopal ASICsProgrammable The support for this work in.
INTRODUCTION:- The approaching 4G (fourth generation) mobile communication systems are projected to solve still-remaining problems of 3G (third generation)
Multi-cellular paradigm The molecular level can support self- replication (and self- repair). But we also need cells that can be designed to fit the specific.
Channel Equalization in MIMO Downlink and ASIP Architectures Predrag Radosavljevic Rice University March 29, 2004.
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
A programmable communications processor for future wireless systems
Anne Pratoomtong ECE734, Spring2002
How to ATTACK Problems Facing 3G Wireless Communication Systems
Matlab as a Development Environment for FPGA Design
A Digital Signal Prophecy The past, present and future of programmable DSP and the effects on high performance applications Continuing technology enhancements.
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
DSPs for Future Wireless Base-Stations
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
DSP Architectures for Future Wireless Base-Stations
Suman Das, Sridhar Rajagopal, Chaitali Sengupta and Joseph R.Cavallaro
DSPs for Future Wireless Base-Stations
Presentation transcript:

ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia Communication ECE Department, Houston, Texas USA WWRF11 Meeting June 2004

Page 2 WWRF11 Meeting, June 2004, Oslo, Norway Processors in Future Wireless Systems  Future generations of mobile handsets:  High speed and low power  Flexibility  Traditional approaches: ASIC and DSP processors  ASIC drawbacks:  No flexibility: family of ASICs are needed  High probability of design errors, high design cost  DSP drawbacks:  Not optimized for a specific application  Often limited instruction and data level parallelism

Page 3 WWRF11 Meeting, June 2004, Oslo, Norway Processors in Future Wireless Systems  ASIPs (Application Specific Instruction Processors):  Excellent tradeoff between efficiency of ASICs and flexibility of DSPs

Page 4 WWRF11 Meeting, June 2004, Oslo, Norway ASIP Architecture Design  Flexible processors for mobile handsets:  Different modifications of wireless base-band algorithms (processing in slow/fast fading, low/high scattering environments)  Support for evolution of standards (3GPP, 4G, x, WiFi, etc)  Efficient processors to achieve high-demanding real time requirements:  Customized architecture is needed  Extension of ASIP instruction set with application-specific operations

Page 5 WWRF11 Meeting, June 2004, Oslo, Norway ASIP Architecture Based on TTA  TTA: Transport Triggered Architecture  Operations are triggered by data transport  Automatic hardware design flow:  Software environment enables reconfigurable implementation  Retargetable compiler  Conversion from C/C++ code of application to gate level processor design  Fast processor design  Automatic search for optimal (performance/cost) processor  VHDL representation of processor core obtained by software tool

Page 6 WWRF11 Meeting, June 2004, Oslo, Norway ASIP Architecture Based on TTA  Flexible architecture  No limitations to add new FUs, buses, registers  Customizable architecture  Implementation of Special Function Units (SFUs)  Instruction and data level parallelism  Efficient and parallel data flow  Sub-word parallelism by implementing appropriate SFU

Page 7 WWRF11 Meeting, June 2004, Oslo, Norway General Structure of TTA  VLIW architecture principle  Expanded compiler characteristics  Software specification of data transports  Simpler architecture than traditional VLIW  Large flexibility  Independent design of interconnection network and FUs  Easy to add/remove FUs and RFs  Software design environment

Page 8 WWRF11 Meeting, June 2004, Oslo, Norway ASIP Design Flow  Fast and efficient hardware/software co-design  Modified MOVE tool: exploration for optimal TTA processor with SFUs  MOVEGen tool: conversion of processor description into VHDL representation  Xilinx ISE Foundation for fast FPGA prototyping  Mentor Graphics tools: generation of processor layout

Page 9 WWRF11 Meeting, June 2004, Oslo, Norway Our Application: MIMO Downlink Equalization  Physical layer of mobile handset in MIMO downlink  ASIP architecture based on TTA  Flexible architecture solution for different modifications of channel equalization algorithm  Highly optimized for the most computationally complex version of channel equalization

Page 10 WWRF11 Meeting, June 2004, Oslo, Norway High Computational Complexity  Number of operations per chip in one second  Workload varies with velocity and number of channel multipaths  Required processor flexibility  Higher workload than typical operation count specification for TI C5x DSP (widely used in 2G mobile handsets)

Page 11 WWRF11 Meeting, June 2004, Oslo, Norway TTA Processor for Equalization  Customized processor  Application-specific SFUs for equalization  Area of approximately 140K Gates  Flexible design  Equalization in broad range of environments  Dynamic power dissipation: 32mW – 54 mW  Minimum clock frequency to achieve real time: 42MHz – 109MHz Processor core including data memory

Page 12 WWRF11 Meeting, June 2004, Oslo, Norway SFU Example: Complex Multiplication  Reduction of data transports between FUs  Less number of buses and smaller interconnection network  Smaller instruction word  Instruction and data parallelism placed inside CXMUL Real multipliers Complex multiplier

Page 13 WWRF11 Meeting, June 2004, Oslo, Norway Gate Level CMOS Synthesis  Mentor Graphics Tools  Leonardo Spectrum for low level synthesis  IC Station for automatic layout generation  CMOS library of logic cells  Customized equalizer with user detection  Included library of SFUs  Synthesis estimate of processor core: 182,887 gates

Page 14 WWRF11 Meeting, June 2004, Oslo, Norway Conclusions  ASIP architecture based on TTA for future mobile handsets  Same efficiency as ASIC while keeping flexibility  Customization  SFUs for application specific operations  Optimization of architecture  Reduction of power dissipation and area while achieving speedup  Interaction between hardware implementation and software environment  Fast and efficient design flow  Easy to fix design errors