Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro

Slides:

Advertisements

Similar presentations

Multiuser Detection for CDMA Systems

Advertisements

DSPs Vs General Purpose Microprocessors

Development of Parallel Simulator for Wireless WCDMA Network Hong Zhang Communication lab of HUT.

Real-Time DSP Multiprocessor Implementation for Future Wireless Base-Station Receivers Bryan Jones, Sridhar Rajagopal, and Dr. Joseph Cavallaro.

1 Wireless Communication Low Complexity Multiuser Detection Rami Abdallah University of Illinois at Urbana Champaign 12/06/2007.

Submission May, 2000 Doc: IEEE / 086 Steven Gray, Nokia Slide Brief Overview of Information Theory and Channel Coding Steven D. Gray 1.

EE360: Lecture 8 Outline Multiuser Detection

"Spatial Multiuser Access OFDM With Antenna Diversity and Power Control” Mobiles, M Ti Antennas for ith user Base Station, M R antennas.

HPEC_GPU_DECODE-1 ADC 8/6/2015 MIT Lincoln Laboratory GPU Accelerated Decoding of High Performance Error Correcting Codes Andrew D. Copeland, Nicholas.

Implementation Issues for Channel Estimation and Detection Algorithms for W-CDMA Sridhar Rajagopal and Joseph Cavallaro ECE Dept.

DSPs in Wireless Communication Systems Vishwas Sundaramurthy Electrical and Computer Engineering Department, Rice University, Houston,TX.

1 Techniques to control noise and fading l Noise and fading are the primary sources of distortion in communication channels l Techniques to reduce noise.

A bit-streaming, pipelined multiuser detector for wireless communications Sridhar Rajagopal and Joseph R. Cavallaro Rice University

Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.

Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,

ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.

RICE UNIVERSITY DSPs for 4G wireless systems Sridhar Rajagopal, Scott Rixner, Joseph R. Cavallaro and Behnaam Aazhang This work has been supported by Nokia,

TI DSPS FEST 1999 Implementation of Channel Estimation and Multiuser Detection Algorithms for W-CDMA on Digital Signal Processors Sridhar Rajagopal Gang.

Ali Al-Saihati ID# Ghassan Linjawi

Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal Srikrishna Bhashyam, Joseph R. Cavallaro,

RICE UNIVERSITY DSP architectures for wireless communications Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.

RICE UNIVERSITY “Joint” architecture & algorithm designs for baseband signal processing Sridhar Rajagopal and Joseph R. Cavallaro Rice Center for Multimedia.

RICE UNIVERSITY Advanced Wireless Receivers: Algorithmic and Architectural Optimizations Suman Das Rice University Department of Electrical and Computer.

RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.

Motivation Wireless Communication Environment Noise Multipath (ISI!) Demands Multimedia applications  High rate Data communication  Reliability.

Development of Programmable Architecture for Base-Band Processing S. Leung, A. Postula, Univ. of Queensland, Australia A. Hemani, Royal Institute of Tech.,

RICE UNIVERSITY DSPs for future wireless systems Sridhar Rajagopal.

DSP Architectural Considerations for Optimal Baseband Processing Sridhar Rajagopal Scott Rixner Joseph R. Cavallaro Behnaam Aazhang Rice University, Houston,

Implementing algorithms for advanced communication systems -- My bag of tricks Sridhar Rajagopal Electrical and Computer Engineering This work is supported.

Pipelining and number theory for multiuser detection Sridhar Rajagopal and Joseph R. Cavallaro Rice University This work is supported by Nokia, TI, TATP.

Space Time Codes. 2 Attenuation in Wireless Channels Path loss: Signals attenuate due to distance Shadowing loss : absorption of radio waves by scattering.

RICE UNIVERSITY On the architecture design of a 3G W-CDMA/W-LAN receiver Sridhar Rajagopal and Joseph R. Cavallaro Rice University Center for Multimedia.

Equalization Techniques By: Mohamed Osman Ahmed Mahgoub.

Implementing Multiuser Channel Estimation and Detection for W-CDMA Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro and Behnaam Aazhang Rice.

RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.

SR: 599 report Channel Estimation for W-CDMA on DSPs Sridhar Rajagopal ECE Dept., Rice University Elec 599.

RICE UNIVERSITY Handset architectures Sridhar Rajagopal ASICsProgrammable The support for this work in.

Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,

Optimal Sequence Allocation and Multi-rate CDMA Systems Krishna Kiran Mukkavilli, Sridhar Rajagopal, Tarik Muharemovic, Vikram Kanodia.

Channel Equalization in MIMO Downlink and ASIP Architectures Predrag Radosavljevic Rice University March 29, 2004.

Guided by: Nagasundari Asst. Professor Dept. of ISE PESIT A Seminar on 4G WIRELESS SYSTEM By Suresh M.R USN:1PI08IS411.

Dynamo: A Runtime Codesign Environment

Space Time Codes.

Space-Time and Space-Frequency Coded Orthogonal Frequency Division Multiplexing Transmitter Diversity Techniques King F. Lee.

Techniques to control noise and fading

A programmable communications processor for future wireless systems

Sridhar Rajagopal April 26, 2000

Technical Feasibility of Spreading Codes for HRb

DESIGN OF A SPECIFIC CDMA SYSTEM FOR AIR TRAFFIC CONTROL APPLICATIONS

Optimal Sequence Allocation and Multi-rate CDMA Systems

Anne Pratoomtong ECE734, Spring2002

How to ATTACK Problems Facing 3G Wireless Communication Systems

Sridhar Rajagopal and Joseph R. Cavallaro Rice University

STUDY AND IMPLEMENTATION

Modeling of RF in W-CDMA with SystemView

Sridhar Rajagopal and Joseph R. Cavallaro Rice University

HDR a solution using MIMO-OFDM

DSPs for Future Wireless Base-Stations

Key Features Summary and Results Update

The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.

On-line arithmetic for detection in digital communication receivers

Programmable processors for wireless base-stations

Final Project presentation

Modeling of RF in W-CDMA with SystemView

Sridhar Rajagopal, Srikrishna Bhashyam,

DSPs in emerging wireless systems

DSP Architectures for Future Wireless Base-Stations

On-line arithmetic for detection in digital communication receivers

Suman Das, Sridhar Rajagopal, Chaitali Sengupta and Joseph R.Cavallaro

DSPs for Future Wireless Base-Stations

Presentation transcript:

Task partitioning wireless base-station receiver algorithms on multiple DSPs and FPGAs Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro Rice University This work is supported by Nokia, TI, TATP and NSF

Motivation Build wireless multimedia communication systems Kbps to Mbps Sophisticated algorithms - exponential complexity Approaches: Sub-optimal algorithms - O(n2,n3) complexity Better hardware implementations needed

Hardware implementations DSP - programmable ASIC - customized hardware FPGA - programmable ASICs Single DSP - too slow Need flexibility - for different protocols speed - to meet real-time Multiple DSP-FPGA solution investigated

Contributions Efficient task-partitioning multiuser estimation and detection algorithms on fixed hardware maximize performance, minimize overhead 1.19X- 5.92X speedup with 2 DSPs. additional processing power and internal memory Use of FPGAs to accelerate multiuser detection Multiple DSP-FPGA to meet real-time requirements

Outline Introduction Multiprocessor system at Rice Single and multiprocessor simulations FPGAs for acceleration Summary

Multiuser estimation and detection noise + interference Base-station Direct Reflections User 1 User 2 Jointly estimate attenuations, fading and delays Jointly detect data of all users

Benefits of multiuser estimation & detection 2 4 6 8 10 12 14 16 -4 -3 -2 -1 Error rate vs. SNR SNR (in dB) Bit error rate Single-user (channel estimation + detection) Multi-user estimation+ Single-user detection Multi-user (channel estimation + detection)

Base-station receiver Antenna Multiuser detection Decoding Information bits Multiuser channel estimation Training Tracking

Sub-optimal estimation and detection Maximum likelihood estimation O( User2 * spreading gain ) avoids matrix inversion by an iterative scheme Multi-user detection with interference cancellation Single user detector (code matched filter) O( User * spreading gain ) 3 Stages of parallel interference cancellation O( User2 )

Outline Introduction Multiprocessor system at Rice Single and multiprocessor simulations FPGAs for acceleration Summary

Multiprocessor implementations Single DSP - too slow Multiple DSPs - communication overhead Partition estimation and detection on different DSPs Narrow communication link Maximize performance Data rates dependent only on detection

Multiprocessor system at Rice Prototype multiprocessor board from Sundance Inc. Two TI C67 DSPs and two Xilinx 300K gate FPGAs Inter-processor communication at 20 MBps Host DSP Multiuser estimation detection Received bits Detected bits PC FPGA2 FPGA1 DSP1 DSP2

Outline Introduction Multiprocessor system at Rice Single and multiprocessor simulations FPGAs for acceleration Summary

Base case implementation Single DSP Multiuser estimation 10X-50X slower than multiuser detection Different algorithm complexity Multiuser detection in internal memory (64 KB) Multiuser estimation in internal and off-chip memory

Base case simulation Execution time (in seconds) Users 5 10 15 20 25 5 10 15 20 25 30 35 -6 -5 -4 -3 -2 -1 Execution time (in seconds) Users Multi-user estimation Single-user estimation Multi-user detection Single-user detection

Dual DSP implementation Both estimation and detection now in internal memory 2X - 12.66X speedup in estimation (DSP1 vs. DSP) No change in detection performance Estimation still 3X slower than detection Inter-processor communication overhead O( users * spreading gain ) = 16 - 512 KB overhead

Dual DSP simulations Execution time (in seconds) Users 5 10 15 20 25 5 10 15 20 25 30 35 -6 -5 -4 -3 -2 -1 Execution time (in seconds) Users Multi-user estimation - DSP Comm. overhead DSP1 - DSP2 Multi-user estimation - DSP1 Multi-user detection - DSP Multi-user detection - DSP2

Balancing division of tasks Unbalanced task division Estimation 3X slower than detection Huge communication overhead > estimation, detection Data rates dependent only on detection Update channel estimates less frequently reasonable for slow fading channels (indoor environments)

Frequency of estimation updates Can update more frequently with more users Once every 48 bits - single user Once every 9 bits - 32 users Relatively larger overhead for fewer users Estimation, detection = O( User2 ) Communication overhead = O( User )

Frequency of channel estimate updates 5 10 15 20 25 30 35 40 45 50 Users Frequency of estimation updates ( 1 in 'x' bits)

Outline Introduction Multiprocessor system at Rice Single and multiprocessor simulations FPGAs for acceleration Summary

Limitations of DSP implementations Further acceleration needed for real-time performance Matrix based massively parallel algorithms Detection of bits {+1,-1} : bit - level operations DSPs Bit multiplications not needed - (add/subtract on FPGA) Bit storage not convenient Not fully able to exploit parallelism

FPGAs for acceleration Flexibility of ASICs Good for parallelism and bit-level operations Code matched filter detector Multiuser estimation PIC (Stage 1) PIC (Stage 2) Received bits Detected bits DSP2 DSP1 FPGA1 FPGA2

Multiprocessor simulations 5 10 15 20 25 30 35 -6 -5 -4 -3 -2 Execution time (in seconds) Users Single DSP implementation 2 DSP implementation Target data rate - 128 Kbps/user 2 DSPs + 2 FPGAs

Multiprocessor advantages 1.19X - 5.92X speedup using 2 DSPs Up to 50X acceleration possible by task balancing with additional FPGAs DSP - FPGA communication overhead Just 2 DSPs and 2 FPGAs can meet 128 Kbps/user real-time requirements for up to 7 users

Outline Introduction Multiprocessor system at Rice Single and multiprocessor simulations FPGAs for acceleration Future work and summary

Future work DSP - FPGA communication overhead Transferring KBs of data into FPGAs Implementation of channel decoding Complete real-time system

Summary Efficient task-partitioning multiuser estimation and detection algorithms on fixed hardware maximize performance, minimize overhead 1.19X- 5.92X speedup with 2 DSPs. additional processing power and internal memory Use of FPGAs to accelerate multiuser detection Multiple DSP-FPGA to meet real-time requirements