Sridhar Rajagopal and Joseph R. Cavallaro Rice University

Slides:

Advertisements

Similar presentations

Multiuser Detection for CDMA Systems

Advertisements

Real-Time DSP Multiprocessor Implementation for Future Wireless Base-Station Receivers Bryan Jones, Sridhar Rajagopal, and Dr. Joseph Cavallaro.

1 Wireless Communication Low Complexity Multiuser Detection Rami Abdallah University of Illinois at Urbana Champaign 12/06/2007.

Multiuser Detection in CDMA A. Chockalingam Assistant Professor Indian Institute of Science, Bangalore-12

Data-Parallel Digital Signal Processors: Algorithm mapping, Architecture scaling, and Workload adaptation Sridhar Rajagopal.

The 3 rd MCM of COST 289: TU Košice, October 30-31, 2003 Technical University of Košice, Slovakia 1 of 27 THE PIECE-WISE LINEAR MICROSTATISTIC MULTI-USER.

Partial Parallel Interference Cancellation Based on Hebb Learning Rule Taiyuan University of Technology Yanping Li.

Implementation Issues for Channel Estimation and Detection Algorithms for W-CDMA Sridhar Rajagopal and Joseph Cavallaro ECE Dept.

DSPs in Wireless Communication Systems Vishwas Sundaramurthy Electrical and Computer Engineering Department, Rice University, Houston,TX.

A bit-streaming, pipelined multiuser detector for wireless communications Sridhar Rajagopal and Joseph R. Cavallaro Rice University

Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.

Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,

ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.

Tinoosh Mohsenin and Bevan M. Baas VLSI Computation Lab, ECE Department University of California, Davis Split-Row: A Reduced Complexity, High Throughput.

RICE UNIVERSITY DSPs for 4G wireless systems Sridhar Rajagopal, Scott Rixner, Joseph R. Cavallaro and Behnaam Aazhang This work has been supported by Nokia,

TI DSPS FEST 1999 Implementation of Channel Estimation and Multiuser Detection Algorithms for W-CDMA on Digital Signal Processors Sridhar Rajagopal Gang.

Iterative Multi-user Detection for STBC DS-CDMA Systems in Rayleigh Fading Channels Derrick B. Mashwama And Emmanuel O. Bejide.

Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal Srikrishna Bhashyam, Joseph R. Cavallaro,

RICE UNIVERSITY DSP architectures for wireless communications Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.

RICE UNIVERSITY “Joint” architecture & algorithm designs for baseband signal processing Sridhar Rajagopal and Joseph R. Cavallaro Rice Center for Multimedia.

RICE UNIVERSITY Advanced Wireless Receivers: Algorithmic and Architectural Optimizations Suman Das Rice University Department of Electrical and Computer.

RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.

RICE UNIVERSITY A real-time baseband communications processor for high data rate wireless systems Sridhar Rajagopal ECE Department Ph.D.

Decision Feedback Equalization in OFDM with Long Delay Spreads

VIRGINIA POLYTECHNIC INSTITUTE & STATE UNIVERSITY MOBILE & PORTABLE RADIO RESEARCH GROUP MPRG Combined Multiuser Detection and Channel Decoding with Receiver.

RICE UNIVERSITY DSPs for future wireless systems Sridhar Rajagopal.

DSP Architectural Considerations for Optimal Baseband Processing Sridhar Rajagopal Scott Rixner Joseph R. Cavallaro Behnaam Aazhang Rice University, Houston,

Implementing algorithms for advanced communication systems -- My bag of tricks Sridhar Rajagopal Electrical and Computer Engineering This work is supported.

Pipelining and number theory for multiuser detection Sridhar Rajagopal and Joseph R. Cavallaro Rice University This work is supported by Nokia, TI, TATP.

Real-Time Turbo Decoder Nasir Ahmed Mani Vaya Elec 434 Rice University.

RICE UNIVERSITY On the architecture design of a 3G W-CDMA/W-LAN receiver Sridhar Rajagopal and Joseph R. Cavallaro Rice University Center for Multimedia.

Implementing Multiuser Channel Estimation and Detection for W-CDMA Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro and Behnaam Aazhang Rice.

Overview of Implementation Issues for Multitier Networks on DSPs Joseph R. Cavallaro Electrical & Computer Engineering Dept. Rice University August 17,

RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.

A New Class of High Performance FFTs Dr. J. Greg Nash Centar ( High Performance Embedded Computing (HPEC) Workshop.

SR: 599 report Channel Estimation for W-CDMA on DSPs Sridhar Rajagopal ECE Dept., Rice University Elec 599.

Algorithms and Architectures for Future Wireless Base-Stations Sridhar Rajagopal and Joseph Cavallaro ECE Department Rice University April 19, 2000 This.

A 1.2V 26mW Configurable Multiuser Mobile MIMO-OFDM/-OFDMA Baseband Processor Motivations –Most are single user, SISO, downlink OFDM solutions –Training.

Application of Addition Algorithms Joe Cavallaro.

Fast VLSI Implementation of Sorting Algorithm for Standard Median Filters Hyeong-Seok Yu SungKyunKwan Univ. Dept. of ECE, Vada Lab.

RICE UNIVERSITY Handset architectures Sridhar Rajagopal ASICsProgrammable The support for this work in.

Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,

Optimal Sequence Allocation and Multi-rate CDMA Systems Krishna Kiran Mukkavilli, Sridhar Rajagopal, Tarik Muharemovic, Vikram Kanodia.

Channel Equalization in MIMO Downlink and ASIP Architectures Predrag Radosavljevic Rice University March 29, 2004.

Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro

Differencing Multistage Detector

Hiba Tariq School of Engineering

Dynamo: A Runtime Codesign Environment

A programmable communications processor for future wireless systems

UCLA Progress Report OCDMA Channel Coding

Sridhar Rajagopal April 26, 2000

Optimal Sequence Allocation and Multi-rate CDMA Systems

Anne Pratoomtong ECE734, Spring2002

How to ATTACK Problems Facing 3G Wireless Communication Systems

An Improved Split-Row Threshold Decoding Algorithm for LDPC Codes

A 100 µW, 16-Channel, Spike-Sorting ASIC with On-the-Fly Clustering

Modeling of RF in W-CDMA with SystemView

Sridhar Rajagopal and Joseph R. Cavallaro Rice University

DSPs for Future Wireless Base-Stations

High Throughput LDPC Decoders Using a Multiple Split-Row Method

On-line arithmetic for detection in digital communication receivers

<month year> doc.: IEEE /125r0 August 2004

Modeling of RF in W-CDMA with SystemView

Sridhar Rajagopal, Srikrishna Bhashyam,

DSPs in emerging wireless systems

DSP Architectures for Future Wireless Base-Stations

On-line arithmetic for detection in digital communication receivers

Suman Das, Sridhar Rajagopal, Chaitali Sengupta and Joseph R.Cavallaro

DSPs for Future Wireless Base-Stations

Presentation transcript:

A bit-streaming, pipelined multiuser detector for wireless communications Sridhar Rajagopal and Joseph R. Cavallaro Rice University {sridhar,cavallar}@rice.edu This work is supported by Nokia, TI, TATP and NSF

Motivation Implementing Multiuser Detection for 3G wireless systems at the base-station Challenges: -large complexity -block based algorithms (latency) Unable to meet real-time requirements (3GPP)

Contributions Developed a simple architecture for asynchronous multiuser detection [ + , x ] Bit-streaming - reduced latency - no window edge computations - lower memory requirements Pipelined stages - higher throughput (with more hardware) DSP-based implementation closer to real-time

Multiuser detection noise + interference Base-station Direct Reflections User 1 User 2 Jointly detect data of all users

Benefits of multiuser detection 2 4 6 8 10 12 14 16 -4 -3 -2 -1 Error rate vs. SNR SNR (in dB) Bit error rate Single-user (channel estimation + detection) Multi-user estimation+ Single-user detection Multi-user (channel estimation + detection)

Asynchronous multiuser interference Interference due to past, current and future bits of other users Delay I-1 I Interference from future bits of other users b1, i-i Desired user I I+1 Interference from previous bits of other users bk, i I I+1 bj, i+1 ri-1 ri ri+1 ri+2

Multistage Parallel Interference Cancellation (PIC) Conventional Code Matched filter: A- channel estimates y - soft decision d - detected bits Iterate for convergence (PIC) S=diag(AHA)

Multistage Parallel Interference Cancellation (PIC) Tri- diagonal Block Toeplitz matrix [KD * KD] D- detection window length Previous Work: Make the block Toeplitz matrix circulant S. Das, J. R. Cavallaro, and B. Aazhang. Computationally Efficient Multiuser Detectors PIMRC1997

Block Based Detector 2 extra edge bit computations per stage. Latency - variable [Worst case (1st bit)  D*latency] 1 MF 12 1 PIC1 12 1 PIC3 12 1 PIC2 12 Bits 2-11 TIME 11 MF 22 11 PIC1 22 11 PIC3 22 11 PIC2 22 Bits 12-21 TIME

Bit-streaming the multiuser detection algorithm Savings in memory by D2

Pipelining the multiuser detector Matched Filter (causal) 1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5 6 7 8 9 10 11 12 PIC - Stage 1 1 2 3 4 5 6 7 8 9 10 11 12 PIC - Stage 2 1 2 3 4 5 6 7 8 9 10 11 12 PIC - Stage 3 TIME

Pipelined architecture for multiuser detection

Code matched filter detector FPGAs for pipelining Flexibility of ASICs Good for parallelism and bit-level operations DSP FPGA1 FPGA2 FPGA3 Code matched filter detector PIC (Stage 1) PIC (Stage 2) PIC (Stage 3) Received bits Multiuser estimation Detected bits

DSP simulations Execution time (in seconds) Users 5 10 15 20 25 30 35 5 10 15 20 25 30 35 -6 -5 -4 -3 -2 Execution time (in seconds) Users DSP implementation Target data rate - 128 Kbps/user DSP- MF + FPGAs - PIC

Summary Simple, bit-streaming pipelined multiuser detector Avoids block computations -Savings in memory by D2 No edge bit computations in a window - 2/D computational savings per stage Lower constant latency by D. Can achieve real-time for up to 7 users

Test chip built as part of a VLSI course project Number of users supported: 4 Area available: 3000x3000 inside the pad frame Area used: ~85% CMOS micron process: 0.5 micron Chip speed: 2Mbps http://www.owlnet.rice.edu/~sunbeam/422/

11 MF 22 11 PIC1 22 11 PIC3 22 11 PIC2 22 Bits 12-21