Implementing Multiuser Channel Estimation and Detection for W-CDMA Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro and Behnaam Aazhang Rice.

Slides:



Advertisements
Similar presentations
Multiuser Detection for CDMA Systems
Advertisements

Programmable FIR Filter Design
Development of Parallel Simulator for Wireless WCDMA Network Hong Zhang Communication lab of HUT.
Real-Time DSP Multiprocessor Implementation for Future Wireless Base-Station Receivers Bryan Jones, Sridhar Rajagopal, and Dr. Joseph Cavallaro.
1 Wireless Communication Low Complexity Multiuser Detection Rami Abdallah University of Illinois at Urbana Champaign 12/06/2007.
EE360: Lecture 8 Outline Multiuser Detection
DSP Implementation of a BPSK SNR Estimation Algorithm for OFDM Systems in AWGN Channel University of Patras Department of Electrical & Computer Engineering.
Receiver Performance for Downlink OFDM with Training Koushik Sil ECE 463: Adaptive Filter Project Presentation.
Technion – Israel Institute of Technology Department of Electrical Engineering High Speed Digital Systems Lab Written by: Haim Natan Benny Pano Supervisor:
Using Programmable Logic to Accelerate DSP Functions 1 Using Programmable Logic to Accelerate DSP Functions “An Overview“ Greg Goslin Digital Signal Processing.
Viterbi Decoder Project Alon weinberg, Dan Elran Supervisors: Emilia Burlak, Elisha Ulmer.
Sub-Nyquist Sampling DSP & SCD Modules Presented by: Omer Kiselov, Daniel Primor Supervised by: Ina Rivkin, Moshe Mishali Winter 2010High Speed Digital.
Implementation Issues for Channel Estimation and Detection Algorithms for W-CDMA Sridhar Rajagopal and Joseph Cavallaro ECE Dept.
DSPs in Wireless Communication Systems Vishwas Sundaramurthy Electrical and Computer Engineering Department, Rice University, Houston,TX.
RICE UNIVERSITY Implementing the Viterbi algorithm on programmable processors Sridhar Rajagopal Elec 696
Efficient FPGA Implementation of QR
Techniques for Low Power Turbo Coding in Software Radio Joe Antoon Adam Barnett.
A bit-streaming, pipelined multiuser detector for wireless communications Sridhar Rajagopal and Joseph R. Cavallaro Rice University
1 UTILIZING CHANNEL CODING INFORMATION IN CIVA- BASED BLIND SEQUENCE DETECTORS Xiaohua(Edward) Li Department of Electrical and Computer Engineering State.
Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
J. Christiansen, CERN - EP/MIC
FPGA Implementations for Volterra DFEs
RICE UNIVERSITY DSPs for 4G wireless systems Sridhar Rajagopal, Scott Rixner, Joseph R. Cavallaro and Behnaam Aazhang This work has been supported by Nokia,
TI DSPS FEST 1999 Implementation of Channel Estimation and Multiuser Detection Algorithms for W-CDMA on Digital Signal Processors Sridhar Rajagopal Gang.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal Srikrishna Bhashyam, Joseph R. Cavallaro,
RICE UNIVERSITY DSP architectures for wireless communications Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
RICE UNIVERSITY “Joint” architecture & algorithm designs for baseband signal processing Sridhar Rajagopal and Joseph R. Cavallaro Rice Center for Multimedia.
RICE UNIVERSITY Advanced Wireless Receivers: Algorithmic and Architectural Optimizations Suman Das Rice University Department of Electrical and Computer.
RICE UNIVERSITY A real-time baseband communications processor for high data rate wireless systems Sridhar Rajagopal ECE Department Ph.D.
VIRGINIA POLYTECHNIC INSTITUTE & STATE UNIVERSITY MOBILE & PORTABLE RADIO RESEARCH GROUP MPRG Combined Multiuser Detection and Channel Decoding with Receiver.
RICE UNIVERSITY DSPs for future wireless systems Sridhar Rajagopal.
DSP Architectural Considerations for Optimal Baseband Processing Sridhar Rajagopal Scott Rixner Joseph R. Cavallaro Behnaam Aazhang Rice University, Houston,
Implementing algorithms for advanced communication systems -- My bag of tricks Sridhar Rajagopal Electrical and Computer Engineering This work is supported.
Pipelining and number theory for multiuser detection Sridhar Rajagopal and Joseph R. Cavallaro Rice University This work is supported by Nokia, TI, TATP.
Real-Time Turbo Decoder Nasir Ahmed Mani Vaya Elec 434 Rice University.
Spatiotemporal Saliency Map of a Video Sequence in FPGA hardware David Boland Acknowledgements: Professor Peter Cheung Mr Yang Liu.
RICE UNIVERSITY On the architecture design of a 3G W-CDMA/W-LAN receiver Sridhar Rajagopal and Joseph R. Cavallaro Rice University Center for Multimedia.
Overview of Implementation Issues for Multitier Networks on DSPs Joseph R. Cavallaro Electrical & Computer Engineering Dept. Rice University August 17,
SR: 599 report Channel Estimation for W-CDMA on DSPs Sridhar Rajagopal ECE Dept., Rice University Elec 599.
Algorithms and Architectures for Future Wireless Base-Stations Sridhar Rajagopal and Joseph Cavallaro ECE Department Rice University April 19, 2000 This.
RICE UNIVERSITY Handset architectures Sridhar Rajagopal ASICsProgrammable The support for this work in.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
Optimal Sequence Allocation and Multi-rate CDMA Systems Krishna Kiran Mukkavilli, Sridhar Rajagopal, Tarik Muharemovic, Vikram Kanodia.
Channel Equalization in MIMO Downlink and ASIP Architectures Predrag Radosavljevic Rice University March 29, 2004.
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
Improved Resource Sharing for FPGA DSP Blocks
A programmable communications processor for future wireless systems
Sridhar Rajagopal April 26, 2000
Adaptation Behavior of Pipelined Adaptive Filters
Optimal Sequence Allocation and Multi-rate CDMA Systems
How to ATTACK Problems Facing 3G Wireless Communication Systems
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Ian C. Wong, Zukang Shen, Jeffrey G. Andrews, and Brian L. Evans
Modeling of RF in W-CDMA with SystemView
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
DSPs for Future Wireless Base-Stations
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
On-line arithmetic for detection in digital communication receivers
Modeling of RF in W-CDMA with SystemView
Sridhar Rajagopal, Srikrishna Bhashyam,
DSPs in emerging wireless systems
Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.
Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.
DSP Architectures for Future Wireless Base-Stations
On-line arithmetic for detection in digital communication receivers
Suman Das, Sridhar Rajagopal, Chaitali Sengupta and Joseph R.Cavallaro
DSPs for Future Wireless Base-Stations
Presentation transcript:

Implementing Multiuser Channel Estimation and Detection for W-CDMA Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro and Behnaam Aazhang Rice University This work is supported by Nokia, Texas Instruments, Texas Advanced Technology Program and NSF

Organization Joint Estimation & Detection An Implementation-Friendly Scheme Simulations Architectural Features –Task Partitioning –Area-Time Tradeoffs Conclusions Future Work

Base-Station with MUD Multiple Users Channel Estimation Multiuser Detection Decoder Data Pilot Demod -ulator Antenna Decision Feedback MUXMUX Detected Bits + Base-station Receiver Delay MUXMUX d b

Joint Estimation & Detection Jointly estimate the channel response and detect all the user’s bits. Shown to have better performance as well as reduced computational complexity. Maximum Likelihood Based Channel Estimation –[C.Sengupta et al. : PIMRC’1998 WCNC’1999] Differencing Multistage Detection based on Parallel Interference Cancellation –[G.Xu et al. : SPIE’1999]

Computations Involved Model Compute Correlation Matrices Bits of K async. users aligned at times I and I-1 Received bits of spreading length N for K users riri bibi b i-1 time delay

Multishot Detection Solve for the channel estimate, A i

Differencing Multistage Detection Stage 0 [ Matched Filter Detector] Stage 1 [ to build differencing vector] Successive Stages S=diag(A H A) y - soft decision d - detected bits (hard decision )

Structure of A H A Not difficult to Compute A H A Block Bi-Diagonal Matrix : Use Structure

Drawbacks Matrix Inversion/ Decomposition Needed Result not available till end of computation –Delay before Detection Difficult for Tracking Higher Precision Needed – Floating Point Units Larger Memory Requirements –Storage of elements to compute inverse –Float = 32 bits / Input accuracy = bits SLOW! - Difficult to meet Real-Time –[S.Rajagopal et al. : TI DSPFest’1999]

Proposed Base-Station No Multiuser Detection TI's Wireless Basestation (

New Scheme Iterative Method to find the Channel Estimates – [S.Bhashyam et al. : WCNC’2000 (submitted)] Can be easily adapted to Tracking for Fading Channels Fixed Point Implementation Estimates ready for detection Immediately Simpler Hardware and Software. –Computation Savings only Per Bit

Iterative Scheme Tracking –Slow Fading : Large Window L –Fast Fading : Smaller Window L Method of Steepest Descent Stable convergence behavior μ fixed : Bit-by-Bit update Matches Closely to the Scheme with Inversions

Simulations - AGWN Channel Detection Window = 12 SINR = 0 Paths =3 Preamble = bits/user MF – Matched Filter ML- Maximum Likelihood ACT – using inversion

Fading Channel with Tracking Doppler = 10 Hz, 1000 Bits,15 users, 3 Paths

DSP Implementation C6201 Texas Instruments –Fixed Point Processor –200 MHz 32 -bit VLIW Architecture 8 Functional Units –2 Multipliers –4 Adders –2 Load/Store TI C Compiler

Simulation Work in Progress! Why better? –Fixed Point Implementation - Faster on DSPs –Higher Clock Speeds / Faster Multiplications –More SIMD Parallelism due to smaller wordlength. –Software Code Simpler to write Smaller Program Size Problems –Input Bit Precision Analysis –Overflows

Task - Partitioning the Algorithm Multiple Users Channel Estimation Multiuser Detection Decoder Data Pilot Demod -ulator Antenna Decision Feedback MUXMUX Detected Bits + Base-station Receiver Delay MUXMUX d b

Task Decomposition Matrix Products IterateCorrelation Matrices (Per Bit) R br [I] O(KN) A 0 H A 1 O(K 2 N) A H r O(KND ) A 1 H A 1 O(K 2 N) A 0 H A 0 O(K 2 N) A[I] O(K 2 N) Multistage Detection (Per Window) O(DK 2 M) b Pilot Data MUXMUX d Data’ MUXMUX A[R] O(K 2 N) d R br [R] O(KN) R bb O(K 2 ) Block I Block II Block III Block IV Channel Estimation Multistage Detection Task A Task B S.Das et al : Asilomar’99 TIME

Channel Estimation Architecture Detection Architecture –One version already ready –[G.Xu - Master’s Thesis 1999] Advantages over DSP Implementation: –Optimal Memory Utilization –Custom Blocks for exploiting available pipelining and parallelism –Parts could be mapped to FPGA / Reconfigurable logic –Shows theoretical bounds for maximum achievable Data Rates –Shows how tasks could be split among different processors

Block Diagram b 0 b 0 ’ (2K 2 ) bb’ (2 K 2 ) MUX (2K) MUX (N) MUX (2 K 2 ) Inverter (2 K 2 ) R bb (2 K 2 ) R br [R] (KN) Multiplier (2 K 2 N) A tmp [R] >> (4 K 2 ) A [R] (KN) b0 (2K) r0 (N) b r[R] Inverter (2K) Window MUX (2K) MUX (N) R br [I] (KN) A tmp >> (4 K 2 ) r0 (N) r[I] Inverter (2K) Multiplier (2 K 2 N) A [I] (KN) bit 8-bit REAL IMAG Each block shows no. of “operations” in it.

Channel Estimation Window bit 8-bit b 0 b 0 ’ (2K 2 ) bb’ (2 K 2 ) MUX (2K) MUX (N) MUX (2 K 2 ) Inverter (2 K 2 ) R bb (2 K 2 ) R br [R] (KN) Multiplier (2 K 2 N) A tmp [R] >> (4 K 2 ) A [R] (KN) b0 (2K) r0 (N) b r[R] Inverter (2K) REAL Each block shows no. of “operations” in it.

Auto-correlation Structure b 0 b 0 ’ (2K 2 ) bb’ (2 K 2 ) MUX (2 K 2 ) Inverter (2 K 2 ) b,b 0 are 1-bit Subtraction by using inverter Rbb using a Counter Fully Parallel 2K 2 elements O(1) Time Pipelined [with LOAD] 2K elements O(K) Time Serial [with LOAD] 1 element O(2K 2 ) Time R bb (2 K 2 )

Cross-Correlation Structure MUX (2K) MUX (N) R br [R] (KN) Inverter (2K) r is 8-bit, b is 1-bit Rbr using 8-bit Adders Based on sign of b Fully Parallel KN, O(1) Pipelined N, O(K) Serial 1, O(KN)

Iterative Update Structure R bb (2 K 2 ) R br [R] (KN) Multiplier (2 K 2 N) A tmp [R] >> (4 K 2 ) A [R] (KN) REAL 8-bit Multipliers 16-bit Adders for Multiplier 8-bit Adders for A Parallel KN, O(K) Pipelined N, O(K 2 ) Serial 1, O(K 2 N)

Elements in each block Example : N = 32,L =100, K =32 Fully Parallel Solution : 4K Multipliers, 12K Adders : O(32) Time Pipelined Solution :100 Multipliers, 300 Adders : O(1K) Time

Conclusions Iterative Scheme for Joint Estimation & Detection No loss in algorithm performance Suitable for Hardware Implementation –On DSPs, FPGAs and ASICs Supports Tracking for Fading Channels Fixed Point Implementation Feasible ASIC architecture –To exploit available pipelining and parallelism Multiuser Channel Estimation and Detection algorithms POSSIBLE to IMPLEMENT for W-CDMA.

Future Work MS Extend Architecture to Long Codes Task Partition the algorithm on the Sundance Multi- DSP/FPGA board to achieve real-time Post-MS Downlink Architectures to Min. Power Consumption /Area Implementing Coding/Decoding Blocks and integrate RENE’

EXTRA SLIDES

Data Rates Achieved Assuming Channel Estimation Real-Time

Fading Channel SNR = 10 dB, Doppler = 10 Hz, 1000 Bits