Implementation Issues for Channel Estimation and Detection Algorithms for W-CDMA Sridhar Rajagopal and Joseph Cavallaro ECE Dept.

Slides:



Advertisements
Similar presentations
Multiuser Detection for CDMA Systems
Advertisements

DSPs Vs General Purpose Microprocessors
1 Multi-user Detection Gwo-Ruey Lee. Wireless Access Tech. Lab. CCU Wireless Access Tech. Lab. 2 Outlines Multiple Access Communication Synchronous CDMA.
Interference Cancellation Algorithm with Pilot in 3GPP/LTE
Development of Parallel Simulator for Wireless WCDMA Network Hong Zhang Communication lab of HUT.
Real-Time DSP Multiprocessor Implementation for Future Wireless Base-Station Receivers Bryan Jones, Sridhar Rajagopal, and Dr. Joseph Cavallaro.
1 Wireless Communication Low Complexity Multiuser Detection Rami Abdallah University of Illinois at Urbana Champaign 12/06/2007.
Multiuser Detection in CDMA A. Chockalingam Assistant Professor Indian Institute of Science, Bangalore-12
MUD: Multi-user detection Jetmir Palushi Stevens Institute of Technology EE613 DSP for Communications.
The 3 rd MCM of COST 289: TU Košice, October 30-31, 2003 Technical University of Košice, Slovakia 1 of 27 THE PIECE-WISE LINEAR MICROSTATISTIC MULTI-USER.
EE360: Lecture 8 Outline Multiuser Detection
King Fahd University of Petroleum &Minerals Electrical Engineering Department EE-400 presentation CDMA systems Done By: Ibrahim Al-Dosari Mohammad.
Multi-user CDMA Enhancing capacity of wireless cellular CDMA.
10 January,2002Seminar of Master Thesis1 Helsinki University of Technology Department of Electrical and Communication Engineering WCDMA Simulator with.
1 Multiuser Detection for CDMA Anders Høst-Madsen (with contributions from Yu Jaechon, Ph.D student) TRLabs & University of Calgary.
Receiver Performance for Downlink OFDM with Training Koushik Sil ECE 463: Adaptive Filter Project Presentation.
King Fahd University of Petroleum & Minerals  Electrical Engineering Department EE 578 Simulation of Wireless Systems Code Division Multiple Access Transmission.
DSPs in Wireless Communication Systems Vishwas Sundaramurthy Electrical and Computer Engineering Department, Rice University, Houston,TX.
RICE UNIVERSITY Implementing the Viterbi algorithm on programmable processors Sridhar Rajagopal Elec 696
Cellular Mobile Communication Systems Lecture 5
1 Techniques to control noise and fading l Noise and fading are the primary sources of distortion in communication channels l Techniques to reduce noise.
System parameters and performance CDMA-2000, W-CDMA (UMTS), GSM 900, WLAN a, WLAN b, Bluetooth. By Øystein Taskjelle.
A bit-streaming, pipelined multiuser detector for wireless communications Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
RICE UNIVERSITY DSPs for 4G wireless systems Sridhar Rajagopal, Scott Rixner, Joseph R. Cavallaro and Behnaam Aazhang This work has been supported by Nokia,
TI DSPS FEST 1999 Implementation of Channel Estimation and Multiuser Detection Algorithms for W-CDMA on Digital Signal Processors Sridhar Rajagopal Gang.
Iterative Multi-user Detection for STBC DS-CDMA Systems in Rayleigh Fading Channels Derrick B. Mashwama And Emmanuel O. Bejide.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal Srikrishna Bhashyam, Joseph R. Cavallaro,
RICE UNIVERSITY “Joint” architecture & algorithm designs for baseband signal processing Sridhar Rajagopal and Joseph R. Cavallaro Rice Center for Multimedia.
RICE UNIVERSITY Advanced Wireless Receivers: Algorithmic and Architectural Optimizations Suman Das Rice University Department of Electrical and Computer.
Channel Independent Viterbi Algorithm (CIVA) for Blind Sequence Detection with Near MLSE Performance Xiaohua(Edward) Li State Univ. of New York at Binghamton.
RICE UNIVERSITY DSPs for future wireless systems Sridhar Rajagopal.
DSP Architectural Considerations for Optimal Baseband Processing Sridhar Rajagopal Scott Rixner Joseph R. Cavallaro Behnaam Aazhang Rice University, Houston,
Implementing algorithms for advanced communication systems -- My bag of tricks Sridhar Rajagopal Electrical and Computer Engineering This work is supported.
Pipelining and number theory for multiuser detection Sridhar Rajagopal and Joseph R. Cavallaro Rice University This work is supported by Nokia, TI, TATP.
RICE UNIVERSITY On the architecture design of a 3G W-CDMA/W-LAN receiver Sridhar Rajagopal and Joseph R. Cavallaro Rice University Center for Multimedia.
Implementing Multiuser Channel Estimation and Detection for W-CDMA Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro and Behnaam Aazhang Rice.
Overview of Implementation Issues for Multitier Networks on DSPs Joseph R. Cavallaro Electrical & Computer Engineering Dept. Rice University August 17,
Outline Transmitters (Chapters 3 and 4, Source Coding and Modulation) (week 1 and 2) Receivers (Chapter 5) (week 3 and 4) Received Signal Synchronization.
SR: 599 report Channel Estimation for W-CDMA on DSPs Sridhar Rajagopal ECE Dept., Rice University Elec 599.
Algorithms and Architectures for Future Wireless Base-Stations Sridhar Rajagopal and Joseph Cavallaro ECE Department Rice University April 19, 2000 This.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,
Optimal Sequence Allocation and Multi-rate CDMA Systems Krishna Kiran Mukkavilli, Sridhar Rajagopal, Tarik Muharemovic, Vikram Kanodia.
Outline  Introduction (Resource Management and Utilization).  Compression and Multiplexing (Other related definitions).  The Most Important Access Utilization.
Channel Equalization in MIMO Downlink and ASIP Architectures Predrag Radosavljevic Rice University March 29, 2004.
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
Differencing Multistage Detector
Techniques to control noise and fading
A programmable communications processor for future wireless systems
Sridhar Rajagopal April 26, 2000
Equalization in a wideband TDMA system
Optimal Sequence Allocation and Multi-rate CDMA Systems
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
A 100 µW, 16-Channel, Spike-Sorting ASIC with On-the-Fly Clustering
Modeling of RF in W-CDMA with SystemView
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
VLIW DSP vs. SuperScalar Implementation of a Baseline H.263 Encoder
DSPs for Future Wireless Base-Stations
On-line arithmetic for detection in digital communication receivers
Enhancing capacity of wireless cellular CDMA
Enhancing capacity of wireless cellular CDMA
Modeling of RF in W-CDMA with SystemView
Sridhar Rajagopal, Srikrishna Bhashyam,
DSP Architectures for Future Wireless Base-Stations
On-line arithmetic for detection in digital communication receivers
Suman Das, Sridhar Rajagopal, Chaitali Sengupta and Joseph R.Cavallaro
DSPs for Future Wireless Base-Stations
Presentation transcript:

Implementation Issues for Channel Estimation and Detection Algorithms for W-CDMA Sridhar Rajagopal and Joseph Cavallaro ECE Dept.

Contents Introduction W-CDMA Channel Estimation and Detection DSP Implementation ASIC Implementation Other Current Projects Future Work

The CDMA Research Group We cover the entire (spread) spectrum! – Algorithms – Implementation issues D M

Implementation Issues Important because –Real-time –Low Power –Mobility /Size DSPs –Signal Processing Communications ASICs / FPGAs –Speed / Size

W-CDMA CDMA : Code Division Multiple Access W-CDMA : Wideband CDMA (5 MHz) –Next Generation Communication Systems –Integrating Multimedia Capabilities –QoS /Multi-rate Services –Higher Data Rates 2048,384,144 Kbps

Uplink - Async, Multiuser Uplink - Async, Multiuser Direct Path Reflected Paths Base Station Noise + MAI User 1 User 2

Downlink - Sync, Single User Direct Path Reflected Paths Base Station Noise + MAI User 1 User 2 User 1

Determining the Channel Channel Estimation –Need to know the Channel for proper detection Delays and Amplitudes : Multiuser/path –Send sequence of known bits (Pilot / Preamble) –2 types Code Multiplexed with Data Time Multiplexed with Data Detection –Use knowledge of channel for detection of Data bits

W-CDMA Standards Not fixed yet…... Uplink –Channel Estimation - Time Multiplexed –Multiuser Detection Downlink –Channel Estimation - Common Pilot –Detection : Rake Receivers/ Equalizers

Channel Estimation Uplink –Time Multiplexed Maximum Likelihood Subspace Downlink –Continuous LMS Based Adaptive

Multiuser Detection Optimal Sub-optimal Linear Interference Cancellation Neural Network MAI Whitening Decorrelating MMSE Serial SIC Parallel PIC MLSE (Viterbi)

Base-Station Receiver Channel Estimator Multiuser Detector Demux Decoder Data Pilot Estimated Amplitudes & Delays Demodulator Antenna

CDMA Uplink System Channel Encoder Channel Encoder Channel Encoder Spreading AWGN Matched Filter Matched Filter Channel Estimator Matched Filter Multi- User Detector Channel Decoder + User 1 d 1 User 2 d 2 User K d K R(t) User 1 d 1 ' User 2 d 2 ' User K d K ' y1y1 y2y2 yKyK Demux

Maximum Likelihood Channel Estimation Send a time-multiplexed Preamble (Pilot). Channel properties extracted Compare with known pilot and estimate. Keep estimate for remaining data bits (static). Repeat preamble every frame, if no tracking.

The Maximum Likelihood Algorithm Compute the correlation matrices Compute the channel estimate Calculate the noise covariance matrix K. Calculate the channel impulse response vector z. Extract the ampitudes and delays using least squares fit.

The ML Algorithm Complexity Complex-Real Dot Product. Complex-Real Matrix Product. Complex -Real Product. Real Square roots. –Solving quadratic equation for least squares fit. Critical code : Matrix-vector / Dot Product Assuming Unity Noise Covariance Offline

Differencing Multistage Multiuser Detection Based on the principle of Parallel Interference Cancellation (PIC) Cross-correlation information used to remove interference of other users Repeated iterations for convergence Differencing techniques to improve performance

The Differencing Multistage Detector Split the cross correlation matrix into lower, upper and the diagonal matrix. Calculate impulse response x is called the differencing vector.

Multistage Detector Complexity Matrix Multiplication: –Computed only once for one frame Dot Product: –Computed iteratively Critical code: Dot Product

TI Tools Used Evaluation Modules (EVM) for C6201 and C6701 fixed and floating point DSPs –64 KB each internal program & data memory –256 KB SBSRAM, 8 MB SDRAM (external) C Compiler ver 3.0 from Code Generation Tools Code Composer ver 4.02 for profiling

DSP Implementation: Channel Estimation Floating point implementation found more feasible due to matrix inversions and square-roots. Code optimized for the DSP Use of Specialized approximate instructions –Approximate reciprocal square roots –Approximate reciprocals Use of Assembly Code for critical part. –TI's C67 floating point benchmarks for Matrix-Vector Multiplication & Dot Product Data Memory requirements for Channel Estimation

Approximate Instructions & Assembly L = 150, P =3, N= 31, SNR = 5dB, SINR = -10 dB Number of users --> Execution time(in milliseconds) --> Use of specialized instructions and assembly code on C6701 DSP C6701: Original C6701: with Intrinsics C6701: with Assembly 10% improvement 100% improvement

Data Memory Requirements Data to be placed in External memory 130 6

DSP Implementation: Multistage Detection 16-bit Fixed Point C Code Code optimized for the DSP Use of Assembly Code for critical part –TI's C62 fixed point assembly benchmarks for Dot Product Data memory requirements for Multistage Detection

Data Memory Requirements Data can be placed completely in Internal memory

x 10 4 Total Number of Iterations Number of Flops Users:K=15 SNR=6dB Conventional Method Differencing Method Flops Count conventional differencing 2X speedup for a three-stage detector

Real-Time Requirements Real-Time capability by C6201 DSP NUMBER OF USERS MAX BIT RATE PER USER (kb/s) SNR=10dB Window Size=12 Conventional Method Differencing Method 12users 150kb/s

Trends in Recent DSPs More internal memory and higher clock speeds –C6203 : 512 KB data, 384 KB program, 250 MHz –useful for uplink channel estimation algorithms. Specialized Blocks in the DSP Core. –Viterbi decoding in C54. Lower Voltage operation –1.2 V in C5402, useful for saving power consumption in the mobile.

ASIC Implementation MOSIS Tiny-Chip (40-pin DIP) –8 synchronous users –12-bit fixed point implementation –6000 transistors –1.2  m CMOS technology –190kb/s for each user – 3-stage cascade delay < 15  s

Advantages of ASICs Highly paralleled instructions: 4 RISC IPC (instructions per cycle) –accumulating while shifting, loading and storing –recoding while loading Application specific architecture –faster I/O –smaller on chip memory –smaller ALU

Chip (Single Stage) Architecture SHIFT ALUALU RECODER REG (L+L’)A Control Logic Internal signals External signals

Chip Layout 12-bit ALU Soft Decisions Cross- Correlation Recoding logic 2.0 mm

The Actual Chip Photograph

3-stage Cascade Mode Sin Hin Fin Load CLK Sout Hout Fout 1/2 Sin Hin Fin Load CLK Sout Hout Fout 1/2 Sin Hin Fin Load CLK Sout Hout Fout 1/2 Matche d Filter Output Detector Output Hand Shakin g Load R Clock Output Valid

System Timing Load R 1st Stage2nd Stage 3rd Stage Final Output

Interference Cancellation

Scalable ASIC Design Xilinx FPGA XC4000: 500k gates, 96MHz

DSP-ASIC Comparison TI’s ‘C54xx: General purpose DSP core + ASIC

Other Current Projects Simulation Testbed –Entire Chain of Algorithms Simulink - RTW Rapid Prototyping Matlab to DSP Copper Contest –Implementation of Multistage Detector using 0.15 micron Copper Technology

Wireless LAN Project Home Area Wireless LAN High Speed Office Wireless LAN Outdoor CDMA Cellular Network

Future Work Fixed Point Implementations on DSPs/ASICs – Uplink & Downlink Algorithms Approximations using Linear Algebra Support Long Codes and Fading Multistage Detector –Execution time Predictability – Increase Efficiency GPP Comparisons : Praful, Partha, Dr.Adve Effect of DMA and Caches