DSP Architectures for Future Wireless Base-Stations

Slides:



Advertisements
Similar presentations
Multiuser Detection for CDMA Systems
Advertisements

Development of Parallel Simulator for Wireless WCDMA Network Hong Zhang Communication lab of HUT.
Real-Time DSP Multiprocessor Implementation for Future Wireless Base-Station Receivers Bryan Jones, Sridhar Rajagopal, and Dr. Joseph Cavallaro.
1 Wireless Communication Low Complexity Multiuser Detection Rami Abdallah University of Illinois at Urbana Champaign 12/06/2007.
Multiuser Detection in CDMA A. Chockalingam Assistant Professor Indian Institute of Science, Bangalore-12
Submission May, 2000 Doc: IEEE / 086 Steven Gray, Nokia Slide Brief Overview of Information Theory and Channel Coding Steven D. Gray 1.
Sub-Nyquist Sampling DSP & SCD Modules Presented by: Omer Kiselov, Daniel Primor Supervised by: Ina Rivkin, Moshe Mishali Winter 2010High Speed Digital.
Implementation Issues for Channel Estimation and Detection Algorithms for W-CDMA Sridhar Rajagopal and Joseph Cavallaro ECE Dept.
Lecture 29 Fall 2006 Lecture 29: Parallel Programming Overview.
DSPs in Wireless Communication Systems Vishwas Sundaramurthy Electrical and Computer Engineering Department, Rice University, Houston,TX.
RICE UNIVERSITY Implementing the Viterbi algorithm on programmable processors Sridhar Rajagopal Elec 696
A bit-streaming, pipelined multiuser detector for wireless communications Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
RICE UNIVERSITY DSPs for 4G wireless systems Sridhar Rajagopal, Scott Rixner, Joseph R. Cavallaro and Behnaam Aazhang This work has been supported by Nokia,
TI DSPS FEST 1999 Implementation of Channel Estimation and Multiuser Detection Algorithms for W-CDMA on Digital Signal Processors Sridhar Rajagopal Gang.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal Srikrishna Bhashyam, Joseph R. Cavallaro,
RICE UNIVERSITY DSP architectures for wireless communications Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
RICE UNIVERSITY “Joint” architecture & algorithm designs for baseband signal processing Sridhar Rajagopal and Joseph R. Cavallaro Rice Center for Multimedia.
RICE UNIVERSITY Advanced Wireless Receivers: Algorithmic and Architectural Optimizations Suman Das Rice University Department of Electrical and Computer.
1 WP2.3 “Radio Interface and Baseband Signal Processing” Content of D15 and Outline of D18 CAPANINA Neuchatel Meeting October 28th, 2005 – Marina Mondin.
RICE UNIVERSITY A real-time baseband communications processor for high data rate wireless systems Sridhar Rajagopal ECE Department Ph.D.
VIRGINIA POLYTECHNIC INSTITUTE & STATE UNIVERSITY MOBILE & PORTABLE RADIO RESEARCH GROUP MPRG Combined Multiuser Detection and Channel Decoding with Receiver.
RICE UNIVERSITY DSPs for future wireless systems Sridhar Rajagopal.
DSP Architectural Considerations for Optimal Baseband Processing Sridhar Rajagopal Scott Rixner Joseph R. Cavallaro Behnaam Aazhang Rice University, Houston,
Implementing algorithms for advanced communication systems -- My bag of tricks Sridhar Rajagopal Electrical and Computer Engineering This work is supported.
Pipelining and number theory for multiuser detection Sridhar Rajagopal and Joseph R. Cavallaro Rice University This work is supported by Nokia, TI, TATP.
RICE UNIVERSITY On the architecture design of a 3G W-CDMA/W-LAN receiver Sridhar Rajagopal and Joseph R. Cavallaro Rice University Center for Multimedia.
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
Implementing Multiuser Channel Estimation and Detection for W-CDMA Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro and Behnaam Aazhang Rice.
Overview of Implementation Issues for Multitier Networks on DSPs Joseph R. Cavallaro Electrical & Computer Engineering Dept. Rice University August 17,
SR: 599 report Channel Estimation for W-CDMA on DSPs Sridhar Rajagopal ECE Dept., Rice University Elec 599.
Algorithms and Architectures for Future Wireless Base-Stations Sridhar Rajagopal and Joseph Cavallaro ECE Department Rice University April 19, 2000 This.
RICE UNIVERSITY Handset architectures Sridhar Rajagopal ASICsProgrammable The support for this work in.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,
An FFT for Wireless Protocols Dr. J. Greg Nash Centar ( HAWAI'I INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES Mobile.
Optimal Sequence Allocation and Multi-rate CDMA Systems Krishna Kiran Mukkavilli, Sridhar Rajagopal, Tarik Muharemovic, Vikram Kanodia.
Channel Equalization in MIMO Downlink and ASIP Architectures Predrag Radosavljevic Rice University March 29, 2004.
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
Andreas Hoffmann Andreas Ropers Tim Kogel Stefan Pees Prof
Differencing Multistage Detector
Hiba Tariq School of Engineering
Suman Das Rice University
A programmable communications processor for future wireless systems
Sridhar Rajagopal April 26, 2000
ADSL Time-Domain Equalizer
Optimal Sequence Allocation and Multi-rate CDMA Systems
How to ATTACK Problems Facing 3G Wireless Communication Systems
Pipelining and Vector Processing
Introduction to Digital Signal Processors (DSPs)
Sum of Absolute Differences Hardware Accelerator
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Modeling of RF in W-CDMA with SystemView
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
VLIW DSP vs. SuperScalar Implementation of a Baseline H.263 Encoder
DSPs for Future Wireless Base-Stations
High Throughput LDPC Decoders Using a Multiple Split-Row Method
Physical Layer Approach for n
On-line arithmetic for detection in digital communication receivers
Sridhar Rajagopal COMP 625 April 17, 2000
Modeling of RF in W-CDMA with SystemView
Sridhar Rajagopal, Srikrishna Bhashyam,
DSPs in emerging wireless systems
EM based Multiuser detection in Fading Multipath Environments
On-line arithmetic for detection in digital communication receivers
Suman Das, Sridhar Rajagopal, Chaitali Sengupta and Joseph R.Cavallaro
Information Sciences and Systems Lab
DSPs for Future Wireless Base-Stations
Presentation transcript:

DSP Architectures for Future Wireless Base-Stations Sridhar Rajagopal and Joseph Cavallaro ECE Department Rice University April 10, 2000 This work is supported by Texas Instruments, Nokia, Texas Advanced Technology Program and NSF

Overview Future Base-Stations Current DSP Implementation Our Approach Make Algorithms Computationally effective Task Partitioning for pipelining, parallelism DSP/ARM Extensions for Performance Acceleration TI Meeting 4/10/00

Evolution of Wireless Comm First Generation Voice Second/Current Generation Voice + Low-rate Data (9.6Kbps) Third Generation + Voice + High-rate Data (2 Mbps) + Multimedia W-CDMA TI Meeting 4/10/00

Communication System Uplink Direct Path Reflected Paths Noise +MAI User 1 User 2 Base Station TI Meeting 4/10/00

Main Processing Blocks Channel Estimation Detection Decoding Baseband Layer of Base-Station Receiver TI Meeting 4/10/00

No Multiuser Detection Proposed Base-Station TI's Wireless Basestation (http://www.ti.com/sc/docs/psheets/diagrams/basestat.htm) TI Meeting 4/10/00

Real -Time Requirements Multiple Data Rates by Varying Spreading Factors Detection needs to be done in real-time 1953 cycles available in a C6x DSP at 250MHz to detect 1 bit at 128 Kbps TI Meeting 4/10/00

Current DSP Implementation 9 10 11 12 13 14 15 2 4 6 8 16 18 x 10 Number of Users Data Rates Achieved Data Rate Comparisons for Matched Filter and Multiuser Detector Multiuser Detector(C67) Matched Filter(C67) Multiuser Detector(C64)* Matched Filter(C64)* Targeted Data Rate Targeted Data Rate = 128Kbps C67 at 166MHz Projected (8x) TI Meeting 4/10/00

Complexity Algorithm Choice Limited by Complexity Main Features Multistage reduces data rate by half. Main Features Matrix based operations High levels of parallelism Bit level computations 32x32 problem size for the Detector shown Estimation, Decoding assumed pipelined. TI Meeting 4/10/00

Reasons Sophisticated, Compute-Intensive Algorithms Need more MIPs/FLOPs performance Unable to fully exploit pipelining or parallelism Bit - level computations / Storage TI Meeting 4/10/00

Our Approach Make algorithms computationally effective without sacrificing error rate performance Task Partitioning on Multiple Processing Elements DSPs : Core FPGAs : Application Specific / Bit-level Computations VLSI Implementation to find extensions for DSPs. TI Meeting 4/10/00

Algorithms Channel Estimation Detection Avoid inversion by iterative scheme Detection Avoid block-based detection by pipelining TI Meeting 4/10/00

Computations Involved delay Model Compute Correlation Matrices ri bi bi+1 time Bits of K async. users aligned at times I and I-1 Received bits of spreading length N for K users TI Meeting 4/10/00

Solve for the channel estimate, Ai Multishot Detection Solve for the channel estimate, Ai Multishot Detection TI Meeting 4/10/00

Differencing Multistage Detection Stage 0- Matched Filter Stage 1 Successive Stages S=diag(AHA) y - soft decision d - detected bits (hard decision) TI Meeting 4/10/00

Iterative Scheme Tracking Method of Steepest Descent Stable convergence behavior Same Performance TI Meeting 4/10/00

Simulations - AWGN Channel 4 5 6 7 8 9 10 11 12 -3 -2 -1 Comparison of Bit Error Rates (BER) Signal to Noise Ratio (SNR) BER MF ActMF ML ActML O(K2N) O(K3+K2N) Detection Window = 12 SINR = 0 Paths =3 Preamble L =150 Spreading N = 31 Users K = 15 10000 bits/user MF – Matched Filter ML- Maximum Likelihood ACT – using inversion TI Meeting 4/10/00

Block Based Detector Matched Filter 1 12 Stage 1 Stage 2 Stage 3 1 12 11 22 Matched Filter Stage 1 Stage 2 Stage 3 Bits 2-11 Bits 12-21 TI Meeting 4/10/00

Pipelined Detector 1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5 6 7 8 9 10 11 12 Matched Filter 1 2 3 4 5 6 7 8 9 10 11 12 Stage 1 1 2 3 4 5 6 7 8 9 10 11 12 Stage 2 Stage 3 1 2 3 4 5 6 7 8 9 10 11 12 TI Meeting 4/10/00

Task Decomposition [Asilomar99] Block I Block II Block III Multistage Detector Correlation Matrices (Per Bit) Inverse Matrix Products Block IV M U X d A0HA1 O(K2N) Multistage Detection (Per Window) Rbr[R] O(KN) RbbAH = Rbr[R] O(K2N) b A0HA0 O(K2N) Rbr[I] O(KN) Data’ M U X RbbAH = Rbr[I] O(K2N) d O(DK2Me) Rbb O(K2) A1HA1 O(K2N) Pilot AHr O(KND) Data Channel Estimation Matched Filter TI Meeting 4/10/00

Achieved Data Rates 9 10 11 12 13 14 15 0.5 1 1.5 2 2.5 3 x 10 5 Number of Users Data Rates Data Rates for Different Levels of Pipelining and Parallelism (Parallel A) (Parallel+Pipe B) (Parallel A) (Pipe B) (Parallel A) B A B Sequential A + B Data Rate Requirement = 128 Kbps TI Meeting 4/10/00

VLSI Implementation Channel Estimation as a Case Study Area - Time Efficient Architecture Real - Time Implementation Bit- Level Computations - FPGAs Core Operations - DSPs TI Meeting 4/10/00

DSP Extensions for Performance Bit-level storage / processing support Registers / Memory / ALU Efficient Matrix -Based operations Matrix- Vector Multiply Support for Complex-valued data Efficient memory accesses Pre-fetching Data - C64 TI Meeting 4/10/00

(Converts Frames to Bits) Use of ARM Core Work on Higher Base Station Layers User Interface Translation Synchronization Transport Network OSI Layers 3-7 Data Link Layer (Converts Frames to Bits) Layer 2 Physical Layer (hardware; raw bit stream) 1 ARM DSP TI Meeting 4/10/00

Software Suggestions Limited OS Support Compiler Efficiency No more Assembly! Performance Analysis Tools Code Composer Studio 1.2 TI Meeting 4/10/00

Conclusions DSPs to play major role in Future Base-Station Implementations. Search for Computationally Efficient Algorithms and Better Processor Designs to meet Real-Time TI Meeting 4/10/00