Download presentation
Presentation is loading. Please wait.
Published byGwen Hopkins Modified over 8 years ago
1
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal Srikrishna Bhashyam, Joseph R. Cavallaro, and Behnaam Aazhang This work is supported by Nokia, TI, TATP and NSF
2
Motivation Computationally complex algorithms for base-stations – multiple users, high data rates – matrix inversions, floating point accuracy needed – DSP solutions infeasible for real-time [S.Das’99] Real-time implementations for baseband receiver? – multiuser channel estimation *S.Das et al., “Arithmetic Acceleration Techniques for Wireless Base-station Receivers”, Asilomar 1999
3
Contributions New estimation scheme – designed from an implementation perspective – bit-streaming, fixed-point architecture – reduced complexity, same error rate performance Real-time architecture design – exploit bit-level parallelism – area-constrained, time-constrained – real-time with minimum area
4
Baseband signal processing Multiple Users Base-Station Receiver Multiuser Channel estimation Multiuser Detection Decoding Antenna Information Bits TrackingTraining
5
Channel estimation Direct Path Reflected Path Noise +MAI User 1 User 2 Base Station Estimates unknown fading amplitudes and asynchronous delays.
6
Need for multiuser channel estimation Detector performance depends on estimation accuracy Best estimator : Maximum Likelihood => jointly estimate parameters for all users => Multiuser channel estimation Single-user sliding correlator used for implementation
7
Multiuser channel estimation algorithm - Training/Tracking bits - Received signal N - Spreading gain (typically fixed,e.g: 32) K - Number of users (variable, <=N) - Maximum Likelihood channel estimate
8
Outline Background Channel Estimation - An implementation perspective VLSI architectures –Area-constrained, Time-constrained, Area-Time efficient DSP Comparisons and Conclusions
9
Iterative scheme for channel estimation Bit-streaming, method of gradient descent Stable convergence behavior with µ Simple fixed-point architecture
10
456789101112 10 -3 10 -2 10 Comparison of Bit Error Rates (BER) Signal to Noise Ratio (SNR) BER Iterative Channel Est. Original Channel Est. O(K 2 N) O(K 3 +K 2 N) Simulations - Static multipath channel SINR = 0 dB Paths =3 Training =150 bits Spreading N = 31 Users K = 15
11
Outline Background Channel Estimation - An implementation perspective VLSI architectures – Area-constrained, Time-constrained, Area-Time efficient DSP Comparisons and Conclusions
12
Design specifications 32 Users (K) 32 spreading code length (N) Target = 128 Kbps – 4000 cycles available at 500 MHz Single cycle addition/multiplication
13
Task decomposition IterateCorrelation Matrices (Per Bit) A O(4K 2 N,8) R br O(2KN,8 ) R bb O(2K 2,8) TIME Channel Estimate to Detector b 0 (2K,1) Tracking Window r 0 (N,8) b L (2K,1) r L (N,8) L
14
Architecture design XNOR gates, UP/DOWN counters 8-bit adders 8-bit multipliers [Schulte’93] * Schulte, Swartzlander “Truncated Multiplication with Correction Constant”, Workshop on VLSI Signal Processing,1993
15
Area-constrained : Min. area, not real- time b0b0 bLbL Channel Estimate
16
Area-constrained : Hardware used
17
Time-constrained : Real time, large area b*b T b 0 *b 0 T bLbL b0b0 MUX R br MUXMUX rLrL r0r0 MUXMUX R bb A Mult Subtract >> Subtract 2K*1 K(2K-1)*1 2K 2 *8 2KN*16 2KN*8 2K*1 N*8 2KN*8 Channel Estimate
18
Time-constrained : Hardware used
19
Area-Time efficient architecture design Area - constrained – single 8-bit multiplier – cycles (128,000) [3.81 Kbps, 248 FA Cells] Time-constrained – 8-bit multipliers – log 2 (2K) cycles (6) [83.33 Mbps, 20,000,000 FA Cells] Goal : real-time with minimum area Different parallelism levels for multipliers
20
Area-Time efficient : Real-time, min. area b L *b L T b 0 *b 0 T bLbL b0b0 MUX MUXMUX rLrL r0r0 Mult Subtract >> Subtract 2K*1 2K*8 1*16 1*8 1*1 1*8 N*8 1*8 R br Counters StoreLoad R bb A (i) DEMUX MUX A (i-1) 1*8 Adder 1*8 2K*1 2K*8 Channel Estimate
21
Area-Time efficient : Hardware used
22
Outline Background Channel Estimation - An implementation perspective VLSI architectures –Area-constrained, Time-constrained, Area-Time efficient DSP Comparisons and Conclusions
23
DSP comparisons DSPs unable to exploit bit-level parallelism Inefficient storage of bits Unable to replace bit-multiplications by add/sub.
24
Scalability of architectures Design for maximum number of users in the system Fewer users – turn off functional units to reduce power – reconfigure hardware for higher data rates (FPGA) Investigating K-user design using K/2-user designs. Investigating DSP extensions
25
Conclusions New estimation scheme – designed from an implementation perspective – bit-streaming, fixed-point architecture – reduced complexity, same error rate performance Real-time architecture designs – exploit bit-level parallelism – area-constrained, time-constrained – real-time with minimum area => Real-time architectures for base-band signal processing
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.