Presentation is loading. Please wait.

Presentation is loading. Please wait.

High Throughput LDPC Decoders Using a Multiple Split-Row Method

Similar presentations


Presentation on theme: "High Throughput LDPC Decoders Using a Multiple Split-Row Method"— Presentation transcript:

1 High Throughput LDPC Decoders Using a Multiple Split-Row Method
Tinoosh Mohsenin and Bevan M. Baas VLSI Computation Lab, ECE Department University of California, Davis

2 Outline Introduction to LDPC Codes and Decoders
Multi-Split-Row Decoding Method Implementing Multi-Split-Row Decoders Conclusion

3 Error Correction in Communication Systems
Error correction is widely used in communication systems

4 LDPC Codes Applications
Standards Digital Video Broadcasting (DVB-S2): 2005 10 Gigabit Ethernet (10GBASE-T): 2006 Next generation of WiMAX Challenges with LDPC decoders High memory bandwidth requirement High interconnect complexity Many target applications are power and cost constrained

5 LDPC Decoding: Message Passing Algorithm
α Performs row and column operations iteratively Example (9,5) LDPC Code Code length (N) = 9 Information length = 5 Row weight (Wr) = 3 Column weight (Wc) = 2 Row processing Column processing β ú û ù ê ë é 1 Row Processing Column Processing H =

6 Message Passing (Row processing )
ú û ù ê ë é = 1 H SPA: MinSum:

7 Message Passing (Column processing )
ú û ù ê ë é = 1 H Column Processing is the received information from the channel

8 Decoder Architectures
Serial decoders Single row processor, column processor, shared memory Simple and small area Disadvantages Low throughput: 100 Kbps - 10 Mbps Semi-parallel decoders Multiple row and column processors, multiple memory banks Higher throughput Example: 2048-bit, rate-1/2, (3,6) programmable decoder [Mansour 2006] 14.3 mm2, 0.18 μm CMOS 125 MHz, 640 Mbps

9 Full Parallel Decoders
Row and column processors are directly mapped according to the parity check matrix Highest throughput Major challenges Routing congestion due to extrinsic information passed between row and column processors Large delay, area, and power caused by long wires Example: 1024-bit, irregular code, 4 bits per symbol, [Blanksby 2002] 52.5 mm2, 0.16 μm CMOS 64 MHz, 1Gbit/sec 5x384x32 =61440 5x2048x6 Row 1 2 384 Col 3 2048 M N

10 Multi-Split-Row Decoding Method
Outline Introduction to LDPC Codes Split-Row Decoder Algorithm Multi-Split-Row Decoding Method Implementing Multi-Split-Row Decoders Conclusion

11 Goals Very high throughputs Area efficient (small circuit area)
Therefore more energy efficient Well suited for long-length LDPC codes Well suited for hardware implementations

12 The Multi-Split-Row Decoder
Key ideas H matrix is split into multiple blocks Each block is processed almost independently Minimal information is shared between blocks Results Lower interconnect complexity Reduced processor complexity Hardware results Higher throughput Smaller decoder area and higher area utilization Slightly increased error rate

13 Standard vs. Multi-Split-Row Decoder

14 Multi-Split-Row Algorithm
The magnitude portion of the row processor output α is larger for the Multi-Split-Row decoder Sign Magnitude By normalizing the α values with a scale factor S<1 the error performance of Multi-Split-Row decoder is improved S

15 Optimum Scale factor Multi-Split-2 Multi-Split-4
Bit Error Probability Bit Error Probability Scale Factor = 0.2 Scale Factor = 0.3 (2048,1723) RS-based LDPC code used by 10 Gbit Ethernet standard Row weight: 32 Column weight: 6 No. of iterations:15

16 Bit Error Rate Performance Comparison
Code length: 2048 bits Message length: 1723 bits Row weight: 32 Column weight: 6 No. of iterations:15 SPA: Sum Product Algorithm [Mackay 1999] MinSum: [Fossorier 2002] WBF: Weighted Bit Flipping [Kou, Lin 2001] Improved WBF: [Fossorier 2004] BF: Bit Flipping [Gallager 1963] 0.35dB 0.25dB

17 Bit Error Rate Performance Comparison
Code length: 5256 bits Message length: bits Row weight: 72 Column weight: 6 No. of iterations: 15 0.25 dB 0.3 dB

18 Optimum Scale Factors for Different Codes
(N, K) (Wc,Wr) Optimum Scale Factor SP-2 SP-4 SP-6 SP-8 SP-12 (1536,770) (3,6) 0.45 - (1008,507) (4,8) 0.35 (1536,1155) (4,16) 0.4 0.25 (8088,6743) (4,24) 0.27 0.22 (2048,1723) (6,32) 0.3 0.2 * 0.15 (16352,14329) 0.16 (5248,4842) (5,64) (5256,4823) (6,72) 0.18 0.14 Multi-split row works best for: Regular codes High row-weight codes The optimum scale factor decreases as the partitioning of the H matrix increases

19 Implementing Multi-Split-Row Decoders
Outline Introduction to LDPC Codes and Decoder Arch Multi-Split-Row Decoding Method Implementing Multi-Split-Row Decoders Conclusion

20 Sign-wire implementation

21 Full-Parallel Decoder Implementations
Standard Multi-Split-Row-2 Multi-Split-Row-4 (2048,1723) RS-based (6,32) LDPC code

22 A Full-Parallel Decoder Implementation
Number of sign-passing wires is negligible compared to the total number of wires. TotalNumofWires = 2bMWr + 2(Spn-1)M (2048,1723) LDPC code with N = 2048 M (number of rows) = 384 b (bits per symbol) = 5 Wr = 32 Total number of wires Sign passing wires Ratio sign/total wires Split-Row-2 123,648 768 0.6% Split-Row-4 125,184 2304 2.0%

23 Full Parallel Decoder Chips
0.18 µm CMOS Technology, 6M layer No. of input + output registers Number of column processors row processors Individual row processor area (μm2) Standard 2x2048 2048 384 31,411 Split-Row-2 768 2 x 13,014 = 26,028 Split-Row-4 1536 4 x 5897 = 23,588

24 Three Full Parallel MinSum Decoders
Avg. wire length (mm) Chip size (mm2) Worst case speed (MHz) Decoding throughput (Gbps) Standard 0.32 139.1 10 1.4 Split-Row-2 0.20 75.8 16 2.2 Split-Row-4 0.11 43.9 52 7.1 Improvements for 2.9x 3.2x 5.1x (6,32) (2048,1723) RS-based LDPC code Resolution of 5 bits per message Throughputs calculated at 15 decoding iterations Results based on 0.18 µm CMOS, C

25 Conclusion Multi-Split-Row decoder method provides a significant reduction in circuit area Results in: Reduced wire interconnect complexity Increased circuit area utilization Increased speed Simpler implementation A good tradeoff between hardware complexity and error performance

26 Acknowledgments Support Thanks Intel Corporation UC MICRO
NSF Grant No NSF CAREER Award No UCD Faculty Research Grant Thanks Prof. Shu Lin Lan Lan Eric Work Zhiyi Yu


Download ppt "High Throughput LDPC Decoders Using a Multiple Split-Row Method"

Similar presentations


Ads by Google