Tinoosh Mohsenin 2, Houshmand Shirani-mehr 1, Bevan Baas 1 1 University of California, Davis 2 University of Maryland Baltimore County Low Power LDPC Decoder.

Slides:



Advertisements
Similar presentations
Doc.: IEEE /0071r1 Submission January 2004 Aleksandar Purkovic, Nortel NetworksSlide 1 LDPC vs. Convolutional Codes for n Applications:
Advertisements

Error Correction and LDPC decoding CMPE 691/491: DSP Hardware Implementation Tinoosh Mohsenin 1.
Inserting Turbo Code Technology into the DVB Satellite Broadcasting System Matthew Valenti Assistant Professor West Virginia University Morgantown, WV.
Houshmand Shirani-mehr 1,2, Tinoosh Mohsenin 3, Bevan Baas 1 1 VCL Computation Lab, ECE Department, UC Davis 2 Intel Corporation, Folsom, CA 3 University.
Network Coding Testbed Using Software-Defined Radio Abstract In current generation networks, network nodes operate by replicating and forwarding the packets.
Improving BER Performance of LDPC Codes Based on Intermediate Decoding Results Esa Alghonaim, M. Adnan Landolsi, Aiman El-Maleh King Fahd University of.
Arbitrary Bit Generation and Correction Technique for Encoding QC-LDPC Codes with Dual-Diagonal Parity Structure Chanho Yoon, Eunyoung Choi, Minho Cheong.
Cooperative Multiple Input Multiple Output Communication in Wireless Sensor Network: An Error Correcting Code approach using LDPC Code Goutham Kumar Kandukuri.
Submission May, 2000 Doc: IEEE / 086 Steven Gray, Nokia Slide Brief Overview of Information Theory and Channel Coding Steven D. Gray 1.
Turbo Codes Azmat Ali Pasha.
On the Construction of Energy- Efficient Broadcast Tree with Hitch-hiking in Wireless Networks Source: 2004 International Performance Computing and Communications.
Computer Architecture Project
An FPGA Based Adaptive Viterbi Decoder Sriram Swaminathan Russell Tessier Department of ECE University of Massachusetts Amherst.
Code and Decoder Design of LDPC Codes for Gbps Systems Jeremy Thorpe Presented to: Microsoft Research
Interconnect Efficient LDPC Code Design Aiman El-Maleh Basil Arkasosy Adnan Al-Andalusi King Fahd University of Petroleum & Minerals, Saudi Arabia Aiman.
Generalized Communication System: Error Control Coding Occurs In Right Column. 6.
Improving the Performance of Turbo Codes by Repetition and Puncturing Youhan Kim March 4, 2005.
The Role of Specialization in LDPC Codes Jeremy Thorpe Pizza Meeting Talk 2/12/03.
1 1 © 2011 The MathWorks, Inc. Accelerating Bit Error Rate Simulation in MATLAB using Graphics Processors James Lebak Brian Fanous Nick Moore High-Performance.
COGNITIVE RADIO FOR NEXT-GENERATION WIRELESS NETWORKS: AN APPROACH TO OPPORTUNISTIC CHANNEL SELECTION IN IEEE BASED WIRELESS MESH Dusit Niyato,
Low Density Parity Check (LDPC) Code Implementation Matthew Pregara & Zachary Saigh Advisors: Dr. In Soo Ahn & Dr. Yufeng Lu Dept. of Electrical and Computer.
COE4OI5 Engineering Design. Copyright S. Shirani 2 Course Outline Design process, design of digital hardware Programmable logic technology Altera’s UP2.
Labs Practicing in Design of Combinational Networks and FSM with Concurrent Error Detection Tatjana Stanković, Goran Djordjević, Mile Stojčev 2075 Microprocessor.
Tinoosh Mohsenin and Bevan M. Baas VLSI Computation Lab, ECE Department University of California, Davis Split-Row: A Reduced Complexity, High Throughput.
A 240ps 64b Carry-Lookahead Adder in 90nm CMOS Faezeh Montazeri Advanced VLSI Course Presentation University of Tehran December.
Optimal digital circuit design Mohammad Sharifkhani.
EKT 221/4 DIGITAL ELECTRONICS II  Registers, Micro-operations and Implementations - Part3.
Distributed computing using Projective Geometry: Decoding of Error correcting codes Nachiket Gajare, Hrishikesh Sharma and Prof. Sachin Patkar IIT Bombay.
Miss Insah Bhurtah Main Supervisor: Prof. K.M.S. Soyjaudah Associate Supervisor: Dr C Catherine.
Design of a High-Throughput Low-Power IS95 Viterbi Decoder Xun Liu Marios C. Papaefthymiou Advanced Computer Architecture Laboratory Electrical Engineering.
Introduction of Low Density Parity Check Codes Mong-kai Ku.
Coding Theory. 2 Communication System Channel encoder Source encoder Modulator Demodulator Channel Voice Image Data CRC encoder Interleaver Deinterleaver.
Performance and Power Analysis of Globally Asynchronous Locally Synchronous Multiprocessor Systems Zhiyi Yu, Bevan M. Baas VLSI Computation Lab, ECE department,
ISSCC 2008 Student Forum An 18 Gbps 2048-bit 10GBASE-T Ethernet LDPC Decoder Tinoosh Mohsenin Electrical & Computer Engineering, UC Davis
Real-Time Turbo Decoder Nasir Ahmed Mani Vaya Elec 434 Rice University.
Part 1: Overview of Low Density Parity Check(LDPC) codes.
Multi-Split-Row Threshold Decoding Implementations for LDPC Codes
Tufts University. EE194-WIR Wireless Sensor Networks. February 17, 2005 Increased QoS through a Degraded Channel using a Cross-Layered HARQ Protocol Elliot.
An ARQ Technique Using Related Parallel and Serial Concatenated Convolutional Codes Yufei Wu formerly with: Mobile and Portable Radio Research Group Virginia.
Semi-Parallel Reconfigurable Architecture for Real-time LDPC decoding Karkooti, M.; Cavallaro, J.R.; Information Technology: Coding and Computing, 2004.
Unit 1 Lecture 4.
Turbo Codes. 2 A Need for Better Codes Designing a channel code is always a tradeoff between energy efficiency and bandwidth efficiency. Lower rate Codes.
Code Construction and FPGA Implementation of a Low-Error-Floor Multi-Rate Low-Density Parity-Check Code Decoder Lei Yang, Hui Liu, C.-J Richard Shi Transactions.
Doc.: IEEE /0146r1 Submission March 2005 John Benko, Marie-Helene Hamon, France TelecomSlide 1 Advanced Coding Comparison Marie-Helene Hamon,
Implementing Tile-based Chip Multiprocessors with GALS Clocking Styles Zhiyi Yu, Bevan Baas VLSI Computation Lab, ECE Department University of California,
A Low-Area Interconnect Architecture for Chip Multiprocessors Zhiyi Yu and Bevan Baas VLSI Computation Lab ECE Department, UC Davis.
Overcoming the Sensing-Throughput Tradeoff in Cognitive Radio Networks ICC 2010.
1 Aggregated Circulant Matrix Based LDPC Codes Yuming Zhu and Chaitali Chakrabarti Department of Electrical Engineering Arizona State.
Waseda University Low-Density Parity-Check Code: is an error correcting code which achieves information rates very close to the Shanon limit. Message-Passing.
Channel Coding and Error Control 1. Outline Introduction Linear Block Codes Cyclic Codes Cyclic Redundancy Check (CRC) Convolutional Codes Turbo Codes.
Institute for Experimental Mathematics Ellernstrasse Essen - Germany DATA COMMUNICATION introduction A.J. Han Vinck May 10, 2003.
Error Correction and LDPC decoding
ECE Department, University of California, Davis
VLSI Architectures For Low-Density Parity-Check (LDPC) Decoders
Q. Wang [USTB], B. Rolfe [BCA]
A Scalable Architecture for LDPC Decoding
Rate 7/8 (1344,1176) LDPC code Date: Authors:
Interleaver-Division Multiple Access on the OR Channel
January 2004 Turbo Codes for IEEE n
An Improved Split-Row Threshold Decoding Algorithm for LDPC Codes
Chapter 6.
High Throughput LDPC Decoders Using a Multiple Split-Row Method
Physical Layer Approach for n
Yiyu Shi*, Wei Yao*, Jinjun Xiong+ and Lei He*
On-line arithmetic for detection in digital communication receivers
January 2019 Project: IEEE P Working Group for Wireless Personal Area Networks (WPANs) Submission Title: [Rate 1/4 LDPC interleaver proposal] Date.
Low-Density Parity-Check Codes
<month year> <doc.: IEEE doc> January 2013
On-line arithmetic for detection in digital communication receivers
Comparisons of HARQ transmission schemes for 11be
Presentation transcript:

Tinoosh Mohsenin 2, Houshmand Shirani-mehr 1, Bevan Baas 1 1 University of California, Davis 2 University of Maryland Baltimore County Low Power LDPC Decoder with Efficient Stopping Scheme for Undecodable Blocks 1

LDPC Codes and Their Applications Low Density Parity Check (LDPC) codes have superior error correction performance Standards and applications 10 Gigabit Ethernet (10GBASE-T) Digital Video Broadcasting (DVB-S2, DVB-T2, DVB-C2) Next-Gen Wired Home Networking (G.hn) WiMAX (802.16e) WiFi (802.11n) Hard disks Deep-space satellite missions Signal to Noise Ratio (dB) Bit Error Probability dB Convolutional Uncoded Figure courtesy of B. Nikolic, 2003 (modified) 2 3 dB LDPC

Message Passing: Variable node processing λ is the original received information from the channel 3 α : message from check to variable node β : message from variable to check node

Message Passing: Check node processing (MinSum) 4 Sign Magnitude After check node processing, the next iteration starts with another variable node processing (begins a new iteration)

Early Termination for Decoder Convergence With early termination a high energy efficiency for a variety of SNRs can be achieved Existing work to detect undecodable blocks requires the knowledge of SNR or adds large hardware complexity [1] [2] [3] [4]. 5 [1] Z. Kai et al., 2008 [2] L. Z.Cui et al., 2007 [3] D. Shin,et al., 2007 [4] J. Li et al.,2006

LDPC Decoder Design Goals and Features Key goals Very high throughput and energy efficiency Area efficient (small circuit area) Good error performance Contributions Termination scheme for undecodable blocks Very low complexity Nearly no error performance loss Split-Row Threshold decoding Reduced interconnect complexity Reduced processor complexity 6

Outline Iterative LDPC Decoding Termination Scheme for Undecodable Blocks Split-Row Threshold Decoding Decoder Implementations and Results Conclusion

Proposed Stopping Method A block is most likely decodable if checksum value (S Check ) monotonically decreases as decoding iteration count increases [1], [2]. By checking checksum value in marked region, undecodable codewords can be identified. Results for (6,32) (1723,2048) 10GBASE-T code 8 SNR = 4.0 dB SNR = 3.6 dB [1] Z. Kai et al, 2008 [2] L. Z.Cui et al, 2007

Threshold Determination Checksum values for three consecutive iterations are compared with predefined TH1, TH2, and TH3 values. The iteration check and threshold values are obtained by simulations. BER results for (6,32) (1723,2048) 10GBASE-T code at SNR=4.3 dB. Optimum threshold values are between

Outline Iterative LDPC Decoding Termination Scheme for Undecodable Blocks Split-Row Threshold Decoding Decoder Implementations and Results Conclusion

MinSum vs. Split-Row Threshold Decoding MinSum decoding Split-Row Threshold decoding reduction of input wires to check processor reduction of check processor area each message is sent with at least 6 bit wire 11 Sign Sp0 Sign Sp1 Thresh Sp0 Thresh Sp1 Mohsenin, et al., ICC 2009, ISCAS 2009, TCAS2010, Patent 12/605078, filed 2009

Error Performance for 2048-bit 10GBASE-T Code dB 0.12 dB Sum Product Algorithm MinSum Normalized Split-Row-2 Threshold Split-Row-4 Threshold Split-Row-8 Threshold Split-Row-16 Threshold Split-Row-2 (Original)

13 Error Correction Performance and Convergence (contd.) 0.05 dB SNR loss compared to original decoding At SNR<3.2 dB, average no. of iterations is 2.3x smaller Results for (6,32) (1723,2048) 10GBASE-T code

Outline Iterative LDPC Decoding Termination Scheme for Undecodable Blocks Split-Row Threshold Decoding Decoder Implementations and Results Conclusion

Full parallel Decoder Implementation Check node partitions simultaneously compute locally, final output is updated using Sign and Threshold_en signals from nearest partition. Implemented five full parallel decoders for (6,32) (1723,2048) 10GBASE-T code 2048 variable processors, 384 check processors

Split-Row Threshold Decoder Physical Layout Synthesis RTL Power & Floor plan Placement Clk tree placement Route Post route optimization Chk Proc Var Proc

Comparison of Decoders 10GBASE-T Code 65 nm, 7 M, 1.3 V MinSumSplit-2 Threshold Split-4 Threshold Split-8 Threshold Split-16 Threshold Split-16 vs.MinSum Final area utilization38%51%85%92%97% 2.5x Area (mm 2 ) ÷3.8 Speed (MHz) x 15 iter (Gbps) x Energy per 15 iter (pJ/bit) ÷4.3 CAD route CPU time (hour) ÷ MinSum Split-2 Threshold Split-4 Threshold Split-16 Threshold Split-8 Threshold

Proposed Early-stopping Method Comparison 18

Conclusion Efficient method for stopping decoding for undecodable blocks is introduced. Split-Row Threshold decoding reduces the number of connections between check and variable processors. This results in a higher logic utilization and a smaller circuit. Energy efficiency is improved by 2.4x for SNR 4.3 dB over original decoding. 19

Acknowledgements Support ST Microelectronics NSF Grant and CAREER award Intel SRC GRC Grant 1598 and CSR Grant 1659 Intellasys UC Micro SEM