C Model Sim (Fixed-Point) -A New Approach to Pipeline FFT Processor

Slides:

Advertisements

Similar presentations

David Hansen and James Michelussi

Advertisements

Cost-Effective Pipeline FFT/IFFT VLSI Architecture for DVB-H System Present by: Yuan-Chu Yu Chin-Teng Lin and Yuan-Chu Yu Department of Electrical and.

Digital Kommunikationselektronik TNE027 Lecture 5 1 Fourier Transforms Discrete Fourier Transform (DFT) Algorithms Fast Fourier Transform (FFT) Algorithms.

ECE 734: Project Presentation Pankhuri May 8, 2013 Pankhuri May 8, point FFT Algorithm for OFDM Applications using 8-point DFT processor (radix-8)

1 Final project Speaker: Team 5 電機三黃柏森趙敏安 Mentor : 陳圓覺 Adviser: Prof. An-Yeu Wu Date: 2007/1/22.

Image Compression System Megan Fuller and Ezzeldin Hamed 1.

Design and VLSI implementation of a digital audio-specific DSP core for MP3/AAC Kyoung Ho Bang, Nam Hun Jeong, Joon Seok Kim, Young Cheol Park and Dae.

Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.

Real time DSP Professors: Eng. Julian Bruno Eng. Mariano Llamedo Soria.

ELEN 5346/4304 DSP and Filter Design Fall Lecture 12: Number representation and Quantization effects Instructor: Dr. Gleb V. Tcheslavski Contact:

Graduate Category: Engineering and Technology Degree Level: Ph.D. Abstract ID# 122 On-Chip Spectral Analysis for Built-In Testing and Digital Calibration.

Under-Graduate Project Mid-Term Paper Reading Presentation Adviser: Prof. An-Yeu Wu Mentor: 詹承洲第二組溫仁揚溫昌懌.

Student : Andrey Kuyel Supervised by Mony Orbach Spring 2011 Final Presentation High speed digital systems laboratory High-Throughput FFT Technion - Israel.

Computational Technologies for Digital Pulse Compression

IMAGE COMPRESSION USING BTC Presented By: Akash Agrawal Guided By: Prof.R.Welekar.

HW/SW PARTITIONING OF FLOATING POINT SOFTWARE APPLICATIONS TO FIXED - POINTED COPROCESSOR CIRCUITS - Nalini Kumar Gaurav Chitroda Komal Kasat.

Fast Memory Addressing Scheme for Radix-4 FFT Implementation Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Xin Xiao, Erdal Oruklu and.

EKT 221/4 DIGITAL ELECTRONICS II  Registers, Micro-operations and Implementations - Part3.

J. Greg Nash ICNC 2014 High-Throughput Programmable Systolic Array FFT Architecture and FPGA Implementations J. Greg.

Radix-2 2 Based Low Power Reconfigurable FFT Processor Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Gin-Der Wu and Yi-Ming Liu Department.

ESPL 1 Wordlength Optimization with Complexity-and-Distortion Measure and Its Application to Broadband Wireless Demodulator Design Kyungtae Han and Brian.

Paper Reading - A New Approach to Pipeline FFT Processor Presenter:Chia-Hsin Chen, Yen-Chi Lee Mentor:Chenjo Instructor:Andy Wu.

ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU CORDIC (Coordinate rotation digital computer) Ref: Y. H. Hu, “CORDIC based VLSI architecture.

ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Under-Graduate Project Case Study: Single-path Delay Feedback FFT Speaker: Yu-Min.

Automatic Evaluation of the Accuracy of Fixed-point Algorithms Daniel MENARD 1, Olivier SENTIEYS 1,2 1 LASTI, University of Rennes 1 Lannion, FRANCE 2.

Implementation of a noise subtraction algorithm using Verilog HDL University of Massachusetts, Amherst Department of Electrical & Computer Engineering,

Speaker: Darcy Tsai Advisor: Prof. An-Yeu Wu Date: 2013/10/31

Digital Signal Processing

DEPARTMENTT OF ECE TECHNICAL QUIZ-1 AY Sub Code/Name: EC6502/Principles of digital Signal Processing Topic: Unit 1 & Unit 3 Sem/year: V/III.

Under-Graduate Project Adviser: Prof. An-Yeu Wu Mentor: 詹承洲第二組溫仁揚溫昌懌.

A New Class of High Performance FFTs Dr. J. Greg Nash Centar ( High Performance Embedded Computing (HPEC) Workshop.

Fast Fourier Transforms. 2 Discrete Fourier Transform The DFT pair was given as Baseline for computational complexity: –Each DFT coefficient requires.

Seok-jae, Lee VLSI Signal Processing Lab. Korea University

ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU CORDIC (COordinate Rotation DIgital Computer) For Advanced VLSI and VLSI Signal Processing.

An FFT for Wireless Protocols Dr. J. Greg Nash Centar ( HAWAI'I INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES Mobile.

FFT VLSI Implementation

EC1358 – DIGITAL SIGNAL PROCESSING

EE345S Real-Time Digital Signal Processing Lab Fall 2006 Lecture 17 Fast Fourier Transform Prof. Brian L. Evans Dept. of Electrical and Computer Engineering.

Low Power Design for a 64 point FFT Processor

CORDIC (Coordinate rotation digital computer)

CORDIC Based 64-Point Radix-2 FFT Processor

1 Paper reading A New Approach to FFT Processor Speaker: 吳紋浩第六組洪聖揚吳紋浩 Adviser: Prof. Andy Wu Mentor: 陳圓覺.

The content of lecture This lecture will cover: Fourier Transform

Lecture 1: Introduction and Numbers

Final Project Report 64 points FFT

CORDIC (Coordinate rotation digital computer)

DIGITAL SIGNAL PROCESSING ELECTRONICS

Fixed-pointed FFT model

Multipliers Multipliers play an important role in today’s digital signal processing and various other applications. The common multiplication method is.

Fast Fourier Transform

Subject Name: Digital Signal Processing Algorithms & Architecture

Centar ( Global Signal Processing Expo

A New Approach to Pipeline FFT Processor

Applications of Distributed Arithmetic to Digital Signal Processing:

Finite Wordlength Effects

DEPARTMENT OF INFORMATION TECHNOLOGY DIGITAL SIGNAL PROCESSING UNIT 4

University of Texas at Austin

Data Wordlength Reduction for Low-Power Signal Processing Software

A Parallel Fast Fourier Transform for Millimeter-wave Applications

Speaker: Yumin Adviser: Prof. An-Yeu Wu Date: 2013/10/24

DEPARTMENT OF INFORMATION TECHNOLOGY DIGITAL SIGNAL PROCESSING UNIT 4

95-1 Under-Graduate Project Fixed-point Analysis

FFT VLSI Implementation

95-1 Under-Graduate Project Paper Reading Presentation

Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.

Fixed-point Analysis of Digital Filters

Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.

Speaker: Chris Chen Advisor: Prof. An-Yeu Wu Date: 2014/10/28

Fast Fourier Transform

Presentation transcript:

C Model Sim (Fixed-Point) -A New Approach to Pipeline FFT Processor Presenter: Chia-Hsin Chen, Yen-Chi Lee Mentor: Chenjo Instructor: Andy Wu

Outline FFT Review FFT on Hardware What’s Fixed-Point Model Analysis RTL Implementation Conclusion & Future Works Reference 2019/4/7 Owen, Lee

FFT Review An efficient algorithm computes DFT Twiddle Factor: 2019/4/7 Owen, Lee

Radix-22 DIF Algorithm Proposed by S. He and M. Torkelson Applying a 3-dimensional linear index map 2019/4/7 Owen, Lee

Radix-22 DIF Algorithm (cont.) 2019/4/7 Owen, Lee

R22SDF Pipeline FFT Example: N=256 2019/4/7 Owen, Lee

Floating Point vs Fixed-point Use many bits to represent a number More precise but more computationally demanding Fixed-point Use finite bits to represent a number For hardware implementation 2019/4/7 Owen, Lee

Fixed-Point Model Quantization Saturation Truncation input & twiddle factor Saturation BF2I & BF2II Truncation multiplication 2019/4/7 Owen, Lee

Fixed-Point Model (Cont.) Quantization For initial values (input & twiddle factor) Table look up techniques Saturation For addition If there is a carry-out from MSB, set the number to maximum Truncation For multiplication Store the required bits form MSB, and omit the rest 2019/4/7 Owen, Lee

C/C++ Simulation 2019/4/7 Owen, Lee

Signal-of-Quantization-Noise Ratio SQNR : ratio of signal power to noise power Error range lies in ±1/2 LSB For each additional bit, the SQNR goes up by about 6dB 2019/4/7 Owen, Lee

Requirements 64-point R22SDF Dynamic range for input data:[-4,4] Number of bits for input data:10~14 Number of bits for output data:16~20 SQNR≧50dB Evaluation Equation: Score = Area * (clock period)2 2019/4/7 Owen, Lee

Analysis Overview Worst case for integer part : input x[n]=4 for n=0~63 → maximum output X[0]=4*64=256 → needs 8 bits for integer part But it is not often the case Based on the simulation result, 7 bits is enough 2019/4/7 Owen, Lee

Analysis Overview(Cont.) Fractional part and twiddle factor are closely related Fix one of them and alter the other, the effect is not obvious Therefore, fractional part and twiddle factor have to be increased simultaneously 2019/4/7 Owen, Lee

Plotting (Fix Integer Part) Fix integer part(5 – 8 bits) 2019/4/7 Owen, Lee

Average Case vs Single Case 2019/4/7 Owen, Lee

Plotting (Fix Fractional Part) Fix fractional part(7 – 10bits) 2019/4/7 Owen, Lee

Plotting (Fix Twiddle Factor) Fix twiddle factor(8 – 11bits) 2019/4/7 Owen, Lee

Average SQNR Table (7-bit integer part) frac\twi 6 7 8 9 10 11 12 13 5 33.5778 35.2070 37.5579 38.8910 39.3686 39.5589 39.6900 39.6237 35.6725 37.8124 41.4096 43.8977 44.9417 45.3195 45.4752 45.5751 36.7447 39.3552 43.9892 48.0684 50.0453 50.8486 51.2961 51.4483 37.2861 40.1783 45.5570 50.8939 54.2680 55.9788 56.8690 57.1086 37.6925 40.5367 46.2683 52.6784 57.2449 60.0804 62.0658 62.5697 37.7501 40.6730 46.7541 53.5746 59.0382 63.1174 66.3099 67.2398 37.8557 40.8907 46.9100 53.8641 59.9621 64.9456 69.5297 71.0005 37.8833 40.8881 47.0758 54.1090 60.3129 65.9172 71.5897 73.5548 2019/4/7 Owen, Lee

Possible Sets 2019/4/7 Owen, Lee

RTL Realization Just for functional work Word-lengths are the same through the whole process Integer : 8 bits (including sign bit) Fractional : 8 bits Twiddle : 12 bits (including sign bit) Omit area and timing consideration 2019/4/7 Owen, Lee

RTL Realization (Cont.) 16 points 64 points Logic elements: 2990 Registers : 2054 fmax : 21.46MHz 2019/4/7 Owen, Lee

RTL Realization (Cont.) Memory storage reducing Symmetry of twiddle factor Area reducing Different word-lengths of adder and multiplier Power saving Treat delay block as RAM 2019/4/7 Owen, Lee

Conclusion & Future Works We find some possible sets that meet the requirement Apply these sets of wordlengths to our RTL model Take area & clock period into consideration Power, if possible 2019/4/7 Owen, Lee

References S. He and M. Torkelson. “A new approach to pipeline FFT processor.” IEEE Proceedings of IPPS ’96. S. He and M. Torkelson. “Designing Pipeline FFT Processor for OFDM (de)Modulation.” ISSSE, pp. 257-262, Sept. 1998. J. Y. Oh and M. S. Lim. “New Radix-2 to the 4th Power Pipeline FFT Processor.” IEICE Trans. Electron., Vol.E88-C, No.8 Aug. 2005 E. E. Swartzlander, W. K. W. Young, and S. J. Joseph. “A radix 4 delay commutator for fast Fourier transform processor implementation.” IEEE J. Solid-State Circuits, SC-19(5):702-709, Oct. 1984. C. D. Thompson. “Fourier transform in VLSI.” IEEE Trans. Comput., C-32(11):1047-1057, Nov.1983. Y. Jung, Y. Tak, J. Kim, J. Park, D. Kim, and H. Park. “Efficient FFT Algorithm for OFDM Modulation.” Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. Vol.2 pp.676-678, 2001. A. M. Despain. “Very Fast Fourier Transform Algorithms Hardware for Implementation.” IEEE Trans. on Computers, Vol. c-28, No. 5, May 1979 A. –Y. Wu. “CORDIC.” Slides of Advanced VLSI Y. H. Hu. “CORDIC-based VLSI architectures for digital signal processing.” IEEE Signal Processing Magazine. Pp. 16-35. July 1992 J. G. Proakis. D. G. Manolakis. “Digital signal processing” 3rd edition, Prentice Hall 2019/4/7 Owen, Lee

Thanks for Your Attention Q & A ? 2019/4/7 Owen, Lee