Fast Memory Addressing Scheme for Radix-4 FFT Implementation Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Xin Xiao, Erdal Oruklu and.

Slides:



Advertisements
Similar presentations
David Hansen and James Michelussi
Advertisements

Cost-Effective Pipeline FFT/IFFT VLSI Architecture for DVB-H System Present by: Yuan-Chu Yu Chin-Teng Lin and Yuan-Chu Yu Department of Electrical and.
Control path Recall that the control path is the physical entity in a processor which: fetches instructions, fetches operands, decodes instructions, schedules.
Digital Kommunikationselektronik TNE027 Lecture 5 1 Fourier Transforms Discrete Fourier Transform (DFT) Algorithms Fast Fourier Transform (FFT) Algorithms.
Chapter 9 Computer Design Basics. 9-2 Datapaths Reminding A digital system (or a simple computer) contains datapath unit and control unit. Datapath: A.
ECE 734: Project Presentation Pankhuri May 8, 2013 Pankhuri May 8, point FFT Algorithm for OFDM Applications using 8-point DFT processor (radix-8)
Las Palmas de G.C., Dec IUMA Projects and activities.
1 Final project Speaker: Team 5 電機三 黃柏森 趙敏安 Mentor : 陳圓覺 Adviser: Prof. An-Yeu Wu Date: 2007/1/22.
Image Compression System Megan Fuller and Ezzeldin Hamed 1.
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
Henry Hexmoor1 Chapter 7 Henry Hexmoor Registers and RTL.
1 ReCPU:a Parallel and Pipelined Architecture for Regular Expression Matching Department of Computer Science and Information Engineering National Cheng.
Introduction to Fast Fourier Transform (FFT) Algorithms R.C. Maher ECEN4002/5002 DSP Laboratory Spring 2003.
Chapter 7. Register Transfer and Computer Operations
Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.
Recap – Our First Computer WR System Bus 8 ALU Carry output A B S C OUT F 8 8 To registers’ input/output and clock inputs Sequence of control signal combinations.
1 Real time signal processing SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.
Pipelining By Toan Nguyen.
Prepared by: Hind J. Zourob Heba M. Matter Supervisor: Dr. Hatem El-Aydi Faculty Of Engineering Communications & Control Engineering.
Computer Architecture and Organization
Basics and Architectures
Processor Architecture Needed to handle FFT algoarithm M. Smith.
Presenter: Hong-Wei Zhuang On-Chip SOC Test Platform Design Based on IEEE 1500 Standard Very Large Scale Integration (VLSI) Systems, IEEE Transactions.
Computer Architecture. “The design of a computer system. It sets the standard for all devices that connect to it and all the software that runs on it.
Computer Architecture and Organization Introduction.
Abdullah Aldahami ( ) Feb26, Introduction 2. Feedback Switch Logic 3. Arithmetic Logic Unit Architecture a.Ripple-Carry Adder b.Kogge-Stone.
Computer Design Basics
Lecture 8: Processors, Introduction EEN 312: Processors: Hardware, Software, and Interfacing Department of Electrical and Computer Engineering Spring 2014,
Reconfigurable Computing Using Content Addressable Memory (CAM) for Improved Performance and Resource Usage Group Members: Anderson Raid Marie Beltrao.
Radix-2 2 Based Low Power Reconfigurable FFT Processor Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Gin-Der Wu and Yi-Ming Liu Department.
Chap 7. Register Transfers and Datapaths. 7.1 Datapaths and Operations Two types of modules of digital systems –Datapath perform data-processing operations.
Paper Reading - A New Approach to Pipeline FFT Processor Presenter:Chia-Hsin Chen, Yen-Chi Lee Mentor:Chenjo Instructor:Andy Wu.
CDA 3101 Fall 2013 Introduction to Computer Organization The Arithmetic Logic Unit (ALU) and MIPS ALU Support 20 September 2013.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Under-Graduate Project Case Study: Single-path Delay Feedback FFT Speaker: Yu-Min.
Introduction to Microprocessors
E X C E E D I N G E X P E C T A T I O N S L3-CPU IS 4490 N-Tier Client/Server Architectures Dr. Hoganson Kennesaw State University Layer 3 - CPU CPU has.
Copyright © 2004, Dillon Engineering Inc. All Rights Reserved. An Efficient Architecture for Ultra Long FFTs in FPGAs and ASICs  Architecture optimized.
Speaker: Darcy Tsai Advisor: Prof. An-Yeu Wu Date: 2013/10/31
Updating Designed for Fast IP Lookup Author : Natasa Maksic, Zoran Chicha and Aleksandra Smiljani´c Conference: IEEE High Performance Switching and Routing.
Fundamentals of Programming Languages-II
A New Class of High Performance FFTs Dr. J. Greg Nash Centar ( High Performance Embedded Computing (HPEC) Workshop.
REGISTER TRANSFER LANGUAGE (RTL) INTRODUCTION TO REGISTER Registers1.
1 3 Computing System Fundamentals 3.2 Computer Architecture.
Parallel Implementation of Fast Fourier Transform on a Multi-core System Tao Liu Chi-Li Yu Nov. 29, 2007.
An FFT for Wireless Protocols Dr. J. Greg Nash Centar ( HAWAI'I INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES Mobile.
NCTU, CS VLSI Information Processing Research Lab 研究生 : ABSTRACT Introduction NEW Recursive DFT/IDFT architecture Low computation cycle  1/2: Chebyshev.
Low Power Design for a 64 point FFT Processor
1 Paper reading A New Approach to FFT Processor Speaker: 吳紋浩 第六組 洪聖揚 吳紋浩 Adviser: Prof. Andy Wu Mentor: 陳圓覺.
Optimizing Interconnection Complexity for Realizing Fixed Permutation in Data and Signal Processing Algorithms Ren Chen, Viktor K. Prasanna Ming Hsieh.
Basic Computer Organization and Design
Computer Organization and Architecture + Networks
Final Project Report 64 points FFT
REGISTER TRANSFER LANGUAGE (RTL)
Computer Design Basics
Embedded Systems Design
KU College of Engineering Elec 204: Digital Systems Design
Architecture & Organization 1
Processor Architecture: Introduction to RISC Datapath (MIPS and Nios II) CSCE 230.
Processor Organization and Architecture
Architecture & Organization 1
A New Approach to Pipeline FFT Processor
ARM implementation the design is divided into a data path section that is described in register transfer level (RTL) notation control section that is viewed.
Multiplexor A multiplexor is a device that takes a number of data inputs and selects one of them to pass through as its output. The interface of a multiplexor.
Computer Design Basics
C Model Sim (Fixed-Point) -A New Approach to Pipeline FFT Processor
Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.
Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.
Speaker: Chris Chen Advisor: Prof. An-Yeu Wu Date: 2014/10/28
Computer Architecture
Presentation transcript:

Fast Memory Addressing Scheme for Radix-4 FFT Implementation Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Xin Xiao, Erdal Oruklu and Jafar Saniie (Illinois Institute of Technology) Source: IEEE International Conference on Electro/Information Technology, eit ’09

Outline – Introduction – Radix4-FFT – Related Work – Proposed Method – Experimental Results – Conclusion 2

Introduction 3 Fast Fourier Transform (FFT) is widely applied in the speech processing, image processing, and communication system. One of the key components for various signal processing and communications applications such as software defined radio and OFDM.

Introduction(cont’d) 4

5 The main objective – This study is primarily Concerned Improving the performance of the address generation unit of the FFT processor by eliminating the complex critical path components.

Outline – Introduction – Radix4-FFT – Related Work – Proposed Method – Experimental Results – Conclusion 6

Introduction(cont’d) 7 Important FFT issues – High throughput – FFT size – Power consumption – Low cost – Area

Outline – Introduction – Radix4-FFT – Related Work – Proposed Method – Conclusion 8

Radix-4 9 The N-point discrete Fourier transform is defined by

Data Path of Radix-4 10

Butterfly Units The N-point FFT can be decomposed to repeated micro- operations called butterfly operations. When the size of the butterfly is r, the FFT operation is called a radix-r FFT. 11

Butterfly Units in Radix-4 12

Memory-based FFT In memory-based FFT architecture, only one butterfly structure is implemented in the chip, this butterfly unit will execute all the calculations recursively. 13

Execution Time 14

Outline – Introduction – Radix4-FFT – Related Work – Proposed Method – Experimental Results – Conclusion 15

Related Work 16 YearTitle 1969Organization of Large Scale Fourier Processors J. Assoc. Comput. Mach. 1976Simplified control of FFT hardwareIEEE Trans. Acoust., Speech, Signal Processing 1992Conflict free memory addressing for dedicated FFT hardware IEEE Trans. Circuits Syst. 1999An effective memory addressing scheme for FFT processors IEEE Trans. on Signal Process 2008An Efficient FFT Engine With Reduced Addressing Logic IEEE Transactions on Circuits and Systems II

Data Path of Radix-2 17

Data Path of Radix-4 18

Outline – Introduction – Radix4-FFT – Related Work – Proposed Method – Experimental Results – Conclusion 19

Memory Banks 20 Four memory banks are used to store the data.

Read Ports and Write Ports However, for pass 1 and pass 2, four inputs and four outputs of any butterfly stage belong to same memory bank. Since each memory bank is a two-port memory, at each clock cycle, each memory bank can export (read) once and import(write) once. Four clock cycles are necessary to perform four read and four write accesses in pass 1 and pass 2. 21

Counter D 22

Barrel Shifter The barrel shifter generates all the addresses for four memory banks based on the pass number of the FFT, which can be expressed as: RR(counter B, 2p) where RR(counter B, 2p) means rotate-right butterfly counter B by 2p bits, and p is the pass number of FFT. 23

Twiddle Factor 24

For Larger FFT Size For different length FFT transforms, the control logic of the multiplexers only depends on the last three bits of the counter,so the register and multiplexer structures are fixed for different length FFTs resulting in a common architecture for any N-point FFT. 25

Logic Minimization After logic minimization, it results in only primitive logic gates such as AND/OR gates using the least significant bits of the butterfly counter B. 26

Address Sequences(R0~R15) 27

Address Sequences(R16~R31) 28

Outline – Introduction – Radix4-FFT – Related Work – Proposed Method – Experimental Results – Conclusion 29

Experimental Results 30

Experimental Results 31

Outline – Introduction – Radix4-FFT – Related Work – Proposed Method – Experimental Results – Conclusion 32

Conclusions 33

Thanks for Listening 34