FFT VLSI Implementation

Slides:



Advertisements
Similar presentations
David Hansen and James Michelussi
Advertisements

CS364 CH16 Control Unit Operation
1 ECE734 VLSI Arrays for Digital Signal Processing Chapter 3 Parallel and Pipelined Processing.
Cost-Effective Pipeline FFT/IFFT VLSI Architecture for DVB-H System Present by: Yuan-Chu Yu Chin-Teng Lin and Yuan-Chu Yu Department of Electrical and.
1 Final project Speaker: Team 5 電機三 黃柏森 趙敏安 Mentor : 陳圓覺 Adviser: Prof. An-Yeu Wu Date: 2007/1/22.
ELEC692 VLSI Signal Processing Architecture Lecture 9 VLSI Architecture for Discrete Cosine Transform.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Why Systolic Architecture ? VLSI Signal Processing 台灣大學電機系 吳安宇.
Instruction Level Parallelism (ILP) Colin Stevens.
Chapter 15 Digital Signal Processing
DSP Implementation of a BPSK SNR Estimation Algorithm for OFDM Systems in AWGN Channel University of Patras Department of Electrical & Computer Engineering.
Low power and cost effective VLSI design for an MP3 audio decoder using an optimized synthesis- subband approach T.-H. Tsai and Y.-C. Yang Department of.
1 DSP Implementation on FPGA Ahmed Elhossini ENGG*6090 : Reconfigurable Computing Systems Winter 2006.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU A New Algorithm to Compute the Discrete Cosine Transform VLSI Signal Processing 台灣大學電機系.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Multirate Processing of Digital Signals: Fundamentals VLSI Signal Processing 台灣大學電機系 吳安宇.
An Energy-Efficient Reconfigurable Multiprocessor IC for DSP Applications Multiple programmable VLIW processors arranged in a ring topology –Balances its.
1 Miodrag Bolic ARCHITECTURES FOR EFFICIENT IMPLEMENTATION OF PARTICLE FILTERS Department of Electrical and Computer Engineering Stony Brook University.
VLSI Arithmetic Adders & Multipliers Prof. Vojin G. Oklobdzija University of California
A Bit-Serial Method of Improving Computational Efficiency of Dot-Products 1.
Under-Graduate Project Mid-Term Paper Reading Presentation Adviser: Prof. An-Yeu Wu Mentor: 詹承洲 第二組 溫仁揚 溫昌懌.
Software Defined Radio 長庚電機通訊組 碩一 張晉銓 指導教授 : 黃文傑博士.
Fast Memory Addressing Scheme for Radix-4 FFT Implementation Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Xin Xiao, Erdal Oruklu and.
Radix-2 2 Based Low Power Reconfigurable FFT Processor Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Gin-Der Wu and Yi-Ming Liu Department.
RICE UNIVERSITY “Joint” architecture & algorithm designs for baseband signal processing Sridhar Rajagopal and Joseph R. Cavallaro Rice Center for Multimedia.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Undergraduate Projects Speaker: Wes Adviser: Prof. An-Yeu Wu Date: 2015/09/22 Lab.
Area: VLSI Signal Processing.
HPEC04 Panel Session 1 HPEC 2004 Panel Session: Amending Moore’s Law for Embedded Applications The Second Path: The Role of Algorithms in Maintaining Progress.
Paper Reading - A New Approach to Pipeline FFT Processor Presenter:Chia-Hsin Chen, Yen-Chi Lee Mentor:Chenjo Instructor:Andy Wu.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU CORDIC (Coordinate rotation digital computer) Ref: Y. H. Hu, “CORDIC based VLSI architecture.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Under-Graduate Project Case Study: Single-path Delay Feedback FFT Speaker: Yu-Min.
Applications of Distributed Arithmetic to Digital Signal Processing:
Implementing algorithms for advanced communication systems -- My bag of tricks Sridhar Rajagopal Electrical and Computer Engineering This work is supported.
DSP Architectures Additional Slides Professor S. Srinivasan Electrical Engineering Department I.I.T.-Madras, Chennai –
OPTIMIZING DSP SCHEDULING VIA ADDRESS ASSIGNMENT WITH ARRAY AND LOOP TRANSFORMATION Chun Xue, Zili Shao, Ying Chen, Edwin H.-M. Sha Department of Computer.
Speaker: Darcy Tsai Advisor: Prof. An-Yeu Wu Date: 2013/10/31
A New Class of High Performance FFTs Dr. J. Greg Nash Centar ( High Performance Embedded Computing (HPEC) Workshop.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU CORDIC (COordinate Rotation DIgital Computer) For Advanced VLSI and VLSI Signal Processing.
Fast VLSI Implementation of Sorting Algorithm for Standard Median Filters Hyeong-Seok Yu SungKyunKwan Univ. Dept. of ECE, Vada Lab.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Multirate Processing of Digital Signals (II): Short-Length FIR Filter VLSI Signal Processing.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Brief Overview of Residue Number System (RNS) VLSI Signal Processing 台灣大學電機系 吳安宇.
FFT VLSI Implementation
VLSI SP Course 2001 台大電機吳安宇 1 Why Systolic Architecture ? H. T. Kung Carnegie-Mellon University.
NCTU, CS VLSI Information Processing Research Lab 研究生 : ABSTRACT Introduction NEW Recursive DFT/IDFT architecture Low computation cycle  1/2: Chebyshev.
Array Multiplier Haibin Wang Qiong Wu. Outlines Background & Motivation Principles Implementation & Simulation Advantages & Disadvantages Conclusions.
CORDIC (Coordinate rotation digital computer)
CORDIC Based 64-Point Radix-2 FFT Processor
1 Paper reading A New Approach to FFT Processor Speaker: 吳紋浩 第六組 洪聖揚 吳紋浩 Adviser: Prof. Andy Wu Mentor: 陳圓覺.
CORDIC (Coordinate rotation digital computer)
DIGITAL SIGNAL PROCESSING ELECTRONICS
Embedded Systems Design
VLSI Testing Lecture 6: Fault Simulation
doc.: IEEE <doc#>
ECE 553: TESTING AND TESTABLE DESIGN OF DIGITAL SYSTES
VLSI Testing Lecture 6: Fault Simulation
Comp 541 Wrap Up! Montek Singh Apr 27, 2018.
Introduction to Digital Signal Processors (DSPs)
Brief Overview of Residue Number System (RNS)
Real-time double buffer For hard real-time
A New Approach to Pipeline FFT Processor
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Fast Fourier Transformation (FFT)
Applications of Distributed Arithmetic to Digital Signal Processing:
C Model Sim (Fixed-Point) -A New Approach to Pipeline FFT Processor
A Parallel Fast Fourier Transform for Millimeter-wave Applications
Speaker: Yumin Adviser: Prof. An-Yeu Wu Date: 2013/10/24
95-1 Under-Graduate Project Paper Reading Presentation
Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.
Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.
Speaker: Chris Chen Advisor: Prof. An-Yeu Wu Date: 2014/10/28
Suman Das, Sridhar Rajagopal, Chaitali Sengupta and Joseph R.Cavallaro
Presentation transcript:

FFT VLSI Implementation 台灣大學電機系 吳安宇 VLSI Signal Processing Shousheng He and Mats Torkelson, A new approach to pipeline FFT processor. IEEE Proc. Of IPPS, P766-770, 1996. E. Bidet, D. Castelain, C. Joanblanq, and P. Senn, A fast single-chip implementation of 8192 complex point FFT. IEEE J. Solid-State Circuits, P300-305, March 1995 Updated on 4/2/2001

FFT Review

Implementation --- Two Extreme Method Slow  ----------------- Speed ----------------- Fast Small  ------------------Area------------------- Large Complicated  ------------ Control --------------- Simple

Design Consideration System Requirement e.g., speed, area,power … Trade-off in these two cases, we need More Processing Elements (PE’s) Better Processing Element Utilization Rate Better Control Scheme

FFT Processor --- Block Diagram

Some Current Themes Radix-2 Single-path Delay Feedback. ( N = 16 ) Radix-2 Multi-path Delay Commutator. ( N = 16 )

Some Current Themes (cont.) Radix-4 Single-path Delay Feedback. ( N = 256 ) Radix-4 Single-path Delay Commutator. ( N = 256 ) Radix-4 Multi-path Delay Commutator. ( N = 256 )

Comparison Radix / Speed Low  ----------------------------------- High Control Theme Simple  ----------------------------------- Complex Processing Ability / Unit Low  ----------------------------------- High Combine the advantages  Further decompose high radix PE

Decompose Method (1) Simply ‘‘reuse’’ the repeated micro unit A radix-4 PE

Decompose Method (2) From algorithm level N/4 point FFT

Graphical Explanation (N=16)

Graphical Explanation (cont.) The Eqs are equivalent to the operations below

Circuit of BF2I and BF2II

Radix-22 Single-path Delay Feedback FFT architecture using the above technique, for N=256 Compare with original architecture

Conclusions FFT Applications: Radar Signal Processing, Fast convolution, Spectrum Estimation, OFDM-based Modulation/demodulations Efficient VLSI architectures (parallel processing) are required for real-time processing. However, most systems still employ DSP processors (e.g., TI C3x/C5x) for computations (fast algorithms like DIT and DIF FFT). VLIW (Very Long-length Instruction Word)-based processors (TI C6x) need new programming skills to utilize the two parallel MAC units.