Implementing and Optimizing a Direct Digital Frequency Synthesizer on FPGA Jung Seob LEE Xiangning YANG.

Slides:



Advertisements
Similar presentations
ADSP Lecture2 - Unfolding VLSI Signal Processing Lecture 2 Unfolding Transformation.
Advertisements

1 ECE734 VLSI Arrays for Digital Signal Processing Chapter 3 Parallel and Pipelined Processing.
ECE 734: Project Presentation Pankhuri May 8, 2013 Pankhuri May 8, point FFT Algorithm for OFDM Applications using 8-point DFT processor (radix-8)
Altera FLEX 10K technology in Real Time Application.
Dr. Subbarao Wunnava June 2006 “ Functional Microcontroller Design and Implementation ” Paper Authors : Vivekananda Jayaram Dr. Subbarao Wunnava Research.
Software Defined Radio Lec 7 – Digital Generation of Signals Sajjad Hussain, MCS-NUST.
6.375 Project Arthur Chang Omid Salehi-Abari Sung Sik Woo May 11, 2011
VLSI Communication SystemsRecap VLSI Communication Systems RECAP.
Minimizing Clock Skew in FPGAs
Bryan Lahartinger. “The Apriori algorithm is a fundamental correlation-based data mining [technique]” “Software implementations of the Aprioiri algorithm.
Hardware Implementation of Antenna Beamforming using Genetic Algorithm Kevin Hsiue Bryan Teague.
Institute of Applied Microelectronics and Computer Engineering © 2014 UNIVERSITY OF ROSTOCK | College of Computer Science and Electrical Engineering.
Caltech CS184a Fall DeHon1 CS184a: Computer Architecture (Structures and Organization) Day17: November 20, 2000 Time Multiplexing.
IUCEE Workshop presentation-YVJoshi VLSI Signal Processing Y. V. Joshi SGGS Institute of Engineering and Technology, Nanded.
Data Partitioning for Reconfigurable Architectures with Distributed Block RAM Wenrui Gong Gang Wang Ryan Kastner Department of Electrical and Computer.
Storage Assignment during High-level Synthesis for Configurable Architectures Wenrui Gong Gang Wang Ryan Kastner Department of Electrical and Computer.
GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.
Digital signature using MD5 algorithm Hardware Acceleration
Sub-Nyquist Sampling DSP & SCD Modules Presented by: Omer Kiselov, Daniel Primor Supervised by: Ina Rivkin, Moshe Mishali Winter 2010High Speed Digital.
Efficient Multi-Ported Memories for FPGAs Eric LaForest Greg Steffan University of Toronto Computer Engineering Research Group February 22, 2010.
Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Spezielle.
Multirate Signal Processing
Coarse and Fine Grain Programmable Overlay Architectures for FPGAs
Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Color.
EE302 Lesson 19: Digital Communications Techniques 3.
Implementation of MAC Assisted CORDIC engine on FPGA EE382N-4 Abhik Bhattacharya Mrinal Deo Raghunandan K R Samir Dutt.
The GNU in RADIO Shravan Rayanchu. SDR Getting the code close to the antenna –Software defines the waveform –Replace analog signal processing with Digital.
Software Defined Radio 長庚電機通訊組 碩一 張晉銓 指導教授 : 黃文傑博士.
A Fast Hardware Approach for Approximate, Efficient Logarithm and Anti-logarithm Computation Suganth Paul Nikhil Jayakumar Sunil P. Khatri Department of.
AMIN FARMAHININ-FARAHANI CHARLES TSEN KATHERINE COMPTON FPGA Implementation of a 64-bit BID-Based Decimal Floating Point Adder/Subtractor.
Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Selected.
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
Radix-2 2 Based Low Power Reconfigurable FFT Processor Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Gin-Der Wu and Yi-Ming Liu Department.
Accelerating Homomorphic Evaluation on Reconfigurable Hardware Thomas Pöppelmann, Michael Naehrig, Andrew Putnam, Adrian Macias.
1 A ROM-less DDFS Using A Nonlinear DAC With An Error Compensation Current Array Chua-Chin Wang, Senior Member, IEEE, Chia-Hao Hsu, Student Member, IEEE,
Digital Phase Control System for SSRF LINAC C.X. Yin, D.K. Liu, L.Y. Yu SINAP, China
1 Implementation in Hardware of Video Processing Algorithm Performed by: Yony Dekell & Tsion Bublil Supervisor : Mike Sumszyk SPRING 2008 High Speed Digital.
Floating-Point Divide and Square Root for Efficient FPGA Implementation of Image and Signal Processing Algorithms Xiaojun Wang, Miriam Leeser
Generating Sinusoidal Signals Prof. Brian L. Evans Dept. of Electrical and Computer Engineering The University of Texas at Austin EE 445S Real-Time Digital.
Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Spezielle.
A High-Speed Hardware Implementation of the LILI-II Keystream Generator Paris Kitsos...in cooperation with Nicolas Sklavos and Odysseas Koufopavlou Digital.
ELEC692 VLSI Signal Processing Architecture Lecture 3
Spatiotemporal Saliency Map of a Video Sequence in FPGA hardware David Boland Acknowledgements: Professor Peter Cheung Mr Yang Liu.
© 2010 Altera Corporation - Public Lutiac – Small Soft Processors for Small Programs David Galloway and David Lewis November 18, 2010.
Software Defined Radios 長庚電機通訊組 碩一 張晉銓 指導教授 : 黃文傑博士.
Copyright © 2004, Dillon Engineering Inc. All Rights Reserved. An Efficient Architecture for Ultra Long FFTs in FPGAs and ASICs  Architecture optimized.
ESS | FPGA for Dummies | | Maurizio Donna FPGA for Dummies Basic FPGA architecture.
CORDIC-Based Processor
November 29, 2011 Final Presentation. Team Members Troy Huguet Computer Engineer Post-Route Testing Parker Jacobs Computer Engineer Post-Route Testing.
Exploiting Parallelism
MEG trigger system This short presentation describes the present status of the trigger algorithms of the MEG experiment implemented on the Xilinx FPGA.
An optimization of the SAFER+ algorithm for custom hardware and TMS320C6x DSP implementation. By: Sachin Garg Vikas Sharma.
Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Spezielle.
VLSI Design of 2-D Discrete Wavelet Transform for Area-Efficient and High- Speed Image Computing - End Presentation Presentor: Eyal Vakrat Instructor:
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR Topics n FPGA fabric architecture concepts.
Resource Sharing in LegUp. Resource Sharing in High Level Synthesis Resource Sharing is a well-known technique in HLS to reduce circuit area by sharing.
CORDIC Based 64-Point Radix-2 FFT Processor
Backprojection Project Update January 2002
Hiba Tariq School of Engineering
Software Defined Radio PhD Program on Electrical Engineering
By: Mohammadreza Meidnai Urmia university, Urmia, Iran Fall 2014
FPGA Implementation of Multicore AES 128/192/256
Direct Digital Synthesis: Applications to Radar
C-LSTM: Enabling Efficient LSTM using Structured Compression Techniques on FPGAs Shuo Wang1, Zhe Li2, Caiwen Ding2, Bo Yuan3, Qinru Qiu2, Yanzhi Wang2,
Software Defined Radio Expanded
Centar ( Global Signal Processing Expo
Multiplier-less Multiplication by Constants
C Model Sim (Fixed-Point) -A New Approach to Pipeline FFT Processor
Alireza Hodjat IVGroup
Presented by Mohsen Shakiba
Presentation transcript:

Implementing and Optimizing a Direct Digital Frequency Synthesizer on FPGA Jung Seob LEE Xiangning YANG

Direct Digital Frequency Synthesizer (DDFS) Generate sinusoid and cosinusoid waveform Widely used in digital communication system – Software radio – Spread-spectrum modulation – Phase shift-keying modulation

DFFS Algorithm Look-up table – Fast, less sophisticated design, high precision – Huge table == large area, inflexible Angle rotation: CORDIC, interpolation, etc. – Compact, area efficient, flexible – Complex design, low speed, wide internal data- path to ensure desired output precision

DFFS Algorithm Used in our project: hybrid method – Use LUT to gives a coarse position of the angle – Use angle rotation method to fine turn the angle Engineering trade-off of area efficiency, design complexity, computation need, and cycle times 16-bit internal data path to give 15 bit output precision 2 128x16-bit LUTs are needed – [Lee & Park] S. Lee and I. Park, “Quadrature direct digital synthesis using fine-grain angle rotation”, ISCAS’2004

Implementation Implementing the algorithm on FPGA with algorithm mapping and transforming approach – Non-pipeline Optimizing – cut-set retiming techniques => pipeline version

FPGA Implementation Choose Altera Cyclone II EP2C5 – Low cost FPGA, – On-line listed price: $12.8 – Internal RAM for LUT

Current Status A software implementation of the algorithm – For studying the characteristics of the algorithm Non-pipeline version is finished – Test bench – it works! – Synthesis: max sample output frequency (max operating frequency) : MHz

Next Stage Implementing the pipe-line version Further optimization: – Parallel processing ? – Buffering ?

Thank You Question ?

Pipelined Retiming Add registers to 3 stages: 2 stage -> 6 stage pipelined structure

Comparison between retiming and non-retiming version Maximum Clock rate – Non-pipelined version: 42.84MHz – Pipelined version: 116 MHz Critical path delay is reduced as much as 2.7 times Total cell Area change? – Non-pipelined version: – Pipelined version: