Computational Technologies for Digital Pulse Compression

Slides:

Advertisements

Similar presentations

DFT & FFT Computation.

Advertisements

Your Name Your Title Your Organization (Line #1) Your Organization (Line #2) Week 8 Update Joe Hoatam Josh Merritt Aaron Nielsen.

David Hansen and James Michelussi

© 2003 Xilinx, Inc. All Rights Reserved Course Wrap Up DSP Design Flow.

DSPs Vs General Purpose Microprocessors

Programmable FIR Filter Design

Digital Kommunikationselektronik TNE027 Lecture 5 1 Fourier Transforms Discrete Fourier Transform (DFT) Algorithms Fast Fourier Transform (FFT) Algorithms.

Distributed Arithmetic

Masters Presentation at Griffith University Master of Computer and Information Engineering Magnus Nilsson

ECE 734: Project Presentation Pankhuri May 8, 2013 Pankhuri May 8, point FFT Algorithm for OFDM Applications using 8-point DFT processor (radix-8)

ACHIZITIA IN TIMP REAL A SEMNALELOR. Three frames of a sampled time domain signal. The Fast Fourier Transform (FFT) is the heart of the real-time spectrum.

FPGA Programming for Real Time Analysis of Lidar Systems

Digital Signal Processing

CENG536 Computer Engineering Department Çankaya University.

Implement a 2x2 MIMO OFDM-based channel measurement system (no data yet) at 2.4 GHz Perform baseband processing and digital up and down conversion on Nallatech.

DFT Filter Banks Steven Liddell Prof. Justin Jonas.

Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.

Chapter 8: The Discrete Fourier Transform

1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.

Sampling, Reconstruction, and Elementary Digital Filters R.C. Maher ECEN4002/5002 DSP Laboratory Spring 2002.

Chapter 15 Digital Signal Processing

20 October 2003WASPAA New Paltz, NY1 Implementation of real time partitioned convolution on a DSP board Enrico Armelloni, Christian Giottoli, Angelo.

Digital Kommunikationselektronik TNE027 Lecture 4 1 Finite Impulse Response (FIR) Digital Filters Digital filters are rapidly replacing classic analog.

Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.

A Performance and Energy Comparison of FPGAs, GPUs, and Multicores for Sliding-Window Applications From J. Fowers, G. Brown, P. Cooke, and G. Stitt, University.

1 Real time signal processing SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.

Kathy Grimes. Signals Electrical Mechanical Acoustic Most real-world signals are Analog – they vary continuously over time Many Limitations with Analog.

Using Programmable Logic to Accelerate DSP Functions 1 Using Programmable Logic to Accelerate DSP Functions “An Overview“ Greg Goslin Digital Signal Processing.

GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.

1 DSP Implementation on FPGA Ahmed Elhossini ENGG*6090 : Reconfigurable Computing Systems Winter 2006.

FPGA Based Fuzzy Logic Controller for Semi- Active Suspensions Aws Abu-Khudhair.

Inputs to Signal Generation.vi: -Initial Distance (m) -Velocity (m/s) -Chirp Duration (s) -Sampling Info (Sampling Frequency, Window Size) -Original Signal.

Prepared by: Hind J. Zourob Heba M. Matter Supervisor: Dr. Hatem El-Aydi Faculty Of Engineering Communications & Control Engineering.

Real time DSP Professors: Eng. Julian Bruno Eng. Mariano Llamedo Soria.

1 Miodrag Bolic ARCHITECTURES FOR EFFICIENT IMPLEMENTATION OF PARTICLE FILTERS Department of Electrical and Computer Engineering Stony Brook University.

Digital Pulse Amplitude Modulation (PAM)

Highest Performance Programmable DSP Solution September 17, 2015.

Student : Andrey Kuyel Supervised by Mony Orbach Spring 2011 Final Presentation High speed digital systems laboratory High-Throughput FFT Technion - Israel.

Floating Point vs. Fixed Point for FPGA 1. Applications Digital Signal Processing -Encoders/Decoders -Compression -Encryption Control -Automotive/Aerospace.

Efficient FPGA Implementation of QR

200/MAPLD 2004 Craven1 Super-Sized Multiplies: How Do FPGAs Fare in Extended Digit Multipliers? Stephen Craven Cameron Patterson Peter Athanas Configurable.

HW/SW PARTITIONING OF FLOATING POINT SOFTWARE APPLICATIONS TO FIXED - POINTED COPROCESSOR CIRCUITS - Nalini Kumar Gaurav Chitroda Komal Kasat.

Software Defined Radio 長庚電機通訊組碩一張晉銓指導教授 : 黃文傑博士.

ECE 448: Lab 6 DSP and FPGA Embedded Resources (Digital Downconverter)

Week 7 Lecture 1+2 Digital Communications System Architecture + Signals basics.

1 Reconfigurable Acceleration of Microphone Array Algorithms for Speech Enhancement Ka Fai Cedric Yiu, Yao Lu, Xiaoxiang Shi The Hong Kong Polytechnic.

ESPL 1 Wordlength Optimization with Complexity-and-Distortion Measure and Its Application to Broadband Wireless Demodulator Design Kyungtae Han and Brian.

MITSUBISHI ELECTRIC RESEARCH LABORATORIES Cambridge, Massachusetts High resolution SAR imaging using random pulse timing Dehong Liu IGARSS’ 2011 Vancouver,

Hardware Benchmark Results for An Ultra-High Performance Architecture for Embedded Defense Signal and Image Processing Applications September 29, 2004.

Spatiotemporal Saliency Map of a Video Sequence in FPGA hardware David Boland Acknowledgements: Professor Peter Cheung Mr Yang Liu.

Copyright © 2004, Dillon Engineering Inc. All Rights Reserved. An Efficient Architecture for Ultra Long FFTs in FPGAs and ASICs  Architecture optimized.

Sponsored By Abstract 1 Ritamar Siurano – Undergraduate Student Prof. Domingo Rodriguez – Advisor Abigail Fuentes – Graduate StudentProf. Ana B. Ramirez.

DEPARTMENTT OF ECE TECHNICAL QUIZ-1 AY Sub Code/Name: EC6502/Principles of digital Signal Processing Topic: Unit 1 & Unit 3 Sem/year: V/III.

A New Class of High Performance FFTs Dr. J. Greg Nash Centar ( High Performance Embedded Computing (HPEC) Workshop.

Recursive Architectures for 2DLNS Multiplication RESEARCH CENTRE FOR INTEGRATED MICROSYSTEMS - UNIVERSITY OF WINDSOR 11 Recursive Architectures for 2DLNS.

An FFT for Wireless Protocols Dr. J. Greg Nash Centar ( HAWAI'I INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES Mobile.

بسم الله الرحمن الرحيم Digital Signal Processing Lecture 14 FFT-Radix-2 Decimation in Frequency And Radix -4 Algorithm University of Khartoum Department.

FP7 Uniboard project Digital Receiver G. Comoretto, A. Russo, G. Tuccari, A Baudry, P. Camino, B. Quertier Dwingeloo, February 27, 2009.

Fang Fang James C. Hoe Markus Püschel Smarahara Misra

FFT-based filtering and the

Embedded Systems Design

Measurements of digital signals with spectrum analyzers

Centar ( Global Signal Processing Expo

Measurements of digital signals with spectrum analyzers

Convolution, GPS and the TigerSHARC XCORRS instr.

The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.

Govt. Polytechnic Dhangar(Fatehabad)

Real time signal processing

Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.

Introduction SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.

Presentation transcript:

Computational Technologies for Digital Pulse Compression A Comparison of Two Computational Technologies for Digital Pulse Compression Presented by Michael J. Bonato Vice President of Engineering Catalina Research Inc. – A Paravant Company High Performance Embedded Computing Conference 2002 MIT Lincoln Laboratory September 24, 2002

Goals of Presentation Highlight major design trade-offs when comparing an ASIC and FPGA solution for pulse compression Provide information to help choose the right tool for the right job

Outline Overview of pulse compression Comparison of computational approaches Trade-offs when mapping algorithm to an ASIC or FPGA Example analysis Other considerations Summary

Pulse Compression Overview Convolves return signal with complex conjugate of transmit waveform Produces peak where correlation occurs [1] Indicates location of target in range Compressed pulse narrower than width of transmit waveform (higher range resolution) Helps radar obtain good ranging accuracy with low instantaneous transmitter power Ability to produce narrow peaks depends upon transmit waveform’s Bandwidth Duration (length) Bandwidth • duration = Time Bandwidth Product (TBP) Higher TBP [2] Finer range resolution Lower instantaneous transmitting power Requires more computational horsepower

Pulse Compression Illustration (convolution with complex conjugate of transmit waveform) Received Signal (t) Compressed Received Signal (t) Two targets in receive window hard to pinpoint in time (range) Targets clearly stand out after compression

Approaches to Digital Pulse Compression Time domain convolution Filter time samples of receive window using Finite Impulse Response (FIR) filter Use transmit waveform samples as tap values (number of taps = TBP) Frequency domain complex multiplication FFT (of receive window) Complex multiplication by complex conjugate of FFT (transmit waveform) IFFT Overlap by TBP if sectioned convolution* Both approaches mathematically equivalent Convolution (time)  multiplication (frequency) * For DSP implementation, TBP = duration • sampling rate

Which Approach to Use? Computational efficiency is the driving factor Operations defined here as total number of multiplies and adds Number of FIR operations per input sample: Number of FFT operations per input vector: Both equations assume complex data = 8N – 2 where N = number of taps = 5 N log2 N where N = FFT length

Example: TBP = 256 FIR operations = 8 * 256 - 2 = 2046  2046 operations need to happen every new input sample FFT operations:  assume an FFT length of twice the TBP 5 * 512 * log2 (512) = 23,040  this needs to happen twice (once for FFT, once for IFFT)* = 2 * 23,040 = 46,080 operations  i.e. for every input vector, 46,080 operations need to occur  assuming sectioned convolution, overlap input vectors by TBP  thus, effective operations per input sample: 46,080 / ( 512 – 256 ) = 180 operations per new input sample FFT approach is over 11 times as efficient as FIR in this case! * Time domain window can be folded into first pass of FFT Complex multiplication can be folded in with first pass of IFFT

Computational Efficiency of FFT vs. FIR

Mapping FFTs into Hardware ASIC or FPGA? ASIC: Pathfinder-2 programmable frequency domain vector processor FPGA: Xilinx VirtexE Trade space considerations: Radar system parameters TBP Number of samples in the receive window Number of bits (precision and dynamic range) Performance (measured in Pulse Repetition Frequency)

Radar System Parameters FFT size determined by ( TBP + Ns - 1 ) [3] TBP = number of samples representing transmit pulse Ns = number of samples in receive window Longer FFTs need more Processing Larger radix cores More passes through the data Memory Bits = [ Pw + 2 (Rw / c) ] • Fs Pw = pulse width of transmit waveform Rw = range window of the radar c = speed of light Fs = sampling rate of digital receiver system

Number of Bits Today’s high speed ADCs 14 bits up to 100 MSPS 12 bits up to 200 MSPS FFT radix computations create word growth Radix 2 can cause growth of one bit just due to additions Radix 4: two bits Radix 16: four bits Longer FFT lengths require more radix passes More opportunity for growth

Floating Point vs. Fixed Point [4] Can lead to truncation or rounding errors for both addition and multiplication Overflows highly unlikely due to very large dynamic range Requires more hardware resources than fixed point (adders in particular) Fixed point Truncation or rounding errors occur only for multiplication Addition can lead to overflows Avoid by making word length sufficiently long (may not be practical) Avoid by shifting (scaling), but this can compromise precision

Performance: Pulse Repetition Frequency Defines how often the radar transmits pulses Higher PRFs imply Faster update rates and track loop closure Lower Doppler ambiguity Higher range ambiguity Time between transmit pulses sets a limit on the processing time available Conversely, the processing time required for a given FFT size limits the achievable PRF

Example Analysis Assume the following radar system parameters: Transmit Pulse Width 10.2 usec A/D Sampling Rate (Baseband) 10 MSPS Range Window 10 Km

Calculate FFT Size TBP = pulse width • sampling rate 10.2 usec • 10 MSPS = 102 samples Ns (number of samples in the receive window) [ 10.2 usec + 2 ( 10 Km / c ) ] • 10 MSPS = 769 samples FFT size = 102 + 769 – 1 = 870 samples minimum Round to power of two: 1024 points Well within capabilities of Pathfinder-2 or FPGA

Define Word Length Assume 14 bit ADC Assume one bit growth per radix 2 stage (ten stages for 1K FFT) Implies word length of 24 bits for fixed point operations For worst case input to FFT Assuming rest of system can support the dynamic range Fixed point implementation must Define sufficiently large word (accumulator), or Scale data input to each radix stage Blindly shift at every iteration (Xilinx 1K FFT 16 bit core) [5] Implement “intelligent” shifting (e.g. block floating point) Not an issue for floating point (Pathfinder-2)

Processing Performance Algorithm: window  CFFT  CMUL  IFFT for 1K vector Pathfinder-2 35.4 usec at 133 MHz clock Achievable PRF = 1 / 35.4 usec = 28.3 KHz assuming one channel 32 bit IEEE floating point Xilinx XCV2000E sizing estimate Assume 80 MHz clock rate Achievable PRF (with 75% utilization)  15 KHz (one channel) 24 bit fixed point Overflow still a concern 24 bits would suffice for 1K FFT alone (most applications) Does not provide for growth due to IFFT Scaling / shifting logic will still be needed

Additional Design Considerations Part count Minimum Pathfinder-2 solution requires Pathfinder-2 ASIC Three external address generators Three SRAM banks Small FPGA to act as a controller Entire solution could fit in XCV2000E Parts costs (estimated) Pathfinder-2 solution = $1,500 Xilinx XCV2000E = $2,900 Design flexibility and development What if you decide to change FFT sizes? What if you want to match against multiple transmit waveforms?

Pulse Compression Application (1K Vector Size) Summary Less demanding pulse compression application good match for FPGAs More demanding system requirements quickly drive solution towards a Pathfinder-2 type of approach Pulse Compression Application (1K Vector Size) Pathfinder-2 (ASIC) XCV2000E (FPGA) Higher PRFs Lower PRFs Higher Parts Count Lower Parts Count Less Expensive More Expensive Minimal Precision and Dynamic Range Concerns Valid Dynamic Range and Precision Concerns Easily Scalable to More Demanding Algorithms Not Easily Scalable to More Demanding Algorithms

References [1] Cook, Charles E., “Pulse Compression – Key to More Efficient Radar Transmission,” Barton Radar Systems Volume III, 1960. [2] Skolnik, Merrill I., Introduction to Radar Systems, McGraw-Hill Book Co., NY, 1962. [3] Brigham, Oran E., The Fast Fourier Transform, Prentice-Hall Inc., Englewood Cliffs, NJ, 1974. [4] Rabiner, L. R. and Gold, B., Theory and Application of Digital Signal Processing, Prentice-Hall Inc., Englewood Cliffs, NJ, 1975. [5] Xilinx Product Specification., “High Performance 1024-Point Complex FFT/IFFT V1.0.5,” Xilinx Inc., 2000.