UniBoard Meeting, October 12-13th 2010 Jonathan Hargreaves, JIVE EVN Correlator Design UniBoard Meeting, October 12-13 th 2010 Contract no. 227290.

Slides:



Advertisements
Similar presentations
Network II.5 simulator ..
Advertisements

1/1/ / faculty of Electrical Engineering eindhoven university of technology Speeding it up Part 3: Out-Of-Order and SuperScalar execution dr.ir. A.C. Verschueren.
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
NetFPGA Project: 4-Port Layer 2/3 Switch Ankur Singla Gene Juknevicius
Digital FX Correlator Nimish Sane Center for Solar-Terrestrial Research New Jersey Institute of Technology, Newark, NJ EOVSA Technical Design Meeting.
FPGA Configuration. Introduction What is configuration? – Process for loading data into the FPGA Configuration Data Source Configuration Data Source FPGA.
FPGA Programming for Real Time Analysis of Lidar Systems
Spring 2003CSE P5481 Reorder Buffer Implementation (Pentium Pro) Hardware data structures retirement register file (RRF) (~ IBM 360/91 physical registers)
David Hawkins Exascale Signal Processing for Millimeter-Wavelength Radio Interferometers David Hawkins
Alice EMCAL Meeting, July 2nd EMCAL global trigger status: STU design progress Olivier BOURRION LPSC, Grenoble.
Reliable Data Storage using Reed Solomon Code Supervised by: Isaschar (Zigi) Walter Performed by: Ilan Rosenfeld, Moshe Karl Spring 2004 Part A Final Presentation.
Sampling, Reconstruction, and Elementary Digital Filters R.C. Maher ECEN4002/5002 DSP Laboratory Spring 2002.
IO Controller Module Arbitrates IO from the CCP Physically separable from CCP –Can be used as independent data logger or used in future projects. Implemented.
Reliable Data Storage using Reed Solomon Code Supervised by: Isaschar (Zigi) Walter Performed by: Ilan Rosenfeld, Moshe Karl Spring 2004 Midterm Presentation.
Final Presentation Momentum Measurement Card Project supervised by: Mony Orbach Project performed by: Hadas Preminger, Uri Niv.
David Nelson STAVE Test Electronics July 1, ATLAS STAVE Test Electronics Preliminary V3 Presented by David Nelson.
6. The CPU and Memory Chapt. 7.
Using Programmable Logic to Accelerate DSP Functions 1 Using Programmable Logic to Accelerate DSP Functions “An Overview“ Greg Goslin Digital Signal Processing.
GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.
Viterbi Decoder Project Alon weinberg, Dan Elran Supervisors: Emilia Burlak, Elisha Ulmer.
Inter TEL62 communication M. Raggi, M. Piccini, F. Gonnella 16 th October 2013 TDAQ Working Group Meeting.
DARPA Digital Audio Receiver, Processor and Amplifier Group Z James Cotton Bobak Nazer Ryan Verret.
Hardware Design of High Speed Switch Fabric IC. Overall Architecture.
Dr. Rabie A. Ramadan Al-Azhar University Lecture 6
PROCStar III Performance Charactarization Instructor : Ina Rivkin Performed by: Idan Steinberg Evgeni Riaboy Semestrial Project Winter 2010.
Casper 2010Marc Torres Part 2: Building blocks for the next generation.
Chapter 8 CPU and Memory: Design, Implementation, and Enhancement The Architecture of Computer Hardware and Systems Software: An Information Technology.
ECE 448: Lab 6 DSP and FPGA Embedded Resources (Digital Downconverter)
LZRW3 Decompressor dual semester project Part A Mid Presentation Students: Peleg Rosen Tal Czeizler Advisors: Moshe Porian Netanel Yamin
Tera-Pixel APS for CALICE Jamie Crooks, Microelectronics/RAL SRAM Column Concept.
Real-time Acquisition and Processing of Data from the GMRT Pulsar Back- ends Ramchandra M. Dabade (VNIT, Nagpur) Guided By, Yashwant Gupta.
Testing of Latch-TDC Da-Shung Su Jia-Ye Chen, Hsi-Hung Yao, Su-Yin Wang, Ting-Hua Chang, Wen-Chen Chang 2011/07/13.
4/19/20021 TCPSplitter: A Reconfigurable Hardware Based TCP Flow Monitor David V. Schuehler.
Jason Manley, Aaron Parsons, Don Backer, Henry Chen, Terry Filiba, David MacMahon, Peter McMahon, Arash Parsa, Andrew Siemion, Dan Werthimer, Mel Wright.
Chapter 4 MARIE: An Introduction to a Simple Computer.
Company LOGO Final presentation Spring 2008/9 Performed by: Alexander PavlovDavid Domb Supervisor: Mony Orbach GPS/INS Computing System.
Microprocessor Microprocessor (cont..) It is a 16 bit μp has a 20 bit address bus can access upto 220 memory locations ( 1 MB). It can support.
FPGA firmware of DC5 FEE. Outline List of issue Data loss issue Command error issue (DCM to FEM) Command lost issue (PC with USB connection to GANDALF)
80386DX functional Block Diagram PIN Description Register set Flags Physical address space Data types.
High Speed Digital System Lab Final Presentation 1 semester project  Instructor: Mony Orbach  Students: Pavel Shpilberg Ohad Fundoianu.
Scanning FM Receiver SOC FPGA Design Lab Project by Marc Chiesa.
Chapter 2 Data Manipulation © 2007 Pearson Addison-Wesley. All rights reserved.
A Programmable Single Chip Digital Signal Processing Engine MAPLD 2005 Paul Chiang, MathStar Inc. Pius Ng, Apache Design Solutions.
CERN, 18 december 2003Coincidence Matrix ASIC PRR Coincidence ASIC modifications E.Petrolo, R.Vari, S.Veneziano INFN-Rome.
Mid presentation Part A Project Netanel Yamin & by: Shahar Zuta Moshe porian Advisor: Dual semester project November 2012.
Philippe Picard 2 nd SKADS Workshop October 2007 Station Processing Philippe Picard Observatoire de Paris Meudon, 11th October 2007.
Fundamentals of Programming Languages-II
DDRIII BASED GENERAL PURPOSE FIFO ON VIRTEX-6 FPGA ML605 BOARD PART B PRESENTATION STUDENTS: OLEG KORENEV EUGENE REZNIK SUPERVISOR: ROLF HILGENDORF 1 Semester:
LKr readout and trigger R. Fantechi 3/2/2010. The CARE structure.
Atacama Large Millimeter/submillimeter Array Karl G. Jansky Very Large Array Robert C. Byrd Green Bank Telescope Very Long Baseline Array ALMA Correlator.
GBT protocol implementation on Xilinx FPGAs Csaba SOOS PH-ESE-BE-OT.
UniBoard Progress Meeting, December 2009 Jonathan Hargreaves, JIVE EVN Correlator Design UniBoard Progress Meeting, December 2009 Contract no
H. Krüger, , DEPFET Workshop, Heidelberg1 System and DHP Development Module overview Data rates DHP function blocks Module layout Ideas & open questions.
Status and Plans for Xilinx Development
UniBoard Meeting,October 12-13th 2010 Jonathan Hargreaves, JIVE Eric Kooistra, ASTRON UniBoard Testing UniBoard Meeting, October12-13 th 2010 Contract.
Netherlands Institute for Radio Astronomy 1 ASTRON is part of the Netherlands Organisation for Scientific Research (NWO) Board Design Gijs Schoonderbeek.
JIVE UniBoard Correlator External Review
The UniBoard Generic Hardware for Radio Astronomy Signal Processing
The UniBoard A RadioNet FP7 Joint Research Activity, 9 partners
A Streaming FFT on 3GSPS ADC Data using Core Libraries and DIME-C
LFD Correlator MWA-LFD Kick-off Meeting San Diego 5-June-2006
JIVE UniBoard Correlator (JUC) Firmware
Correlator – Backend System Overview
New Crate Controller Development
Centar ( Global Signal Processing Expo
Layered Protocol Wrappers Design and Interface review
IV. Convolutional Codes
The Uniboard  FPGA Processing for Astronomy
UNIBOARD : VLBI APPLICATION CHARACTERIZATION
ADSP 21065L.
Presentation transcript:

UniBoard Meeting, October 12-13th 2010 Jonathan Hargreaves, JIVE EVN Correlator Design UniBoard Meeting, October th 2010 Contract no

UniBoard Meeting, October 12-13th 2010 Basic Configuration FN 0 Station Pol, 8 Bit, 64 MHz BW FN 1 Station Pol, 8 Bit, 64 MHz BW FN 2 Station Pol, 8 Bit, 64 MHz BW FN 3 Station Pol, 8 Bit, 64 MHz BW BN corr products 16 MHz BW BN corr products 16 MHz BW BN corr products 16 MHz BW BN corr products 16 MHz BW Station Data in to 10GbE Ports Correlation Products out to backend computer Works with 800MT/s DDR3

UniBoard Meeting, October 12-13th 2010 Expanded Configuration FN 0 Station Pol, 8 Bit, 128 MHz BW FN 1 Station Pol, 8 Bit, 128 MHz BW FN 2 Station Pol, 8 Bit, 128 MHz BW FN 3 Station Pol, 8 Bit, 128 MHz BW BN corr products 32 MHz BW BN corr products 32 MHz BW BN corr products 32 MHz BW BN corr products 32 MHz BW Station Data in to 10GbE Ports Correlation Products out to backend computer Needs 4GB DDR for FN

UniBoard Meeting, October 12-13th 2010 Spectral Line FN 0 Station Pol, 8 Bit, 64 MHz BW FN 1 Station Pol, 8 Bit, 64 MHz BW FN 2 Station Pol, 8 Bit, 64 MHz BW FN 3 Station Pol, 8 Bit, 64 MHz BW BN corr products 16 MHz BW BN corr products 16 MHz BW BN corr products 16 MHz BW BN corr products 16 MHz BW Station Data in to 10GbE Ports Correlation Products out to backend computer Subset of frequency points from each FN routed via spare 10GbE ports to another UniBoard

UniBoard Meeting, October 12-13th 2010 Proof of Concept First show we can get data flowing through by doing autocorrelations: 4 x 16MHz bands per station 2 polarizations in separate frames 2 bits per sample Use validity bits to tell the correlator which data to process Simplifications Delay model not needed for autocorrelations Only need one FN Can read out correlation products through 1Gb ports on BNs

UniBoard Meeting, October 12-13th 2010 FN - VDIF Frame Buffering

UniBoard Meeting, October 12-13th 2010 FN - VDIF Frame Buffering Assume 4 stations per port. If they are not used data gets marked invalid in the buffer VDIF header is checked for frame length, station, and band. If any are outside the range set by the controller, or the frame is marked invalid, the frame is ignored FIFOs can store two 8192 byte frames. A frame must be stored in the buffer before another one arrives Use validity bits to tell the correlator which data to process. One validity bit per frame which is set when good data arrives and reset when the data is readout Readout starts on an FFT period tick The scheduler tops up the read side FIFOs to maintain a continuous flow of data for the complete FFT period. Data can be read in an order which minimises overhead by jumping between DDR3 banks

UniBoard Meeting, October 12-13th 2010 FN – Polyphase Filter Bank PFB Sin/Cos Gen 0 X8 Phase Rotator PFB DM Control Sync and Validity FB Control Sync and validity Re-Quantize to 9 bits RQ Control Re-Quantize to 9 bits Validity Phase Rotator DS0 DS1 DS14 DS15 PFB Sync Validit y 0-15 Sin/Cos Gen 7 Phase Counter Sync Valid 0-15 X8

UniBoard Meeting, October 12-13th 2010 FN – Polyphase Filter Bank Each Filter bank produces 4096 frequency bins Can clock at over 300MHz but choose as this is the DDR3 local bus speed This is over 4 times real time for a 64MHz band (complex) but probably need to time share them due to memory constraints Still 128MHz per FN is realistic We are comparing 4 taps vs. 6 taps per frequency bin We can easily try different window functions... Kaiser vs. Blackman Harris etc. FFT should not re-order data – this can be done in the corner turner

UniBoard Meeting, October 12-13th 2010 FN to BN mesh Val, sync Framer for frequency bins Mesh Re-orderMesh Re-order To BN0 Data x16 Val, sync Data x16 Val, sync Data x16 Val, sync Data x16 Framer for frequency bins Framer for frequency bins Framer for frequency bins ALTGX 6.25Gbps serialize 8b/10b encoder ALTGX 6.25Gbps serialize 8b/10b encoder ALTGX 6.25Gbps serialize 8b/10b encoder ALTGX 6.25Gbps serialize 8b/10b encoder To BN1 To BN2 To BN3 FN0 ALTGX 6.25Gbps deserialize 8b/10b decoder From FN0 Alignment Logic Mesh Re-order Mesh Re-order FIFO FIFO BN0 ALTGX 6.25Gbps deserialize 8b/10b decoder ALTGX 6.25Gbps deserialize 8b/10b decoder ALTGX 6.25Gbps deserialize 8b/10b decoder From FN1 From FN2 From FN3 FIFO FIFO FIFO FIFO FIFO FIFO

UniBoard Meeting, October 12-13th 2010 BN Overview Frame no. within integration period DDR3 Module I (1GB) Scheduler Deframer FIFOFIFO DDR3 Controller IP DDR3 Module II (1GB) Data, validity DDR3 Controller IP Deframer FIFOFIFO Corner Turner Station code Start of integration Scheduler Correlator 132 Complex MACs Correlator 132 Complex MACs Data Buffer A Data Buffer B Validity Buffer A Validity Buffer B Validity bit accumulat or Copy output to Avalon MM registers

UniBoard Meeting, October 12-13th 2010 BN Correlation Engine a) b) c) 132 MAC cells clocked > 260MHz One half-DSP block plus 72 registers per MAC cell for 36 bit accumulation Sixteen passes: L-L, R-R, L-R, R-L

UniBoard Meeting, October 12-13th 2010 Development Path Delay models to be calculated in the FNs using coefficients downloaded from the control computer The four FNs need to be synchronised Measure average power in each frequency bin Increase BW throughput in the FN by increasing the duty cycle of the filter banks Increase BW correlated in the BN by inplementing two correlation engines Implement 10Gbps ports on the BN to export correlation products