January 2007 RAMP Retreat BEE3 Update Chuck Thacker Technical Fellow Microsoft Research 11 January, 2007.

Slides:



Advertisements
Similar presentations
January 2008 RAMP Retreat BEE3 Update Chuck Thacker John Davis Microsoft Research Chen Chang BWRC/BEECube 16 January 2008.
Advertisements

© 2003 Xilinx, Inc. All Rights Reserved Course Wrap Up DSP Design Flow.
Terms 4 Definitions and Questions. Motherboard The main board of a computer, usually containing the circuitry for the central processing unit, keyboard,
Graduate Computer Architecture I Lecture 15: Intro to Reconfigurable Devices.
Architecture Design Methodology. 2 The effects of architecture design on metrics:  Area (cost)  Performance  Power Target market:  A set of application.
June 2007 RAMP Tutorial BEE3 Update Chuck Thacker John Davis Microsoft Research 10 June, 2007.
Motion Tracking Recorder 360 (MTR-360) Group #1 Lee Estep Philip Robertson Andy Schiestl Robert Tate.
Ramp august 2008 retreat Xilinx RAMP donations Kees Vissers Paul Hartke Xilinx Research.
BEEKeeper Remote Management and Debugging of Large FPGA Clusters Terry Filiba Navtej Sadhal.
Simulation Interface Final Presentation Guy Zur Eithan Nadir Instructor : Igal Kogan.
BabyBEE Defining the Silicon Circuit Board CASPER Workshop August 4, 2008 Bob Conn CTO.
Configurable System-on-Chip: Xilinx EDK
ASPLOS ’08 Ramp Tutorial BEE3 Update Chuck Thacker John Davis Microsoft Research 2 March 2008.
Characterization Presentation Neural Network Implementation On FPGA Supervisor: Chen Koren Maria Nemets Maxim Zavodchik
XUP Virtex-5 Development System January XUP Virtex52 Introducing XUPV5-LX110T A powerful and versatile platform packaged and priced for Academia!
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 5: January 24, 2007 ALUs, Virtualization…
Lecture 7 Lecture 7: Hardware/Software Systems on the XUP Board ECE 412: Microcomputer Laboratory.
May 8, Peripheral Design Options For USB 2.0 Solutions Dave Thompson Manager of High Speed I/O Development Agere Systems,
System Architecture A Reconfigurable and Programmable Gigabit Network Interface Card Jeff Shafer, Hyong-Youb Kim, Paul Willmann, Dr. Scott Rixner Rice.
GPGPU platforms GP - General Purpose computation using GPU
General Purpose FIFO on Virtex-6 FPGA ML605 board midterm presentation
Programmable Logic- How do they do that? 1/16/2015 Warren Miller Class 5: Software Tools and More 1.
Students: Oleg Korenev Eugene Reznik Supervisor: Rolf Hilgendorf
General Purpose FIFO on Virtex-6 FPGA ML605 board Students: Oleg Korenev Eugene Reznik Supervisor: Rolf Hilgendorf 1 Semester: spring 2012.
UCT Software-Defined Radio Research Group
® ChipScope ILA TM Xilinx and Agilent Technologies.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Ross Brennan On the Introduction of Reconfigurable Hardware into Computer Architecture Education Ross Brennan
Leo Greiner IPHC testing Sensor and infrastructure testing at LBL. Capabilities and Plan.
Engineering & Instrumentation Department, ESDG, Rob Halsall, 24th February 2005CFI/Confidential CFI - Opto DAQ - Status 24th February 2005.
An emerging computing paradigm where data and services reside in massively scalable data centers and can be ubiquitously accessed from any connected devices.
Silicon Building Blocks for Blade Server Designs accelerate your Innovation.
DLS Digital Controller Tony Dobbing Head of Power Supplies Group.
SLAAC SV2 Briefing SLAAC Retreat, May 2001 Heber, UT Brian Schott USC Information Sciences Institute.
RiceNIC: A Reconfigurable and Programmable Gigabit Network Interface Card Jeff Shafer, Dr. Scott Rixner Rice Computer Architecture:
GBT Interface Card for a Linux Computer Carson Teale 1.
Lessons Learned The Hard Way: FPGA  PCB Integration Challenges Dave Brady & Bruce Riggins.
Xilinx Programmable Logic Design Solutions Version 2.1i Designing the Industry’s First 2 Million Gate FPGA Drop-In 64 Bit / 66 MHz PCI Design.
U N C L A S S I F I E D FVTX Detector Readout Concept S. Butsyk For LANL P-25 group.
Research on Reconfigurable Computing Using Impulse C Carmen Li Shen Mentor: Dr. Russell Duren February 1, 2008.
Project Presentation: Physical Unclonable Functions
J. Christiansen, CERN - EP/MIC
PROCStar III Performance Charactarization Instructor : Ina Rivkin Performed by: Idan Steinberg Evgeni Riaboy Semestrial Project Winter 2010.
1 Abstract & Main Goal המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory The focus of this project was the creation of an analyzing device.
Lecture 10: Logic Emulation October 8, 2013 ECE 636 Reconfigurable Computing Lecture 13 Logic Emulation.
Hardware proposal for the L2  trigger system detailed description of the architecture mechanical considerations components consideration electro-magnetic.
BEE3 Updates June 13 th, 2007 Chuck Thacker, John Davis Microsoft Research Chen Chang UC Berkeley.
An FX software correlator for VLBI Adam Deller Swinburne University Australia Telescope National Facility (ATNF)
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Final Presentation Final Presentation OFDM implementation and performance test Performed by: Tomer Ben Oz Ariel Shleifer Guided by: Mony Orbach Duration:
1 RapidIO Testbed Update Chris Conger Honeywell Project 1/25/2004.
Reconfigurable Computing Aspects of the Cray XD1 Sandia National Laboratories / California Craig Ulmer Cray User Group (CUG 2005) May.
IBM/Motorola/Apple PowerPC
This material exempt per Department of Commerce license exception TSU Xilinx On-Chip Debug.
Industrial Controls Engineering Department First CERN PXI Users Group meeting 19 th October 2011 – Hubert REYMOND – EN/ICE 1.
Copyright © 2004, Dillon Engineering Inc. All Rights Reserved. An Efficient Architecture for Ultra Long FFTs in FPGAs and ASICs  Architecture optimized.
Introduction to Field Programmable Gate Arrays Lecture 1/3 CERN Accelerator School on Digital Signal Processing Sigtuna, Sweden, 31 May – 9 June 2007 Javier.
DDRIII BASED GENERAL PURPOSE FIFO ON VIRTEX-6 FPGA ML605 BOARD PART B PRESENTATION STUDENTS: OLEG KORENEV EUGENE REZNIK SUPERVISOR: ROLF HILGENDORF 1 Semester:
 System Requirements are the prerequisites needed in order for a software or any other resources to execute efficiently.  Most software defines two.
Raw Status Update Chips & Fabrics James Psota M.I.T. Computer Architecture Workshop 9/19/03.
THE COMPUTER MOTHERBOARD AND ITS COMPONENTS Compiled By: Jishnu Pradeep.
Bus Systems ISA PCI AGP.
Current DCC Design LED Board
Architecture & Organization 1
Xilinx ChipScope Pro Overview
Course Agenda DSP Design Flow.
Architecture & Organization 1
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
Five Key Computer Components
Windows Virtual PC / Hyper-V
Presentation transcript:

January 2007 RAMP Retreat BEE3 Update Chuck Thacker Technical Fellow Microsoft Research 11 January, 2007

January 2007 RAMP Retreat Outline What is BEE3? BEE2-BEE3 Differences Project participants Engineering plan, schedule

January 2007 RAMP Retreat What is BEE3? Follow-on to BEE2 (BWRC, 2004) Board with several highly-connected FPGAs Vehicle for computer architecture research –Microsoft’s primary interest Potential platform for high performance DSP applications –Astronomers, and perhaps others. Allows large scale architectural experiments –Although perhaps not as large as originally hoped –And certainly not at the speed of a real implementation Can scale smoothly from a single board to 64 boards (256 FPGAs)

January 2007 RAMP Retreat BEE2

January 2007 RAMP Retreat BEE2 – BEE3 Differences 4 Xilinx Virtex 5 vs 5 Virtex 2 Pro FPGAs –We use XC5VLX110T-ff1136 –V2Pro is now obsolete (130nm) –V5 is a major improvement (65nm) 6-input LUT (64 bit DP RAM) Better Block RAMs Improved interconnect Better signal integrity 8 Infiniband/CX4 channels vs 18 4 x8 PCI Express Low Profile slots

January 2007 RAMP Retreat BEE3 – BEE2 Differences (2) 2 Banks DDR2 x 2 vs 4 Banks DDR2 x 1 –Same capacity (64 GB likely) –Lower bandwidth –Mandated by fewer signal pins on V5 4 10/100/1000 Ethernet channels No SATA –BEE2 SATA didn’t work anyway –iSCSI instead (?) No PowerPCs –This version has not yet been released by Xilinx

January 2007 RAMP Retreat BEE2 – BEE3 Differences (3) Divided the system into two boards, Main and Control –Main board has FPGAs, all high speed logic –Control board handles downloading, monitoring –Simplifies main board engineering – can design control board in parallel Smaller main board –168 vs 374 in 2 –Fewer layers for lower cost Much more “PC-like” Can use PC power supplies, peripherals Several layouts are being considered –All fit in 2U enclosure –Much more attention is being given to thermal design –Must pass UL, FCC

January 2007 RAMP Retreat BEE3 Main Board

January 2007 RAMP Retreat Bandwidths (per-FPGA) Memory –400 MT/s * 8B/T * 2 channels: 6.4GB/s Ring –400 MT/s * 12 B/T: 4.8 GB/s QSH –400 MT/s * 10 B/T: 4 GB/s Ethernet –125 MB/s CX4 –1.25 GB/s * 2 directions * 2 channels: 5GB/s PCI Express –Same as CX4

January 2007 RAMP Retreat BEE3 Clocking, JTAG

January 2007 RAMP Retreat BEE3 Control Board

January 2007 RAMP Retreat BEE3 System (v1)

January 2007 RAMP Retreat BEE3 System (v2)

January 2007 RAMP Retreat BEE3 Main Board (v3)

January 2007 RAMP Retreat Remaining Issues Precise EATX compatibility, or not? –Affects layout complexity, thermal design Power supply sizing –We don’t want to leave the overclockers in the lurch Standard power supplies (?) –“2U” supplies aren’t as efficient, have fewer vendors –Prefer Intel/Google “12V only” supplies (minimum loading issue), if available in time and at reasonable cost PCI Express is nonstandard –Xilinx hard macro is “device only”, not host –Need an intrepid graduate student –Can still use it for additional Infiniband/CX4 channels

January 2007 RAMP Retreat Project Participants and Roles Microsoft Research (Silicon Valley) –Funds and manages system engineering Celestica (Ottawa and elsewhere) –Does main board engineering, produces final systems. –Microsoft has a very deep relationship with Celestica Function Engineering (Palo Alto) –Does thermal and mechanical engineering Xilinx (San Jose) –Provides FPGAs for academic machines –Provides FPGA application expertise Ramp Group (BWRC) –Control board, basic software Ramp Community –Uses the systems for research

January 2007 RAMP Retreat Why is Microsoft interested? We believe the overall RAMP effort will have significant impact, and want to support it in the most effective way we can. –Simply paying for grad students seems suboptimal We observe that universities aren’t very good at this sort of system engineering and production. –Grad students are great for many things, but doing things like board layout aren’t among them. –Requires deep understanding of tools and production processes. Pros have this. –We can open doors that academia can’t. –We have experience in managing this sort of program. We want the systems themselves –As infrastructure for our new effort in computer architecture (yes, this is a recruiting pitch). We also want systems to be available to other industrial users –This might be more difficult if the systems came from academia. –But we don’t want to be in the hardware business.

January 2007 RAMP Retreat Plan, schedule Generate design spec: 6 weeks –Scope layout problems and layer count Layout and signal integrity: 12 weeks –Parts procurement proceeds in parallel –Will probably do 4-5 prototypes. Board fab, test and assembly: 3 weeks Design verification testing:5 weeks –This happens at Microsoft or BWRC Production can start in Summer ‘07

January 2007 RAMP Retreat Discussion?