1 RAMP White RAMP Retreat, BWRC, Berkeley, CA 20 January 2006 RAMP collaborators: Arvind (MIT), Krste Asanovíc (MIT), Derek Chiou (Texas), James Hoe (CMU),

Slides:



Advertisements
Similar presentations
RAMP Gold : An FPGA-based Architecture Simulator for Multiprocessors Zhangxi Tan, Andrew Waterman, David Patterson, Krste Asanovic Parallel Computing Lab,
Advertisements

© Krste Asanovic, 2014CS252, Spring 2014, Lecture 5 CS252 Graduate Computer Architecture Spring 2014 Lecture 5: Out-of-Order Processing Krste Asanovic.
CSE 490/590, Spring 2011 CSE 490/590 Computer Architecture Cache III Steve Ko Computer Sciences and Engineering University at Buffalo.
4/16/2013 CS152, Spring 2013 CS 152 Computer Architecture and Engineering Lecture 19: Directory-Based Cache Protocols Krste Asanovic Electrical Engineering.
CSE 490/590, Spring 2011 CSE 490/590 Computer Architecture Snoopy Caches I Steve Ko Computer Sciences and Engineering University at Buffalo.
© Krste Asanovic, 2014CS252, Spring 2014, Lecture 12 CS252 Graduate Computer Architecture Spring 2014 Lecture 12: Synchronization and Memory Models Krste.
Jared Casper, Ronny Krashinsky, Christopher Batten, Krste Asanović MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, USA A Parameterizable.
RAMP in Retrospect David Patterson August 25, 2010.
Computer Architecture Lab at Combining Simulators and FPGAs “An Out-of-Body Experience” Eric S. Chung, Brian Gold, James C. Hoe, Babak Falsafi {echung,
1 Jan 07 RAMP PI Report: Plans until next Retreat & Beyond Krste Asanovíc (MIT), Derek Chiou (Texas), James Hoe(CMU), Christos Kozyrakis (Stanford), Shih-Lien.
CSE 490/590, Spring 2011 CSE 490/590 Computer Architecture Snoopy Caches II Steve Ko Computer Sciences and Engineering University at Buffalo.
CSE 490/590, Spring 2011 CSE 490/590 Computer Architecture Directory-Based Caches II Steve Ko Computer Sciences and Engineering University at Buffalo.
CSE 490/590, Spring 2011 CSE 490/590 Computer Architecture Cache II Steve Ko Computer Sciences and Engineering University at Buffalo.
EECS Electrical Engineering and Computer Sciences B ERKELEY P AR L AB P A R A L L E L C O M P U T I N G L A B O R A T O R Y EECS Electrical Engineering.
CSE 490/590, Spring 2011 CSE 490/590 Computer Architecture Directory-Based Caches I Steve Ko Computer Sciences and Engineering University at Buffalo.
1 RAMP Implementation J. Wawrzynek. 2 RDL supports multiple platforms:  XUP, pure software, BEE2 BEE2 will be the standard RAMP platform for the next.
© Derek Chiou 1 RAMP-White Derek Chiou and Hari Angepat The University of Texas at Austin Supported in part by DOE, NSF, IBM, Intel, and Xilinx.
1 Research Accelerator for MultiProcessing Dave Patterson, UC Berkeley January RAMP collaborators: Arvind (MIT), Krste Asanovíc (MIT), Derek Chiou.
February 9, 2010CS152, Spring 2010 CS 152 Computer Architecture and Engineering Lecture 7 - Memory Hierarchy-II Krste Asanovic Electrical Engineering and.
RAMP Gold: Architecture and Timing Model Andrew Waterman, Zhangxi Tan, Rimas Avizienis, Yunsup Lee, David Patterson, Krste Asanović Parallel Computing.
© 2006 Regents University of California. All Rights Reserved RAMP Blue: A Message- Passing Many-Core System in FPGAs ISCA Tutorial/Workshop June 10th,
Computer Architecture Lab at 1 P ROTO F LEX : FPGA-Accelerated Hybrid Functional Simulator Eric S. Chung, Eriko Nurvitadhi, James C. Hoe, Babak Falsafi,
BEEKeeper Remote Management and Debugging of Large FPGA Clusters Terry Filiba Navtej Sadhal.
UC Berkeley 1 Time dilation in RAMP Zhangxi Tan and David Patterson Computer Science Division UC Berkeley.
RAMP 1 RAMP Tutorial and Workshop, ISCA 2007 Wrap-up Release Schedule More information – RAMP website:
1 Breakout thoughts (compiled with N. Carter): Where will RAMP be in 3-5 Years (What is RAMP, where is it going?) Is it still RAMP if it is mapping onto.
Research Accelerator for Multiple Processors
CS 152 Computer Architecture and Engineering Lecture 21: Directory-Based Cache Protocols Scott Beamer (substituting for Krste Asanovic) Electrical Engineering.
1 RAMP Models and Platforms Krste Asanovic UC Berkeley RAMP Retreat, Berkeley, CA January 15, 2009.
1 A Community Vision for a Shared Experimental Parallel HW/SW Platform Dave Patterson, Pardee Professor of Comp. Science, UC Berkeley President, Association.
1 Introduction to Research Accelerator for Multiple Processors David Patterson (Berkeley, CO-PI), Arvind (MIT), Krste Asanovíc (Berkeley/MIT), Derek Chiou.
© 2006 Regents University of California. All Rights Reserved RAMP Blue: A Message Passing Multi-Processor System on the BEE2 Andrew Schultz and Alex Krasnov.
CIS 314 : Computer Organization Lecture 1 – Introduction.
1 RAMP Tutorial Introduction/Overview Krste Asanovic UC Berkeley RAMP Tutorial, ASPLOS, Seattle, WA March 2, 2008.
1 IRAM Testing / Verification Sam Williams UC Berkeley
Murali Vijayaraghavan MIT Computer Science and Artificial Intelligence Laboratory RAMP Retreat, UC Berkeley, January 11, 2007 A Shared.
1 Some things we think we learned & the road ahead The RAMPants (as prepared by Mark Oskin) But first, let us thank you for the invaluable feedback you.
1 RAMP Infrastructure Krste Asanovic UC Berkeley RAMP Tutorial, ISCA/FCRC, San Diego June 10, 2007.
Krste Asanovic MIT Computer Science and Artificial Intelligence Laboratory Embedded RAMP Workshop, BWRC August.
RAMP Retreat Summer 2006 Break Session Leaders & Questions Greg Gibeling, Derek Chiou, James Hoe, John Wawrzynek & Christos Kozyrakis 6/21/2006.
April 18, 2011CS152, Spring 2011 CS 152 Computer Architecture and Engineering Lecture 19: Directory-Based Cache Protocols Krste Asanovic Electrical Engineering.
Dynamic Runtime Testing for Cycle-Accurate Simulators Saša Tomić, Adrián Cristal, Osman Unsal, Mateo Valero Barcelona Supercomputing Center (BSC) Universitat.
Comp-TIA Standards.  AMD- (Advanced Micro Devices) An American multinational semiconductor company that develops computer processors and related technologies.
1 Berkeley RAD Lab Technical Overview Armando Fox, Randy Katz, Michael Jordan, Dave Patterson, Scott Shenker, Ion Stoica March 2006.
Computer Maintenance Unit Subtitle: Bus Structures Excerpted from Copyright © Texas Education Agency, All rights reserved.
LiNK: An Operating System Architecture for Network Processors Steve Muir, Jonathan Smith Princeton University, University of Pennsylvania
RAMPing Down Chuck Thacker Microsoft Research August 2010.
TEMPLATE DESIGN © Hardware Design, Synthesis, and Verification of a Multicore Communication API Ben Meakin, Ganesh Gopalakrishnan.
Parallel Programming on the SGI Origin2000 With thanks to Igor Zacharov / Benoit Marchand, SGI Taub Computer Center Technion Moshe Goldberg,
Derek Chiou Prototyping to Emulation 1 From Prototyping to Emulation: The StarT (*T) Era ( ) Derek Chiou (Dataflow-StarT-Synthesis Era occupant)
Operating System Issues in Multi-Processor Systems John Sung Hardware Engineer Compaq Computer Corporation.
FPGA-based Fast, Cycle-Accurate Full System Simulators Derek Chiou, Huzefa Sanjeliwala, Dam Sunwoo, John Xu and Nikhil Patil University of Texas at Austin.
Network design Topic 6 Testing and documentation.
1 Retreat (Advance) John Wawrzynek UC Berkeley January 15, 2009.
3/12/07CS Visit Days1 A Sea Change in Processor Design Uniprocessor SpecInt Performance: From Hennessy and Patterson, Computer Architecture: A Quantitative.
Running Commodity Operating Systems on Scalable Multiprocessors Edouard Bugnion, Scott Devine and Mendel Rosenblum Presentation by Mark Smith.
1 Scalability and Accuracy in a Large-Scale Network Emulator Nov. 12, 2003 Byung-Gon Chun.
CS 152 Computer Architecture and Engineering Lecture 18: Snoopy Caches
Andrew Putnam University of Washington RAMP Retreat January 17, 2008
Structural Simulation Toolkit / Gem5 Integration
Dr. George Michelogiannakis EECS, University of California at Berkeley
Derek Chiou The University of Texas at Austin
David Patterson Electrical Engineering and Computer Sciences
Krste Asanovic Electrical Engineering and Computer Sciences
RAMP Retreat, UC Berkeley
The Stanford FLASH Multiprocessor
Combining Simulators and FPGAs “An Out-of-Body Experience”
Cloud Web Filtering Platform
CS 152 Computer Architecture and Engineering CS252 Graduate Computer Architecture Lecture 18 Cache Coherence Krste Asanovic Electrical Engineering and.
CSE 486/586 Distributed Systems Cache Coherence
Presentation transcript:

1 RAMP White RAMP Retreat, BWRC, Berkeley, CA 20 January 2006 RAMP collaborators: Arvind (MIT), Krste Asanovíc (MIT), Derek Chiou (Texas), James Hoe (CMU), Christos Kozyrakis (Stanford), Shih-Lien Lu (Intel), Mark Oskin (Washington), David Patterson (Berkeley), Jan Rabaey (Berkeley), and John Wawrzynek (Berkeley)

2 RAMP White Goals Large-scale multiprocessor that can boot standard operating systems and run commercial applications Cycle-accurate emulation of target system Cycle-level reproducibility and debugging Unprecedented observability

3 RAMP White Structure DRAM DRAM Cntl. Mem. Sched. Coherence Engine Router L2$ + Coherence CPU + L1$ + Coherenc e To Other Nodes ISA Independent RAMP White uses scalable directory-based coherence protocol Multiple different ISAs will eventually be supported L2$ optional Target router topology independent of host link topology Host DRAM used to support host emulation (e.g., L2 cache image) and tracing, as well as target memory Non-target accesses

4 RAMP White Components RDL & Diagnostic/Debug Architecture (UCB & MIT) Processor+L1$ (Industrial partners)  Need port to RDL (SUN for Niagara) Router (Mark Oskin, UW) 1. single packet router (64 bits to where) 2. Burst transfers 3. reliability Shared L2$ (Krste Asanovic, MIT) Coherence Protocol (James Hoe, CMU) Memory Access Scheduler (Krste Asanovic, MIT) Memory/DRAM controller (UCB)  Starts with BLUE (multiport access/physical) BWRC System Integration (Derek Chiou, UT Austin) All of the above need graduate student contribution.

5 White Implementation Stages White 1.0, 2Q06  64 CPUs, use Xilinx hard PowerPC core  No L2$  Simplest coherence protocol (maybe MSI, or even just MI)  Simple store+forward router White 2.0, 3Q06  128 CPUs, 32-bit soft core  Include L2$  Improved coherence protocol White 3.0, 4Q06  64 CPUs, 64-bit soft core  Cut-through router White 4.0, 1Q07  Multiple ISAs supported  Advanced coherence protocol

6 White Validation Formal verification of implementation  baseline protocol already verified Comparison against C reference model  Convert Verilog into C model (Verilator and/or VTOC)  Compare results while running workloads “Headless” dynamic verification at full speed  CPUs replaced with memory traffic generators and checkers Regression suites at full speed  Run software workloads, compare outputs with reference

7 Contributions for RAMP White NSF/CRI proposal pending for staff  2 full-time staff (one HW/gateware, one OS/software)  Integration, Testing, Documentation, Distribution, etc. UCB/Xilinx ports Microblaze to RDL  Serves in RAMP blue and as an example of processor in RDL  (companies to do the others)