Southampton: Oct 99AMULET3i - 1 AMULET3i - asynchronous SoC Steve Furber - n Agenda: AMULET3i Design tools Future problems.

Slides:



Advertisements
Similar presentations
Verilog HDL -Introduction
Advertisements

Designing, Deploying and Managing Workflow in SharePoint Sites Steve Heaney Product Development Manager OBS
Handling Complexity in FEV Erik Seligman CS 510, Lecture 6, January 2009.
CSE 341 Verilog HDL An Introduction. Hardware Specification Languages Verilog  Similar syntax to C  Commonly used in  Industry (USA & Japan) VHDL 
Reading1: An Introduction to Asynchronous Circuit Design Al Davis Steve Nowick University of Utah Columbia University.
Fault-Tolerant Delay-Insensitive Inter-Chip Communication Yebin Shi Apt Group The University of Manchester.
Clockless Logic System-Level Specification and Synthesis Ack: Tiberiu Chelcea.
Southampton: Oct 99Asynchronous Circuit Compilation- 1 Asynchronous Circuit Compilation Dr. Doug Edwards
Altera FLEX 10K technology in Real Time Application.
Overview: Chapter 7  Sensor node platforms must contend with many issues  Energy consumption  Sensing environment  Networking  Real-time constraints.
Asynchronous comparator design
Southampton: Oct 99Asynchronous Circuit Compilation- 1 AMULET3-H n Asynchronous macrocell ARM compatible processor core Full custom RAM Compiled ROM Balsa.
On-Chip Cache Analysis A Parameterized Cache Implementation for a System-on-Chip RISC CPU.
Graduate Computer Architecture I Lecture 15: Intro to Reconfigurable Devices.
ASPDAC / VLSI Tutorial on "Large Asynchronous Systems" 1 Logic design of asynchronous circuits Part IV: Large Asynchronous Systems.
Hardware and Petri nets Synthesis of asynchronous circuits from Signal Transition Graphs.
CSE115: Introduction to Computer Science I Dr. Carl Alphonce 219 Bell Hall 1.
The Design Process Outline Goal Reading Design Domain Design Flow
TCSS 372A Computer Architecture. Getting Started Get acquainted (take pictures) Discuss purpose, scope, and expectations of the course Discuss personal.
ECE Synthesis & Verification1 ECE 667 Spring 2011 Synthesis and Verification of Digital Systems Verification Introduction.
Energy Evaluation Methodology for Platform Based System-On- Chip Design Hildingsson, K.; Arslan, T.; Erdogan, A.T.; VLSI, Proceedings. IEEE Computer.
Configurable System-on-Chip: Xilinx EDK
Lab for Reliable Computing Generalized Latency-Insensitive Systems for Single-Clock and Multi-Clock Architectures Singh, M.; Theobald, M.; Design, Automation.
Logic Design Outline –Logic Design –Schematic Capture –Logic Simulation –Logic Synthesis –Technology Mapping –Logic Verification Goal –Understand logic.
ARMOR Asynchronous RISC Microprocessor הטכניון - מכון טכנולוגי לישראל המעבדה למערכות ספרתיות מהירות הפקולטה להנדסת חשמל Submitted by: Tziki Oz-Sinay, Ori.
Analysis and Avoidance of Cross-talk in on-chip buses Chunjie Duan Ericsson Wireless Communications Anup Tirumala Jasmine Networks Sunil P Khatri University.
Fall 2009 / Winter 2010 Ran Ginosar (
Behavioural synthesis of asynchronous controllers: a case study with a self-timed communication channel Alex Yakovlev, Frank Burns, Alex Bystrov, Albert.
CSS Lecture 2 Chapter 3 – Connecting Computer Components with Buses Bus Structures Synchronous, Asynchronous Typical Bus Signals Two level, Tri-state,
From Concept to Silicon How an idea becomes a part of a new chip at ATI Richard Huddy ATI Research.
Spring 2002EECS150 - Lec0-intro Page 1 EECS150 - Digital Design Lecture 8 - Hardware Description Languages February 14, 2002 John Wawrzynek.
Octavo: An FPGA-Centric Processor Architecture Charles Eric LaForest J. Gregory Steffan ECE, University of Toronto FPGA 2012, February 24.
VerificationTechniques for Macro Blocks (IP) Overview Inspection as Verification Adversarial Testing Testbench Design Timing Verification.
Robust Low Power VLSI R obust L ow P ower VLSI Memory Management Units for Instruction and Data Cache for OR1200 CPU Core Arijit Banerjee ASIC/SOC Class.
Delevopment Tools Beyond HDL
(1) Modeling Digital Systems © Sudhakar Yalamanchili, Georgia Institute of Technology, 2006.
1 Chapter 2. The System-on-a-Chip Design Process Canonical SoC Design System design flow The Specification Problem System design.
VERIFICATION OF I2C INTERFACE USING SPECMAN ELITE By H. Mugil Vannan Experts Mr. Rahul Hakhoo, Section Manager, CMG-MCD Mr. Umesh Srivastva, Project Leader.
Chap. 1 Overview of Digital Design with Verilog. 2 Overview of Digital Design with Verilog HDL Evolution of computer aided digital circuit design Emergence.
Asynchronous Datapath Design Adders Comparators Multipliers Registers Completion Detection Bus Pipeline …..
MOUSETRAP Ultra-High-Speed Transition-Signaling Asynchronous Pipelines Montek Singh & Steven M. Nowick Department of Computer Science Columbia University,
Hardware Design Environment Instructors: Fu-Chiung Cheng ( 鄭福炯 ) Associate Professor Computer Science & Engineering Tatung University.
1 H ardware D escription L anguages Modeling Digital Systems.
Chonnam national university VLSI Lab 8.4 Block Integration for Hard Macros The process of integrating the subblocks into the macro.
1 Fly – A Modifiable Hardware Compiler C. H. Ho 1, P.H.W. Leong 1, K.H. Tsoi 1, R. Ludewig 2, P. Zipf 2, A.G. Oritz 2 and M. Glesner 2 1 Department of.
Developing software and hardware in parallel Vladimir Rubanov ISP RAS.
Hardware-software Interface Xiaofeng Fan
Area and Speed Oriented Implementations of Asynchronous Logic Operating Under Strong Constraints.
CSS 372 Oct 4th - Lecture 3 Chapter 3 – Connecting Computer Components with Buses Bus Structures Synchronous, Asynchronous Typical Bus Signals Two level,
Chapter 8 CPU and Memory: Design, Implementation, and Enhancement The Architecture of Computer Hardware and Systems Software: An Information Technology.
Reading1: An Introduction to Asynchronous Circuit Design Al Davis Steve Nowick University of Utah Columbia University.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
RELIABILITY ENGINEERING 28 March 2013 William W. McMillan.
An Overview of Hardware Design Methodology Ian Mitchelle De Vera.
CHAPTER 8 Developing Hard Macros The topics are: Overview Hard macro design issues Hard macro design process Physical design for hard macros Block integration.
Tools - LogiBLOX - Chapter 5 slide 1 FPGA Tools Course The LogiBLOX GUI and the Core Generator LogiBLOX L BX.
IMPLEMENTATION OF MIPS 64 WITH VERILOG HARDWARE DESIGN LANGUAGE BY PRAMOD MENON CET520 S’03.
LYU0703 Parallel Distributed Programming on PS3 1 Huang Hiu Fung Wong Chung Hoi Supervised by Prof. Michael R. Lyu Department of Computer.
Proposal for an Open Source Flash Failure Analysis Platform (FLAP) By Michael Tomer, Cory Shirts, SzeHsiang Harper, Jake Johns
ECE 171 Digital Circuits Chapter 8 Hardware Description Language Herbert G. Mayer, PSU Status 11/23/2015 Copied with Permission from prof. Mark
Chapter 11 System-Level Verification Issues. The Importance of Verification Verifying at the system level is the last opportunity to find errors before.
What’s New in Xilinx Ready-to-use solutions. Key New Features of the Foundation Series 1.5/1.5i Release  New device support  Integrated design environment.
Chapter 3 System Buses.  Hardwired systems are inflexible  General purpose hardware can do different tasks, given correct control signals  Instead.
ASIC/FPGA design flow. Design Flow Detailed Design Detailed Design Ideas Design Ideas Device Programming Device Programming Timing Simulation Timing Simulation.
Problem: design complexity advances in a pace that far exceeds the pace in which verification technology advances. More accurately: (verification complexity)
EECE 320 L8: Combinational Logic design Principles 1Chehab, AUB, 2003 EECE 320 Digital Systems Design Lecture 8: Combinational Logic Design Principles.
1 Advanced Digital Design Asynchronous Design Automation by A. Steininger and J. Lechner Vienna University of Technology.
A Case for Standard-Cell Based RAMs in Highly-Ported Superscalar Processor Structures Sungkwan Ku, Elliott Forbes, Rangeen Basu Roy Chowdhury, Eric Rotenberg.
Synthesis from HDL Other synthesis paradigms
HIGH LEVEL SYNTHESIS.
Presentation transcript:

Southampton: Oct 99AMULET3i - 1 AMULET3i - asynchronous SoC Steve Furber - n Agenda: AMULET3i Design tools Future problems

Southampton: Oct 99AMULET3i - 2 AMULET3 n a third generation asynchronous ARM performance comparable with ARM9 radically new internal organisation –based on reorder buffer –Harvard core, unified I/D memory n under development within the OMI ATOM project first application as as part of a telecommunications controller

Southampton: Oct 99AMULET3i - 3 AMULET3i SoC organisation

Southampton: Oct 99AMULET3i - 4 AMULET3i - physical layout

Southampton: Oct 99AMULET3i - 5 AMULET3 core organisation n Harvard core forward from reorder buffer out-of-order completion in-order register update aborts handled at writeback

Southampton: Oct 99AMULET3i - 6 AMULET3H local bus RAM n segmented memory I & D ports arbitrate at each block quad-word I & D line buffers

Southampton: Oct 99AMULET3i - 7 AMULET3 tools n LARD behavioural modelling tool for async design 10x designer productivity vs Asim n Petrify much enhanced FORCAGE descendant can handle wider range of circuits n Balsa synthesis tool used for DMA controller

Southampton: Oct 99AMULET3i - 8 Tools - LARD n Language for Asynchronous Research and Development parallel processes with communication primitives extensive data types modelling of elapsed time used to model AMULET3 available from AMULET web site

Southampton: Oct 99AMULET3i - 9 Tools - LARD n Features time view block view HLL debug test generation co-simulation n Platforms UNIX/Linux

Southampton: Oct 99AMULET3i - 10 Tools - BALSA n Synthesis system for asynchronous circuits similar to Philips ‘Tangram’ used for AMULET3H DMA controller direct HLL to netlist compilation –syntax directed translation peephole optimisation

Southampton: Oct 99AMULET3i - 11 Tools - BALSA

Southampton: Oct 99AMULET3i - 12 Tools - Petrify n Petri Net modelling tool for low-level asynchronous circuits speed-independent synthesis technology mapping very powerful –can be tricky to use extensively used to design AMULET3 modules

Southampton: Oct 99AMULET3i - 13 AMULET3 validation n workstations now powerful enough to run ARM validation suite under TimeMill around 8 CPU-weeks total n testing full functionality now very hard very complex system-on-chip n design aimed at high performance timing margins much reduced validation complex and uncertain

Southampton: Oct 99AMULET3i - 14 AMULET3 - problems n high performance target timing margins must be small timing is hard to verify –very dependent on accurate extraction, models n modelling tools are imperfect e.g. crosstalk –bus wire delay 1.5ns +/- 1ns crosstalk –careful layout gives 0.9ns +/- 0.15ns how can we be sure such factors are OK?

Southampton: Oct 99AMULET3i - 15 The Future n timing accuracy is getting harder wire delays will become more significant crosstalk will get worse on-chip transistor variance will increase higher speeds will lead to higher noise n will delay-matching be viable? alternatives are dual-rail or other DI codes –incur significant area and power overheads

Southampton: Oct 99AMULET3i - 16 Gate vs (2mm) wire delays, ps

Southampton: Oct 99AMULET3i - 17 Alternatives to bundled data n Delay-insensitive codes timing encoded in data dual-rail encoding –100% area overhead c.f. bundled data –significant power cost –e.g. NCL from Theseus deal just announced with Motorola use conventional synthesis tools timing closure ceases to be an issue

Southampton: Oct 99AMULET3i - 18 Alternatives to bundled data n Delay-insensitive codes N-of-M codes –3-of-6 code 50% area overhead 3 transitions to send 4 bits –2-of-7 code 75% area overhead 2 transitions to send 4 bits –well-suited to inter-chip communication –may suit on-chip buses

Southampton: Oct 99AMULET3i - 19 Conclusions n complex async design is feasible standard tools are just about survivable additional tools improve productivity n ideal design flow: LARD-like specification formal verification of high-level properties automated synthesis onto module library n timing closure is the major problem may ultimately rule out bundled data