Advances in Clockless and Mixed-Timing Digital Systems Prof. Steven M. Nowick Department of Computer Science Columbia University.

Slides:



Advertisements
Similar presentations
1-1 ECE 424 Design of Microprocessor-Based Systems Haibo Wang ECE Department Southern Illinois University Carbondale, IL
Advertisements

1 Clockless Logic  Recap: Lookahead Pipelines  High-Capacity Pipelines.
Cellular Phones as Embedded Systems Liza Rodriguez Aurelio Morales EEL Embedded Systems Dept. of Electrical and Computer Engineering University.
Advances in Designing Clockless Digital Systems Prof. Steven M. Nowick Department of Computer Science Columbia University New York,
1 Clockless Logic Montek Singh Tue, Mar 16, 2004.
COMP Clockless Logic and Silicon Compilers Lecture 3
Chapter 1. Introduction This course is all about how computers work But what do we mean by a computer? –Different types: desktop, servers, embedded devices.
1 Clockless Logic Prof. Montek Singh Feb. 3, 2004.
1 Clockless Logic Montek Singh Tue, Mar 21, 2006.
Next-generation Chips & Computing with Atoms Igor Markov ACAL / EECS, Univ. of Michigan.
A. A. Jerraya Mark B. Josephs South Bank University, London System Timing.
1 Effect of Increasing Chip Density on the Evolution of Computer Architectures R. Nair IBM Journal of Research and Development Volume 46 Number 2/3 March/May.
CIS 314 : Computer Organization Lecture 1 – Introduction.
1 Clockless Logic or How do I make hardware fast, power- efficient, less noisy, and easy-to-design? Montek Singh Thu, Jan 8, 2004.
ELEN468 Lecture 11 ELEN468 Advanced Logic Design Lecture 1Introduction.
1 Clockless Computing Montek Singh Thu, Sep 13, 2007.
1 Recap: Lectures 5 & 6 Classic Pipeline Styles 1. Williams and Horowitz’s PS0 pipeline 2. Sutherland’s micropipelines.
1 Clockless Logic: Dynamic Logic Pipelines (contd.)  Drawbacks of Williams’ PS0 Pipelines  Lookahead Pipelines.
ENEE 644 Dr. Ankur Srivastava Office: 1349 A.V. Williams URL: Computer-Aided Design of.
ECE 232 L1 Intro.1 Adapted from Patterson 97 ©UCBCopyright 1998 Morgan Kaufmann Publishers ECE 232 Hardware Organization and Design Lecture 1 Introduction.
Clockless Logic Montek Singh Tue, Apr 6, Case Study: An Adaptively-Pipelined Mixed Synchronous-Asynchronous System Montek Singh Univ. of North Carolina.
EE141 © Digital Integrated Circuits 2nd Introduction 1 The First Computer.
Low Power Design of Integrated Systems Assoc. Prof. Dimitrios Soudris
Department of Computer Engineering
Department of Computer and Information Science, School of Science, IUPUI Dale Roberts, Lecturer Computer Science, IUPUI CSCI.
4.0 rtos implementation part II
February 12, 1998 Aman Sareen DPGA-Coupled Microprocessors Commodity IC’s for the Early 21st Century by Aman Sareen School of Electrical Engineering and.
Chap. 1 Overview of Digital Design with Verilog. 2 Overview of Digital Design with Verilog HDL Evolution of computer aided digital circuit design Emergence.
Clockless Chips Date: October 26, Presented by:
1 Seminar on High-Speed Asynchronous Pipelines Montek Singh Thursdays 10-11, SN325.
COLUMBIA UNIVERSITY Computer Engineering Program The Fu Foundation School of Engineering and Applied Science IN THE CITY OF NEW YORK Computer Engineering:
CAD for Physical Design of VLSI Circuits
MOUSETRAP Ultra-High-Speed Transition-Signaling Asynchronous Pipelines Montek Singh & Steven M. Nowick Department of Computer Science Columbia University,
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
2/6/2003IDEAL-IST Workshop, Christos P. Sotiriou, ICS-FORTH 1 IDEAL-IST Workshop Christos P. Sotiriou, Institute of Computer Science, FORTH.
CMP 4202: VLSI System Design Lecturer: Geofrey Bakkabulindi
Paper review: High Speed Dynamic Asynchronous Pipeline: Self Precharging Style Name : Chi-Chuan Chuang Date : 2013/03/20.
CSE 494: Electronic Design Automation Lecture 2 VLSI Design, Physical Design Automation, Design Styles.
COE 405 Design and Modeling of Digital Systems
Reminder Lab 0 Xilinx ISE tutorial Research Send me an if interested Looking for those interested in RC with skills in compilers/languages/synthesis,
Computer Organization and Design Computer Abstractions and Technology
1 COMP Clockless Logic and Silicon Compilers or How do I take “hard” out of hardware design? Montek Singh Thu, Jan 12, 2006.
Introduction Advantage of DSP: - Better signal quality & repeatable performance - Flexible  Easily modified (Software Base) - Handle more complex processing.
Computer Organization & Assembly Language © by DR. M. Amer.
Reader: Pushpinder Kaur Chouhan
Present – Past -- Future
Curtis A. Nelson 1 Technology Mapping of Timed Circuits Curtis A. Nelson University of Utah September 23, 2002.
1 Clockless Logic or How do I make hardware fast, power- efficient, less noisy, and easy-to-design? Montek Singh Tue, Jan 14, 2003.
By Nasir Mahmood.  The NoC solution brings a networking method to on-chip communication.
EE141 © Digital Integrated Circuits 2nd Introduction 1 Principle of CMOS VLSI Design Introduction Adapted from Digital Integrated, Copyright 2003 Prentice.
VLSI Design System-on-Chip Design
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO CS 219 Computer Organization.
1 Practical Design and Performance Evaluation of Completion Detection Circuits Fu-Chiung Cheng Department of Computer Science Columbia University.
VADA Lab.SungKyunKwan Univ. 1 L5:Lower Power Architecture Design 성균관대학교 조 준 동 교수
Implementing Tile-based Chip Multiprocessors with GALS Clocking Styles Zhiyi Yu, Bevan Baas VLSI Computation Lab, ECE Department University of California,
Clockless Chips Under the esteemed guidance of Romy Sinha Lecturer, REC Bhalki Presented by: Lokesh S. Woldoddy 3RB05CS122 Date:11 April 2009.
Submitted by Abi Mathew Roll No:1
1 Clockless Logic Montek Singh Thu, Mar 2, Review: Logic Gate Families  Static CMOS logic  Dynamic logic, or “domino” logic  Transmission gates,
Welcome To Seminar Presentation Seminar Report On Clockless Chips
Roadmap History Synchronized vs. Asynchronous overview How it works
Introduction to Microprocessors
Scalable Processor Design
Steven Lewis, Andrey Martchovsky, Chengcheng Xu
Trends in Electronics Reliability Testing
Emerging Technologies of Computation
Combinational Circuits
Combinational Circuits
1.Introduction to Advanced Digital Design (14 marks)
Presentation transcript:

Advances in Clockless and Mixed-Timing Digital Systems Prof. Steven M. Nowick Department of Computer Science Columbia University

OUTLINE I. Asynchronous & Mixed-Timing Design: Overview & Recent Developments II. Low-Latency Interface Circuits for Mixed-Timing Domains

Trends and Challenges Trends in Chip Design: next decade “Semiconductor Industry Association (SIA) Roadmap” (97-8) Unprecedented Challenges: complexity and scale (= size of systems) clock speeds power management “time-to-market” Design becoming unmanageable using a centralized (synchronous) approach….

Trends and Challenges (cont.) 1. Clock Rate: 1980: several MegaHertz 2001: ~750 MegaHertz - 1+ GigaHertz 2004: several GigaHertz Design Challenge: “clock skew”: clock must be near-simultaneous across entire chip

Trends and Challenges (cont.) 2. Chip Size and Density: Total #Transistors per Chip: 60-80% increase/year –~1970: 4 thousand (Intel 4004) –today: million –2004 and beyond: 100 million-1 billion Design Challenges: system complexity, design time, clock distribution soon, clock will not reach across chip in 1 cycle!

Trends and Challenges (cont.) 3. Power Consumption Low power: ever-increasing demand –consumer electronics: battery-powered – high-end processors: avoid expensive fans, packaging Design Challenge: clock inherently consumes power continuously “power-down” techniques: only partly effective

Trends and Challenges (cont.) 4. Design Re-Use, Scalability Increasing pressure for faster “time-to-market”. Need: reusable components: “plug-and-play” design scalable design: easy system upgrades Design Challenge: mismatch w/ central fixed-rate clock

Trends and Challenges (cont.) 5. Future Trends: “Mixed Timing” Domains Chips themselves becoming distributed systems…. contain many sub-regions, operating at different speeds: Design Challenge: breakdown of single central clock control

Introduction Synchronous vs. Asynchronous Systems? Synchronous Systems: use a global clock –entire system operates at fixed-rate –uses “centralized control” clock

Introduction (cont.) Synchronous vs. Asynchronous Systems? (cont.) Asynchronous Systems: no global clock –components can operate at varying rates –communicate locally via “handshaking” –uses “distributed control” “handshaking interfaces”

Introduction (cont.) Asynchronous Circuits: –long history (since early 1950’s), but... –early approaches often impractical: slow, complex Synchronous Circuits: –used almost everywhere: highly successful –benefits: simplicity, support by existing design tools But recently: renewed interest in asynchronous circuits

Asynchronous Design Several Potential Advantages: Lower Power –no clock ==> components use power only “on demand” Robustness, Scalability –no global timing==>“mix-and-match” varied components Higher Performance –systems not limited to “worst-case” clock rate

Asynchronous Design: Challenges Critical Design Issues: components must communicate cleanly = “hazard-free” highly-concurrent designs: much harder to understand! Lack of Existing Design Tools: most commercial “CAD” tools targeted to synchronous

Asynchronous Design: Recent Commercial Interest 1. Philips Semiconductors [86-present] async chips now in commercial pagers, cell phones 3-4x lower power than synchronous much lower electromagnetic interference (EMI) 2. Motorola/Theseus Logic [99-] Joint venture: develop async embedded processor 3. Intel [96-98] experimental high-speed design: instruction-length decoder 3-4x faster than synchronous

Asynchronous Design: Recent Commercial Interest 4. Sun Labs [~95-present] experimental high-speed pipelines, routing fabric, systems 5. IBM Research [~98-present] experimental high-speed pipelines, etc. 6. Several recent async startups: Theseus Logic (Florida) ADD (Pasadena) Self-Timed Solutions (UK)

My Research: Highlights 3 Main Asynchronous Areas: 1. CAD Tools: optimization algorithms + software packages 2. High-Speed Asynchronous Pipelines 3. Interface Circuits: for mixed-timing domains

My Research: Funding NSF: 2 Large-Scale “ITR” Awards ($2.5 Million) [2000] 1. “CAD Tools” to Design/Optimize Asynchronous Systems (joint with USC) 2. 3rd-Generation Wireless Systems (async, very low power) (joint with Columbia EE - Ken Shepard) Other Funding: NSF, Sun, NYS CAT, Sloan Fdtn.

1. Developing Asynchronous CAD Tools Focus: 2 types of CAD tools (a) for individual controllers (i.e., finite-state machines) (b) for entire digital systems (a) The “MINIMALIST” Package [ICCAD-91/95/97/99, DAC-96] –R. Fuhrer, M. Theobald –Downloaded to 60+ sites/18+ countries (b) High-Level Synthesis Package [DAC-01, DATE-02] –M. Theobald, T. Chelcea Include: many sophisticated optimization algorithms Goal: provide many options for design-space exploration

1(a). Synthesizing A Controller Using the “MINIMALIST” CAD Tool Inputs: req-send treq rd-iq adbld-out ack-pkt Outputs: tack peack adbld req-send+ treq+ rd-iq+/ adbld+ adbld-out+/ peack+ rd-iq-/ peack- adbld- tack+ adbld-out- treq- rd-id+/ adbld+ adbld-out+/ peack+ rd-iq-/ peack- adbld- tack- adbld-out- treq+ ack-pkt+/ peack+ tack+ ack-pkt- treq-/ peack- tack- treq-/ tack- treq+/ tack+ ack-pkt+/ peack- tack- adbld-out- treq- ack-pkt+/ peack+ req-send-/ -- adbld-out- treq+ rd-iq+/ adbld+ From HP Labs “Mayfly” Project

EXAMPLE (cont.): Examples:

Basic Digital Building Blocks = datapath components adders, multipliers, dividers, … central to almost all digital systems Asynchronous Design: several potential advantages high speed (not limited by commercial clock rates) adaptible interfacing (easy reuse in different environments) Goal: new architectures + designs for very fast async datapath components Use Pipelining: to improve performance 2. High-Speed Digital Design

global clock SYNCHRONOUS ASYNCHRONOUS PIPELINED COMPUTATION: like an assembly line no global clock 2. High-Speed Digital Design

Function Block Completion Detector Data in Data out PC AN ASYNCHRONOUS PIPELINE: Williams/Horowitz (Stanford 86-91) 2. High-Speed Digital Design

Our Goal: extremely high-speed digital components much faster than commercial processors Contribution: 3 new async pipeline styles [Singh/Nowick] dynamic logic: 1. Lookahead Pipelines [Async-00] 2. High-Capacity Pipelines [ISSCC-02, Async-02, WVLSI-00] static logic: 3. MOUSETRAP Pipelines [ICCD-01] 2. High-Speed Digital Design

Contributions (cont.): introduce novel highly-concurrent protocols basic operating speed: ~3.5+ GigaHertz [0.25 micron] gracefully handle variable input/output rates Technology Transfer: IBM T.J. Watson [ ] in fabricated experimental FIR filter chip (for disk drives) 2. High-Speed Digital Design

Critical challenge: interface sync/async, sync/sync systems -- operating at different clock rates --robustly, at high-speed! ASYNC SYSTEM Interface Circuits = “glue circuits” SYNC SYSTEM: CLOCK 1 SYNC SYSTEM: CLOCK 2 3. Robust Interface Circuits for “Mixed-Timing” Domains [DAC-01]

4. Low-Power Applications Now investigating several promising async applications: 3rd-Generation Wireless Systems (with K. Shepard, EE) –very low power, reconfigurable to different standards Embedded Processors –used in cell phones, automobiles, digital cameras,...

5. Tech Transfer: IBM Research Invited to transfer pipeline technology: PhD Student (Montek Singh): 5-month internship (5-12/00) IBM Application: filter design –async design -- sandwiched between sync interfaces Fabricated Chip: evaluated in Feb.-March 2001 Benefits: “adaptive-pipelining” [ISSCC-02] Potential for future use in IBM products….