How to Reach Zettaflops Erik P. DeBenedictis Avogadro-Scale Computing April 17, 2008 The Bartos Theater Building E15 MIT SAND2008-2492C Sandia is a multiprogram.

Slides:



Advertisements
Similar presentations
Beyond CMOS CTSG Dec. 15, 2009 Work in Progress: Not for Distribution Beyond CMOS CTSG IRC Meeting December 15, 2009 DRAFT.
Advertisements

EVALUATION OF A CIRCUIT PATH DELAY TUNING TECHNIQUE FOR NANOMETER CMOS Advisor: Dr. Adit D. Singh Committee members: Dr. Vishwani D. Agrawal and Dr. Victor.
Topics Electrical properties of static combinational gates:
Treinamento: Testes Paramétricos em Semicondutores Setembro 2012
An International Technology Roadmap for Semiconductors
Resonant Tunnelling Devices A survey on their progress.
Kameshwar K. Yadavalli, Alexei O. Orlov, Ravi K. Kummamuru, John Timler, Craig Lent, Gary Bernstein, and Gregory Snider Department of Electrical Engineering.
Kwangok Jeong and Andrew B. Kahng UCSD VLSI CAD Laboratory
Semiconductor Memory Design. Organization of Memory Systems Driven only from outside Data flow in and out A cell is accessed for reading by selecting.
Erik P. DeBenedictis, Organizer Sandia National Laboratories Los Alamos Computer Science Institute Symposium 2004 The Path To Extreme Computing Sandia.
Erik P. DeBenedictis Sandia National Laboratories February 24, 2005 Sandia Zettaflops Story A Million Petaflops Sandia is a multiprogram laboratory operated.
Ensemble Emulation Feb. 28 – Mar. 4, 2011 Keith Dalbey, PhD Sandia National Labs, Dept 1441 Optimization & Uncertainty Quantification Abani K. Patra, PhD.
From analog to digital circuits A phenomenological overview Bogdan Roman.
Roadmap of Microelectronic Industry. Scaling of MOSFET Reduction of channel length L  L/α Integration density  α 2 Speed  α; Power/device  1/α 2 Power.
Unstructured Data Partitioning for Large Scale Visualization CSCAPES Workshop June, 2008 Kenneth Moreland Sandia National Laboratories Sandia is a multiprogram.
Advance Nano Device Lab. Fundamentals of Modern VLSI Devices 2 nd Edition Yuan Taur and Tak H.Ning 0 Ch5. CMOS Performance Factors.
Microelectronic Circuits, Sixth Edition Sedra/Smith Copyright © 2010 by Oxford University Press, Inc. C H A P T E R 13 CMOS Digital Logic Circuits.
Alexei O. Orlov Department of Electrical Engineering University of Notre Dame, IN, USA Temperature dependence of locked mode in a Single-Electron Latch.
Advanced Semiconductor Devices Y-BRANCH SWITCH (YBS) Anubhav Khandelwal.
Lecture 21, Slide 1EECS40, Fall 2004Prof. White Lecture #21 OUTLINE –Sequential logic circuits –Fan-out –Propagation delay –CMOS power consumption Reading:
Emerging Technologies – A Critical Review Presenter: Qufei Wu 12/05/05.
Low Power Design and Adiabatic Circuits P.Ranjith M.Tech(ICT)
The CMOS Inverter Slides adapted from:
Advanced Computing and Information Systems laboratory Device Variability Impact on Logic Gate Failure Rates Erin Taylor and José Fortes Department of Electrical.
EE466: VLSI Design Power Dissipation. Outline Motivation to estimate power dissipation Sources of power dissipation Dynamic power dissipation Static power.
INTEGRATED CIRCUIT LOGIC FAMILY
Erik P. DeBenedictis Sandia National Laboratories May 16, 2005 Petaflops, Exaflops, and Zettaflops for Science and Defense Sandia is a multiprogram laboratory.
Overview of Future Hardware Technologies The chairman’s notes from the plenary session appear in blue throughout this document. Subject to these notes.
SOS8 Erik P. DeBenedictis Sandia National Laboratories Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for.
Lecture 2 1 Computer Elements Transistors (computing) –How can they be connected to do something useful? –How do we evaluate how fast a logic block is?
Device Physics – Transistor Integrated Circuit
Keeping Up with Moore’s Law Who the heck is this Moore guy anyway? Gordon E. Moore was the cofounder of Intel Corporation Gordon E. Moore was the cofounder.
1 Memory Technology Comparison ParameterZ-RAMDRAMSRAM Size11.5x4x StructureSingle Transistor Transistor + Cap 6 Transistor Performance10.5x2x.
Review: Basic Building Blocks  Datapath l Execution units -Adder, multiplier, divider, shifter, etc. l Register file and pipeline registers l Multiplexers,
Erik P. DeBenedictis Sandia National Laboratories October 24-27, 2005 Workshop on the Frontiers of Extreme Computing Sandia is a multiprogram laboratory.
1 ERD 2011 ITRS Winter Conference – Incheon, Korea – 11 December ERD chapters, scope, difficult challenges, taxonomy, etc. An Chen GLOBALFOUNDRIES.
Figure 9.1. Use of silicon oxide as a masking layer during diffusion of dopants.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Limitations of Digital Computation William Trapanese Richard Wong.
The George Washington University School of Engineering and Applied Science Department of Electrical and Computer Engineering ECE122 – Lab 7 MOSFET Parameters.
Ratioed Circuits Ratioed circuits use weak pull-up and stronger pull-down networks. The input capacitance is reduced and hence logical effort. Correct.
Prospects for High-Aspect-Ratio FinFETs in Low-Power Logic Mark Rodwell, Doron Elias University of California, Santa Barbara 3rd Berkeley Symposium on.
By Francesco Maddalena 500 nm. 1. Introduction To uphold Moore’s Law in the future a new generation of devices that fully operate in the “quantum realm”
Erik P. DeBenedictis Sandia National Laboratories May 5, 2005 Reversible Logic for Supercomputing How to save the Earth with Reversible Computing Sandia.
Extreme Hardware “Extreme” Definition: very great Synonyms:...consummate, great, greatest, high, highest, intense, maximal, maximum, severe, sovereign,
Extreme Computing 2005 Center for Nano Science and Technology University of Notre Dame Craig S. Lent University of Notre Dame Molecular QCA and the limits.
Designing for Uncertainty: Critical Issues for New Nanoelectronic Technologies Evelyn L. Hu California NanoSystems Institute UCSB UNCERTAINTIES Technologies.
VLSI Design Lecture 5: Logic Gates Mohammad Arjomand CE Department Sharif Univ. of Tech. Adapted with modifications from Wayne Wolf’s lecture notes.
University of Notre Dame Lecture 19 - Intro to MQCA Nanomagnetic Logic Devices.
Classical Control for Quantum Computers Mark Whitney, Nemanja Isailovic, Yatish Patel, John Kubiatowicz U.C. Berkeley.
1 The piezoelectronic transistor: A nanoactuator-based post-CMOS digital switch with high speed and low power Class: Bio MEMS Components and System Teacher.
The End of CMOS Scaling will be Good for Space Computing Fault Tolerant Spaceborne Computing Employing New Technologies May 29, 2008 Sandia National Laboratories.
Erik P. DeBenedictis Sandia National Laboratories October 27, 2005 Workshop on the Frontiers of Extreme Computing Overall Outbrief Sandia is a multiprogram.
S. E. Thompson EEL Quantum cellular Automata Quantum Cellular Automata (QCA) refers to any one of several models of quantum computation, which have.
Clocked Molecular Quantum-dot Cellular Automata
EE141 © Digital Integrated Circuits 2nd Introduction 1 Principle of CMOS VLSI Design Introduction Adapted from Digital Integrated, Copyright 2003 Prentice.
Erik P. DeBenedictis Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of.
Trends in IC technology and design J. Christiansen CERN - EP/MIC
FPGA-Based System Design: Chapter 2 Copyright  2004 Prentice Hall PTR Topics n Logic gate delay. n Logic gate power consumption. n Driving large loads.
EE201C : Stochastic Modeling of FinFET LER and Circuits Optimization based on Stochastic Modeling Shaodi Wang
Physical Limits of Computing Dr. Mike Frank Slides from a Course Taught at the University of Florida College of Engineering Department of Computer & Information.
CS203 – Advanced Computer Architecture
Computer Organization IS F242. Course Objective It aims at understanding and appreciating the computing system’s functional components, their characteristics,
CS203 – Advanced Computer Architecture
Architecture & Organization 1
Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2
Architecture & Organization 1
Device Physics – Transistor Integrated Circuit
Design Technologies Custom Std Cell Performance Gate Array FPGA Cost.
Device Physics – Transistor Integrated Circuit
Presentation transcript:

How to Reach Zettaflops Erik P. DeBenedictis Avogadro-Scale Computing April 17, 2008 The Bartos Theater Building E15 MIT SAND C Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy's National Nuclear Security Administration under contract DE-AC04-94AL85000.

Outline Why Strive for Zettaflops? –Global Warming Mission How Far Will CMOS Go? “Beyond CMOS” Alternatives The Zettaflops Workshops: Political Landscape International Technology Roadmap for Semiconductors (ITRS) Activity My Answers on “How to Reach Zettaflops”

Climate Modeling as an Application SCaLeS study included section on climate Understanding and mitigating global warming analyzed and requires 1 Zettaflops

Outline Why Strive for Zettaflops? –Global Warming Mission How Far Will CMOS Go? “Beyond CMOS” Alternatives The Zettaflops Workshops: Political Landscape International Technology Roadmap for Semiconductors (ITRS) Activity My Answers on “How to Reach Zettaflops”

ITRS Process Integration Spreadsheet Big Spreadsheet –Columns are years –Rows are 100+ transistor parameters –Manual entry of process parameters by year –Excel computes operating parameters –Extra degrees of freedom go to making Moore’s Law smooth – not the best computers

ITRS Spreadsheet Structure Target is exponential in “Years in Future” Fprocessor is result of 96 rows of targets, inputs, and iterative calculation Result usually matches to one decimal place! Line Width Scaling ITRS 2003 supplementary material

Workup for Climate Modeling Conclusion: CMOS to 200 Petaflops; QDCA to.5 Zettaflops  QDCA  P  QDCA Best Case Logic CMOS  P  CMOS Best Case Logic 

Outline Why Strive for Zettaflops? –Global Warming Mission How Far Will CMOS Go? “Beyond CMOS” Alternatives The Zettaflops Workshops: Political Landscape International Technology Roadmap for Semiconductors (ITRS) Activity My Answers on “How to Reach Zettaflops”

Reversible Logic Parameters We need some data point on performance Graph to right from a published paper by Lent et. al. Notre Dame on quantum dot cellular automata However, architectural considerations say their operating points are not ideal E diss reversible operation Energy/E k E k =0.5 eV T=60K E diss irreversible operation Operating Point Memory Operatin g Point T c (s) k B Tlog e (2) Figure 8: Molecular Quantum Dot Cellular Automata speed-energy curve for irreversible and reversible operation (courtesy of C. Lent) with operating points used in this paper labeled.

Workup for Climate Modeling Conclusion: CMOS to 200 Petaflops; QDCA to.5 Zettaflops  QDCA  P  QDCA Best Case Logic CMOS  P  CMOS Best Case Logic 

CMOS and Beyond CMOS Limits CMOS per ITRS roadmap –With operating points adjusted for climate modeling machines instead of matching Moore’s Law –200 2 MW DARPA Exascale study –1 >2 MW A New Computing Device –Notre Dame QDCA –Reversible Logic –.5 Zettaflops

Outline Why Strive for Zettaflops? –Global Warming Mission How Far Will CMOS Go? “Beyond CMOS” Alternatives The Zettaflops Workshops: Political Landscape International Technology Roadmap for Semiconductors (ITRS) Activity My Answers on “How to Reach Zettaflops”

Testing the Political Waters in 2005 & 2007 Question: Among policy makers, is Moore’s Law more powerful than Landauer’s Limit? –If Moore’s Law is more powerful, no chance of funding physics of computing research –If Landauer’s limit is more powerful, physics of computation research is the only way to continue the computer industry Method: Run a workshop on Exaflops and Zettaflops and see how the participants sort out the difference

Testing Procedure 2007

Results of Experiment Answer in 2005 –No distinction between Exaflops and Zettaflops Answer in 2007 –Audience renamed it the Exaflops conference (because 90% of the conclusions pertained to Exaflops) –Everybody left understanding the distinction between Exaflops and Zettaflops

Outline Why Strive for Zettaflops? –Global Warming Mission How Far Will CMOS Go? “Beyond CMOS” Alternatives The Zettaflops Workshops: Political Landscape International Technology Roadmap for Semiconductors (ITRS) Activity My Answers on “How to Reach Zettaflops”

Selecting Successor to CMOS by 12/31/2008 From meeting and to committee  The semiconductor industry is waking up Downselect “beyond CMOS” options through a advocate/skeptic competition

Downselect Criteria Basic description This section comprises a description of the proposed device family. The section may include textual and graphical descriptions but should be independent of (or parameterized by) feature size F Principle of OperationControl mechanism Thermal injection over gate barrirer Operating temperature Usually 25C - 125C Materials and GeometryBase Si Device Architecture FET Patterning Lithography Design 2D layout Circuit element Transistor, 3 or 4 terminal Device density as a function of feature size F ~ 1/F^2 Size in units of feature size F of a gate equivalent to a 2-input NAND gate, including contacts and isolation and necessary peripheral circuitry >~65 F^2 State variables and control State variableVoltage Number of logic states2 (high and low) Logic FamilyInformation processing basis Universal set comprising NAND, NOR, NOT logic gates, also pass gates Interconnects Wire Compatible memory SRAM (fast), DRAM (dense) Clock CMOS based clock circuits CMOS compatible N/A

Downselect Criteria LimitationsThis section comprises a list of known limiting factors for performance and manufacturing Materials and Geometry Sources of variability LER, Doping fluctuations ~ 1/SQRT (LW) External parasitics Access resistance, fringe capacitance State variables and control Noise margin (Vdd-Vth)/ KT/q > 5 QM limit Tuneling: Band to Band, Source-to- Drain Performance Potential This section comprises an extrapolation of the technology to about the year 2020, stipulating F=14 nm. Provide best estimate numerical values. Switching speed and energy Intrinsic speed of single elementLchan/v ~ 0.1ps Self Gain gm/gd ~ Vdd/DIBL Proposed clock rate xxx Switching Energy per gate or gate proposed clock rate 0.5*Cload*Vdd^2 Static Power Dissipation per gate or gate equivalentVdd*Ioff*(2/5) InterconnectInterconnect delay per micron RC Interconnect energy as a function of distance at proposed clock rate CV^2

Outline Why Strive for Zettaflops? –Global Warming Mission How Far Will CMOS Go? “Beyond CMOS” Alternatives The Zettaflops Workshops: Political Landscape International Technology Roadmap for Semiconductors (ITRS) Activity My Answers on “How to Reach Zettaflops”

How to Reach Zettaflops (I/III) End-user application: Understanding and mitigating global warming to save the Earth –The climate modeling community can supply representatives that say 1 Zettaflops is needed Computer required: anything >2 Exaflops –DARPA IPTO is preparing a plan for 1 Exaflops but that looks like a stretch goal for mature CMOS –Reference Zettaflops workshop that there is no CMOS solution beyond 1 Exaflops

How to Reach Zettaflops (II/III) Perform physical science research and discover a new computing device –ITRS calls this the “new switch” we can guarantee it won’t be a switch –NRI, NSF, maybe national labs have infrastructure in place and can distribute research funds –Downselect competition complete12/31/2008 may I be your sponsor? –Generate funds from Government and industry through appeal to end-user application

How to Reach Zettaflops (III/III) You will need a –Sub kT classical information processing device Quantum computer won’t solve problems of the necessary public interest –Compatible interconnect –Compatible memory and storage –A new computer architecture Post von Neumann, post parallel von Neumann, must be some sort of heterogeneous accelerated quasi- special purpose architecture