Design Methodology.

Name: Design Methodology.
Uploaded: 2017-10-05T03:14:17+00:00
Duration: PTM15S57
Description: Design Methodology.

Design Methodology

The Design Productivity Challenge
58%/Yr. compound Complexity growth rate 21%/Yr. compound Productivity growth rate 1981 Logic Transistors per Chip (K) Productivity (Trans./Staff-Month) 100 1,000 10,000 100,000 1,000,000 10,000,000 100,000,000 10 1985 1989 1993 1997 2001 2005 2009 Logic Transistors per Chip (K) Productivity (Trans./Staff-Month) 1981 1983 1985 1987 1989 1991 1993 1995 1997 1999 2001 2003 2005 2007 2009 A growing gap between design complexity and design productivity Source: ITRS’97

The Custom Approach Intel 4004 Courtesy Intel

Transition to Automation and Regular Structures
Intel 4004 (‘71) Intel 8080 Intel 8085 Intel 80286 Intel 80486 Courtesy Intel

Automating Design Exploitation By Algorithms
Regular Structures Logic Synthesis Regularization of Connection Floorplanning (Localization of function) System Level Performance/Power/Cost Allocation of Physical Resources Communication/Interconnect Hierarchy based on Sensitivity to Latency Wires to Link Protocols

A System-on-a-Chip: Example
Courtesy: Philips

Design Methodology Design process traverses iteratively between three abstractions: behavior, structure, and geometry More and more automation for each of these steps

Floorplanning A Protocol Processor for Wireless

Implementation Choices
Custom Standard Cells Compiled Cells Ma cro Cells Cell-based Pre-diffused (Gate Arrays) Pre-wired (FPGA's) Array-based Semicustom Digital Circuit Implementation Approaches

Impact of Implementation Choices
Domain-specific processor (e.g. DSP) 10-100 Embedded microprocessor Energy Efficiency (in MOPS/mW) 1-10 Hardwired custom Configurable/Parameterizable 0.1-1 Somewhat flexible Fully flexible Flexibility (or application scope) None

Implementation Strategies
PLA Technology confined in cell macros (tiling) Cell based logic Technology confined to cells (area) Both 1-d and 2-d solutions Transistor Arrays (Gate arrays) Technology confined to layers (Below M1 fixed)

PLA: Programmable Logic Array
x 1 2 AND plane Product terms OR f

Two-Level Logic minterm
Every logic function can be expressed in sum-of-products format (AND-OR) minterm Inverting format (NOR-NOR) more effective

PLA Layout – Exploiting Regularity
V DD GND f And-Plane Or-Plane

Breathing Some New Life in PLAs
River PLAs A cascade of multiple-output PLAs. Adjacent PLAs are connected via river routing. No placement and routing needed. Output buffers and the input buffers of the next stage are shared. Courtesy B. Brayton

Experimental Results Also: RPLAs are regular and predictable
Area: RPLAs (2 layers) 1.23 SCs (3 layers) , NPLAs (4 layers) 1.31 Delay RPLAs 1.04 SCs 1.00 NPLAs 1.09 Synthesis time: for RPLA , synthesis time equals design time; SCs and NPLAs still need P&R. Also: RPLAs are regular and predictable Layout of C2670 Standard cell, 2 layers channel routing Standard cell, 3 layers OTC Network of PLAs, 4 layers OTC River PLA, 2 layers no additional routing

2-d Cell Based: “Hard” Modules
25632 (or 8192 bit) SRAM Generated by hard-macro module generator

1-d Cell-based Design (standard cells)
Feedthrough cell Logic cell Routing channel Rows of cells Functional Routing channel requirements are reduced by presence of more interconnect layers module (RAM, multiplier, )

Standard Cell — Example
[Brodersen92]

Standard Cell – The New Generation
Cell-structure hidden under interconnect layers

Standard Cell - Example
3-input NAND cell (from ST Microelectronics): C = Load capacitance T = input rise/fall time

“Soft” MacroModules Synopsys DesignCompiler

The “Design Closure” Problem
Iterative Removal of Timing Violations (white lines) Courtesy Synopsys

Gate Array — Sea-of-gates
Uncommited Cell Committed Cell (4-input NOR)

Sea-of-gate Primitive Cells
Using oxide-isolation Using gate-isolation

Sea-of-gates Random Logic Memory Subsystem LSI Logic LEA300K
(0.6 mm CMOS) Courtesy LSI Logic

The return of gate arrays?
Via programmable gate array (VPGA) Via-programmable cross-point metal-5 metal-6 programmable via Exploits regularity of interconnect [Pileggi02]

Pre-wired Arrays: Classification of prewired arrays (or field-programmable devices): Based on Programming Technique Fuse-based (program-once) Non-volatile EPROM based RAM based Programmable Logic Style Array-Based Look-up Table Programmable Interconnect Style Channel-routing Mesh networks

Fuse-Based FPGA Open by default, closed by applying current pulse
antifuse polysilicon ONO dielectric n + antifuse diffusion 2 l Open by default, closed by applying current pulse From Smith’97

Array-Based Programmable Logic
5 4 O 3 2 1 Programmable OR array O I 3 2 1 Fixed AND array Programmable OR array I 5 4 O 3 2 1 Fixed OR array Programmable AND array Programmable AND array O O O 3 2 1 O 3 O 2 O 1 PLA (flexible – sizing) PROM (dense) PAL (uniform load) Indicates programmable connection Indicates fixed connection

Programming a PROM f 1 X 2 NA : programmed node

2-input mux as programmable logic block
Configuration A B S F= X 1 Y XY X +Y X Y A F B 1 S

Logic Cell of Actel Fuse-Based FPGA
SA Y C D SB 1 S1 S0

LUT-Based Logic Cell Xilinx 4000 Series Courtesy Xilinx D C ....C
1 ....C clock 3 2 F Logic function H G Din F’ G’ H’ H1 H2 H0 EC S/R control Multiplexer Controlled by Configuration Program Y Q SD YQ Bypass X XQ RD Xilinx 4000 Series Courtesy Xilinx

Array-Based Programmable Wiring
Interconnect Point Programmed interconnection Input/output pin Cell Horizontal tracks Vertical tracks

Mesh-based Interconnect Network
Switch Box Connect Box Interconnect Point Courtesy Dehon and Wawrzyniek

Transistor Implementation of Mesh
Courtesy Dehon and Wawrzyniek

Hierarchical Mesh Network
Use overlayed mesh to support longer connections Reduced fanout and reduced resistance Courtesy Dehon and Wawrzyniek

Altera MAX From Smith97

Altera MAX Interconnect Architecture
column channel row channel LAB2 PIA LAB1 LAB6 t LAB Array-based (MAX ) Mesh-based (MAX 9000) Courtesy Altera

Field-Programmable Gate Arrays Fuse-based
Standard-cell like floorplan

Xilinx 4000 Interconnect Architecture
12 Quad 8 Single 4 Double 3 Long Direct CLB 2 Connect 3 Long 12 4 4 8 4 8 4 2 Quad Long Global Long Double Single Global Carry Direct Clock Clock Chain Connect Courtesy Xilinx

RAM-based FPGA Xilinx XC4000ex Courtesy Xilinx

Design at a crossroad System-on-a-Chip
RAM 500 k Gates FPGA + 1 Gbit DRAM Preprocessing Multi- Spectral Imager mC system +2 Gbit DRAM Recog- nition Analog 64 SIMD Processor Array + SRAM Image Conditioning 100 GOPS Embedded applications: cost, performance, and energy are the issues! DSP and control intensive Mixed-mode Software design is crucial

Addressing the Design Complexity Issue Architecture Reuse
Reuse comes in generations

Heterogeneous Programmable Platforms
FPGA Fabric Embedded memories Embedded PowerPc Hardwired multipliers Xilinx Vertex-II Pro High-speed I/O Courtesy: Xilinx

Summary Design Choice forced by System Tradeoffs
Deep Sub-micron Challenges Regularity (Design flexibility at smallest scales) Power consumption! Interconnection Parasitics Nanoscopic Devices/Modules New circuit solutions are bound to emerge Who can afford design in the years to come?

Design Methodology.

Similar presentations

Presentation on theme: "Design Methodology."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Design Methodology.

Similar presentations

Presentation on theme: "Design Methodology."— Presentation transcript:

Similar presentations

About project

Feedback