Download presentation
1
Design Methodology
2
The Design Productivity Challenge
58%/Yr. compound Complexity growth rate 21%/Yr. compound Productivity growth rate 1981 Logic Transistors per Chip (K) Productivity (Trans./Staff-Month) 100 1,000 10,000 100,000 1,000,000 10,000,000 100,000,000 10 1985 1989 1993 1997 2001 2005 2009 Logic Transistors per Chip (K) Productivity (Trans./Staff-Month) 1981 1983 1985 1987 1989 1991 1993 1995 1997 1999 2001 2003 2005 2007 2009 A growing gap between design complexity and design productivity Source: ITRS’97
3
The Custom Approach Intel 4004 Courtesy Intel
4
Transition to Automation and Regular Structures
Intel 4004 (‘71) Intel 8080 Intel 8085 Intel 80286 Intel 80486 Courtesy Intel
5
Automating Design Exploitation By Algorithms
Regular Structures Logic Synthesis Regularization of Connection Floorplanning (Localization of function) System Level Performance/Power/Cost Allocation of Physical Resources Communication/Interconnect Hierarchy based on Sensitivity to Latency Wires to Link Protocols
6
A System-on-a-Chip: Example
Courtesy: Philips
7
Design Methodology Design process traverses iteratively between three abstractions: behavior, structure, and geometry More and more automation for each of these steps
8
Floorplanning A Protocol Processor for Wireless
9
Implementation Choices
Custom Standard Cells Compiled Cells Ma cro Cells Cell-based Pre-diffused (Gate Arrays) Pre-wired (FPGA's) Array-based Semicustom Digital Circuit Implementation Approaches
10
Impact of Implementation Choices
Domain-specific processor (e.g. DSP) 10-100 Embedded microprocessor Energy Efficiency (in MOPS/mW) 1-10 Hardwired custom Configurable/Parameterizable 0.1-1 Somewhat flexible Fully flexible Flexibility (or application scope) None
11
Implementation Strategies
PLA Technology confined in cell macros (tiling) Cell based logic Technology confined to cells (area) Both 1-d and 2-d solutions Transistor Arrays (Gate arrays) Technology confined to layers (Below M1 fixed)
12
PLA: Programmable Logic Array
x 1 2 AND plane Product terms OR f
13
Two-Level Logic minterm
Every logic function can be expressed in sum-of-products format (AND-OR) minterm Inverting format (NOR-NOR) more effective
14
PLA Layout – Exploiting Regularity
V DD GND f And-Plane Or-Plane
15
Breathing Some New Life in PLAs
River PLAs A cascade of multiple-output PLAs. Adjacent PLAs are connected via river routing. No placement and routing needed. Output buffers and the input buffers of the next stage are shared. Courtesy B. Brayton
16
Experimental Results Also: RPLAs are regular and predictable
Area: RPLAs (2 layers) 1.23 SCs (3 layers) , NPLAs (4 layers) 1.31 Delay RPLAs 1.04 SCs 1.00 NPLAs 1.09 Synthesis time: for RPLA , synthesis time equals design time; SCs and NPLAs still need P&R. Also: RPLAs are regular and predictable Layout of C2670 Standard cell, 2 layers channel routing Standard cell, 3 layers OTC Network of PLAs, 4 layers OTC River PLA, 2 layers no additional routing
17
2-d Cell Based: “Hard” Modules
25632 (or 8192 bit) SRAM Generated by hard-macro module generator
18
1-d Cell-based Design (standard cells)
Feedthrough cell Logic cell Routing channel Rows of cells Functional Routing channel requirements are reduced by presence of more interconnect layers module (RAM, multiplier, )
19
Standard Cell — Example
[Brodersen92]
20
Standard Cell – The New Generation
Cell-structure hidden under interconnect layers
21
Standard Cell - Example
3-input NAND cell (from ST Microelectronics): C = Load capacitance T = input rise/fall time
22
“Soft” MacroModules Synopsys DesignCompiler
23
The “Design Closure” Problem
Iterative Removal of Timing Violations (white lines) Courtesy Synopsys
24
Gate Array — Sea-of-gates
Uncommited Cell Committed Cell (4-input NOR)
25
Sea-of-gate Primitive Cells
Using oxide-isolation Using gate-isolation
26
Sea-of-gates Random Logic Memory Subsystem LSI Logic LEA300K
(0.6 mm CMOS) Courtesy LSI Logic
27
The return of gate arrays?
Via programmable gate array (VPGA) Via-programmable cross-point metal-5 metal-6 programmable via Exploits regularity of interconnect [Pileggi02]
28
Pre-wired Arrays: Classification of prewired arrays (or field-programmable devices): Based on Programming Technique Fuse-based (program-once) Non-volatile EPROM based RAM based Programmable Logic Style Array-Based Look-up Table Programmable Interconnect Style Channel-routing Mesh networks
29
Fuse-Based FPGA Open by default, closed by applying current pulse
antifuse polysilicon ONO dielectric n + antifuse diffusion 2 l Open by default, closed by applying current pulse From Smith’97
30
Array-Based Programmable Logic
5 4 O 3 2 1 Programmable OR array O I 3 2 1 Fixed AND array Programmable OR array I 5 4 O 3 2 1 Fixed OR array Programmable AND array Programmable AND array O O O 3 2 1 O 3 O 2 O 1 PLA (flexible – sizing) PROM (dense) PAL (uniform load) Indicates programmable connection Indicates fixed connection
31
Programming a PROM f 1 X 2 NA : programmed node
32
2-input mux as programmable logic block
Configuration A B S F= X 1 Y XY X +Y X Y A F B 1 S
33
Logic Cell of Actel Fuse-Based FPGA
SA Y C D SB 1 S1 S0
34
LUT-Based Logic Cell Xilinx 4000 Series Courtesy Xilinx D C ....C
1 ....C clock 3 2 F Logic function H G Din F’ G’ H’ H1 H2 H0 EC S/R control Multiplexer Controlled by Configuration Program Y Q SD YQ Bypass X XQ RD Xilinx 4000 Series Courtesy Xilinx
35
Array-Based Programmable Wiring
Interconnect Point Programmed interconnection Input/output pin Cell Horizontal tracks Vertical tracks
36
Mesh-based Interconnect Network
Switch Box Connect Box Interconnect Point Courtesy Dehon and Wawrzyniek
37
Transistor Implementation of Mesh
Courtesy Dehon and Wawrzyniek
38
Hierarchical Mesh Network
Use overlayed mesh to support longer connections Reduced fanout and reduced resistance Courtesy Dehon and Wawrzyniek
39
Altera MAX From Smith97
40
Altera MAX Interconnect Architecture
column channel row channel LAB2 PIA LAB1 LAB6 t LAB Array-based (MAX ) Mesh-based (MAX 9000) Courtesy Altera
41
Field-Programmable Gate Arrays Fuse-based
Standard-cell like floorplan
42
Xilinx 4000 Interconnect Architecture
12 Quad 8 Single 4 Double 3 Long Direct CLB 2 Connect 3 Long 12 4 4 8 4 8 4 2 Quad Long Global Long Double Single Global Carry Direct Clock Clock Chain Connect Courtesy Xilinx
43
RAM-based FPGA Xilinx XC4000ex Courtesy Xilinx
44
Design at a crossroad System-on-a-Chip
RAM 500 k Gates FPGA + 1 Gbit DRAM Preprocessing Multi- Spectral Imager mC system +2 Gbit DRAM Recog- nition Analog 64 SIMD Processor Array + SRAM Image Conditioning 100 GOPS Embedded applications: cost, performance, and energy are the issues! DSP and control intensive Mixed-mode Software design is crucial
45
Addressing the Design Complexity Issue Architecture Reuse
Reuse comes in generations
46
Heterogeneous Programmable Platforms
FPGA Fabric Embedded memories Embedded PowerPc Hardwired multipliers Xilinx Vertex-II Pro High-speed I/O Courtesy: Xilinx
47
Summary Design Choice forced by System Tradeoffs
Deep Sub-micron Challenges Regularity (Design flexibility at smallest scales) Power consumption! Interconnection Parasitics Nanoscopic Devices/Modules New circuit solutions are bound to emerge Who can afford design in the years to come?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.