User Programmable Logic Device: Architecture and CAD Tools TingTing Hwang and C. L. Liu.

Slides:



Advertisements
Similar presentations
Technology Mapping. Perform the final gate selection from a particular library Two basic approaches 1. ruled based technique 2. graph covering technique.
Advertisements

Presentation of Designing Efficient Irregular Networks for Heterogeneous Systems-on-Chip by Christian Neeb and Norbert Wehn and Workload Driven Synthesis.
A Graph-Partitioning-Based Approach for Multi-Layer Constrained Via Minimization Yih-Chih Chou and Youn-Long Lin Department of Computer Science, Tsing.
Altera FLEX 10K technology in Real Time Application.
Timing Optimization. Optimization of Timing Three phases 1globally restructure to reduce the maximum level or longest path Ex: a ripple carry adder ==>
ECE 506 Reconfigurable Computing Lecture 6 Clustering Ali Akoglu.
BSPlace: A BLE Swapping technique for placement Minsik Hong George Hwang Hemayamini Kurra Minjun Seo 1.
Ch.7 Layout Design Standard Cell Design TAIST ICTES Program VLSI Design Methodology Hiroaki Kunieda Tokyo Institute of Technology.
Power Visualization, Analysis, and Optimization Tools for FPGAs Matthew French, Li Wang and Michael Wirthlin USC, Information Sciences Institute, BYU.
Architecture Design Methodology. 2 The effects of architecture design on metrics:  Area (cost)  Performance  Power Target market:  A set of application.
Lecture 26: Reconfigurable Computing May 11, 2004 ECE 669 Parallel Computer Architecture Reconfigurable Computing.
Chapter 2 – Netlist and System Partitioning
Lecture 3: Field Programmable Gate Arrays II September 10, 2013 ECE 636 Reconfigurable Computing Lecture 3 Field Programmable Gate Arrays II.
Reconfigurable Computing (EN2911X, Fall07)
Physical Design Outline –What is Physical Design –Design Methods –Design Styles –Analysis and Verification Goal –Understand physical design topics Reading.
NanoMap: An Integrated Design Optimization Flow for a Hybrid Nanotube/CMOS Dynamically Reconfigurable Architecture Wei Zhang†, Li Shang‡ and Niraj K. Jha†
EDA (CS286.5b) Day 3 Clustering (LUT Map and Delay) N.B. no lecture Thursday.
CS294-6 Reconfigurable Computing Day 15 October 13, 1998 LUT Mapping.
Lecture 9: Multi-FPGA System Software October 3, 2013 ECE 636 Reconfigurable Computing Lecture 9 Multi-FPGA System Software.
The Memory/Logic Interface in FPGA’s with Large Embedded Memory Arrays The Memory/Logic Interface in FPGA’s with Large Embedded Memory Arrays Steven J.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 3: January 27, 2008 Clustering (LUT Mapping, Delay) Please work preclass example.
1 Circuit Partitioning Presented by Jill. 2 Outline Introduction Cut-size driven circuit partitioning Multi-objective circuit partitioning Our approach.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 17: March 30, 2009 Clustering (LUT Mapping, Delay)
1. 2 FPGAs Historically, FPGA architectures and companies began around the same time as CPLDs FPGAs are closer to “programmable ASICs” -- large emphasis.
1 CSC 6001 VLSI CAD (Physical Design) January
ECE 506 Reconfigurable Computing Lecture 7 FPGA Placement.
1 DIGITAL DESIGN I DR. M. MAROUF FPGAs AUTHOR J. WAKERLY.
Global Routing.
POWER-DRIVEN MAPPING K-LUT-BASED FPGA CIRCUITS I. Bucur, N. Cupcea, C. Stefanescu, A. Surpateanu Computer Science and Engineering Department, University.
CAD for Physical Design of VLSI Circuits
Ch.9 CPLD/FPGA Design TAIST ICTES Program VLSI Design Methodology Hiroaki Kunieda Tokyo Institute of Technology.
March 20, 2007 ISPD An Effective Clustering Algorithm for Mixed-size Placement Jianhua Li, Laleh Behjat, and Jie Huang Jianhua Li, Laleh Behjat,
© 2003 Xilinx, Inc. All Rights Reserved For Academic Use Only Xilinx Design Flow FPGA Design Flow Workshop.
1 Wire Length Prediction-based Technology Mapping and Fanout Optimization Qinghua Liu Malgorzata Marek-Sadowska VLSI Design Automation Lab UC-Santa Barbara.
University of British Columbia Dept. of Electrical and Computer Engineering November 30, 2007 A Combined Clustering and Placement Algorithm for FPGAs Mark.
Thermal-aware Steiner Routing for 3D Stacked ICs M. Pathak and S.K. Lim Georgia Institute of Technology ICCAD 07.
Julien Lamoureux and Steven J.E Wilton ICCAD
CSE 494: Electronic Design Automation Lecture 2 VLSI Design, Physical Design Automation, Design Styles.
Massachusetts Institute of Technology 1 L14 – Physical Design Spring 2007 Ajay Joshi.
10/25/ VLSI Physical Design Automation Prof. David Pan Office: ACES Lecture 3. Circuit Partitioning.
L11: Lower Power High Level Synthesis(2) 성균관대학교 조 준 동 교수
Reconfigurable Computing Using Content Addressable Memory (CAM) for Improved Performance and Resource Usage Group Members: Anderson Raid Marie Beltrao.
Impact of Interconnect Architecture on VPSAs (Via-Programmed Structured ASICs) Usman Ahmed Guy Lemieux Steve Wilton System-on-Chip Lab University of British.
Han Liu Supervisor: Seok-Bum Ko Electrical & Computer Engineering Department 2010-Feb-2.
This material exempt per Department of Commerce license exception TSU Xilinx Tool Flow.
1 A Min-Cost Flow Based Detailed Router for FPGAs Seokjin Lee *, Yongseok Cheon *, D. F. Wong + * The University of Texas at Austin + University of Illinois.
ECE 260B – CSE 241A /UCB EECS Kahng/Keutzer/Newton Physical Design Flow Read Netlist Initial Placement Placement Improvement Cost Estimation Routing.
Timing-Driven Routing for FPGAs Based on Lagrangian Relaxation
Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 ECE 697F Reconfigurable Computing Lecture 6 Mapping to Embedded Memory and PLAs.
1 Synthesizing Datapath Circuits for FPGAs With Emphasis on Area Minimization Andy Ye, David Lewis, Jonathan Rose Department of Electrical and Computer.
FPGA CAD 10-MAR-2003.
1 Field-programmable Gate Array Architectures and Algorithms Optimized for Implementing Datapath Circuits Andy Gean Ye University of Toronto.
FPGA Logic Cluster Design Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
1 WireMap FPGA Technology Mapping for Improved Routability Stephen Jang, Xilinx Inc. Billy Chan, Xilinx Inc. Kevin Chung, Xilinx Inc. Alan Mishchenko,
Architecture and algorithm for synthesizable embedded programmable logic core Noha Kafafi, Kimberly Bozman, Steven J. E. Wilton 2003 Field programmable.
ASIC/FPGA design flow. Design Flow Detailed Design Detailed Design Ideas Design Ideas Device Programming Device Programming Timing Simulation Timing Simulation.
Interconnect Characteristics of 2.5-D System Integration Scheme Yangdong (Steven) Deng & Wojciech P. Maly
ECE 506 Reconfigurable Computing Lecture 5 Logic Block Architecture Ali Akoglu.
1 Architecture of Datapath- oriented Coarse-grain Logic and Routing for FPGAs Andy Ye, Jonathan Rose, David Lewis Department of Electrical and Computer.
Programmable Hardware: Hardware or Software?
ASIC Design Methodology
Placement study at ESA Filomena Decuzzi David Merodio Codinachs
ESE535: Electronic Design Automation
FlowMap Algorithm Perform clustering on the following 2-bounded network Intra-cluster and node delay = 0, inter-cluster = 1 Pin constraint = 3 Practical.
Chapter 2 – Netlist and System Partitioning
FPGA Glitch Power Analysis and Reduction
ESE535: Electronic Design Automation
Fast Min-Register Retiming Through Binary Max-Flow
Chapter 3b Leakage Efficient Chip-Level Dual-Vdd Assignment with Time Slack Allocation for FPGA Power Reduction Prof. Lei He Electrical Engineering Department.
Reconfigurable Computing (EN2911X, Fall07)
Presentation transcript:

User Programmable Logic Device: Architecture and CAD Tools TingTing Hwang and C. L. Liu

Design Decisions Mid-size Capacity For control logic Hierarchical Interconnection Structure Predictable delay PLA-based Cell Structure

New Architecture Cell Structure Interconnection Architecture

Single-Output PLA Cells Investigate the tradeoff between chip area/delay and cell granularity for single-output PLA cells.

Conclusions for Cell Structure Area model: 3-4 inputs, 2 products PLA-based FPGA area versus cell granularity by Kouloheris and Gamal: 8-10 inputs, products, 3-4 outputs Area model plays an important role in the architecture evaluation process

Interconnect Architecture A B 2 2 A B PI=6 level2 A B level3level4 PO=6 4 2 S=2 level

PI switch logic cell Hierarchical Interconnect Structure

Graph Model of Interconnections pin1pin2 PI cell3 input Level3_input Level2_input Level1_input cell1cell2 output

Development Flow Input Technology Mapping Hierarchical Clustering Placement & Routing Output  TV-Pack  U of Toronto  SIS- TEMPLA  陳世梁  Placement & 100 % routing completion  洪偉綸  Performance driven hierarchical clustering  謝禎鋐  Hierarchical clustering for area  林稚忠

Algorithm for Technology Mapping Area minimization and Delay (Depth) minimization Select k Mapping Product term check LUT-blocks Single-output PLA blocks

Hierarchical Clustering for Area Given pin and capacity constraints of block at each level, minimize the area required (the number of top-level blocks)

Min-Cut Based Hierarchical Clustering Step1 :Min-cut (net) based K-way partition by hMetis (initial partition) Step2 :Refine the initial partitions to satisfy pin and area constraints. Step3 :If pin and area constraints can’t be satisfied, then abort, re-start and set K=K+1. Step4 :If it reaches leave block, then outputs the result else do the next level partition

Performance Driven Hierarchical Clustering Given timing constraint (required time), pin and block capacity constraints, Minimize the area required (the number of top blocks)

Algorithm 1.Clustering Labeling Multiple fan-out node duplication 2.Merging 3-Level hierarchical merge Area constraint Pin constraint

Delay & Area Comparisons 174 (1) 198 (1) 1277 (1) 253 (1) 151 (1) 6 52% 6 40% 6 53% 7 40% 8 60% 214 (1.23) 227 (1.15) 1479 (1.16) 286 (1.13) 172 (1.14) 5 63% 5 48% 5 61% 6 47% 7 68% 319 (1.83) 4 75% pair 332 (1.67) 4 75% C (1.36) 4 75% C (1.41) 5 60% C (1.40) 6 83% alu4 delay#block #PO#PI#PLBCir. P-BasedNo-DupB-DelayP-BasedNo-DupB-Delay

Placement and Routing Architecture File Clustered Net-lists P & R Output Placement & Routing on Graph

Algorithm for Placement and Routing While( not all nodes placed ) { pick the most critical net-list from the net-list set; for the selected net do compute priority of node for the selected net; select one node based on node ’ s priority; select location for the node based on the current placed nodes; place the node at the selected location; update priority of nets; mark placed nodes; }

Future Work Architecture Study Area Speed Expandability Design Tools Technology Mapping Placement and Routing Architecture Design Tools