Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 ECE 697F Reconfigurable Computing Lecture 6 Mapping to Embedded Memory and PLAs.

Slides:



Advertisements
Similar presentations
Critical Reading Strategies: Overview of Research Process
Advertisements

Technology Mapping. Perform the final gate selection from a particular library Two basic approaches 1. ruled based technique 2. graph covering technique.
Spartan-3 FPGA HDL Coding Techniques
ECE 667 Synthesis & Verificatioin - FPGA Mapping 1 ECE 667 Synthesis and Verification of Digital Systems Technology Mapping for FPGAs D.Chen, J.Cong, DAOMap.
FPGA Technology Mapping Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
Floating-Point FPGA (FPFPGA) Architecture and Modeling (A paper review) Jason Luu ECE University of Toronto Oct 27, 2009.
FPGA Latency Optimization Using System-level Transformations and DFG Restructuring Daniel Gomez-Prado, Maciej Ciesielski, and Russell Tessier Department.
➢ Performing Technology Mapping and Optimization by DAG Covering: A Review of Traditional Approaches Evriklis Kounalakis.
Combining Technology Mapping and Retiming EECS 290A Sequential Logic Synthesis and Verification.
1 DAOmap: A Depth-optimal Area Optimization Mapping Algorithm for FPGA Designs Deming Chen, Jacon Cong ICCAD 2004 Presented by: Wei Chen.
Technology Mapping.
Lecture 3: Field Programmable Gate Arrays II September 10, 2013 ECE 636 Reconfigurable Computing Lecture 3 Field Programmable Gate Arrays II.
Physical Design Outline –What is Physical Design –Design Methods –Design Styles –Analysis and Verification Goal –Understand physical design topics Reading.
Rewiring – Review, Quantitative Analysis and Applications Matthew Tang Wai Chung CUHK CSE MPhil 10/11/2003.
EDA (CS286.5b) Day 3 Clustering (LUT Map and Delay) N.B. no lecture Thursday.
CS294-6 Reconfigurable Computing Day 15 October 13, 1998 LUT Mapping.
Lecture 9: Multi-FPGA System Software October 3, 2013 ECE 636 Reconfigurable Computing Lecture 9 Multi-FPGA System Software.
EDA (CS286.5b) Day 19 Covering and Retiming. “Final” Like Assignment #1 –longer –more breadth –focus since assignment #2 –…but ideas are cummulative –open.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 3: January 27, 2008 Clustering (LUT Mapping, Delay) Please work preclass example.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 17: March 30, 2009 Clustering (LUT Mapping, Delay)
FPGA Technology Mapping Algorithms
FPGA Technology Mapping. 2 Technology mapping:  Implements the optimized nodes of the Boolean network to the target device library.  For FPGA, library.
Register-Transfer (RT) Synthesis Greg Stitt ECE Department University of Florida.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Lecture 2: Field Programmable Gate Arrays September 13, 2004 ECE 697F Reconfigurable Computing Lecture 2 Field Programmable Gate Arrays.
Philip Brisk 2 Paolo Ienne 2 Hadi Parandeh-Afshar 1,2 1: University of Tehran, ECE Department 2: EPFL, School of Computer and Communication Sciences Efficient.
An automatic tool flow for the combined implementation of multi-mode circuits Brahim Al Farisi, Karel Bruneel, João Cardoso, Dirk Stroobandt.
Electrical and Computer Engineering Archana Rengaraj ABC Logic Synthesis basics ECE 667 Synthesis and Verification of Digital Systems Spring 2011.
05/04/06 1 Integrating Logic Synthesis, Tech mapping and Retiming Presented by Atchuthan Perinkulam Based on the above paper by A. Mishchenko et al, UCAL.
Tools - Implementation Options - Chapter15 slide 1 FPGA Tools Course Implementation Options.
1 Towards Optimal Custom Instruction Processors Wayne Luk Kubilay Atasu, Rob Dimond and Oskar Mencer Department of Computing Imperial College London HOT.
1 EECS 219B Spring 2001 Timing Optimization Andreas Kuehlmann.
A Routing Approach to Reduce Glitches in Low Power FPGAs Quang Dinh, Deming Chen, Martin D. F. Wong Department of Electrical and Computer Engineering University.
L11: Lower Power High Level Synthesis(2) 성균관대학교 조 준 동 교수
Reconfigurable Computing Using Content Addressable Memory (CAM) for Improved Performance and Resource Usage Group Members: Anderson Raid Marie Beltrao.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 24: April 18, 2011 Covering and Retiming.
DAOmap: A Depth-optimal Area Optimization Mapping Algorithm for FPGA Designs Deming Chen and Jason Cong Computer Science Department University of California,
Lecture 13: Logic Emulation October 25, 2004 ECE 697F Reconfigurable Computing Lecture 13 Logic Emulation.
CprE / ComS 583 Reconfigurable Computing Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #4 – FPGA.
1 - CPRE 583 (Reconfigurable Computing): VHDL to FPGA: A Tool Flow Overview Iowa State University (Ames) CPRE 583 Reconfigurable Computing Lecture 5: 9/7/2011.
1 A Min-Cost Flow Based Detailed Router for FPGAs Seokjin Lee *, Yongseok Cheon *, D. F. Wong + * The University of Texas at Austin + University of Illinois.
Timing-Driven Routing for FPGAs Based on Lagrangian Relaxation
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 3: January 12, 2004 Clustering (LUT Mapping, Delay)
1 Synthesizing Datapath Circuits for FPGAs With Emphasis on Area Minimization Andy Ye, David Lewis, Jonathan Rose Department of Electrical and Computer.
Technology Mapping. 2 Technology mapping is the phase of logic synthesis when gates are selected from a technology library to implement the circuit. Technology.
CALTECH CS137 Spring DeHon 1 CS137: Electronic Design Automation Day 5: April 12, 2004 Covering and Retiming.
Optimality Study of Logic Synthesis for LUT-Based FPGAs Jason Cong and Kirill Minkovich.
Give qualifications of instructors: DAP
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
Altera Technical Solutions Seminar Schedule OpeningIntroduction FLEX ® 10KE Devices APEX ™ 20K & Quartus ™ Overview Design Integration EDA Integration.
FPGA Logic Cluster Design Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
1 WireMap FPGA Technology Mapping for Improved Routability Stephen Jang, Xilinx Inc. Billy Chan, Xilinx Inc. Kevin Chung, Xilinx Inc. Alan Mishchenko,
DAOmap: A Depth-optimal Area Optimization Mapping Algorithm for FPGA Designs Deming Chen, Jason Cong , Computer Science Department , UCLA Presented.
Lecture 17: Dynamic Reconfiguration I November 10, 2004 ECE 697F Reconfigurable Computing Lecture 17 Dynamic Reconfiguration I Acknowledgement: Andre DeHon.
SEMI-SYNTHETIC CIRCUIT GENERATION FOR TESTING INCREMENTAL PLACE AND ROUTE TOOLS David GrantGuy Lemieux University of British Columbia Vancouver, BC.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 5: February 2, 2015 Clustering (LUT Mapping, Delay)
A Survey of Fault Tolerant Methodologies for FPGA’s Gökhan Kabukcu
Resource Sharing in LegUp. Resource Sharing in High Level Synthesis Resource Sharing is a well-known technique in HLS to reduce circuit area by sharing.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 25: April 17, 2013 Covering and Retiming.
CS137: Electronic Design Automation
ESE535: Electronic Design Automation
Reconfigurable Computing
Standard-Cell Mapping Revisited
Alan Mishchenko University of California, Berkeley
SAT-Based Optimization with Don’t-Cares Revisited
Xilinx CPLD Fitter Advanced Optimization
ESE535: Electronic Design Automation
Measuring the Gap between FPGAs and ASICs
CS137: Electronic Design Automation
Fast Min-Register Retiming Through Binary Max-Flow
Presentation transcript:

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 ECE 697F Reconfigurable Computing Lecture 6 Mapping to Embedded Memory and PLAs

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Outline °Overview Targeted to existing hybrid PLA/LUT FPGAs Area and timing-constrained mapping °HybridMap Graph-based approach Post-map product term estimation °Results Comparisons with existing approaches Mapping to Apex20KE °Acknowledgement: Srini Krishnamoorthy

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Hybrid FPGAs with LUTs and PLAs °LUTs Dense logic, small number of inputs Special circuitry for arithmetic logic °PLAs For wide fanin, low logic density structures Control-logic (e.g. Finite State Machines) °Device containing both resources Move maximum logic to PLAs Implement remaining logic in LUTs Inputs Reset Output

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Hybrid FPGAs °Resources LUT blocks and Pterm macrocells °Objective Minimize 4-LUT area subject to design performance constraints Potential use: smaller devices °Key factor Efficient partitioning of design components

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Hybrid FPGA (Similar to APEX20KE Megalab) Global Interconnect LE 1 LE 2 LE 3 LE 4 LE 5 LE 6 LE 7 LE 8 LE 1 LE 2 LE 3 LE 4 LE 5 LE 6 LE 7 LE 8 PLA LUTs Local Interconnects

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 HybridMap Flow SIS Tech. Independent Optimization Node Partitioning Design Entry LUT Identification Subgraph Generation Pterm Estimation Area Estimation Subgraph Merging Pterm Estimation Area Estimation LUT Mapping PLA Mapping Place and Route PLA Subgraphs LUT Partition Mapped Design Vendor-Specific CAD

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 LUT Identification I1 I2 I3 I4 I5 I6 I7 O1 O2 O3  Design initially reduced to 2-input gates  Topological traversal starting from primary inputs  Input count to any LUT cluster is less than K  A node always belongs to the cluster that minimizes delay  LUT clusters are preliminary  Used to estimate LUT count  Similar to DAG-map (Lawler’s alg) LUT Cluster

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Subgraph Generation Maximum Fanout Free Cone (MFFC) Maximum Fanout Free Subgraph (MFFS) - All edges except output edge(s) stay within the cone O O1 O2

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Subgraph Generation V Search Root-set determination (Forward BFS) Condition: Nodes at each level <= O pla  Extract a multi-input, multi-output subgraph  Root set determination  Identify subgraph outputs  Forward traversal  Subgraph identification  Identify the subgraph inputs  Backward traversal

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Subgraph Identification V I3 I2I1I0 Search Subgraph determination (Backward BFS) O1 O2 O3 O4 O5 Condition: Inputs <= I pla, Backward breadth first search until input constraint met

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Reconvergent Paths  Hill-climbing approach  Start basic subgraph search algorithm from v  If nodes encountered > O pla  Neglect PLA output constraint  Continue search  When nodes encountered < O pla  Revert back to basic search algorithm V I1 I2 I3 I4 O1 O2 basic hill climbing basic Search

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Subgraph Pruning  Hill-climbing subgraph may violate PLA I/O constraints  Need to prune  Pruning steps  Collapse subgraph to two- level form  Remove outputs requiring least inputs  Outputs requiring K inputs (single LUT removal)  Minimal multi-LUT removal PLA

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Subgraph Combining Smaller subgraphs can be packed onto a single PLA Combine based on cost function -LUT savings due to merging -Input sharing -Pterm count Choose 2 subgraphs for merging based on combined feasibility in terms of PLA inputs, outputs and Pterms -Invoke Pterm estimator SG 1 SG 2

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Results: Effect of Initial Graph Node Size Resources: 4-LUTs, 32 Input 16 Output 32 Pterm Macrocells  2-Input case  Identify PLA Partitions  Map rest to LUTs using Flowmap  4-Input case  Map initially to LUTs  Extract PLA Partitions  13 MCNC benchmarks (8939 LUTs initially) Total LUT Savings R=10 R=5 R=10 R=5 R=1 R Number of PLAs 2-input nodes 4-input nodes

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Altera software: Quartus v Timing Constrained Case : 8% LUT Reduction Unconstrained Case : 14% LUT Reduction Mapping to APEX20KE-1

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Summary Hybrid FPGAs: A challenging problem Subgraph based approach to technology mapping Pterm based macrocells, Pterm estimation Unconstrained and timing constrained mapping -Delay and area estimation an important part of the mapping process Results -Pterm estimation improves LUT coverage by about 12% - Apex20KE devices –Unconstrained : 14% 4-LUT savings –Timing constrained : 8% 4-LUT savings

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Notes for Transcription Assignment °Focus on one main point °Write an outline °Check English and spelling

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Transcription Introduction Lecture summary Summary of papers Contrast of papers Conclusion References

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Overview Paper should be focused on a primary goal -May not be obvious until after reading -What will the reader take away from this work -Is there a common theme Introduction -Should summarize the entire paper. -Each paragraph presents a main idea -Ideas should be detailed but sufficiently high-level. -Results paragraph should present a result to support the theme.

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Tackling the Body Lecture summary -Brief review of the lecture -Focus on main point -Should flow seamlessly into/with papers -I can make powerpoint available Summary of papers -Discussion of main approaches -Important new techniques -Main point made by author -Main result/results

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Contrasting the Papers Pick categories for comparison -Should be three to four Each section should contrast a different category. -Try to focus on specific details of each paper Consider using a table to supplement text. -Helps clarify detailed comparison Focus on comparison rather than restating the details

Lecture 6: Mapping to Embedded Memory and PLAs September 27, 2004 Other Tips Have a friend review your work. Try to keep sentences focused -Subject – verb – description Avoid using first person. Write a detailed outline before you write any text. Give yourself plenty of time You should have at least four or five references.