Penn ESE535 Spring 2015 -- DeHon 1 ESE535: Electronic Design Automation Day 11: February 25, 2015 Placement (Intro, Constructive)

Slides:



Advertisements
Similar presentations
Penn ESE 535 Spring DeHon 1 ESE535: Electronic Design Automation Day 12: February 23, 2011 Routing 2 (Pathfinder)
Advertisements

CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 14: March 3, 2004 Scheduling Heuristics and Approximation.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 11: March 4, 2008 Placement (Intro, Constructive)
Circuit Retiming with Interconnect Delay CUHK CSE CAD Group Meeting One Evangeline Young Aug 19, 2003.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 20: April 13, 2009 Placement II (Simulated Annealing)
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 9: February 20, 2008 Partitioning (Intro, KLFM)
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 2: January 23, 2008 Covering.
EDA (CS286.5b) Day 5 Partitioning: Intro + KLFM. Today Partitioning –why important –practical attack –variations and issues.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 21: April 15, 2009 Routing 1.
Penn ESE Fall DeHon 1 ESE (ESE534): Computer Organization Day 19: March 26, 2007 Retime 1: Transformations.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 13: February 26, 2007 Interconnect 1: Requirements.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 7: February 11, 2008 Static Timing Analysis and Multi-Level Speedup.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 11: February 14, 2007 Compute 1: LUTs.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 19: April 9, 2008 Routing 1.
EDA (CS286.5b) Day 11 Scheduling (List, Force, Approximation) N.B. no class Thursday (FPGA) …
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 5: February 2, 2009 Architecture Synthesis (Provisioning, Allocation)
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 16: March 23, 2009 Covering.
EDA (CS286.5b) Day 7 Placement (Simulated Annealing) Assignment #1 due Friday.
Penn ESE525 Spring DeHon 1 ESE535: Electronic Design Automation Day 10: February 18, 2009 Partitioning 2 (spectral, network flow)
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 15: March 18, 2009 Static Timing Analysis and Multi-Level Speedup.
Penn ESE Spring DeHon 1 ESE (ESE534): Computer Organization Day 14: February 28, 2007 Interconnect 2: Wiring Requirements and Implications.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 5: February 2, 2009 Architecture Synthesis (Provisioning, Allocation)
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 8: February 13, 2008 Retiming.
Introduction to Routing. The Routing Problem Apply after placement Input: –Netlist –Timing budget for, typically, critical nets –Locations of blocks and.
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 13: February 4, 2005 Interconnect 1: Requirements.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 4: January 28, 2015 Partitioning (Intro, KLFM)
ESE Spring DeHon 1 ESE534: Computer Organization Day 19: April 7, 2014 Interconnect 5: Meshes.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 2: January 26, 2015 Covering Work preclass exercise.
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 12: February 13, 2002 Scheduling Heuristics and Approximation.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 2: January 21, 2015 Heterogeneous Multicontext Computing Array Work Preclass.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 9: February 16, 2015 Scheduling Variants and Approaches.
Massachusetts Institute of Technology 1 L14 – Physical Design Spring 2007 Ajay Joshi.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 9: February 14, 2011 Placement (Intro, Constructive)
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 24: April 18, 2011 Covering and Retiming.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 10: February 18, 2015 Architecture Synthesis (Provisioning, Allocation)
Modern VLSI Design 3e: Chapter 10 Copyright  1998, 2002 Prentice Hall PTR Topics n CAD systems. n Simulation. n Placement and routing. n Layout analysis.
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 6: January 23, 2002 Partitioning (Intro, KLFM)
Penn ESE525 Spring DeHon 1 ESE535: Electronic Design Automation Day 6: February 4, 2014 Partitioning 2 (spectral, network flow)
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 10: February 6, 2002 Placement (Simulated Annealing…)
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 7: February 3, 2002 Retiming.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 23: April 20, 2015 Static Timing Analysis and Multi-Level Speedup.
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 13: February 20, 2002 Routing 1.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 13: March 3, 2015 Routing 1.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 18: April 2, 2014 Interconnect 4: Switching.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 2: January 20, 2010 Universality, Gates, Logic Work Preclass Exercise.
CALTECH CS137 Winter DeHon CS137: Electronic Design Automation Day 9: February 9, 2004 Partitioning (Intro, KLFM)
CALTECH CS137 Spring DeHon 1 CS137: Electronic Design Automation Day 5: April 12, 2004 Covering and Retiming.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 10: February 16, 2011 Placement II (Simulated Annealing)
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 6: January 30, 2013 Partitioning (Intro, KLFM)
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 20: April 4, 2011 Static Timing Analysis and Multi-Level Speedup.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 15: March 13, 2013 High Level Synthesis II Dataflow Graph Sharing.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 5: February 2, 2015 Clustering (LUT Mapping, Delay)
Caltech CS184 Winter DeHon 1 CS184a: Computer Architecture (Structure and Organization) Day 12: February 5, 2003 Interconnect 2: Wiring Requirements.
CALTECH CS137 Fall DeHon 1 CS137: Electronic Design Automation Day 2: September 28, 2005 Covering.
Penn ESE534 Spring DeHon 1 ESE534: Computer Organization Day 17: March 29, 2010 Interconnect 2: Wiring Requirements and Implications.
CALTECH CS137 Winter DeHon 1 CS137: Electronic Design Automation Day 8: January 27, 2006 Cellular Placement.
CALTECH CS137 Fall DeHon 1 CS137: Electronic Design Automation Day 21: November 28, 2005 Routing 1.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 25: April 17, 2013 Covering and Retiming.
CS137: Electronic Design Automation
CS137: Electronic Design Automation
ESE535: Electronic Design Automation
ESE535: Electronic Design Automation
ESE535: Electronic Design Automation
ESE535: Electronic Design Automation
ESE535: Electronic Design Automation
ESE535: Electronic Design Automation
ESE535: Electronic Design Automation
ESE535: Electronic Design Automation
ESE535: Electronic Design Automation
ESE535: Electronic Design Automation
Presentation transcript:

Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 11: February 25, 2015 Placement (Intro, Constructive)

Penn ESE535 Spring DeHon 2 Today 2D Placement Problem Partitioning  Placement Quadrisection Refinement Behavioral (C, MATLAB, …) RTL Gate Netlist Layout Masks Arch. Select Schedule FSM assign Two-level, Multilevel opt. Covering Retiming Placement Routing

Penn ESE535 Spring DeHon 3 Placement Problem: Pick locations for all building blocks –minimizing energy, delay, area –really: minimize wire length minimize channel density

Penn ESE535 Spring DeHon 4 Bad Placement How bad can it be? –Area –Delay –Energy

Preclass Channel Widths Channel Width for Problem 1? Penn ESE535 Spring DeHon 5

Preclass Channel Widths Channel Width for Problem 2? Penn ESE535 Spring DeHon 6

7 Bad: Area All wires cross bisection O(N 2 ) area good: O(N)

Delay How bad can delay be? Penn ESE535 Spring DeHon 8

Delay How good can delay be? Penn ESE535 Spring DeHon 9

10 Bad: Delay All critical path wires cross chip Delay =O(|PATH|*2*L side ) –[and L side is O(N)] good: O(|PATH|* L g ) compare 10ps gates to many nanoseconds to cross chip

Penn ESE535 Spring DeHon 11 Clock Cycle Radius Radius of logic can reach in one cycle (45 nm) –1 Cycle Radius = 10 Few hundred PEs –Chip side 1,000 PE million PEs –100s of cycles to cross

Penn ESE535 Spring DeHon 12 Bad: Energy All wires cross chip: O(L side ) long  O(L side ) capacitance per wire Recall Area  O(N 2 ) So L side  O(N)  O(N) wires  O(N 2 ) capacitance Good: O(1) long wires  O(N) capacitance

Penn ESE535 Spring DeHon 13 Manhattan

Penn ESE535 Spring DeHon 14 Manhattan Distance Horizontal and Vertical Routing: Manhattan distance |X i -X j |+|Y i -Y j | Contrast: Euclidean distance

Penn ESE535 Spring DeHon 15 Distance Can we place everything close?

Penn ESE535 Spring DeHon 16 Illustration Consider a complete tree – nand2’s, no fanout –N nodes Logical circuit depth? Circuit Area? Side Length? Average wire length between nand gates? (lower bound)

Penn ESE535 Spring DeHon 17 “Closeness” Try placing “everything” close

Preclass unit side, 4 unit × unit gates Wire length lower bound? Penn ESE535 Spring DeHon 18

Penn ESE535 Spring DeHon 19 Generalizing Preclass 5 What’s minimum length for longest wires? ?

Penn ESE535 Spring DeHon 20 Generalizing Interconnect Lengths P>0.5 Side is  (N) IO crossing it is N p What’s minimum length for longest wires? Implication: –Wire lengths grow at least as fast as N (p-0.5) ?

Penn ESE535 Spring DeHon 21 Generalizing Interconnect Lengths Large cut widths imply long wires ?

Penn ESE535 Spring DeHon 22 Placement Problem Characteristics Familiar –NP Complete –local, greedy not work –greedy gets stuck in local minima

Penn ESE535 Spring DeHon 23 Constructive Placement

Penn ESE535 Spring DeHon 24 Basic Idea Partition (bisect) to define halves of chip –minimize wire crossing Recurse to refine When get down to single component, done

Penn ESE535 Spring DeHon 25 Adequate? Does recursive bisection capture the primary constraints of two-dimensional placement?

Penn ESE535 Spring DeHon 26 Problems Greedy, top-down cuts –maybe better pay cost early? Two-dimensional problem –(often) no real cost difference between H and V cuts Interaction between subtrees –not modeled by recursive bisect

Example Think of this (right) as logical graph. Assume we find the “right” bisection (shown) Where do A and B go? How does recursive partitioning enforce/encourage this? Penn ESE535 Spring DeHon 27 ABABA

Penn ESE535 Spring DeHon 28 Interaction

Penn ESE535 Spring DeHon 29 Example Ideal split (not typical) “Equivalent” split ignoring external constraints Practically -- makes all H cuts also be V cuts

Penn ESE535 Spring DeHon 30 Interaction

Penn ESE535 Spring DeHon 31 Problem Need to keep track of where things are –outside of current partition –include costs induced by above …but don’t necessarily know where things are –still solving problem

Penn ESE535 Spring DeHon 32 Improvement: Ordered Order operations Keep track of existing solution Use to constrain or pass costs to next subproblem B A

Penn ESE535 Spring DeHon 33 Improvement: Ordered Order operations Keep track of existing solution Use to constrain or pass costs to next subproblem Flow cut –use existing in src/sink –A nets = src, B nets = sink B A S T

Penn ESE535 Spring DeHon 34 Improvement: Ordered Order operations Keep track of existing solution Use to constrain or pass costs to next subproblem Flow cut –use existing in src/sink –A nets = src, B nets = sink FM: start with fixed, unmovable nets for side-biased inputs B A S T

Penn ESE535 Spring DeHon 35 Improvement: Constrain Partition once Constrain movement within existing partitions Account for both H and V crossings Partition next –(simultaneously work parallel problems) –easy modification to FM

Penn ESE535 Spring DeHon 36 Constrain Partition Solve AB and CD concurrently. C D A B

Penn ESE535 Spring DeHon 37 Improvement: Quadrisect Solve more of problem at once Quadrisection: –partition into 4 bins simultaneously –keep track of costs all around

Penn ESE535 Spring DeHon 38 Quadrisect Modify FM to work on multiple buckets k-way has: –k(k-1) buckets –|from|  |to| –quad  12 reformulate gains update still O(1)

Penn ESE535 Spring DeHon 39 Quadrisect Cases (15): –(1 partition)  4 –(2 part)  6 = (4 choose 2) –(3 part)  4 = (4 choose 3) –(4 part)  1

Penn ESE535 Spring DeHon 40 Recurse Keep outside constraints –(cost effects) Problem? –Don’t know detail place What can we do? –Model as at center of unrefined region

Penn ESE535 Spring DeHon 41 Option: Terminal Propagation Abstract inputs as terminals Partition based upon Represent cost effects on placement/refinement decisions

Penn ESE535 Spring DeHon 42 Option: Refine Keep refined placement Use in cost estimates

Penn ESE535 Spring DeHon 43 Problem Still have ordering problem What is the problem? Earlier subproblems solved with weak constraints from later –(cruder placement estimates) Solved previous case by flattening Why might not be satisfied with that? In extreme give up divide and conquer Alternative?

Penn ESE535 Spring DeHon 44 Iterate After solve later problems “Relax” solution Solve earlier problems again with refined placements (cost estimates) Repeat until converge

Penn ESE535 Spring DeHon 45 Iteration/Cycling General technique to deal with phase-ordering problem –what order do we perform transformations, make decisions? –How get accurate information to everyone Still basically greedy

Penn ESE535 Spring DeHon 46 Refinement Relax using overlapping windows Deal with edging effects Huang&Kahng claim 10-15% improve –cycle –overlap

Penn ESE535 Spring DeHon 47 Possible Refinement Allow unbalanced cuts –most things still work –just distort refinement groups –allowing unbalance using FM quadrisection looks a bit tricky –gives another 5-10% improvement

Penn ESE535 Spring DeHon 48 Runtime Each gain update still O(1) –(bigger constants) –so, FM partition pass still O(N) O(1) iterations expected assume O(1) overlaps exploited O(log(N)) levels Total: O(N log(N)) –very fast compared to typical annealing (annealing next time)

Penn ESE535 Spring DeHon 49 Quality: Area [Huang&Kahng/ISPD1997] Gordian-L: Analytic global placer DOMINO: network flow detail

Penn ESE535 Spring DeHon 50 Quality: Delay Weight edges based on criticality –Periodic, interleaved timing analysis

Penn ESE535 Spring DeHon 51 Uses Good by self Starting point for simulated annealing –speed convergence With synthesis (both high level and logic) –get a quick estimate of physical effects –(play role in estimation/refinement at larger level) Early/fast placement –before willing to spend time looking for best For fast placement where time matters –FPGAs, online placement?

Penn ESE535 Spring DeHon 52 Summary Partition to minimize cut size Additional constraints to do well –Improving constant factors Quadrisection Keep track of estimated placement Relax/iterate/Refine

Penn ESE535 Spring DeHon 53 Big Ideas: Potential dominance of interconnect Divide-and-conquer Successive Refinement Phase ordering: estimate/relax/iterate

Penn ESE535 Spring DeHon 54 Admin Reading for Monday –Online (JSTOR): classic paper on Simulated Annealing