Download presentation
Presentation is loading. Please wait.
1
1 General Iterative Heuristics for VLSI Multiobjective Partitioning by Dr. Sadiq M. Sait Dr. Aiman El-Maleh Mr. Raslan Al Abaji King Fahd University Computer Engineering Department
2
2 Introduction Problem Formulation Cost Functions Proposed Approaches Experimental results Conclusion Outline ….
3
3 Design Characteristics 0.13M 12MHz 1.5um CAE Systems, Silicon compilation 7.5M 333MHz 0.25um Cycle-based simulation, Formal Verification 3.3M 200MHz 0.6um Top-Down Design, Emulation 1.2M 50MHz 0.8um HDLs, Synthesis 0.06M 2MHz 6um SPICE Simulation Key CAD Capabilities The Challenges to sustain such an exponential growth to achieve gigascale integration have shifted in a large degree, from the process of manufacturing technologies to the design technology. VLSI Technology Trend
4
4 Technology0.1 um Transistors200 M Logic gates40 M Size520 mm 2 Clock2 - 3.5 GHz Chip I/O’s4,000 Wiring levels 7 - 8 Voltage0.9 - 1.2 Power160 Watts Supply current~160 Amps Performance Power consumption Noise immunity Area Cost Time-to-market Tradeoffs!!! The VLSI Chip in 2006
5
5 1.System Specification 2.Functional Design 3.Logic Design 4.Circuit Design 5.Physical Design 6.Design Verification 7.Fabrication 8.Packaging Testing and Debugging VLSI design process is carried out at a number of levels. VLSI Design Cycle
6
6 Physical Design converts a circuit description into a geometric description. This description is used to manufacture a chip. 1.Partitioning 2.Floorplanning and Placement 3.Routing 4.Compaction The physical design cycle consists of: Physical Design
7
7 Decomposition of a complex system into smaller subsystems. Each subsystem can be designed independently speeding up the design process (divide-and conquer-approach). Decompose a complex IC into a number of functional blocks, each of them designed by one or a team of engineers. Decomposition scheme has to minimize the interconnections between subsystems. Why we need Partitioning ?
8
8 System Level Partitioning Board Level Partitioning Chip Level Partitioning System PCBs Chips Subcircuits / Blocks Levels of Partitioning
9
9 Need for Power optimization Portable devices. Power consumption is a hindrance in further integration. Increasing clock frequency. Need for delay optimization In current sub micron design wire delay tend to dominate gate delay. Larger die size imply long on-chip global routes, which affect performance. Optimizing delay due to off-chip capacitance. Motivation
10
10 Objective Design a class of iterative algorithms for VLSI multi objective partitioning. Explore partitioning from a wider angle and consider circuit delay, power dissipation and interconnect in the same time, under balance constraint.
11
11 Objectives : Power cost is optimized AND Delay cost is optimized AND Cutset cost is optimized Constraint Balanced partitions to a certain tolerance degree. (10%) Problem formulation
12
12 Based on hypergraph model H = (V, E) Cost 1: c(e) = 1 if e spans more than 1 block Cutset = sum of hyperedge costs Efficient gain computation and update cutset = 3 Cutset
13
13 path : SE 1 C 1 C 4 C 5 SE 2. Delay = CD SE1 + CD C1 + CD C4 + CD C5 + CD SE2 CD C1 = BD C1 + LF C1 * ( Coffchip + CINP C2 + CINP C3 + CINP C4 ) Delay Model
14
14 Delay(Pi) = Pi: is any path Between 2 cells or nodes P : set of all paths of the circuit. Delay
15
15 The average dynamic power consumed by CMOS logic gate in a synchronous circuit is given by: Ni : is the number of output gate transition per cycle( switching Probability) : Is the Load Capacitance Power
16
16 : Load Capacitances driven by a cell before Partitioning : additional Load due to off chip capacitance.( cut net) Total Power dissipation of a Circuit: Power
17
17 : Can be assumed identical for all nets :Set of Visible gates Driving a load outside the partition. Power
18
18 The Balance as constraint is expressed as follows: However balance as a constraint is not appealing because it may prohibits lots of good moves. Objective : |Cells(block1) – Cells( block2)| Balance
19
19 Imprecise values of the objectives – best represented by linguistic terms that are basis of fuzzy algebra Conflicting objectives Operators for aggregating function Fuzzy logic for cost function
20
20 1.The cost to membership mapping. 2.Linguistic fuzzy rule for combining the membership values in an aggregating function. 3.Translation of the linguistic rule in form of appropriate fuzzy operators. Use of fuzzy logic for Multi- objective cost function
21
21 And-like operators –Min operator = min ( 1, 2 ) –And-like OWA = * min ( 1, 2 ) + ½ (1- ) ( 1 + 2 ) Or-like operators –Max operator = max ( 1, 2 ) –Or-like OWA = * max ( 1, 2 ) + ½ (1- ) ( 1 + 2 ) Where is a constant in range [0,1] Some fuzzy operators
22
22 WhereO i and C i are lower bound and actual cost of objective “i” i (x) is the membership of solution x in set “good ‘i’ ” g i is the relative acceptance limit for each objective. Membership functions
23
23 A good partitioning can be described by the following fuzzy rule IF solution has small cutset AND low power AND short delay AND good Balance. THEN it is a good solution Fuzzy linguistic rule
24
24 The above rule is translated to AND-like OWA Represent the total Fuzzy fitness of the solution, our aim is to Maximize this fitness. Respectively (Cutset, Power, Delay, Balance ) Fitness. Fuzzy cost function
25
25 Algorithm (Genetic_Algorithm) Construct_Population(N p ); For j = 1 to N p Evaluate_Fitness (Population[j]) End For; For i = 1 to N g For j = 1 to N o (x,y) Choose_parents; offspring[j] Crossover(x,y) EndFor; Population Select ( Population, offspring, N p ) For k = 1 to N p Apply Mutation (Population[k]) EndFor; GA for multiobjective Partitioning
26
26 Solution representation
27
27 Different population sizes Parent selection: Roulette wheel –The probability of selecting a chromosome for crossover is N p is the population size GA implementation
28
28 Simple single point crossover: Selection before mutation Roulette wheel (rlt) Elitism random (ernd) GA implementation
29
29 Algorithm Tabu_Search Start with an initial feasible solution S Initialize tabu list and aspiration level For fixed number of iterations Do Generate neighbor solutions V* N(S) Find best S* V* If move S to S* is not in T Then Accept move and update best solution Update T and AL Else If Cost(S*) < AL Then Accept move and update best solution Update T and AL End If End For Tabu Search
30
30 Neighbor solution –Change the block of a randomly selected cells. The Tabu list size depends on the circuit size. TS implementation
31
31 Tabu list Store index of one of the swapped cell. Various sizes for tabu list. Aspiration Level The best neighbor is better than the global best. TS implementation
32
32 Experimental Results ISCAS 85-89 Benchmark Circuits
33
33 GA Vs Tabu Multi-objective
34
Circuit S13207 GA
35
Circuit S13207 TS
36
36 Circuit S13207 GA Vs TS time
37
37 Conclusion The present work successfully addressed the important issue of reducing power and delay consumption in VLSI circuits. The present work successfully formulate and provide solutions to the problem of multiobjective VLSI partitioning TS partitioning algorithm outperformed GA in terms of quality of solution and execution time
38
38 Thank you.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.