Presentation is loading. Please wait.

Presentation is loading. Please wait.

Smart Hill Climbing for Agile Dynamic Mapping in Many- Core Systems Design Automation Conference(DAC), pp.1-6, May 29-June 7 2013, Austin, TX, USA M. Fattah,

Similar presentations


Presentation on theme: "Smart Hill Climbing for Agile Dynamic Mapping in Many- Core Systems Design Automation Conference(DAC), pp.1-6, May 29-June 7 2013, Austin, TX, USA M. Fattah,"— Presentation transcript:

1 Smart Hill Climbing for Agile Dynamic Mapping in Many- Core Systems Design Automation Conference(DAC), pp.1-6, May 29-June 7 2013, Austin, TX, USA M. Fattah, M. Daneshtalab, P. Liljeberg, J. Plosila Reporter Hsuan-Ru Li

2 Outline  Introduction  Related Work  Definitions  Smart First Node Selection  Results And Analysis  Conclusions 2

3 Introduction  Many-core systems will feature an extremely dynamic workload.  An unpredictable sequence of different applications enter and leave the system at run-time.  A run-time system manager is required to efficiently map an incoming application onto the system resources. 3

4 Introduction(cont.)  Central manager (CM) of the system decides on the appropriate node for each task.  The system performance is significantly influenced by the utilized mapping approach.  Consider a contiguous application mapping:  Relatively close nodes.  No fragmentation. 4

5 Introduction(cont.)  Finding a convex region of nodes is a polynomial, O(n 3 ), problem.  CoNA method significantly decreased complexity. 5

6 Introduction(cont.)  CoNA starts from a first node and attempts to map the application tasks onto a set of contiguous nodes around it.  Select a first node leads to least fragmentation of remaining nodes.  Hill climbing search heuristic is adapted in order to find the optimum first node rapidly among all the available nodes. 6

7 Introduction(cont.)  Smart Hill Climbing: HiC with intelligence.  n is the given network size.  Best case: O(√n)  Worst case: O(n 2 ) 7

8 Outline  Introduction  Related Work  Definitions  Smart First Node Selection  Results And Analysis  Conclusions 8

9 Related Work  Different first node or generally the mapping area selection methods.  First node selection:  Nearest Neighbor (NN)  Best Neighbor (BN)  Region selection :  Incremental (INC) approach  CoNA  VIP-supported approach 9

10 Outline  Introduction  Related Work  Definitions  Smart First Node Selection  Results And Analysis  Conclusions 10

11 Definitions  Consider a homogenous mesh-based NoC in definitions and experiments.  Define several evaluation metrics as assessment tools to compare different algorithms.  Mapping algorithms try to allocate system resources in an optimal way. 11

12 Definitions(cont.)  Each application in the system is represented by a directed graph denoted as a task graph A p =TG(T, E), t i ∈ T, edge e i,j ∈ E, w i,j = amount of data transferred. 12

13 Definitions(cont.)  Architecture graph AG(N, L) is a simple M×M 2D-mesh NoC with the XY routing.  A set of nodes n x,y ∈ N, connected together through communication links l k ∈ L.  Each node n x,y contains a 5-port router r x,y connected to the local processing element pe x,y by its local port. 13

14 Definitions(cont.) 14

15 Definitions(cont.) 15

16 Definitions(cont.)  One-to-one mapping function from the set of application tasks T, to the set of NoC nodes N: 16

17 Definitions(cont.)  Mapping function is started if and only if there are enough available nodes to map onto them. 17

18 Definitions(cont.)  A task t i is mapped onto as nt i and the packet corresponding to the edge e i,j as pck i,j  The packet sent from nt i to nt j. The set of running applications on the system is also denoted by APPS.  |APPS| means number of running applications. 18

19 Definitions(cont.) 19 Average Weighted Manhattan Distance (AWMD)

20 Definitions(cont.)  Congestion increases the network latency dramatically and also increases the network dynamic power consumption.  External congestion occurs when a network channel is contented by the packets of different applications.  Internal congestion is related to the packets of the same application. 20

21 Definitions(cont.)  To decrease congestion  The mapped area of an application should be as convex as possible and minimally fragmented.  The allocated nodes as a metric to assess the Mapped Region Dispersion. 21

22 Definitions(cont.)  The area with the smallest MRD is almost circular.  However, a circular region will generate more area fragmentation in long term.  The best mapped area would be square. 22

23 Definitions(cont.)  MRD of a square with |T| nodes  Normalized MRD metric 23

24 Definitions(cont.)  The NMRD value of 1 means a squared area.  NMRD increases  Mapped area is getting more fragmented.  Less similar to a square shape. 24

25 Outline  Introduction  Related Work  Definitions  Smart First Node Selection  Results And Analysis  Conclusions 25

26 Smart First Node Selection 26  Approximate model estimates the number of available nodes in a square shape around a given node(First node).  Proposed model is utilized in the adapted hill climbing search hubristic  Find the appropriate first node in an agile and smart manner.

27 Smart First Node Selection(cont.) 27  Square factor of a given node, SF(n i,j ), is the estimated number of contiguous, almost square-shaped, available nodes around that node.

28 Smart First Node Selection(cont.) 28  First find the largest square centered on n i,j, SQ max = (n i,j, r max )  Some more nodes beyond the square borders not belonging to system rectangles, as marked with asterisk.

29 Smart First Node Selection(cont.) 29  The algorithm also calculates a direction, called open direction (openDir).  Indicates one of the neighbors of the node estimated to have a larger SF.  An openDir of value zero  No specific direction is predicted to result in a larger square factor.

30 Smart First Node Selection(cont.) 30  SF calculation has a linear time complexity of O (|APPS|).  However, this will take O(M 2 |APPS|) time. (exhaustive search)

31 Smart First Node Selection(cont.) 31  Smart Hill Climbing(SHiC) starts from a randomly selected node and walks smartly through the network nodes.  This significantly reduces the amount of traversed nodes, resulting in an agile mapping algorithm.

32 Smart First Node Selection(cont.) 32  SHiC looks for the node with the optimum SF according to a preference function. (SF(n cur ) SF(n cur )) OR (SF(n next ) ≥ |T| AND SF(n next ) < SF(n cur ))

33 Smart First Node Selection(cont.) 33

34 Smart First Node Selection(cont.) 34 Find the Largest SF

35 Smart First Node Selection(cont.) 35

36 Smart First Node Selection(cont.) 36

37 Smart First Node Selection(cont.) 37  Stochastic hill climbing approach is executed several times starting from different randomly chosen nodes.  The algorithm is repeated (2+√|APPS|) times (line 1).  The SHiC outer loop is executed O(√|APPS|) times, the while loop is executed O(M) times.

38 Smart First Node Selection(cont.) 38  SF(n next ) is calculated in each iteration in O(|APPS|).  The time complexity of the SHiC will be O(M×|APPS| 3/2 ).  This is significantly faster than the exhaustive search.

39 Outline  Introduction  Related Work  Definitions  Smart First Node Selection  Results And Analysis  Conclusions 39

40 Results And Analysis  Applications with 4 to 35 tasks are generated using TGG.  The communication volumes (w i,j ) are randomly distributed between 2 to 16 flits of data.  SystemC many-core platform which utilizes a pruned version of Noxim, as its communication architecture. 40

41 Results And Analysis(cont.)  Different mapping and first node selection are evaluated.  The network size varying from 8×8 to 20×20 nodes.  A random sequence of applications is entered into the scheduler FIFO according to the desired rate, λ.  The maximum possible scheduling rate is called λ full. 41

42 Results And Analysis(cont.)  An allocation request for the scheduled application is sent to the CM of the platform residing in the node n 0,0. 42

43 Results And Analysis(cont.) 43

44 Results And Analysis(cont.)  In order to assess the square factor accuracy.  Perform exhaustive search to select the node.  Network size: 16×16.  Applications enter the system with 0.8λ full rate. 44

45 Results And Analysis(cont.) 45

46 Results And Analysis(cont.)  Search on the SF, exhaustive search is the best result.  SHiC on dispersion (NMRD) and power (AWMD) metrics show only 4% of increase.  SHiC approach significantly enhances the performance of the system under the same utilized mapping algorithm. 46

47 Results And Analysis(cont.)  CoNA/SHiC verse CoNA/NN case.  Both the dispersion and power dissipation of applications increase versus increase of λ.  SHiC keeps its preeminence over other approaches and outperforms by 10 to 30 percent. 47

48 Results And Analysis(cont.)  Study the λ effect on the system performance. 48

49 Results And Analysis(cont.)  CoNA/SHiC verse NN/NN case.  λ is kept 0.8λ full.  SHiC scales well as the network size increases and keeps the system performance at the same level. 49

50 Results And Analysis(cont.) 50

51 Outline  Introduction  Related Work  Definitions  Smart First Node Selection  Results And Analysis  Conclusions 51

52 Conclusions  This algorithm utilized an approximate model which quickly estimates the available area around a given node.  The provided open direction aided the climbing algorithm to reach the optimum node faster by taking smart steps. 52

53 Conclusions(cont.)  Results emphasized the significant impact of convex mapping on congestion reduction of the network. 53

54 Thank You 54


Download ppt "Smart Hill Climbing for Agile Dynamic Mapping in Many- Core Systems Design Automation Conference(DAC), pp.1-6, May 29-June 7 2013, Austin, TX, USA M. Fattah,"

Similar presentations


Ads by Google