Download presentation
Presentation is loading. Please wait.
Published byMelvin McGee Modified over 9 years ago
1
University of Rostock Institute of Applied Microelectronics and Computer Engineering Monitoring and Control of Temperature in Networks-on- Chip Tim Wegner, Claas Cornelius, Andreas Tockhorn, Dirk Timmermann; MEMICS 2010, Mikulov, Czech Republic, October 22-24
2
2 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs Outline 1. Introduction 2. Networks-on-Chip (NoCs) 3. Impact of Temperature on Reliability 4. Monitoring & Control of Temperature in NoCs 5. Summary
3
3 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs 1. Introduction Increasing integration density → rising complexity, shrinking device sizes NoCs able to deal with arising requirements (e.g. for communication) But: Reliability becomes a dominant factor for chip design Goal: Increase reliability in NoC-based systems Increasing integration density → rising complexity, shrinking device sizes NoCs able to deal with arising requirements (e.g. for communication) But: Reliability becomes a dominant factor for chip design Goal: Increase reliability in NoC-based systems Impacts of technological development
4
4 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs Outline 1. Introduction 2. Networks-on-Chip (NoCs) 3. Impact of Temperature on Reliability 4. Monitoring & Control of Temperature in NoCs 5. Summary
5
5 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs 2. Networks-on-Chip Infrastructure for on-chip interconnection Point-to-point links replace long global busses Parallel packet-based communication Separation of communication & computation Globally asynchronous locally synchronous (GALS) Modularity of IP cores (not part of actual NoC) reusability, high abstraction level Infrastructure for on-chip interconnection Point-to-point links replace long global busses Parallel packet-based communication Separation of communication & computation Globally asynchronous locally synchronous (GALS) Modularity of IP cores (not part of actual NoC) reusability, high abstraction level Properties NoCs are able to satisfy requirements of modern VLSI systems
6
6 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs Outline 1. Introduction 2. Networks-on-Chip (NoCs) 3. Impact of Temperature on Reliability 4. Monitoring & Control of Temperature in NoCs 5. Summary
7
7 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs 3. Impact of Temperature on Reliability Increasing integration densities, progress of nanotechnology Growing number of transistors per chip = raised probability of failure decreasing structural size of ICs = higher susceptibility to environmental influences & deterioration Increasing integration densities, progress of nanotechnology Growing number of transistors per chip = raised probability of failure decreasing structural size of ICs = higher susceptibility to environmental influences & deterioration Impacts of technological progress Intel 8086 (1978): ≈879 transistors/mm² Intel Bloomfield (2008): ≈2,78 Mio. transistors/mm²
8
8 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs 3. Impact of Temperature on Reliability Particular physical effects (e.g. TDDB, EM) contribute to deterioration Abetted by high temperatures Correlation between temperature & failure mechanisms established by Arrhenius model Exponential decrease of IC lifetime with temperature Particular physical effects (e.g. TDDB, EM) contribute to deterioration Abetted by high temperatures Correlation between temperature & failure mechanisms established by Arrhenius model Exponential decrease of IC lifetime with temperature Why is thermal awareness important? Growing influence of on-chip temperature distribution on lifetime, operability, performance etc.
9
9 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs Outline 1. Introduction 2. Networks-on-Chip (NoCs) 3. Impact of Temperature on Reliability 4. Monitoring & Control of Temperature in NoCs 5. Summary
10
Mitigate effects contributing to deterioration & delay occurrence of failures Control of on-chip temperature distribution Mitigate effects contributing to deterioration & delay occurrence of failures Control of on-chip temperature distribution 10 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs 4. Monitoring and Control of Temperature for NoCs Objective: Effective mechanisms to monitor & control on-chip temperature Integration into existing NoC Preservation of modularity & reusability Minimum costs (area, frequency) Maximum performance of monitoring and control Minimum impact on system performance Effective mechanisms to monitor & control on-chip temperature Integration into existing NoC Preservation of modularity & reusability Minimum costs (area, frequency) Maximum performance of monitoring and control Minimum impact on system performance Requirements:
11
11 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs 4.1 Mechanisms for monitoring Concept: attach physical monitoring probes to every IP core temperature variation ∆T Continuous checking of T IPC |T IPC,old - T IPC,new | ≥ ∆T ? Report T IPC,new Area: 66 LUT/FF pairs Frequency: 227 MHz temperature variation ∆T Continuous checking of T IPC |T IPC,old - T IPC,new | ≥ ∆T ? Report T IPC,new Area: 66 LUT/FF pairs Frequency: 227 MHz Event-driven: Period of time ∆t Report T IPC,new every ∆t Area: 80 LUT/FF pairs Frequency: 338 MHz Period of time ∆t Report T IPC,new every ∆t Area: 80 LUT/FF pairs Frequency: 338 MHz Time-driven:
12
12 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs 4.2 Mechanisms for control Reception & interpretation of probe packets Instructions for Dynamic Frequency Scaling to probes (if necessary) Area: 507 LUT/FF pairs Frequency: 165 MHz Reception & interpretation of probe packets Instructions for Dynamic Frequency Scaling to probes (if necessary) Area: 507 LUT/FF pairs Frequency: 165 MHz Central Control Unit (CCU): !!! Not the smartest approach, but suffices to test functionality !!!
13
Area penalty: 30,5% Freq. penalty: 8,2% Area penalty: 30,5% Freq. penalty: 8,2% Area penalty: 7,3% Freq. penalty: / (but Mux/Demux) Area penalty: 7,3% Freq. penalty: / (but Mux/Demux) Area penalty: / Freq. penalty: / Area penalty: / Freq. penalty: / 13 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs 4.3 Integration of monitoring 3 approaches Different impact on performance & costs Into IP core:Router port of IP core:Extra router port:
14
14 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs 4.4 Impact on system performance
15
15 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs 4.5 Performance of monitoring & control
16
16 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs 5. Summary Event-driven approach preferable (situational monitoring, better performance, no redundant traffic, lower area costs) Integration into NoC using router port of IP core best trade-off between costs & preservation of modularity/non-intrusiveness Event-driven approach preferable (situational monitoring, better performance, no redundant traffic, lower area costs) Integration into NoC using router port of IP core best trade-off between costs & preservation of modularity/non-intrusiveness Conclusion Implementation of 2 approaches for monitoring on-chip temperature + 3 methods for integration into NoC Investigation of: Costs (area, frequency) Impact on system performance Performance of monitoring & control
17
Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Thanks for your attention! Any questions? tim.wegner@uni-rostock.de www.networks-on-chip.com University of Rostock, Germany Institute of Applied Microelectronics and Computer Engineering Contact: Homepage:
18
Establishes relationship between temperature and failure mechanisms Describes dependence of chemical reactions on temperature changes Assumption: all other parameters constant 18 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Arrhenius Model Lifetime of ICs decreases exponentially with temperature Monitoring and Control of Temperature in NoCs
19
19 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Monitoring and Control of Temperature in NoCs Inoperability of transistor through gate oxide breakdown (long-term) Time Dependent Dielectric Breakdown (TDDB) Formation of charge traps Current flow !!! HEAT !!! More charge traps Conducting path through gate oxide
20
20 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Transport of material in conductors (i.e. wires) Cause: ion movement induced by current flow (ions’ mobility increases with temperature) Effects: Hillocks short circuits Voids interruption of current paths Electromigration (EM) Monitoring and Control of Temperature in NoCs
21
21 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Intel Bloomfield: Year: 2008 731 Mio. Transistors 263mm² 2779467 Tr./mm2 Intel 8086: Year: 1978 29k transistors 33mm² 879 Tr./mm² Intel Processors Monitoring and Control of Temperature in NoCs
22
22 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Impact on system performance Monitoring and Control of Temperature in NoCs
23
23 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Performance of monitoring & control Monitoring and Control of Temperature in NoCs
24
24 Tim Wegner - 23 October 2010 MEMICS 2010, Mikulov, Czech Republic, October 22-24 Synthesis results for monitoring & control ComponentIntegration method Event- driven probe Time- driven probe Central Control Unit Into IP core Using IP core port Extra port Frequency [MHz] 227338165122119112 Area [LUT/FF pairs] 6680507190118962312 Unmodified NoC router: 1771 LUT/FF pairs, 122 MHz Monitoring and Control of Temperature in NoCs
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.