Rabi Mahapatra Department of Computer Science & Engineering Texas A&M University.

Slides:



Advertisements
Similar presentations
A Novel 3D Layer-Multiplexed On-Chip Network
Advertisements

Thread Criticality Predictors for Dynamic Performance, Power, and Resource Management in Chip Multiprocessors Abhishek Bhattacharjee Margaret Martonosi.
International Symposium on Low Power Electronics and Design Energy-Efficient Non-Minimal Path On-chip Interconnection Network for Heterogeneous Systems.
Dr. Kalpakis CMSC 621, Advanced Operating Systems. Fall 2003 URL: Distributed System Architectures.
A reconfigurable system featuring dynamically extensible embedded microprocessor, FPGA, and customizable I/O Borgatti, M. Lertora, F. Foret, B. Cali, L.
CSE 6590 Department of Computer Science & Engineering York University 1 Introduction to Wireless Ad-hoc Networking 5/4/2015 2:17 PM.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
NoC Modeling Networks-on-Chips seminar May, 2008 Anton Lavro.
June 20 th 2004University of Utah1 Microarchitectural Techniques to Reduce Interconnect Power in Clustered Processors Karthik Ramani Naveen Muralimanohar.
OCIN Workshop Wrapup Bill Dally. Thanks To Funding –NSF - Timothy Pinkston, Federica Darema, Mike Foster –UC Discovery Program Organization –Jane Klickman,
IP I/O Memory Hard Disk Single Core IP I/O Memory Hard Disk IP Bus Multi-Core IP R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R R Networks.
MINIMISING DYNAMIC POWER CONSUMPTION IN ON-CHIP NETWORKS Robert Mullins Computer Architecture Group Computer Laboratory University of Cambridge, UK.
What Great Research ?s Can RAMP Help Answer? What Are RAMP’s Grand Challenges ?
1 Multi - Core fast Communication for SoPC Multi - Core fast Communication for SoPC Technion – Israel Institute of Technology Department of Electrical.
Orion: A Power-Performance Simulator for Interconnection Networks Presented by: Ilya Tabakh RC Reading Group4/19/2006.
HW/SW Co-Synthesis of Dynamically Reconfigurable Embedded Systems HW/SW Partitioning and Scheduling Algorithms.
NoC, NoC! Who's there? Rabi N. Mahapatra Texas A&M University.
Network-on-Chip: Communication Synthesis Department of Computer Science Texas A&M University.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
1 Energy Efficient Communication in Wireless Sensor Networks Yingyue Xu 8/14/2015.
Exploring the Tradeoffs of Configurability and Heterogeneity in Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable.
Yao Wang, Yu Wang, Jiang Xu, Huazhong Yang EE. Dept, TNList, Tsinghua University, Beijing, China Computing System Lab, Dept. of ECE Hong Kong University.
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
2017/4/21 Towards Full Virtualization of Heterogeneous Noc-based Multicore Embedded Architecture 2012 IEEE 15th International Conference on Computational.
Power Issues in On-chip Interconnection Networks Mojtaba Amiri Nov. 5, 2009.
High-Performance Networks for Dataflow Architectures Pravin Bhat Andrew Putnam.
“Low-Power, Real-Time Object- Recognition Processors for Mobile Vision Systems”, IEEE Micro Jinwook Oh ; Gyeonghoon Kim ; Injoon Hong ; Junyoung.
Low-Power Wireless Sensor Networks
N. GSU Slide 1 Chapter 02 Cloud Computing Systems N. Xiong Georgia State University.
Computer Science Open Research Questions Adversary models –Define/Formalize adversary models Need to incorporate characteristics of new technologies and.
SMART: A Single- Cycle Reconfigurable NoC for SoC Applications -Jyoti Wadhwani Chia-Hsin Owen Chen, Sunghyun Park, Tushar Krishna, Suvinay Subramaniam,
QoS Support in High-Speed, Wormhole Routing Networks Mario Gerla, B. Kannan, Bruce Kwan, Prasasth Palanti,Simon Walton.
By Omar Y. Tahboub Multimedia and Networking Lab MediaNet Computer Science Department Kent State University.
Flow of presentation:  Kind of attacks on embedded systems.  Most relevant security threats faced by NOC.  Solutions/ways suggested so far to deal.
Presenter: Min-Yu Lo 2015/10/19 Asit K. Mishra, N. Vijaykrishnan, Chita R. Das Computer Architecture (ISCA), th Annual International Symposium on.
Salim Hariri HPDC Laboratory Enhanced General Switch Management Protocol Salim Hariri Department of Electrical and Computer.
Page 1 Reconfigurable Communications Processor Principal Investigator: Chris Papachristou Task Number: NAG Electrical Engineering & Computer Science.
TEMPLATE DESIGN © Hardware Design, Synthesis, and Verification of a Multicore Communication API Ben Meakin, Ganesh Gopalakrishnan.
Embedded Runtime Reconfigurable Nodes for wireless sensor networks applications Chris Morales Kaz Onishi 1.
Heavy and lightweight dynamic network services: challenges and experiments for designing intelligent solutions in evolvable next generation networks Laurent.
© 2012 xtUML.org Bill Chown – Mentor Graphics Model Driven Engineering.
Department of Computer Science and Engineering The Pennsylvania State University Akbar Sharifi, Emre Kultursay, Mahmut Kandemir and Chita R. Das Addressing.
80-Tile Teraflop Network-On- Chip 1. Contents Overview of the chip Architecture ▫Computational Core ▫Mesh Network Router ▫Power save features Performance.
CS 8501 Networks-on-Chip (NoCs) Lukasz Szafaryn 15 FEB 10.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Lecture 10: Logic Emulation October 8, 2013 ECE 636 Reconfigurable Computing Lecture 13 Logic Emulation.
An Architecture and Prototype Implementation for TCP/IP Hardware Support Mirko Benz Dresden University of Technology, Germany TERENA 2001.
Axel Jantsch 1 Networks on Chip Axel Jantsch 1 Shashi Kumar 1, Juha-Pekka Soininen 2, Martti Forsell 2, Mikael Millberg 1, Johnny Öberg 1, Kari Tiensurjä.
R ECONFIGURABLE SECURITY SUPPORT FOR EMBEDDED SYSTEMS 1 AKSHATA VARDHARAJ.
Networks-on-Chip (NoC) Suleyman TOSUN Computer Engineering Deptartment Hacettepe University, Turkey.
Tufts Wireless Laboratory School Of Engineering Tufts University Paper Review “An Energy Efficient Multipath Routing Protocol for Wireless Sensor Networks”,
Survey of multicore architectures Marko Bertogna Scuola Superiore S.Anna, ReTiS Lab, Pisa, Italy.
Chapter 8 System Management Semester 2. Objectives  Evaluating an operating system  Cooperation among components  The role of memory, processor,
Page 1 2P13 Week 1. Page 2 Page 3 Page 4 Page 5.
A seminar Presentation on NETWORK- ON- CHIP ARCHITECTURE EXPLORATION FRAMEWORK Under the supervision of Presented by Mr.G.Naresh,M.Tech., V.Sairamya Asst.
Runtime Reconfigurable Network-on- chips for FPGA-based systems Mugdha Puranik Department of Electrical and Computer Engineering
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
COMPSCI 110 Operating Systems
Working at a Small-to-Medium Business or ISP – Chapter 6
ElasticTree Michael Fruchtman.
Azeddien M. Sllame, Amani Hasan Abdelkader
Storage Virtualization
Shadow: Scalable and Deterministic Network Experimentation
A High Performance SoC: PkunityTM
Introduction to Embedded Systems
RECONFIGURABLE NETWORK ON CHIP ARCHITECTURE FOR AEROSPACE APPLICATIONS
Department of Electrical Engineering Joint work with Jiong Luo
Working at a Small-to-Medium Business or ISP – Chapter 6
Presentation transcript:

Rabi Mahapatra Department of Computer Science & Engineering Texas A&M University

Embedded Systems and Codesign Laboratory Lot of Cores on a BoardA Single Chip Many Core SoC have hundreds of IP cores on a single chip. Multi Core SoC have a handful of IP cores on a single chip.

Performance Demand is ever increasing Frequency is not increasing More transistors available on single die –50 billion transistors Soon! Embedded Systems and Codesign Laboratory

Power and Thermal Management Testing Operating System Design Modeling and Benchmarks Programming Model Memory Bandwidth Fault Tolerance Virtualization Support Embedded System and Codesign Laboratory

Embedded Systems and Codesign Laboratory Secure SoC Infrastructure HW Security Protocols Simulation Platform Test and Debug Software Support Benchmarks Test Infrastructure Reliability Aware Design Low Power Core Design Low Power Communication System Level Power Management Thermal Management Power & Energy Testing & Reliability SecurityDevelopment

Challenge #1

Embedded Systems and Codesign Laboratory Communication Accounts for Significant Power Application OS Level System Level Circuit Level 35% Various Levels of Power Management

As the number of cores per chip scales, on-chip busses will no longer meet performance needs Route packets not wires NoC Components –routers –core-network interfaces (CNI) –links Embedded Systems and Codesign Laboratory An Example NoC with Mesh Topology

What NoC Has to Offer? High Bandwidth Less sensitive to Wire Delay Versatile Infrastructure Scalable Communication Challenges Router Architecture? Power Management? Routing Algorithm Quality of Service Embedded Systems and Codesign Laboratory

Routers are primary component in NoC Buffers consume the most power in Router Embedded Systems and Codesign Laboratory Up to 79% Power Consumption Efficient Management of Buffer can save power

Reduce Active Buffers: Dynamic Buffer Management –Buffers are organized in blocks –Flows are monitored –Excess blocks are powered down based on traffic flow Use Energy Efficient Storage Encoding –SRAMs can be efficient buffer solution –Storing 0 ≠ Storing 1 –Encode the bits in packet to preserve energy Embedded Systems and Codesign Laboratory 20% Energy Savings can be achieved!

Dynamic Peak Power Budget Satisfaction Local Power Consumption is computed Neighbors Power is shared Non deterministic algorithm is used to calculate available budget Embedded Systems and Codesign Laboratory Calculate Local Power Estimate Share Exchange Information Update Power Budget 25% Performance Improvement

Intelligent Power budget Distribution Ant System inspired power budget distribution approach Power ants are sent from surplus region Beggar ants are sent from starving region Power is shared from surplus to starving region Embedded Systems and Codesign Laboratory Sink Source 20% Improvement in Power Budget Utilization Sharing Path

Pfair is an optimal scheduling algorithm for multiprocessor task scheduling Integrated DVFS (Dynamic voltage and frequency scaling) into the Pfair scheduling algorithm Feedback controller based allocation of additional job copies to manage reliability. Embedded Systems and Codesign Laboratory Task set DVS enabled Pfair Scheduler Feedback control Observed Reliability Additional job copies Reduced Failure rate compared to Pfair Up to 50% savings in energy

Implemented the Pfair scheduling algorithm (for MPSoC) in hardware Transformed floating point computations to integer domain. Embedded Systems and Codesign Laboratory Exponential savings in Energy consumption compared to software based scheduler

Temperature aware energy management (TA-DVS) at run time using novel slack reclamation Embedded Systems and Codesign Laboratory MF-DVSTA-DVS Worst Case Energy Consumption Increase Limited to 4% 19% Lower Temperature Violation No Increase in Energy Consumption Low Energy Consumption High Temperature Violation Task set EDF Scheduler Feedback control Temperature Estimation Slack Temperature constrained slack

What We Addressed A Low Power Router Buffer Architecture Peak Power Management Heuristic Intelligent Dynamic Power Budget Distribution Reliability and Temperature aware task scheduling Other Research Challenges Novel flow control to reduce power consumption further Context/Application aware power management System wide power policy management Embedded Systems and Codesign Laboratory

Challenge #2

Challenges –Shrinking feature sizes –Power density and temperature Advantages –NoC can be used as test delivery platform –Redundancy Many Core SoC present us with challenges and advantages in achieving reliable computing Reduced operational lifetime: the “bathtub” curve is getting shallower and narrower Embedded Systems and Codesign Laboratory Enhanced testing is necessary to meet application reliability requirements Graceful degradation Adaptable architectures

Many Core SoC will contain: –Processing, Memory, and I/O IP –Infrastructure IP For Debug, Yield, and Testing (TI-IP) TI-IP provides reliability and availability –No longer necessary to take chip off-line to test –Allows for fast, high-coverage testing –Already being deployed in commercial automotive SoC (Freescale MPC564xL) Embedded Systems and Codesign Laboratory

TI-IP becomes limited by scalability for Many Core –Many TI-IPs distributed across the SoC Each SoC Tile can have TI-IP and Test Vectors –Test vectors can be optimally divided across SoC based on NoC topology –This solves the problem of testing deeply embedded cores Embedded Systems and Codesign Laboratory TI-IP composed of: Test Controller Test Vector Memory Test Vector Set x5 2D-Torus Example

Experimental Analysis: –Observe time to test SoC with and without distributed TI-IP –Measured over a variety of SoC sizes for scalability –Test time independent of SoC size for distributed TI-IP Embedded Systems and Codesign Laboratory 85% reduction in test time 94% reduction in test time

What We Addressed TI-IP: On-Line Testing Test Vector Storage Test Scheduling Open Problems Life Time Reliability? Diagnosis & Recovery? Fault Resilience? Embedded Systems and Codesign Laboratory

Challenge #3

SoC Security Attack Modeling HW Security Solution Protocol Development Security IP Design Threat Response Mechanism Embedded Systems and Codesign Laboratory

Weakness in SoC –Heterogeneous system: IP from different vendors –Tradeoff between security and performance –Decrease visibility and control –Improved attack techniques Sample Attacks –Denial of Service (DoS) –Bandwidth reduction –Draining or Sleep Deprivation –Extraction of secret info –Hijacking of programmable components

Divide the system into secure and unsecure areas –Secure area (ASIC) –Unsecure area (FPGA etc) Secure Bus Design –Extend conventional arbitration using Trojan Detection and Access Control Security IP Based Solution –Transaction Monitoring –Sandboxing Embedded Systems and Codesign Laboratory

Central Security Core Dedicated communication channel for security protocol Secure agents at each Core Distributed Response protocol is necessary Embedded Systems and Codesign Laboratory

State of Art Bio Inspired Modeling of Threat Response Mechanism Security Monitor Core Core Level Anomaly Detector Open Problems Attack Model Development Security Infrastructure Development Protocol Design Design of Security Core/IP Embedded Systems and Codesign Laboratory

Challenge #4

Major road block in advent of many core SoC Need better platform simulator and debugger Rapid development cycle Suitable Benchmarks to effectively evaluate many core SoC Embedded Systems and Codesign Laboratory Design Space Exploration Performance Evaluation Test and Debug Benchmarking and Standardization

Full System SoC Simulation Platform Built using SystemC Generic and Extensible Embedded Systems and Codesign Laboratory

NoCBench System ModelComponents of NoCBench System Kernel –Provides scheduling –Task management Core Library –Processor cores –Memory core –Other IP Network On Chip –NoC backbone with routers and CNIs Embedded Systems and Codesign Laboratory

Configuration Parameters Network on Chip –Router Details –Topology –Injection Limit –Power Model System –Scheduler –Task Configurations –Core Types Reported Metrics Network –Throughput –Latency –Power Application –Execution Time –Cycles –Power Embedded Systems and Codesign Laboratory

VPNoC Scalable architecture cores support Run applications using traces Suitable for data analysis accelerators Can meet performance, power, protocol and security analysis Challenges Slow when using large number of nodes Fast Model is essential Embedded Systems and Codesign Laboratory

Benchmarking is a Challenge –Application Benchmark –Communication Benchmark –Large optimization problems –Social media data analysis –Many other large data analytic problems What kind of setup –How future “many core SoC” will look like Embedded Systems and Codesign Laboratory

What We Have VPNoC SoC simulation Micro Kernel Simple Scheduler Basic core library Limited IPC What Do We Need More core support Thread Library Support Application Benchmarks Embedded Systems and Codesign Laboratory

Many Core SoC is the future Challenges –Performance & Power –Test, Reliability & Security –Benchmarking More Information – and-nochttp://codesign.cs.tamu.edu/index.php/research/soc- and-noc – time-systemshttp://codesign.cs.tamu.edu/index.php/research/real- time-systems

Embedded Systems and Codesign Laboratory