Multi-core SoC Design is the Challenge! What is the Solution? Drew Wingard CTO Sonics, Inc.

Slides:



Advertisements
Similar presentations
Chapter 3 Embedded Computing in the Emerging Smart Grid Arindam Mukherjee, ValentinaCecchi, Rohith Tenneti, and Aravind Kailas Electrical and Computer.
Advertisements

7 December 2011Powering Smartphone SoCs with Multichannel TSV DRAMs1 3D Mania: Powering Smartphone SoCs with Multichannel TSV DRAMs Drew Wingard, Sonics.
3D Graphics Content Over OCP Martti Venell Sr. Verification Engineer Bitboys.
Intelligent Interconnects for Multicore SoCs Drew Wingard, CTO, Sonics, Inc. OCIN06: December 6, 2006.
1 Copyright © 2012 Oracle and/or its affiliates. All rights reserved. Convergence of HPC, Databases, and Analytics Tirthankar Lahiri Senior Director, Oracle.
Efficient Simulation of Agent-based Models on Multi-GPU & Multi-Core Clusters Kalyan S. Perumalla, Ph.D. Senior R&D Manager Oak Ridge National Laboratory.
1 System Level Verification of OCP-IP based SoCs using OCP-IP eVC Himanshu Rawal eInfochips, Inc.,4655 Old Ironsides Drive, Suite 385,Santa Clara, CA
Energy-efficient Task Scheduling in Heterogeneous Environment 2013/10/25.
© 2004 Wayne Wolf Topics Task-level partitioning. Hardware/software partitioning.  Bus-based systems.
Distributed Systems CS
Computer Science, University of Oklahoma Reconfigurable Versus Fixed Versus Hybrid Architectures John K. Antonio Oklahoma Supercomputing Symposium 2008.
AMD OPTERON ARCHITECTURE Omar Aragon Abdel Salam Sayyad This presentation is missing the references used.
International Symposium on Low Power Electronics and Design Energy-Efficient Non-Minimal Path On-chip Interconnection Network for Heterogeneous Systems.
PERFORMANCE ANALYSIS OF MULTIPLE THREADS/CORES USING THE ULTRASPARC T1 (NIAGARA) Unique Chips and Systems (UCAS-4) Dimitris Kaseridis & Lizy K. John The.
Supercharging PlanetLab : a high performance, Multi-Application, Overlay Network Platform Written by Jon Turner and 11 fellows. Presented by Benjamin Chervet.
*time Optimization Heiko, Diego, Thomas, Kevin, Andreas, Jens.
1 Lecture 26: Storage Systems Topics: Storage Systems (Chapter 6), other innovations Final exam stats:  Highest: 95  Mean: 70, Median: 73  Toughest.
Define Embedded Systems Small (?) Application Specific Computer Systems.
By Lauren Felton. The electric grid delivers electricity from points of generation to consumers, and the electricity delivery network functions via two.
Power Aware Solutions for NoC Architecture Yaniv Ben-Itzhak Noc Seminar Winter 08.
From Miles to Millimeter: On-chip Communication Networks Alessandro Pinto U.C. Berkeley
Intel ® Research mote Ralph Kling Intel Corporation Research Santa Clara, CA.
Single-Chip Multi-Processors (CMP) PRADEEP DANDAMUDI 1 ELEC , Fall 08.
A Scalable, Cache-Based Queue Management Subsystem for Network Processors Sailesh Kumar, Patrick Crowley Dept. of Computer Science and Engineering.
New Direction Proposal: An OpenFabrics Framework for high-performance I/O apps OFA TAC, Key drivers: Sean Hefty, Paul Grun.
OCP: Open Core Protocol Marta Posada ESA/ESTEC June 2006.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
CASTNESS‘11 Computer Architectures and Software Tools for Numerical Embedded Scalable Systems Workshop & School: Roma January 17-18th 2011 Frédéric ROUSSEAU.
Tufts Wireless Laboratory School Of Engineering Tufts University “Network QoS Management in Cyber-Physical Systems” Nicole Ng 9/16/20151 by Feng Xia, Longhua.
Presenter : Cheng-Ta Wu Pieter van der Wolf and Jeroen Geuzebroek Synopsys Eindhoven, The Netherlands Design, Automation & Test in Europe Conference &
“Low-Power, Real-Time Object- Recognition Processors for Mobile Vision Systems”, IEEE Micro Jinwook Oh ; Gyeonghoon Kim ; Injoon Hong ; Junyoung.
1 Hardware Support for Collective Memory Transfers in Stencil Computations George Michelogiannakis, John Shalf Computer Architecture Laboratory Lawrence.
Reflections on 10 Years as a Commercial On-chip Interconnect Provider Drew Wingard CTO, Sonics, Inc.
Seaborg Cerise Wuthrich CMPS Seaborg  Manufactured by IBM  Distributed Memory Parallel Supercomputer  Based on IBM’s SP RS/6000 Architecture.
Standard based Instrumentation schemes for 3D SoC Neal Stollon Chairman, Nexus 5001 Forum
Network-on-Chip Energy-Efficient Design Techniques for Interconnects Suhail Basit.
© 2012 xtUML.org Bill Chown – Mentor Graphics Model Driven Engineering.
11 Copyright © 2009 Juniper Networks, Inc. ANDY INGRAM VP FST PRODUCT MARKETING & BUSINESS DEVELOPMENT.
March 9, 2015 San Jose Compute Engineering Workshop.
Veronica Eyo Sharvari Joshi. System on chip Overview Transition from Ad hoc System On Chip design to Platform based design Partitioning the communication.
Next Generation Operating Systems Zeljko Susnjar, Cisco CTG June 2015.
University of Toronto at Scarborough © Kersti Wain-Bantin CSCC40 system architecture 1 after designing to meet functional requirements, design the system.
Shashwat Shriparv InfinitySoft.
System-level power analysis and estimation September 20, 2006 Chong-Min Kyung.
VLSI Algorithmic Design Automation Lab. THE TI OMAP PLATFORM APPROACH TO SOC.
Networks-on-Chip (NoC) Suleyman TOSUN Computer Engineering Deptartment Hacettepe University, Turkey.
Revision - 01 Intel Confidential Page 1 Intel HPC Update Norfolk, VA April 2008.
Multi-objective Topology Synthesis and FPGA Prototyping Framework of Application Specific Network-on-Chip m Akram Ben Ahmed Xinyu LI, Omar Hammami.
Virtual-Channel Flow Control William J. Dally
Jamie Unger-Fink John David Eriksen.  Allocation and Scheduling Problem  Better MPSoC optimization tool needed  IP and CP alone not good enough  Communication.
Is MPI still part of the solution ? George Bosilca Innovative Computing Laboratory Electrical Engineering and Computer Science Department University of.
SRA Key Topics Reference Architectures for Cyber-Physical Systems Dr. Christian El Salloum AVL List GmbH.
System on a Programmable Chip (System on a Reprogrammable Chip)
Lynn Choi School of Electrical Engineering
Microarchitecture.
OCP: High Performance Computing Project
Architecture & Organization 1
Energy Efficient Computing in Nanoscale CMOS
Intel’s Core i7 Processor
Large Scale Test of a storage solution based on an Industry Standard
On-Time Network On-chip
Low Latency Analytics HPC Clusters
Israel Cidon, Ran Ginosar and Avinoam Kolodny
postgrad. Sergiy Korotunov prof. Galyna Tabunshchyk
Architecture & Organization 1
CS 258 Reading Assignment 4 Discussion Exploiting Two-Case Delivery for Fast Protected Messages Bill Kramer February 13, 2002 #
Managing GPU Concurrency in Heterogeneous Architectures
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
Introduction to Heterogeneous Parallel Computing
ICS 252 Introduction to Computer Design
Presentation transcript:

Multi-core SoC Design is the Challenge! What is the Solution? Drew Wingard CTO Sonics, Inc.

DAC 2008: Multi-core Panel 8B2June 10, 2008 Multi-core SoCs Look Alike! Heterogeneous multi-core designs –Optimize for performance, cost, power –Incorporate legacy Built from processor-based tiles –Encapsulate entire subsystem function, including control –Incorporate processing and storage to localize communication –Delivered together with associated firmware Sources: IVA

DAC 2008: Multi-core Panel 8B3June 10, 2008 Multi-core SoC Performance is Determined by Interconnects and Memory Subsystems Tiles need external memory bandwidth due to data set size Tiles communicate with each other via external memory SoC function is dominated by tile processing SoC performance is dominated by DRAM communications –Must use DRAM bandwidth efficiently (too expensive to over design) –Must satisfy real time tile throughput & latency constraints Managing communications performance is tougher each generation –Substantial new features, requiring much more bandwidth –Larger scale systems –More stringent power constraints

DAC 2008: Multi-core Panel 8B4June 10, 2008 Multi-core SoC Design Solutions Use a communications architecture that supports: –Mixing legacy and new tiles – decoupled network –Flexible interface semantics – socket protocols (such as OCP) –High throughput and concurrency – deep pipelines with threads –High efficiency and predictability – non-blocking flow control –Guaranteed bandwidth and latency – end-to-end QoS scheduling –Minimum energy and power – GALS and voltage domains –Earliest performance analysis – TLM-based data flow models –Predictable implementation path – physically optimized network Example (available today!) –SonicsMX SMART Interconnect with MemMax memory scheduler