1 Presenter: Min Yu,Lo 2015/10/9 Lauri Matilainen, Erno Salminen, Timo D. Hamalainen, and Marko Hannikainen. 2011 International Conference on Embedded.

Slides:



Advertisements
Similar presentations
Network II.5 simulator ..
Advertisements

Nios Multi Processor Ethernet Embedded Platform Final Presentation
1 VIRTUAL MACHINES By: Sai Siddharth Kumar Dantu.
Chap 2 System Structures.
AMLAPI: Active Messages over Low-level Application Programming Interface Simon Yau, Tyson Condie,
Addressing the System-on-a-Chip Interconnect Woes Through Communication-Based Design N. Vinay Krishnan EE249 Class Presentation.
© ABB Group Jun-15 Evaluation of Real-Time Operating Systems for Xilinx MicroBlaze CPU Anders Rönnholm.
MPI and RDMA Yufei 10/15/2010. MPI over uDAPL: abstract MPI: most popular parallel computing standard MPI needs the ability to deliver high performace.
MPI in uClinux on Microblaze Neelima Balakrishnan Khang Tran 05/01/2006.
Handheld TFTP Server with USB Andrew Pangborn Michael Nusinov RIT Computer Engineering – CE Design 03/20/2008.
Virtual Memory Virtual Memory Management in Mach Labels and Event Processes in Asbestos Ingar Arntzen.
Supporting Systolic and Memory Communication in iWarp (Borkar et al. 1990) presented by Vasily Volkov CS258, Spring 2008, UC Berkeley.
Introduction to Systems Architecture Kieran Mathieson.
Presenter: Jyun-Yan Li Multiprocessor System-on-Chip Profiling Architecture: Design and Implementation Po-Hui Chen, Chung-Ta King, Yuan-Ying Chang, Shau-Yin.
Chapter 13 Embedded Systems
1 EE249 Discussion A Method for Architecture Exploration for Heterogeneous Signal Processing Systems Sam Williams EE249 Discussion Section October 15,
Modern trends in computer architecture and semiconductor scaling are leading towards the design of chips with more and more processor cores. Highly concurrent.
The importance of switching in communication The cost of switching is high Definition: Transfer input sample points to the correct output ports at the.
AMMPI - Summary Active Messages–2 (AM) implementation over MPI version 1.1 –Porting is trivial - works on virtually any platform that has MPI 1.1 –Often.
Embedded Transport Acceleration Intel Xeon Processor as a Packet Processing Engine Abhishek Mitra Professor: Dr. Bhuyan.
Presenter : Cheng-Ta Wu Antti Rasmus, Ari Kulmala, Erno Salminen, and Timo D. Hämäläinen Tampere University of Technology, Institute of Digital and Computer.
1 What is an operating system? CSC330Patricia Van Hise.
Building an Application Server for Home Network based on Android Platform Yi-hsien Liao Supervised by : Dr. Chao-huang Wei Department of Electrical Engineering.
Leveling the Field for Multicore Open Systems Architectures Markus Levy President, EEMBC President, Multicore Association.
Interconnection Protocol Mustafa Kara Term Work.
Efficient Hardware dependant Software (HdS) Generation using SW Development Platforms Frédéric ROUSSEAU CASTNESS‘07 Computer Architectures and Software.
New Direction Proposal: An OpenFabrics Framework for high-performance I/O apps OFA TAC, Key drivers: Sean Hefty, Paul Grun.
Parallel Processing LAB NO 1.
OpenMP in a Heterogeneous World Ayodunni Aribuki Advisor: Dr. Barbara Chapman HPCTools Group University of Houston.
UNIX System Administration OS Kernal Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept Kernel or MicroKernel Concept: An OS architecture-design.
CASTNESS‘11 Computer Architectures and Software Tools for Numerical Embedded Scalable Systems Workshop & School: Roma January 17-18th 2011 Frédéric ROUSSEAU.
LIGO-G Z 8 June 2001L.S.Finn/LDAS Camp1 How to think about parallel programming.
LOGO OPERATING SYSTEM Dalia AL-Dabbagh
 What is an operating system? What is an operating system?  Where does the OS fit in? Where does the OS fit in?  Services provided by an OS Services.
1b.1 Types of Parallel Computers Two principal approaches: Shared memory multiprocessor Distributed memory multicomputer ITCS 4/5145 Parallel Programming,
Chapter 2: Operating-System Structures. 2.2 Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Jan 14, 2005 Operating System.
Improving Network I/O Virtualization for Cloud Computing.
2012/03/06 匡建慈. goals  To build a multi-core platform with Hadoop environment.  Hardware architecture  What is Hadoop ?  What to do and what we have.
Symmetric multiprocessing
Android is a trademark of Google Inc. Use of this trademark is subject to Google Permissions. Linux® is the registered trademark of Linus Torvalds in the.
Heterogeneous Multikernel OS Yauhen Klimiankou BSUIR
TEMPLATE DESIGN © Hardware Design, Synthesis, and Verification of a Multicore Communication API Ben Meakin, Ganesh Gopalakrishnan.
ESC499 – A TMD-MPI/MPE B ASED H ETEROGENEOUS V IDEO S YSTEM Tony Zhou, Prof. Paul Chow April 6 th, 2010.
Operating Systems David Goldschmidt, Ph.D. Computer Science The College of Saint Rose CIS 432.
Processes Introduction to Operating Systems: Module 3.
Simics: A Full System Simulation Platform Synopsis by Jen Miller 19 March 2004.
Types of Operating Systems 1 Computer Engineering Department Distributed Systems Course Assoc. Prof. Dr. Ahmet Sayar Kocaeli University - Fall 2015.
A. Frank - P. Weisberg Operating Systems Structure of Operating Systems.
1 Presenter: Min Yu,Lo 2015/12/21 Kumar, S.; Jantsch, A.; Soininen, J.-P.; Forsell, M.; Millberg, M.; Oberg, J.; Tiensyrja, K.; Hemani, A. VLSI, 2002.
OpenMP for Networks of SMPs Y. Charlie Hu, Honghui Lu, Alan L. Cox, Willy Zwaenepoel ECE1747 – Parallel Programming Vicky Tsang.
Chapter 1 Basic Concepts of Operating Systems Introduction Software A program is a sequence of instructions that enables the computer to carry.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 4: Threads.
CS4315A. Berrached:CMS:UHD1 Introduction to Operating Systems Chapter 1.
Parallel Computing Presented by Justin Reschke
Background Computer System Architectures Computer System Software.
The World Leader in High Performance Signal Processing Solutions Heterogeneous Multicore for blackfin implementation Open Platform Solutions Steven Miao.
PERFORMANCE OF THE OPENMP AND MPI IMPLEMENTATIONS ON ULTRASPARC SYSTEM Abstract Programmers and developers interested in utilizing parallel programming.
Computer System Structures
Operating System Structures
Distributed and Parallel Processing
CS6401- OPERATING SYSTEMS L T P C
Performance Tuning Team Chia-heng Tu June 30, 2009
Virtual Machines.
by Manuel Saldaña, Daniel Nunes, Emanuel Ramalho, and Paul Chow
Chapter 4: Threads.
Chapter 2: System Structures
LAB 01 Installation of VIRTUAL MACHINE and LINUX
MPJ: A Java-based Parallel Computing System
Prof. Leonardo Mostarda University of Camerino
Presentation transcript:

1 Presenter: Min Yu,Lo 2015/10/9 Lauri Matilainen, Erno Salminen, Timo D. Hamalainen, and Marko Hannikainen International Conference on Embedded Computer Systems (SAMOS)

2015/10/92 The design and implementation of an application programming interface (API) is a trade-off between abstraction it provides and overheads it causes. This paper presents an implementation of Multicore Communications API (MCAPI) on a heterogeneous platform consisting of FPGA-based multiprocessor system-on-chip (MPSoC) connected via PCIe to an external CPU board. The purpose is to provide a unified programming API to different processor and OS types as well as hardware IP-blocks.

2015/10/93 MCAPI is shown to meet these requirements. We show the MCAPI transport implementation on three processors and two buses, measure the overhead cost, and analyze the effort required to port an application from a PC to the MPSoC. The measured library memory footprint is less than 25KB and the roundtrip communication latency is diminishing low - only few dozen clock cycles - compared to non-MCAPI implementation.

2015/10/94 Multicore Communications API Specification(MCAPI) [3] Reduced MPI[7][10][14][15] Cell SDK[8][13][10] Msg[8] MPI This Paper Comparing API performance Message - passing This paper major method

2015/10/95  Solve heterogeneous architecture ◦ Different processor ◦ Different OS type  Feasible performance ◦ Memory footprintd ◦ Performance overhead

2015/10/96  This paper major method ◦ Adopted MCAPI for an FPGA-based MPSOC environment.  The FUNCAPI is divided into two layers. ◦ Top layers ◦ Transport layers  Using virtual nodes for fixed-function units ◦ ex : Discrete cosine transform(DCT)

2015/10/97  MCAPI defines three communications types: ◦ Messages  We don’t have to establish a connection first. ◦ Packet channels and Scalar channels  We have to establishing a connection first.

2015/10/98  An example sequence of MCAPI execution in Funbase software platform.

2015/10/99  Comparison to other API implementations ◦ Performance ◦ memory comparison

2015/10/910  Conclusions ◦ we separated the platform specific parts in a transport layer, thereby porting MCAPI conveniently to NIOS without OS and PC with Linux.  My Comments ◦ This paper is not clear in comparing API architecture individually. ◦ I think the implementation of FUNCAPI is divided into two layers is very good idea.