Electronics Lab, Physics Dept., Aristotle Univ. of Thessaloniki, Greece 17th IEEE International Conference on Electronics, Circuits, and Systems ICECS.

Slides:



Advertisements
Similar presentations
Multiprocessor Architecture for Image processing Mayank Kumar – 2006EE10331 Pushpendre Rastogi – 2006EE50412 Under the guidance of Dr.Anshul Kumar.
Advertisements

System-level Trade-off of Networks-on-Chip Architecture Choices Network-on-Chip System-on-Chip Group, CSE-IMM, DTU.
August 8 th, 2011 Kevan Thompson Creating a Scalable Coherent L2 Cache.
Reporter :LYWang We propose a multimedia SoC platform with a crossbar on-chip bus which can reduce the bottleneck of on-chip communication.
1 SECURE-PARTIAL RECONFIGURATION OF FPGAs MSc.Fisnik KRAJA Computer Engineering Department, Faculty Of Information Technology, Polytechnic University of.
 Understanding the Sources of Inefficiency in General-Purpose Chips.
CSC457 Seminar YongKang Zhu December 6 th, 2001 About Network Processor.
Extensible Networking Platform 1 Liquid Architecture Cycle Accurate Performance Measurement Richard Hough Phillip Jones, Scott Friedman, Roger Chamberlain,
CSC 4250 Computer Architectures December 8, 2006 Chapter 5. Memory Hierarchy.
Multithreaded FPGA Acceleration of DNA Sequence Mapping Edward Fernandez, Walid Najjar, Stefano Lonardi, Jason Villarreal UC Riverside, Department of Computer.
Evaluation of Data-Parallel Splitting Approaches for H.264 Decoding
Performed by: Moshe Emmer, Harar Meir Instructor: Alkalay Daniel Cooperated with: AE faculty המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory.
An ATCA and FPGA-Based Data Processing Unit for PANDA Experiment H.XU, Z.-A. LIU,Q.WANG, D.JIN, Inst. High Energy Physics, Beijing, W. Kühn, J. Lang, S.
1 Controllers-system for APS – CubeSat nano-satellite Instructor: Daniel Alkalay Students: Moshe Emmer & Meir Harar Technion – Israel Institute of Technology.
PhD/Master course, Uppsala  Understanding the interaction between your program and computer  Structuring the code  Optimizing the code  Debugging.
The Effect of Data-Reuse Transformations on Multimedia Applications for Different Processing Platforms N. Vassiliadis, A. Chormoviti, N. Kavvadias, S.
Enhancing Embedded Processors with Specific Instruction Set Extensions for Network Applications A. Chormoviti, N. Vassiliadis, G. Theodoridis, S. Nikolaidis.
GCSE Computing - The CPU
Virtual Architecture For Partially Reconfigurable Embedded Systems (VAPRES) Architecture for creating partially reconfigurable embedded systems Module.
General Purpose FIFO on Virtex-6 FPGA ML605 board midterm presentation
Final presentation Encryption/Decryption on embedded system Supervisor: Ina Rivkin students: Chen Ponchek Liel Shoshan Winter 2013 Part A.
Sept EE24C Digital Electronics Project Design of a Digital Alarm Clock.
Juanjo Noguera Xilinx Research Labs Dublin, Ireland Ahmed Al-Wattar Irwin O. Irwin O. Kennedy Alcatel-Lucent Dublin, Ireland.
Students: Oleg Korenev Eugene Reznik Supervisor: Rolf Hilgendorf
General Purpose FIFO on Virtex-6 FPGA ML605 board Students: Oleg Korenev Eugene Reznik Supervisor: Rolf Hilgendorf 1 Semester: spring 2012.
1 Presenter: Ming-Shiun Yang Sah, A., Balakrishnan, M., Panda, P.R. Design, Automation & Test in Europe Conference & Exhibition, DATE ‘09. A Generic.
Digital signature using MD5 algorithm Hardware Acceleration
1 Design and Implementation of an Efficient MPEG-4 Interactive Terminal on Embedded Devices Yi-Chin Huang, Tu-Chun Yin, Kou-Shin Yang, Yan-Jun Chang, Meng-Jyi.
October 26, 2006 Parallel Image Processing Programming and Architecture IST PhD Lunch Seminar Wouter Caarls Quantitative Imaging Group.
A New Reference Design Development Environment for JPEG 2000 Applications Bill Finch CAST, Inc. Warren Miller AVNET Design Services DesignCon 2003 January.
1 Electronics Lab, Physics Dept., Aristotle Univ. of Thessaloniki, Greece 2 Micro2Gen Ltd., NCSR Demokritos, Greece 17th IEEE International Conference.
Matrix Multiplication on FPGA Final presentation One semester – winter 2014/15 By : Dana Abergel and Alex Fonariov Supervisor : Mony Orbach High Speed.
Operating Systems for Reconfigurable Systems John Huisman ID:
Chapter 4 Threads, SMP, and Microkernels Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design.
Hartman1P1004 Leo Hartman Canadian Space Agency A VHDL Implementation of an On-board Autonomy Solution.
2010 IEEE ICECS - Athens, Greece, December1 Using Flash memories as SIMO channels for extending the lifetime of Solid-State Drives Maria Varsamou.
© 2007 Xilinx, Inc. All Rights Reserved This material exempt per Department of Commerce license exception TSU Hardware Design INF3430 MicroBlaze 7.1.
SJSU SPRING 2011 PARALLEL COMPUTING Parallel Computing CS 147: Computer Architecture Instructor: Professor Sin-Min Lee Spring 2011 By: Alice Cotti.
Hybrid Prototyping of MPSoCs Samar Abdi Electrical and Computer Engineering Concordia University Montreal, Canada
ESC499 – A TMD-MPI/MPE B ASED H ETEROGENEOUS V IDEO S YSTEM Tony Zhou, Prof. Paul Chow April 6 th, 2010.
A User-Lever Concurrency Manager Hongsheng Lu & Kai Xiao.
RISC By Ryan Aldana. Agenda Brief Overview of RISC and CISC Features of RISC Instruction Pipeline Register Windowing and renaming Data Conflicts Branch.
A Configurable High-Throughput Linear Sorter System Jorge Ortiz Information and Telecommunication Technology Center 2335 Irving Hill Road Lawrence, KS.
A Monte Carlo Simulation Accelerator using FPGA Devices Final Year project : LHW0304 Ng Kin Fung && Ng Kwok Tung Supervisor : Professor LEONG, Heng Wai.
Processor Architecture
Design and Implementation of a Scalable System Architecture for Embedded Multimedia Terminal Electrical and Control Engineering (ICECE), 2011 International.
VAPRES A Virtual Architecture for Partially Reconfigurable Embedded Systems Presented by Joseph Antoon Abelardo Jara-Berrocal, Ann Gordon-Ross NSF Center.
The Central Processing Unit (CPU)
An Automated Development Framework for a RISC Processor with Reconfigurable Instruction Set Extensions Nikolaos Vassiliadis, George Theodoridis and Spiridon.
Different Microprocessors Tamanna Haque Nipa Lecturer Dept. of Computer Science Stamford University Bangladesh.
Fundamentals of Programming Languages-II
Fast Lookup for Dynamic Packet Filtering in FPGA REPORTER: HSUAN-JU LI 2014/09/18 Design and Diagnostics of Electronic Circuits & Systems, 17th International.
DDRIII BASED GENERAL PURPOSE FIFO ON VIRTEX-6 FPGA ML605 BOARD PART B PRESENTATION STUDENTS: OLEG KORENEV EUGENE REZNIK SUPERVISOR: ROLF HILGENDORF 1 Semester:
Self-Tuned Distributed Multiprocessor System Xiaoyan Bi CSC Operating Systems Dr. Mirela Damian.
1 of 14 Lab 2: Design-Space Exploration with MPARM.
 Computers are classified according to  Purpose  Data hiding  Functionality  Size.
Runtime Temporal Partitioning Assembly to Reduce FPGA Reconfiguration Time Abelardo Jara-Berrocal, Ann Gordon-Ross HCS Research Laboratory College of Engineering.
Relational Query Processing on OpenCL-based FPGAs Zeke Wang, Johns Paul, Hui Yan Cheah (NTU, Singapore), Bingsheng He (NUS, Singapore), Wei Zhang (HKUST,
GCSE Computing - The CPU
MPOC: A Chip Multiprocessor for Embedded Systems
Basic Computer Organization and Design
Adaptive Block Coding Order for Intra Prediction in HEVC
Presented by Munezero Immaculee Joselyne PhD in Software Engineering
Components of Computer
Ming Liu, Wolfgang Kuehn, Zhonghai Lu, Axel Jantsch
Hyperthreading Technology
Improving cache performance of MPEG video codec
Anne Pratoomtong ECE734, Spring2002
Computer Evolution and Performance
GCSE Computing - The CPU
Presentation transcript:

Electronics Lab, Physics Dept., Aristotle Univ. of Thessaloniki, Greece 17th IEEE International Conference on Electronics, Circuits, and Systems ICECS 2010 – Athens - Greece

 Motivation  Architecture Platform  Design Space Exploration  Results  Conclusions C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab 2

 Motivation  Architecture Platform  Design Space Exploration  Results  Conclusions C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab 3

 Modern multimedia applications › Increased need for computational power  High resolution/throughput imaging/digital signal processing › Need for larger memory space  Modern FPGA devices › Larger › More powerfull › Offer a variety of memory architectures › MPSoC capabilities  Formulate the complete Design Space Exploration problem 4 C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

 Design space of MPSoC FPGA platforms taking into account: › The number of processors › Data/task level parallelism › Different interconnection strategies › Different memory architectures offered 5 C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

 Motivation  Architecture Platform  Design Space Exploration  Results  Conclusions C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab 6

 MPSoC based on the Microblaze processor › Exploration of systems with one to four processors › Interconnection of the processors chosen  Fast Simplex Links (FSL)  FIFO based therefore can also serve as a data buffer  Different memory architectures used › External memory  DDR2 on the xupv5-lx110t (Virtex-5 board) › Local BRAM › Combination of both for all architectures 7 C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

 Motivation  Architecture Platform  Design Space Exploration  Results  Conclusions C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab 8

 Algorithm selection › A widely used streaming multimedia application › Different types of parallelism  The Powerstone JPEG decoder › 4 stages  1-D DC prediction stage (DC)  Entropy Decoder (AC)  DeQuantization (DeQ)  2-D IDCT (IDCT) 9 C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

 Single Microblaze implementations for profiling the application (execution cycles) › With extrernal DDR2 memory › With BRAM Single MB JPEG Decoding Stage DC prediction Entropy Decoding DeQuantization2D-IDCT DDR23,81%28,83%10,81%48,69% BRAM4,11%29,92%8,50%51,00% 10 C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

 8 x 8 pixel blocks are used in every calculation stage  There is no data dependence between the blocks apart from the DC prediction stage  Architecture with only external memory  FSL depth = 4 only pointers are propagated  Architecture with use of BRAMs  FSL depth = 64 they are also used as data buffers 11 C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

12  One Microblaze serves as a master  FSL depth of 4  for synchronization purposes C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

13  One Microblaze serves as a master  External memory  FSL depth of 4 (pointer propagation)  Internal memory  FSL depth of 64 (data propagation) C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

14  Microblaze 0 serves as a master  Data equally divided between Microblaze 0 and 1 C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

 Motivation  Architecture Platform  Design Space Exploration  Results  Conclusions C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab 15

 Calculations of speed up and efficiency with reference to the single Microblaze architectures  Introducing a new parameter hardware efficiency to associate the area increase of the design with the speed up 16 C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

17 C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

18 External MemoryExternal Memory + Local BRAMLocal BRAM C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

19  There is a limitation in the speed up when only external memory is used due to simultaneous memory requests of the processors  This is overcome by using both external and internal memory  The greatest speed up is achieved by the system with both data and task level parallelism, both external memory/local BRAM and 4MB (x3.27) C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

20  The systems with the greatest hardware efficiency are the systems which use only internal BRAMs  The greatest HW_efficiency (3.27) is achieved by the system with 2MB and local BRAMs, followed the system with 3MB (2.8)  HW_efficiency demonstrates the revenue gained at a certain hardware cost (area) C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

 Motivation  Architecture Platform  Design Space Exploration  Results  Conclusions C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab 21

22  A design space exploration for FPGA-based multiprocessing and memory architecture based on the JPEG algorithm  20 different system implementations with 3 different memory approaches and 4 different processor architectures were examined C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

23  Higher speed ups were observed for the architectures which use both internal and external memories and have 4 processors  Higher hardware efficiencies are achieved for architectures that use only internal memories at the expense of total BRAM usage  Our goal is to formulate a methodology for an optimum MPSoC architecture selection based on the performance (speed up) and the cost- effectiveness (HW_efficiency) chosen by the user C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab

The research activities that led to these results, were co-financed by Hellenic Funds and by the European Regional Development Fund (ERDF) under the Hellenic National Strategic Reference Framework (ESPA) , according to Contract no. MICRO2-49-project LoC. 24 Thank you very much for your attention! C.-L. Sotiropoulou – Design Space Exploration for FPGA-based Multiprocessing Systems – AUTH-eLab