Development of a Compact Cluster with Embedded CPUs Sritrusta Sukaridhoto, Yoshifumi Sasaki, Koichi Ito and Takafumi Aoki.

Slides:



Advertisements
Similar presentations
RFID Access Control System March, 2003 Softrónica.
Advertisements

Nios Multi Processor Ethernet Embedded Platform Final Presentation
Practical techniques & Examples
EMS ROMC Environment monitoring system for ROMC..
Seeking prime numbers quickly through parallel-computing Daniel J. Wright.
Beowulf Supercomputer System Lee, Jung won CS843.
Tutorial on MPI Experimental Environment for ECE5610/CSC
Computer Basics 1 Computer Basic 1 includes two lessons:
Linux Clustering A way to supercomputing. What is Cluster? A group of individual computers bundled together using hardware and software in order to make.
Introduction to MPI. What is Message Passing Interface (MPI)?  Portable standard for communication  Processes can communicate through messages.  Each.
High Availability (HA) May 03, Motivation  New Technology  The opportunity to create a cluster  Exploring with Linux Operating system.
CSCI 4550/8556 Computer Networks Comer, Chapter 10: LAN Wiring, Physical Topology, and Interface Hardware.
Introduction to Systems Architecture Kieran Mathieson.
Networks & telecommunications Part 2 July 2, 2012.
1 Parallel Computing—Introduction to Message Passing Interface (MPI)
CS211 Data Structures Sami Rollins Fall 2004.
1 Performance Evaluation of Load Sharing Policies with PANTS on a Beowulf Cluster James Nichols Mark Claypool Worcester Polytechnic Institute Department.
CS 101 Problem Solving and Structured Programming in C Sami Rollins Spring 2003.
UNIX chapter 03 Getting Started Mr. Mohammad Smirat.
07/14/08. 2 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance. Cluster Computer for Basic.
Interactive Face Recognition (IFR) Nishanth Vincent Fairfield University Advisor: Professor Douglas A. Lyon, Ph.D.
What is Router? Router is a device which makes communication between two or more networks present in different geographical locations. Routers are data.
 Chasis / System cabinet  A plastic enclosure that contains most of the components of a computer (usually excluding the display, keyboard and mouse)
Chapter 6 Advanced Installation. Objectives  Describe the types and structure of SCSI devices  Explain the different levels of RAID and types of RAID.
EMBEDDED SYSTEMS G.V.P.COLLEGE OF ENGINEERING Affiliated to J.N.T.U. By By D.Ramya Deepthi D.Ramya Deepthi & V.Soujanya V.Soujanya.
Local Area Networks: Software © Abdou Illia, Spring 2007 School of Business Eastern Illinois University (Week 8, Thursday 3/1/2007)
RSC Williams MAPLD 2005/BOF-S1 A Linux-based Software Environment for the Reconfigurable Scalable Computing Project John A. Williams 1
1 A web enabled compact flash card reader eeble. 2 Weeble Team Chris Foster Nicole DiGrazia Mike Kacirek Website
Parallel Processing LAB NO 1.
Lecture 6: Introduction to Distributed Computing.
ITED 328 Lecture 4 12 Feb 2004 Loosely covering Chapter 5 Internet and LAN Technology.
Electronic Visualization Laboratory, University of Illinois at Chicago MPI on Argo-new Venkatram Vishwanath Electronic Visualization.
Multi-media Computers and Computer Networks. Questions ? Media is used for ………………… Multimedia computer is capable of integrating ………………………………….. OCR stands.
Linux+ Guide to Linux Certification, Third Edition Chapter 6 Advanced Installation.
Executing Message-Passing Programs Mitesh Meswani.
1 Distributed Systems: an Introduction G53ACC Chris Greenhalgh.
1 COMPSCI 110 Operating Systems Who - Introductions How - Policies and Administrative Details Why - Objectives and Expectations What - Our Topic: Operating.
10/12/ Embedded XINU and WRT54GL. 10/12/ Topics Logic and shift operators Data-driven vs function-driven Embedded XINU and WRT54GL.
An Introduction to Parallel Programming and MPICH Nikolaos Hatzopoulos.
1 Lecture 7 LAN Wiring, Physical Topology, and Interface Hardware Computer Networks CS 4316.
Message Passing Programming Model AMANO, Hideharu Textbook pp. 140-147.
Beowulf Software. Monitoring and Administration Beowulf Watch 
PP Lab MPI programming II. Program#1 Write a program that prints hello from every created process. Like: Hello World from process 0 of 5 Hello World from.
1 Message Passing Models CEG 4131 Computer Architecture III Miodrag Bolic.
Chapter 4 Message-Passing Programming. The Message-Passing Model.
Access Method. “ ” A key is usually intended to operate one specific lock or a small number of locks that are keyed alike, so each lock requires a unique.
Minimalist’s Linux Cluster Changyoung Choi, Jeonghyun Kim, Seyong Kim Department of Physics Sejong University.
Running on GCB part1 By: Camilo Silva. Simple steps to run MPI 1.Use putty or the terminal 2.SSH to gcb.fiu.edu 3.Loggin by providing your username and.
Local Area Networks School of Business Eastern Illinois University © Abdou Illia, Spring 2007 (Week 8, Tuesday 2/27/2007)
By Jack Stewart. Cloud computing, or something being in the cloud, is a colloquial expression used to describe a variety of different types of computing.
The 2001 Tier-1 prototype for LHCb-Italy Vincenzo Vagnoni Genève, November 2000.
Digital Literacy: Computer Basics
3/12/2013Computer Engg, IIT(BHU)1 MPI-1. MESSAGE PASSING INTERFACE A message passing library specification Extended message-passing model Not a language.
Message Passing Interface Using resources from
Business-logic Layer Presentation Layer Network Layer Digital Signal Processing Layer SmartHome API SmartHome Software Architecture SH mobile application.
Intro to Distributed Systems Hank Levy. 23/20/2016 Distributed Systems Nearly all systems today are distributed in some way, e.g.: –they use –they.
COMPUTER NETWORKS CS610 Lecture-11 Hammad Khalid Khan.
Nguyen Thi Thanh Nha HMCL by Roelof Kemp, Nicholas Palmer, Thilo Kielmann, and Henri Bal MOBICASE 2010, LNICST 2012 Cuckoo: A Computation Offloading Framework.
System Models Advanced Operating Systems Nael Abu-halaweh.
AMOEBA study of distributed system
Hands on training session for core skills
Advanced Topics: MPI jobs
MPI Message Passing Interface
Introduction to Quantum ESPRESSO and Computer System
CS 584.
Message Passing Models
AI Stick Easy to learn and use, accelerate the industrialization of artificial intelligence, and let the public become an expert in AI.
Hardware Environment VIA cluster - 8 nodes Blade Server – 5 nodes
Parallel Processing - MPI
Some codes for analysis and preparation for programming
Presentation transcript:

Development of a Compact Cluster with Embedded CPUs Sritrusta Sukaridhoto, Yoshifumi Sasaki, Koichi Ito and Takafumi Aoki

Development of a Compact Cluster with Embedded CPUs Introduction Ubiquitous environment Electronics equipment in our surrounding ( equipped with embedded CPU ) Connected by network ducts/secu_ftouchm.html Network Navigation System Mobiles Home Automation Embed ded CPU Distributed cooperation / parallel processing Embed ded CPU Embed ded CPU

Development of a Compact Cluster with Embedded CPUs Introduction Development Prototyping Environment Embedded CPUs Ubiquitous Distributed / Parallel Computing Ubiquitous Computing Cluster (UCC) Low Cost

Development of a Compact Cluster with Embedded CPUs Contents Introduction Cluster Computer Structures Implementations Performance Evaluations Application: Fingerprint Verifications Conclusions and Future Plans

Development of a Compact Cluster with Embedded CPUs Cluster Computer Structures HUB Terminal 100Mbps LAN Connections Node#3 Node#2 Node#1 Node#0 Embedded Devices Power consumption: 60W (TYP) Ubiquitous Computing Cluster Hardware

Development of a Compact Cluster with Embedded CPUs Cluster Computer Structures Specification of Calculation Node Embedded Network Attached Storage (NAS) Include: Embedded CPU ( SH4 ), memory , USB I/F , network I/F , HDD. Able to act as network computer Logically have function as general computer . Small space, low power consumption CPUSH4 (SH7751R, 266MHz) Memory64MB SDRAM HDD120GB, ATA133, 5400rpm NIC10/100 BASE-T (RTL-8139C+) I/FUSB 2.0×2port Power 14W(TYP)

Development of a Compact Cluster with Embedded CPUs Cluster Computer Structures Ubiquitous Computing Cluster Software Operating System Debian GNU Linux for SH4 Stable inter-processor communication Compact kernel and daemons Servers and daemons Inter-processor communication ( rsh, rexec, rcp ) login ( telnet ), file transfer ( FTP ) Network File sharing ( NFS ), Network Information Services (NIS) Development environment compiler (GNU gcc-3.0.4, g++, Fortran77) editor (GNU Emacs, vi) Parallel process interface (MPI, PVM)

Development of a Compact Cluster with Embedded CPUs Cluster Computer Structures How it works ??? UCC Node#3 HUB Node#2Node#1Node#0 Inter-process communication (rsh ) Login, File Transfer ( telnet, FTP ) Terminal Node#0 is also working as administrator server (NIS, NFS ) 100Mbps Fast Ethernet

Development of a Compact Cluster with Embedded CPUs Cluster Computer Structures Computing node is embedded CPU Suitable for prototyping the next generation computer Using COTS product Low cost system Using Linux as Operating System A stable inter-processor communication Open Source COTS: Commercial Off-The-Shelf Embedded Devices UCC Node#3 HUB Node#2 Node#1 Terminal 100Mbps Fast Ethernet Node#0

Development of a Compact Cluster with Embedded CPUs Features Size : 390mm×280mm×150mm , Power Consumption : 60W(TYP)

Development of a Compact Cluster with Embedded CPUs Implementation UCC Node#3 HUB Node#2Node#1 Terminal Node#0 Login to Node#0 Write a parallel program: hello.c #include "mpi.h" #include void main(int argc, char *argv[]) { int numprocs, prognum; /* Initialize MPI */ MPI_Init(&argc, &argv); MPI_Comm_rank(MPI_COMM_WORLD, &procnum); MPI_Comm_size(MPI_COMM_WORLD, &numprocs); printf ("Hello world! from %d of %d", procnum, numprocs); MPI_Finalize(); return; } Compile: mpicc –o hello hello.c Run: mpirun –np 4 hello Hello world! from 0 of 4Hello world! from 1 of 4Hello world! from 2 of 4Hello world! from 3 of 4

Development of a Compact Cluster with Embedded CPUs Advance Application USB port  connect to another devices (Fingerprint sensor, USB-Audio, Camera, etc) Fingerprint Verification System Speech / Voice Recognition System Image Processing System

Development of a Compact Cluster with Embedded CPUs Performance Evaluations Pallas MPI Benchmark (PMB)* :Performance evaluations for MPI communication Ping-Pong measuring delay time when transferring data between 2 processors Broadcast measuring the biggest delay time when transferring from node#0 to the other nodes. *

Development of a Compact Cluster with Embedded CPUs Performance Evaluations (cont.) ping-pong communication test The biggest transfer ability is 70 Mbps It gives enough performance using 100 Mbps ethernet

Development of a Compact Cluster with Embedded CPUs Performance Evaluations (cont.) Communication broadcast test The biggest broadcast communication ability is around 36Mbps It gives enough performance using ordinary HUB

Development of a Compact Cluster with Embedded CPUs Application: Fingerprint Verifications Distributed processing for verifying the fingerprint in a database. Fingerprint matching algorithm→ POC Node#3Node#2Node#1Node#0 Fingerprint Sensor Fingerprint matching in each node using POC Registered Fingerprint Input Fingerprint

Development of a Compact Cluster with Embedded CPUs What is Phase-Only Correlation (POC) ? correlation using only image phase component according to similarity degree of the image, sharp peak produced Algorithm based on signal processing Standard image Input image 2 DFT amplitude Example of Phase-Only Correlation Input image 1 DFT phase correlation phase amplitude

Development of a Compact Cluster with Embedded CPUs Fingerprint Matching Algorithm Using POC Function 128×128 FFT phasing Registered fingerprint ( phase ) × IFFT Peak extraction Peak comparison #Node 0 Fingerprint image Check result × IFFT Peak extraction #Node 3 Registered fingerprint ( phase ) × IFFT Peak extraction #Node 2 Registered fingerprint ( phase ) × IFFT Peak extraction #Node 1 Registered fingerprint ( phase )

Development of a Compact Cluster with Embedded CPUs Fingerprint Verification Performance Evaluation The number of computing nodes: 1, 2, or 4 (can be changed ) The number of registered finger print: 12 (3 images per node ) Evaluation of the matching time on input fingerprint and registered fingerprint Registered finger print Node #0 Node #1 Node #2 Node #3 Image from sensor

Development of a Compact Cluster with Embedded CPUs Result Processing time: less than 2 seconds with 4 nodes. Enough performance with embedded CPUs

Development of a Compact Cluster with Embedded CPUs Conclusion Development of a Ubiquitous Computing Cluster with embedded CPUs World smallest cluster computer in size, power consumption and cost Suitable for prototyping the next generation ubiquitous application Application: Fingerprint verification performance evaluation shows satisfactory result UCC is capable for advance applications

Development of a Compact Cluster with Embedded CPUs Future Plans Ubiquitous Computing Cluster Fingerprint verification Computer vision Voice/Speech recognition Next Generation Ubiquitous Application Cipher Fault tolerant system Image tracking Redundant Server system RFID certification Robot System Face recognition

Development of a Compact Cluster with Embedded CPUs Thank you