Research Achievements Kenji Kaneda. Agenda Research background and goal Research background and goal Overview of my research achievements Overview of.

Slides:



Advertisements
Similar presentations
Multiple Processor Systems
Advertisements

Distributed Processing, Client/Server and Clusters
Ravi Sankar Technology Evangelist | Microsoft
Protocols and software for exploiting Myrinet clusters Congduc Pham and the main contributors P. Geoffray, L. Prylli, B. Tourancheau, R. Westrelin.
An Overview of the Amoeba Distributed Operating System Mallikarjuna Reddy Srinivas Vadlamani University of California Irvine.
Setting up of condor scheduler on computing cluster Raman Sehgal NPD-BARC.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts Amherst Operating Systems CMPSCI 377 Lecture.
Classification of Distributed Systems Properties of Distributed Systems n motivation: advantages of distributed systems n classification l architecture.
Job submission architectures in GRID environment Masamichi Ando M1 Student Taura Lab. Department of Information Science and Technology.
Distributed Processing, Client/Server, and Clusters
City University London
Distributed Hardware How are computers interconnected ? –via a bus-based –via a switch How are processors and memories interconnected ? –Private –shared.
Lesson 17 – UNDERSTANDING OTHER NETWARE SERVICES.
Yousuf Surmust Instructor: Marius Soneru Course: CS550 Fall 2001
Lesson 20 – OTHER WINDOWS 2000 SERVER SERVICES. DHCP server DNS RAS and RRAS Internet Information Server Cluster services Windows terminal services OVERVIEW.
Hardware Firewall Feature © N. Ganesan, Ph.D.. Chapter Objectives Show the configuration of a hardware firewall such as Dlink DI 604 Illustrate the sharing.
Workload Management Massimo Sgaravatto INFN Padova.
Hussain Ali Department of Computer Engineering KFUPM, Dhahran, Saudi Arabia Microsoft Networking.
Windows 2008 Overview Lecture 1. Windows Networking Evolution Windows for Workgroups – peer-to-peer networking built into the OS Windows NT – separate.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Distributed Systems Early Examples. Projects NOW – a Network Of Workstations University of California, Berkely Terminated about 1997 after demonstrating.
About the Presentations The presentations cover the objectives found in the opening of each chapter. All chapter objectives are listed in the beginning.
Computer System Architectures Computer System Software
Parallel Processing LAB NO 1.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 1: Introduction What is an Operating System? Mainframe Systems Desktop Systems.
Introduction to Networking Concepts. Introducing TCP/IP Addressing Network address – common portion of the IP address shared by all hosts on a subnet/network.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Prof. Heon Y. Yeom Distributed Computing Systems Lab. Seoul National University FT-MPICH : Providing fault tolerance for MPI parallel applications.
A brief overview about Distributed Systems Group A4 Chris Sun Bryan Maden Min Fang.
1. Introduction to Distributed Systems. 1. Introduction Two advances in computer technology: A. The development of powerful microprocessors. B. The invention.
1 Lecture 20: Parallel and Distributed Systems n Classification of parallel/distributed architectures n SMPs n Distributed systems n Clusters.
 What is an operating system? What is an operating system?  Where does the OS fit in? Where does the OS fit in?  Services provided by an OS Services.
Remote Access Chapter 4. Learning Objectives Understand implications of IEEE 802.1x and how it is used Understand VPN technology and its uses for securing.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 1: Introduction What is an Operating System? Mainframe Systems Desktop Systems.
Appendix B Planning a Virtualization Strategy for Exchange Server 2010.
Module 3: Designing IP Addressing. Module Overview Designing an IPv4 Addressing Scheme Designing DHCP Implementation Designing DHCP Configuration Options.
1 Next Few Classes Networking basics Protection & Security.
A Virtual Machine Monitor for Utilizing Non-dedicated Clusters Kenji Kaneda Yoshihiro Oyama Akinori Yonezawa (University of Tokyo)
G-JavaMPI: A Grid Middleware for Distributed Java Computing with MPI Binding and Process Migration Supports Lin Chen, Cho-Li Wang, Francis C. M. Lau and.
1 The Roadmap to New Releases Todd Tannenbaum Department of Computer Sciences University of Wisconsin-Madison
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
Database Architectures Database System Architectures Considerations – Data storage: Where do the data and DBMS reside? – Processing: Where.
Internetworking Concept and Architectural Model
NGS Innovation Forum, Manchester4 th November 2008 Condor and the NGS John Kewley NGS Support Centre Manager.
Multiprossesors Systems.. What are Distributed Databases ? “ A Logically interrelated collection of shared data ( and a description of this data) physically.
PARALLEL COMPUTING overview What is Parallel Computing? Traditionally, software has been written for serial computation: To be run on a single computer.
Page 1 Process Migration & Allocation Paul Krzyzanowski Distributed Systems Except as otherwise noted, the content of this.
Virtual Private Grid (VPG) : A Command Shell for Utilizing Remote Machines Efficiently Kenji Kaneda, Kenjiro Taura, Akinori Yonezawa Department of Computer.
Types of Operating Systems 1 Computer Engineering Department Distributed Systems Course Assoc. Prof. Dr. Ahmet Sayar Kocaeli University - Fall 2015.
Virtualization Technology and Microsoft Virtual PC 2007 YOU ARE WELCOME By : Osama Tamimi.
GRID activities in Wuppertal D0RACE Workshop Fermilab 02/14/2002 Christian Schmitt Wuppertal University Taking advantage of GRID software now.
Introduction Why are virtual machines interesting?
Globus Grid Tutorial Part 2: Running Programs Across Multiple Resources.
High Performance LU Factorization for Non-dedicated Clusters Toshio Endo, Kenji Kaneda, Kenjiro Taura, Akinori Yonezawa (University of Tokyo) and the future.
Hands-On Virtual Computing
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 1.
Background Computer System Architectures Computer System Software.
Primitive Concepts of Distributed Systems Chapter 1.
Introduction Goal: connecting multiple computers to get higher performance – Multiprocessors – Scalability, availability, power efficiency Job-level (process-level)
SYSTEM MODELS FOR ADVANCED COMPUTING Jhashuva. U 1 Asst. Prof CSE
COMP1321 Digital Infrastructure Richard Henson March 2016.
Distributed Processing, Client/Server and Clusters
Chapter 1 Introduction to Networking
Windows 2008 Overview Lecture 1.
Chapter 1: Introduction
Chapter 17: Database System Architectures
P3:Personal Power Plant
A Virtual Machine Monitor for Utilizing Non-dedicated Clusters
Lecture Topics: 11/1 Hand back midterms
Presentation transcript:

Research Achievements Kenji Kaneda

Agenda Research background and goal Research background and goal Overview of my research achievements Overview of my research achievements Phoenix Phoenix Virtual Private Grid Virtual Private Grid Summary and recent activities Summary and recent activities

Research Background and Goal

Background Grid computing Grid computing Parallel computing with harnessing many widely-distributed resources Parallel computing with harnessing many widely-distributed resources E.g.) aggregate of PC clusters spread over multiple LANs

Traditional Parallel Computing vs. Grid Computing Traditional parallel computing Reliable processors Reliable processors Single-LAN resources Single-LAN resources Grid computing Unreliable processors Unreliable processors Multi-LANs resources Multi-LANs resources

Difficulty in Grid computing Frequent machine/network failures Frequent machine/network failures E.g.) 1 machine failure per a day Restricted Connectivity Restricted Connectivity Administrative policies restrict communications between machines Administrative policies restrict communications between machines E.g.) firewall, NAT, DHCP Gateway TCP Gateway Firewall

Research Goal Allow a user to harness a computational grid like traditional parallel computing Allow a user to harness a computational grid like traditional parallel computing Fault tolerance Fault tolerance Transparent communication on WANs Transparent communication on WANs

My Research Achievements Design/implementation of middlewares Phoenix Phoenix Parallel programming library for accommodating dynamically joining/leaving resources Parallel programming library for accommodating dynamically joining/leaving resources Virtual Private Grid Virtual Private Grid Command shell for utilizing hundreds of computers spread over multiple LANs Command shell for utilizing hundreds of computers spread over multiple LANs

Phoenix

Phoenix Parallel programming library for accommodating dynamically joining/leaving resources Parallel programming library for accommodating dynamically joining/leaving resources Programming model for supporting migration of application states Programming model for supporting migration of application states Transparent communication mechanism for WANs Transparent communication mechanism for WANs

Programming Model for Supporting Migration of Application States Subsumes a regular message passing model Subsumes a regular message passing model Provides a namespace that does not depend on physical machines Provides a namespace that does not depend on physical machines Programmer uses this name to specify a message destination Programmer uses this name to specify a message destination Programmer can write a program without being aware of physical machines Programmer can write a program without being aware of physical machines

Transparent Communication mechanism for WANs Overlay network construction Overlay network construction Application-level message routing Application-level message routing Processes can communicate with one another Processes can communicate with one another even if networks are not fully connected even if networks are not fully connected even if connection topologies change dynamically even if connection topologies change dynamically

Demonstration Boot processes on 3 subnets Boot processes on 3 subnets Add processes dynamically Add processes dynamically

Demonstration

Experiments (1/3) Speedup with fixed resources Speedup with fixed resources POV-Ray: 78 speedup using 104 processors on 3 LANs POV-Ray: 78 speedup using 104 processors on 3 LANs LU: comparable to MPICH (on a single LAN) LU: comparable to MPICH (on a single LAN)

Experiments (2/3) Speedup with dynamic resources Speedup with dynamic resources POV-Ray takes advantage of dynamically added resources quickly POV-Ray takes advantage of dynamically added resources quickly

Experiments (3/3) Parallel shogi (Japanese chess) program on 720 laptop PCs Parallel shogi (Japanese chess) program on 720 laptop PCs 7~8 speedup 7~8 speedup

Related Work Grid enabled MPIs Grid enabled MPIs E.g.) MPICH-G [G. Bosilca et al. SC ’ 02] Based on a traditional message passing model Based on a traditional message passing model Difficult to support dynamic changes of resources Difficult to support dynamic changes of resources Communications libraries for Grids Communications libraries for Grids E.g.) Ibis [A. Denis et al. HPDC ’ 04] Static message routing Static message routing

Summary ~ Phoenix ~ Parallel programming library for dynamically changing resources Parallel programming library for dynamically changing resources Good speedup with a large number of machines on multiple LANs Good speedup with a large number of machines on multiple LANs

Virtual Private Grid

Virtual Private Grid (VPG) Command shell for utilizing hundreds of computers spread over multiple LANs Command shell for utilizing hundreds of computers spread over multiple LANs

Features (1/2) User can submit jobs without caring administrative restrictions User can submit jobs without caring administrative restrictions E.g.) | > Firewall host1host2 Firewall NAT host3 Execute cmd1 Write to file3 Execute cmd2

Features (2/2) Fault tolerance Fault tolerance VPG can continue to run even if some machines are added/deleted dynamically VPG can continue to run even if some machines are added/deleted dynamically No central server is required No central server is required

Demonstration Environment Environment 3 LANs 3 LANs CPU: Sparc, x86, MIPS, PowerPC CPU: Sparc, x86, MIPS, PowerPC OS: Solaris, Linux, IRIX OS: Solaris, Linux, IRIX

Demonstration

Related Work Grid job submission tools Grid job submission tools E.g.) Globus, Condor-G Difficult to submit jobs to machines under administrative restrictions Difficult to submit jobs to machines under administrative restrictions

Summary ~ Virtual Private Grid ~ Command shell for utilizing hundreds of computers spread over multiple LANs Command shell for utilizing hundreds of computers spread over multiple LANs Fast job submission to more than 100 machines Fast job submission to more than 100 machines

Summary and Recent activities

Summary ~ My Research Achievements ~ Middlewares for Grid computing Middlewares for Grid computing Phoenix Phoenix Virtual Private Grid Virtual Private Grid

Recent Activities (1/2) Virtual SMP Virtual SMP Emulates a multi-processor machines on a loosely-coupled computes Emulates a multi-processor machines on a loosely-coupled computers Virtual dual processor machine on two single processor machines

Recent Activities (2/2) Virtual SMP Virtual SMP Easy utilization of distributed resources with a common OS (e.g., Windows, Linux) Easy utilization of distributed resources with a common OS (e.g., Windows, Linux)