Exploiting SCI in the MultiOS management system Ronan Cunniffe Brian Coghlan SCIEurope’2000 29-AUG-2000.

Slides:



Advertisements
Similar presentations
Windows® Deployment Services
Advertisements

High Performance Cluster Computing Architectures and Systems Hai Jin Internet and Cluster Computing Center.
Capacity Planning and Predicting Growth for Vista Amy Edwards, Ezra Freeloe and George Hernandez University System of Georgia 2007.
1 Web Server Performance in a WAN Environment Vincent W. Freeh Computer Science North Carolina State Vsevolod V. Panteleenko Computer Science & Engineering.
Leveraging WinPE and Linux Preboot for Effective Provisioning Jonathan Richey | Director of Development | Altiris, Inc.
BigBed/bigWig remote file access Hiram Clawson UCSC Center for Biomolecular Science & Engineering.
Other File Systems: LFS and NFS. 2 Log-Structured File Systems The trend: CPUs are faster, RAM & caches are bigger –So, a lot of reads do not require.
Encouraging the Unexpected: Cluster Management for OS and Systems Research Ronan Cunniffe Brian Coghlan Europar’ AUG-2000.
ACAT 2002, Moscow June 24-28thJ. Hernández. DESY-Zeuthen1 Offline Mass Data Processing using Online Computing Resources at HERA-B José Hernández DESY-Zeuthen.
MobiDesk: Mobile Virtual Desktop Computing Ricardo A. Baratto, Shaya Potter, Gong Su, Jason Nieh Network Computing Laboratory Columbia University September.
Chapter 15 Chapter 15: Network Monitoring and Tuning.
University of Pennsylvania 11/21/00CSE 3801 Distributed File Systems CSE 380 Lecture Note 14 Insup Lee.
07/14/08. 2 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance. Cluster Computer for Basic.
File Systems (2). Readings r Silbershatz et al: 11.8.
RAID-x: A New Distributed Disk Array for I/O-Centric Cluster Computing Kai Hwang, Hai Jin, and Roy Ho.
Paper on Best implemented scientific concept for E-Governance Virtual Machine By Nitin V. Choudhari, DIO,NIC,Akola By Nitin V. Choudhari, DIO,NIC,Akola.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 14: Problem Recovery.
Federico Calzolari 1, Silvia Arezzini 2, Alberto Ciampa 2, Enrico Mazzoni 2 1 Scuola Normale Superiore - Pisa, Italy 2 National Institute of Nuclear Physics.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
1 The SpaceWire Internet Tunnel and the Advantages It Provides For Spacecraft Integration Stuart Mills, Steve Parkes Space Technology Centre University.
Hands-On Virtual Computing
1 Guide to Novell NetWare 6.0 Network Administration Chapter 13.
IOS110 Introduction to Operating Systems using Windows Session 9 1.
Distributed Systems. Interprocess Communication (IPC) Processes are either independent or cooperating – Threads provide a gray area – Cooperating processes.
Chapter Fourteen Windows XP Professional Fault Tolerance.
High Performance User-Level Sockets over Gigabit Ethernet Pavan Balaji Ohio State University Piyush Shivam Ohio State University.
Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.
Benefits: Increased server utilization Reduced IT TCO Improved IT agility.
The NE010 iWARP Adapter Gary Montry Senior Scientist
VirtualBox What you need to know to build a Virtual Machine.
A study of introduction of the virtualization technology into operator consoles T.Ohata, M.Ishii / SPring-8 ICALEPCS 2005, October 10-14, 2005 Geneva,
InstantGrid: A Framework for On- Demand Grid Point Construction R.S.C. Ho, K.K. Yin, D.C.M. Lee, D.H.F. Hung, C.L. Wang, and F.C.M. Lau Dept. of Computer.
Increasing Web Server Throughput with Network Interface Data Caching October 9, 2002 Hyong-youb Kim, Vijay S. Pai, and Scott Rixner Rice Computer Architecture.
Large Scale Parallel File System and Cluster Management ICT, CAS.
CHEP04 Performance Analysis of Cluster File System on Linux Yaodong CHENG IHEP, CAS
Server Performance, Scaling, Reliability and Configuration Norman White.
Terminal Servers in Schools A second life for your older computers.
Microsoft Management Seminar Series SMS 2003 Change Management.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 12: Planning and Implementing Server Availability and Scalability.
Minimalist’s Linux Cluster Changyoung Choi, Jeonghyun Kim, Seyong Kim Department of Physics Sejong University.
Page 1 Printing & Terminal Services Lecture 8 Hassan Shuja 11/16/2004.
ITGS Network Architecture. ITGS Network architecture –The way computers are logically organized on a network, and the role each takes. Client/server network.
Fs Streaming Media a presentation by Florian Seidel.
FAT File Allocation Table
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
COMP381 by M. Hamdi 1 Clusters: Networks of WS/PC.
The 2001 Tier-1 prototype for LHCb-Italy Vincenzo Vagnoni Genève, November 2000.
The Hungarian ClusterGRID Project Péter Stefán research associate NIIF/HUNGARNET
1 Farm Issues L1&HLT Implementation Review Niko Neufeld, CERN-EP Tuesday, April 29 th.
Virtual Machines Module 2. Objectives Define virtual machine Define common terminology Identify advantages and disadvantages Determine what software is.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Using Deduplicating Storage for Efficient Disk Image Deployment Xing Lin, Mike Hibler, Eric Eide, Robert Ricci University of Utah.
Virtualised DICE image - scoping Research and Teaching Unit Work in progress.
BY: SALMAN 1.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 12: Planning and Implementing Server Availability and Scalability.
BY: SALMAN.
WP18, High-speed data recording Krzysztof Wrona, European XFEL
Diskless Networks By Sam Morris.
Optimizing the Migration of Virtual Computers
5.0 : Windows Operating System
Windows Azure Migrating SQL Server Workloads
Virtualization overview
Migration Strategies – Business Desktop Deployment (BDD) Overview
Chapter 4.
NFS and AFS Adapted from slides by Ed Lazowska, Hank Levy, Andrea and Remzi Arpaci-Dussea, Michael Swift.
NFS.
Chapter 15: Network Monitoring and Tuning
File system : Disk Space Management
Presentation transcript:

Exploiting SCI in the MultiOS management system Ronan Cunniffe Brian Coghlan SCIEurope’ AUG-2000

Trinity College Dublin; Page 2 The Problem Those OS researchers crashed our cluster !

SCIEurope’ AUG-2000 Trinity College Dublin; Page 3 The Desire Crashes may be illuminating ! OS research environment: may not be stable may be missing features Separate cluster per project: is inefficient is expensive Mmmm …. As clusters become more common, problem gets more acute

SCIEurope’ AUG-2000 Trinity College Dublin; Page 4 Existing solution all candidate environments must be there no easy way to add another environment assumes environments will not over-write others only the current image is accessible i.e. Dual/Multiple Boot Partition the local disk

SCIEurope’ AUG-2000 Trinity College Dublin; Page 5 MultiOS solution - import & export every time can support any number of environments no assumptions about stability or good behaviour environments are accessible when off-line places great demands on network and remote disk Import the environment each time (from remote disk) …. OS #2OS #1 Time OS #1 out OS #2 in

SCIEurope’ AUG-2000 Trinity College Dublin; Page 6 MultiOS standard mechanism for diskless workstations BOOTP: Query remote server for file to download TFTP: Download indicated file Pass control to downloaded file alternate between two types of session Management sessions: node runs management software obtained over network this loads user image to local disk User sessions: node runs whatever environment is on the local disk MultiOS considers this a black box How is it implemented?

SCIEurope’ AUG-2000 Trinity College Dublin; Page 7 Use a full OS as management environment can run from a RAM disk can use network-mounted filesystems Advantages of a full OS SCI drivers raw disk I/O support standard UNIX tools: dd, diff, gzip, rcp, etc. new tools can be written as necessary Management software is Linux

SCIEurope’ AUG-2000 Trinity College Dublin; Page 8 MultiOS Server Scheduler Overall MultiOS architecture Console (WWW) Cluster Nodes Image Server SCI Network BOOTP Server Linux FS (+MultiOS client) Architecture: standard servers: BOOTP, TFTP, HTTP isolated web console TFTP Server

SCIEurope’ AUG-2000 Trinity College Dublin; Page 9 Disk images are big - hard disks are slow Our cluster: 16 nodes x 2GB disks 2GB raw over 100Mbps Ethernet: Nodes Network traffic Time 4 8MB/s 17.1 mins 8 8MB/s 34.1 mins 16 8MB/s 68.3 mins 32 8MB/s mins 2GB raw over SCI: Nodes Network traffic Time 4 40MB/s 3.3 mins 8 50MB/s 5.5 mins 16 50MB/s 10.7 mins 32 50MB/s 21.3 mins

SCIEurope’ AUG-2000 Trinity College Dublin; Page 10 Making the numbers smaller Compression Loading less than a disk Multicast reference image + patching 3 ways to do it: For 100MB differential information for each compute node: Nodes Network traffic Storage Time GB 3.5 mins GB 3.6 mins GB 4.0 mins GB 4.4 mins

SCIEurope’ AUG-2000 Trinity College Dublin; Page 11 If disks were faster Assuming 50MB/s disks : For 100MB differential information for each compute node: Nodes Network traffic Storage Time GB 48 secs GB 56 secs GB 72 secs GB 104 secs

SCIEurope’ AUG-2000 Trinity College Dublin; Page 12 SCI Multicast vs. Ethernet Multicast SCI Multicast-by-propagation end-to-end latency small compared to total time little extraneous traffic Difference becomes important in partitioned clusters IP Multicast multicast = broadcast image traffic disturbs everyone

SCIEurope’ AUG-2000 Trinity College Dublin; Page 13 Partitionable clusters MultiOS traffic can easily saturate the image server fewer points of entry than partitions transport scripts can provide traffic shaping

SCIEurope’ AUG-2000 Trinity College Dublin; Page 14 Summary MultiOS allows a cluster to be shared for any number of different environments for any type of research MultiOS via SCI exploits: high bandwidth ceiling low protocol overheads

SCIEurope’ AUG-2000 Trinity College Dublin; Page 15 Conclusion NB: OS research is an equal opportunity employer ! Our vision of the future Normal user OS researcher