InstantGrid: A Framework for On- Demand Grid Point Construction R.S.C. Ho, K.K. Yin, D.C.M. Lee, D.H.F. Hung, C.L. Wang, and F.C.M. Lau Dept. of Computer.

Slides:



Advertisements
Similar presentations
Remus: High Availability via Asynchronous Virtual Machine Replication
Advertisements

Windows® Deployment Services
Virtual Machine Technology Dr. Gregor von Laszewski Dr. Lizhe Wang.
1 Dynamic DNS. 2 Module - Dynamic DNS ♦ Overview The domain names and IP addresses of hosts and the devices may change for many reasons. This module focuses.
INSTALLING LINUX.  Identify the proper Hardware  Methods for installing Linux  Determine a purpose for the Linux Machine  Linux File Systems  Linux.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Distributed Storage March 12, Distributed Storage What is Distributed Storage?  Simple answer: Storage that can be shared throughout a network.
Managing Computers With Intel AMT Greg Rusu
CS-550: Distributed File Systems [SiS]1 Resource Management in Distributed Systems: Distributed File Systems.
XSEDE 13 July 24, Galaxy Team: PSC Team:
Scale-out Central Store. Conventional Storage Verses Scale Out Clustered Storage Conventional Storage Scale Out Clustered Storage Faster……………………………………………….
14.1 © 2004 Pearson Education, Inc. Exam Planning, Implementing, and Maintaining a Microsoft Windows Server 2003 Active Directory Infrastructure.
Exploiting SCI in the MultiOS management system Ronan Cunniffe Brian Coghlan SCIEurope’ AUG-2000.
Homework 2 In the docs folder of your Berkeley DB, have a careful look at documentation on how to configure BDB in main memory. In the docs folder of your.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 11 Managing and Monitoring a Windows Server 2008 Network.
Welcome Course 20410B Module 0: Introduction Audience
Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.
Frangipani: A Scalable Distributed File System C. A. Thekkath, T. Mann, and E. K. Lee Systems Research Center Digital Equipment Corporation.
Minerva Infrastructure Meeting – October 04, 2011.
Vincenzo Vagnoni LHCb Real Time Trigger Challenge Meeting CERN, 24 th February 2005.
System Architecture & Hardware Configurations Dr. D. Bilal IS 592 Spring 2005.
System Center 2012 Setup The components of system center App Controller Data Protection Manager Operations Manager Orchestrator Service.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Windows ® Deployment Services Infrastructure Planning and Design Published: February 2008 Updated: January 2012.
Chapter-4 Windows 2000 Professional Win2K Professional provides a very usable interface and was designed for use in the desktop PC. Microsoft server system.
Configuration of Linux Terminal Server Group: LNS10A6 Thebe Laxmi, Sharma Prabhakar, Patrick Appiah.
MCTS Guide to Microsoft Windows Server 2008 Applications Infrastructure Configuration (Exam # ) Chapter Two Deploying Windows Servers.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Research Achievements Kenji Kaneda. Agenda Research background and goal Research background and goal Overview of my research achievements Overview of.
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
High Performance Computing Cluster OSCAR Team Member Jin Wei, Pengfei Xuan CPSC 424/624 Project ( 2011 Spring ) Instructor Dr. Grossman.
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
Microsoft Active Directory(AD) A presentation by Robert, Jasmine, Val and Scott IMT546 December 11, 2004.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
Distributed File Systems
OSG Site Provide one or more of the following capabilities: – access to local computational resources using a batch queue – interactive access to local.
Software.
Module 1: Installing and Configuring Servers. Module Overview Installing Windows Server 2008 Managing Server Roles and Features Overview of the Server.
Dynamic Host Configuration Protocol Engr. Mehran Mamonai.
Guide to Linux Installation and Administration, 2e1 Chapter 2 Planning Your System.
Sandor Acs 05/07/
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
Laboratório de Instrumentação e Física Experimental de Partículas GRID Activities at LIP Jorge Gomes - (LIP Computer Centre)
OS2- Sem ; R. Jalili Introduction Chapter 1.
1 Week #10Business Continuity Backing Up Data Configuring Shadow Copies Providing Server and Service Availability.
The II SAS Testbed Site Jan Astalos - Institute of Informatics Slovak Academy of Sciences.
OPERATING SYSTEM SUPPORT DISTRIBUTED SYSTEMS CHAPTER 6 Lawrence Heyman July 8, 2002.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Definition of a Distributed System (1) A distributed system is: A collection of independent computers that appears to its users as a single coherent system.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
LCG LCG-1 Deployment and usage experience Lev Shamardin SINP MSU, Moscow
Communications & Networks National 4 & 5 Computing Science.
Construction methods and monitoring in meta-cluster systems LIT, JINR Korenkov V.V, Mitsyn V.V, Chkhaberidze D.V, Belyakov D.V.
 Introduction  Architecture NameNode, DataNodes, HDFS Client, CheckpointNode, BackupNode, Snapshots  File I/O Operations and Replica Management File.
Windows 2000 Ronnie Park Jarod Nozawa Joe Stones Yassir Mhdhroui.
The 2001 Tier-1 prototype for LHCb-Italy Vincenzo Vagnoni Genève, November 2000.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
Vignesh Ravindran Sankarbala Manoharan. Infrastructure As A Service (IAAS) is a model that is used to deliver a platform virtualization environment with.
The Hungarian ClusterGRID Project Péter Stefán research associate NIIF/HUNGARNET
Grid testing using virtual machines Stephen Childs*, Brian Coghlan, David O'Callaghan, Geoff Quigley, John Walsh Department of Computer Science Trinity.
Distributed Systems: Distributed File Systems Ghada Ahmed, PhD. Assistant Prof., Computer Science Dept. Web:
Windows Certification Paths OR MCSA Windows Server 2012 Installing and Configuring Windows Server 2012 Exam (20410) Administering Windows Server.
System Models Advanced Operating Systems Nael Abu-halaweh.
BIG DATA/ Hadoop Interview Questions.
A Quick Tour of Ceedo Deliver environments to un-managed PCs.
BY: SALMAN 1.
Chapter 6: Securing the Cloud
BY: SALMAN.
Solutions: Backup & Restore
Lecture 15 Reading: Bacon 7.6, 7.7
Presentation transcript:

InstantGrid: A Framework for On- Demand Grid Point Construction R.S.C. Ho, K.K. Yin, D.C.M. Lee, D.H.F. Hung, C.L. Wang, and F.C.M. Lau Dept. of Computer Science, The University of Hong Kong Grid point construction a difficult task Different grid users/applications demand different execution environments (EE’s) Managing - and switching between - different EE’s incur much system administration overheads E.g. Computing grid (MPICH-G2, etc.) vs. service grid (GT3); different OS distributions/versions, libraries, etc. Our solution – InstantGrid A framework for efficient construction of grid point Convenient system administration for multiple EE’s Instant EE construction in remote nodes Complete transparency to user applications Supports in-memory execution – protects HD’s data from malicious access

The InstantGrid Framework All EE’s are installed, configured, and managed in central InstantGrid servers Cluster/grid nodes obtain customized EE’s through network (i.e., the “dissemination” process) Framework consists of the following key elements: Application-centric software grouping Proactive software configuration Discriminative file sharing mechanisms Options for file storage in compute nodes An EE dissemination service Single Linux Image Management (SLIM): The infrastructure for EE dissemination SLIM is able to deliver customized EE’s for: HPC cluster/grid systems Linux desktops Diskless Linux nodes

Application-centric Software Grouping (a)A service-oriented grid point (b)A frontend node for HPC job submission (c)A typical cluster node which processes jobs dispatched from the frontend node (b)+(c): A single EE group indicating the software requirement of a cluster- based grid point, which includes a gatekeeper and a number of compute nodes Software are grouped together to match the specific requirements of applications An EE is a collection of software components, which include an OS, system libraries, grid/cluster middleware, applications, and the user data Customized EE “images” for different users/applications Facilitates software management and dissemination Sample EE’s:

Proactive Software Configuration Discrimitive File Sharing Mechanism Full replication is impractical due to large size of typical EE’s Updating files through NFS is slow InstantGrid adopts a hybrid approach: Replicate (frequently- updated files) + NFS (other files) Traditionally, software are installed/configured incrementally InstantGrid advocates “configuration before dissemination” Try to configure all software in the central server if possible The EE’s disseminated are (almost) ready-to-run Option for File Storage in Compute Nodes “Full-copy to RAM” – files stored entirely in physical memory “Full-copy to HD” – files stored in hard disk “Copy-if-needed” – files stored in HD; only new files are copied EE Dissemination Service Service is offered through a DHCP server, a TFTP server and an NFS server When a client machine boots up, it obtains its IP address and the kernel from the DCHP and TFTP servers respectively Constructs the pre-defined EE by replicating writable files to local storage and mounting the read-only directories through the NFS

Example – Constructing a service- oriented grid point 1. Software installation at SLIM server 2. Client boots and obtains kernel 3. OS image/App disseminated4. Process to generate certificates

Performance evaluation Future Work To devise standard protocols for communicating EE specifications between the InstantGrid servers and compute nodes To optimize InstantGrid’s performance in WAN A 256-node cluster-based grid point can be constructed from scratch in three (copy-if-needed) to five (full-copy to hard disk) minutes Standalone grid points take longer time to construct. The bottleneck mainly lies on the process to generate host certificates Conducted in HKU CS’s Gideon Cluster (Pentium 4 x 300; fast ethernet; each node has 512MB ram, 40GB IDE hard disk) Two tests: (a) a cluster-based grid point, and (b) standalone grid points