Open Source Cluster Applications Resources. Overview What is O.S.C.A.R.? History Installation Operation Spin-offs Conclusions.

Slides:



Advertisements
Similar presentations
Chapter 20 Oracle Secure Backup.
Advertisements

Windows Deployment Services WDS for Large Scale Enterprises and Small IT Shops Presented By: Ryan Drown Systems Administrator for Krannert.
Setting up of condor scheduler on computing cluster Raman Sehgal NPD-BARC.
Leveraging WinPE and Linux Preboot for Effective Provisioning Jonathan Richey | Director of Development | Altiris, Inc.
Linux+ Guide to Linux Certification, Second Edition Chapter 3 Linux Installation and Usage.
14.1 © 2004 Pearson Education, Inc. Exam Planning, Implementing, and Maintaining a Microsoft Windows Server 2003 Active Directory Infrastructure.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 11 Managing and Monitoring a Windows Server 2008 Network.
Installing software on personal computer
Cluster Computing - GCB1 Cluster Computing Javier Delgado Grid-Enabledment of Scientific Applications Professor S. Masoud Sadjadi.
© 2010 VMware Inc. All rights reserved VMware ESX and ESXi Module 3.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
11 MAINTAINING THE OPERATING SYSTEM Chapter 5. Chapter 5: MAINTAINING THE OPERATING SYSTEM2 CHAPTER OVERVIEW Understand the difference between service.
11 MAINTAINING THE OPERATING SYSTEM Chapter 5. Chapter 5: MAINTAINING THE OPERATING SYSTEM2 CHAPTER OVERVIEW  Understand the difference between service.
VMWare Workstation Installation. Starting Vmware Workstation Go to the start menu and start the VMware Workstation program. *Note: The following instructions.
DB2 (Express C Edition) Installation and Using a Database
Linux Operations and Administration
Operating Systems Operating System
Computer Concepts 2013 Chapter 4 Operating Systems and File Management.
SSI-OSCAR A Single System Image for OSCAR Clusters Geoffroy Vallée INRIA – PARIS project team COSET-1 June 26th, 2004.
Tutorial 11 Installing, Updating, and Configuring Software
SUSE Linux Enterprise Server Administration (Course 3037) Chapter 4 Manage Software for SUSE Linux Enterprise Server.
By: Paul Hill Technology Coordinator Gwinn Area Community Schools.
VMWare Workstation Installation. Starting Vmware Workstation Go to the start menu and start the VMware Workstation program. *Note: The following instructions.
High Performance Computing Cluster OSCAR Team Member Jin Wei, Pengfei Xuan CPSC 424/624 Project ( 2011 Spring ) Instructor Dr. Grossman.
Weekly Report By: Devin Trejo Week of May 30, > June 5, 2015.
Bright Cluster Manager Advanced cluster management made easy Dr Matthijs van Leeuwen CEO Bright Computing Mark Corcoran Director of Sales Bright Computing.
Honeypot and Intrusion Detection System
Presented by Open Source Cluster Application Resources (OSCAR) Stephen L. Scott Thomas Naughton Geoffroy Vallée Network and Cluster Computing Computer.
Oak Ridge National Laboratory — U.S. Department of Energy 1 The ORNL Cluster Computing Experience… Stephen L. Scott Oak Ridge National Laboratory Computer.
1 Apache. 2 Module - Apache ♦ Overview This module focuses on configuring and customizing Apache web server. Apache is a commonly used Hypertext Transfer.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Introduction to HP Availability Manager.
Chapter 13 Users, Groups Profiles and Policies. Learning Objectives Understand Windows XP Professional user accounts Understand the different types of.
Client – Server Application Can you create a client server application: The server will be running as a service: does not have a GUI The server will run.
O.S.C.A.R. Cluster Installation. O.S.C.A.R O.S.C.A.R. Open Source Cluster Application Resource Latest Version: 2.2 ( March, 2003 )
MCTS Guide to Microsoft Windows Server 2008 Applications Infrastructure Configuration (Exam # ) Chapter Four Windows Server 2008 Remote Desktop Services,
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Open Source Evaluation - FileZilla Michael Nye ITEC 400 Assignment 14-1 Professor D’Andrea Franklin University April 10, 2008.
OS and System Software for Ultrascale Architectures – Panel Jeffrey Vetter Oak Ridge National Laboratory Presented to SOS8 13 April 2004 ack.
Kickstart Installation
ABone Architecture and Operation ABCd — ABone Control Daemon Server for remote EE management On-demand EE initiation and termination Automatic EE restart.
VMWare Workstation Installation. Starting Vmware Workstation Go to the start menu and start the VMware Workstation program. *Note: The following instructions.
 Load balancing is the process of distributing a workload evenly throughout a group or cluster of computers to maximize throughput.  This means that.
Microsoft Management Seminar Series SMS 2003 Change Management.
Cluster Software Overview
How to for compiling and running MPI Programs. Prepared by Kiriti Venkat.
Module 10: Windows Firewall and Caching Fundamentals.
Linux Operations and Administration
VMWare Workstation Installation. Starting Vmware Workstation Go to the start menu and start the VMware Workstation program. *Note: The following instructions.
Process Manager Specification Rusty Lusk 1/15/04.
2: Operating Systems Networking for Home & Small Business.
LINUX Presented By Parvathy Subramanian. April 23, 2008LINUX, By Parvathy Subramanian2 Agenda ► Introduction ► Standard design for security systems ►
Basic Setup of OpenMeetings Version Three Styles for Setup Style1 : Starting from Scratch – Starting from svn check out and compiling with ant.
C Copyright © 2006, Oracle. All rights reserved. Oracle Secure Backup Additional Installation Topics.
Scientific Linux Inventory Project (SLIP) Troy Dawson Connie Sieh.
OSCAR Symposium – Quebec City, Canada – June 2008 Proposal for Modifications to the OSCAR Architecture to Address Challenges in Distributed System Management.
BY: SALMAN 1.
Andrea Chierici Virtualization tutorial Catania 1-3 dicember 2010
Architecture Review 10/11/2004
Fundamental of Databases
VMware ESX and ESXi Module 3.
Installation The Intercompany Integration Solution for SAP Business One Version 2.0 for SAP Business One 9.1 Welcome to the course on the installation.
Computing Clusters, Grids and Clouds Globus data service
BY: SALMAN.
GWE Core Grid Wizard Enterprise (
Computing Experience…
Deploying and Configuring SSIS Packages
Installation The Intercompany Integration Solution for SAP Business One Version 2.0 for SAP Business One 9.1 Welcome to the course on the installation.
CCR Advanced Seminar: Running CPLEX Computations on the ISE Cluster
Presentation transcript:

Open Source Cluster Applications Resources

Overview What is O.S.C.A.R.? History Installation Operation Spin-offs Conclusions

History CCDK (Community Cluster Development Kit) OCG (Open Cluster Group) OSCAR (the Open Source Cluster Application Resource) IBM, Dell, SGI and Intel working closely together ORNL – Oak Ridge National Laboratory

First Meeting Tim Mattson and Stephen Scott Decided on these: That the adoption of clusters for mainstream, high-performance computing is inhibited by a lack of well-accepted software stacks that are robust and easy to use by the general user. That the group embraces the open-source model of software distribution. Anything contributed to the group must be freely distributable, preferably as source code under the Berkeley open-source license. That the group can accomplish its goals by propagating best-known practices built up through many years of hard work by cluster computing pioneers.

Initial Thoughts Differing architectures (small, medium, large) Two paths of progress, R&D and ease of use Primarily for non-computer-savvy users. Scientists Academics Homogeneous system

Timeline Initial meeting in 2000 Beta development started the same year First distribution, OSCAR 1.0 in 2001 at LinuxWorld Expo in New York City Today up to OSCAR 5.1 Heterogeneous system Far more robust More user friendly

Supported Distributions – 5.0 Distribution and ReleaseArchitectureStatus Red Hat Enterprise Linux 4x86Fully supported Red Hat Enterprise Linux 4x86_64Fully supported Red Hat Enterprise Linux 4ia64Fully supported Fedora Core 4x86Fully supported Fedora Core 4x86_64Fully supported Fedora Core 5x86Fully supported Fedora Core 5x86_64Fully supported Mandriva Linux 2006x86Fully supported SUSE Linux 10.0x86Fully supported

Installation Detailed Installation notes Detailed User guide Basic idea: Configure head node (server) Configure image for client nodes Configure network Distribute node images Manage your own cluster!!

Head Node Install by running./install_cluster eth1 script GUI will auto-launch Chose desired step in GUI, make sure each step is complete before proceeding onto next one All the configuration can be done from this system from now on

Download Subversion is used Default is the OSCAR SVN Can set up custom SVN Allows for up to date installation Allows for controlled rollouts of multiple clusters OPD also has powerful command line functionality (LWP for proxy servers)

Select & Configure OSCAR packages Customize server up to your liking/needs Some packages can be customized This step is very crucial, choice of packages can affect performance as well as compatibility

Installation of Server Node Simply installs packages which were selected Automatically configures the server node Now the Head or Server is ready to manage, administer and schedule jobs for it’s client nodes

Build Client Image Choose name Specify packages within the package file Specify distribution Be wary of automatic reboot if network boot is manually selected as default

Building the Client Image …

Define Clients This step creates the network structure of the nodes It’s advisable to assign IP based on physical links GUI short-comings regarding multiple IP spans Incorrect setup can lead to an error during node installation

Define Clients

Setup Networking SIS – System Installation Suite SystemImager MAC addresses are scanned for Must link a MAC to a node Must select network boot method (rsync, multicast, bt) Must make sure clients support PXE boot or create boot CDs Own Kernel can be used if the one supplied with SIS does not work

Client Installation and Test After the network is properly configured, installation can begin All nodes are installed and rebooted Once the system imaging is complete, a test can be run to ensure the cluster is working properly At this point, the cluster is ready to begin parallel job scheduling

Operation Admin packages are: Torque Resource Manager Maui Scheduler C3 pfilter System Imager Suite Switcher Environment Manager OPIUM Ganglia

Operation Library packages: LAM/MPI OpenMPI MPICH PVM

Torque Resource Manager Server on Head node “mom” daemon on clients Handles job submission and execution Keeps track of cluster resources Has own scheduler but uses Maui by default Commands are not intuitive, documentation must be read From OpenPBS dministration_guide:ch4.1.1_torque_overview dministration_guide:ch4.1.1_torque_overview

Maui Scheduler Handles job scheduling Sophisticated algorithms Customizable Much literature on it’s algorithms Has a commercial gen. of Maui called Moab Accepted as the unofficial HPC standard for scheduling cumentation.php cumentation.php

C3 - Cluster Command Control Developed by ORNL Collection of tools for cluster administration Commands: cget, cpush, crm, cpushimage cexec, cexecs, ckill, cshutdown cnum, cname, clist Cluster Configuration Files dministration_guide:ch4.3.1_c3_overview dministration_guide:ch4.3.1_c3_overview

pfilter Cluster traffic filter Default is that client nodes can only send outgoing communications, outside the scope of the cluster If it is desirable to open up client nodes, pfilter config file must be modified

System Imager Suite Tool for network Linux installations Image based, can even chroot into image Also has database which contains cluster configuration information Tied in with C3 Can handle multiple images per cluster Completely automated once image is created

Switcher Environment Manager Handles “dot” files Does not limit advanced users Designed to help non-savvy users Has guards in place that prevent system destruction Which MPI to use – per user basis Operates on two levels: user and system Modules package is included for advanced users (and used by switcher)

OPIUM Login is handled by the Head node Once connection is established, client nodes do not require authentication Synchronization run by root, at intervals It stores hash values of the password in.shh folder along with a “salt” Password changes must be done at the Head node as all changes propagate from there

Ganglia Distributed Monitoring System Low overhead per node XML for data representation Robust Used in most cluster and grid solutions

LAM/MPI LAM - Local Area Multicomputer LAM initializes the runtime environment on a select number of nodes MPI 1 and some of MPI 2 MPICH2 can be used if installed Two tiered debugging system exists: snapshot and communication log Daemon based

Open MPI Replacement for LAM/MPI Same team working on it LAM/MPI relegated to upkeep only, all new development in Open MPI Much more robust (OS, schedulers) Full MPI-2 compliance Much higher performance

PVM – Parallel Virtual Machine Same as LAM/MPI Can be run outside of the scope of Torque and Maui Supports Windows nodes as well Much better portability Not as robust and powerful as Open MPI

Spin-offs HA-OSCAR - VMware with OSCAR SSI-OSCAR - SSS-OSCAR -

Conclusions Future Direction Open MPI Windows, Mac OS?