Brain Campbell Bryce Carmichael Unquiea Wade Mentor: Dr. Eric Akers.

Slides:



Advertisements
Similar presentations
Setting up of condor scheduler on computing cluster Raman Sehgal NPD-BARC.
Advertisements

Dr. David Wallom Use of Condor in our Campus Grid and the University September 2004.
Bryce Carmichael Lee Godley III Diaminatou Goudiaby Unquiea Wade Mentor: Dr. Eric Akers.
Web Server Hardware and Software
Operating Systems: Software in the Background
Sun Grid Engine Grid Computing Assignment – Fall 2005 James Ruff Senior Department of Mathematics and Computer Science Western Carolina University.
Understanding Operating Systems 1 Overview Introduction Operating System Components Machine Hardware Types of Operating Systems Brief History of Operating.
Operating Systems BTEC IT Practitioners.
MCITP Guide to Microsoft Windows Server 2008 Server Administration (Exam #70-646) Chapter 14 Server and Network Monitoring.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 1: Introduction What is an Operating System? Mainframe Systems Desktop Systems.
Condor Overview Bill Hoagland. Condor Workload management system for compute-intensive jobs Harnesses collection of dedicated or non-dedicated hardware.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Operating Systems.
Installing software on personal computer
1 Chapter Overview Introduction to Windows XP Professional Printing Setting Up Network Printers Connecting to Network Printers Configuring Network Printers.
Hands-On Microsoft Windows Server 2008 Chapter 11 Server and Network Monitoring.
CH 13 Server and Network Monitoring. Hands-On Microsoft Windows Server Objectives Understand the importance of server monitoring Monitor server.
Windows Server 2008 Chapter 11 Last Update
SOFTWARE.
Chapter 3  Manage the computer’s resources ◦ CPU ◦ Memory ◦ Disk drives ◦ Printers  Establish a user interface  Execute and provide services for applications.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Distributed Systems Early Examples. Projects NOW – a Network Of Workstations University of California, Berkely Terminated about 1997 after demonstrating.
© Paradigm Publishing Inc. 4-1 Chapter 4 System Software.
Fundamentals of Networking Discovery 1, Chapter 2 Operating Systems.
Parallel Computing The Bad News –Hardware is not getting faster fast enough –Too many architectures –Existing architectures are too specific –Programs.
High Throughput Computing with Condor at Purdue XSEDE ECSS Monthly Symposium Condor.
Track 1: Cluster and Grid Computing NBCR Summer Institute Session 2.2: Cluster and Grid Computing: Case studies Condor introduction August 9, 2006 Nadya.
Chapter 4 System Software.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
 Introduction to Operating System Introduction to Operating System  Types Of An Operating System Types Of An Operating System  Single User Single User.
Operating Systems  A collection of programs that  Coordinates computer usage among users  Manages computer resources  Handle Common Tasks.
LINUX System : Lecture 2 OS and UNIX summary Bong-Soo Sohn Assistant Professor School of Computer Science and Engineering Chung-Ang University Acknowledgement.
Condor Tugba Taskaya-Temizel 6 March What is Condor Technology? Condor is a high-throughput distributed batch computing system that provides facilities.
Classification of Computers
Computing and the Web Operating Systems. Overview n What is an Operating System n Booting the Computer n User Interfaces n Files and File Management n.
Grid Computing I CONDOR.
Wenjing Wu Andrej Filipčič David Cameron Eric Lancon Claire Adam Bourdarios & others.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
Computers & Operating Systems
Review of Condor,SGE,LSF,PBS
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
CH 13 Server and Network Monitoring. Hands-On Microsoft Windows Server Objectives Understand the importance of server monitoring Monitor server.
Page 1 Printing & Terminal Services Lecture 8 Hassan Shuja 11/16/2004.
© Paradigm Publishing, Inc. 4-1 Chapter 4 System Software Chapter 4 System Software.
Application Software System Software.
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
CONTENT  Introduction Introduction  Operating System (OS) Operating System (OS) Operating System (OS)  Summary Summary  Application Software Application.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Slide 6-1 Chapter 6 System Software Considerations Introduction to Information Systems Judith C. Simon.
THE WINDOWS OPERATING SYSTEM Computer Basics 1.2.
2: Operating Systems Networking for Home & Small Business.
Background Computer System Architectures Computer System Software.
Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.
Creating Grid Resources for Undergraduate Coursework John N. Huffman Brown University Richard Repasky Indiana University Joseph Rinkovsky Indiana University.
به نام خدای مهربان زبان تخصصی دانشجویان کامپیوتر English for Computer Students درس 3 جوانمرد Ejavanmard.blogfa.com
Copyright © 2003 by Prentice Hall 1 Computers: Tools for an Information Age Chapter 3 Operating Systems: Software in the Background BSM025 Computers.
Chapter 2 Operating Systems
System SOFTWARE.
Operating System & Application Software
OpenPBS – Distributed Workload Management System
2. OPERATING SYSTEM 2.1 Operating System Function
Where are being used the OS?
Chapter 16: Distributed System Structures
Example of usage in Micron Italy (MIT)
The Operating system Gives life to the hardware
Operating System Overview
Introduction to research computing using Condor
Presentation transcript:

Brain Campbell Bryce Carmichael Unquiea Wade Mentor: Dr. Eric Akers

Abstract The international polar year was designed to study and better understand the current state of the climatic changes to the world’s ice sheets. For the last few decades, there have been automated weather stations and satellites in geo-synchronous orbit that created data sets. Today, numerous amounts of data are unexplored due to insufficient funding and the scarcity of resources. For this reason, the polar grid concept was proposed to delegate the analysis of the existing data sets. The goal of the Elizabeth City State University’s Polar Grid Team was to construct a model network to serve as a base for a super computing pool. The super computing pool will be constructed on the university’s campus and linked to the overall polar grid system. Numerous Software and protocols were researched that are currently in use at other institutions around the nation. From the possible protocols, the condor software was chosen. Condor was created and developed at the University of Wisconsin because of easier usage and its willingness for expansion. An eighteen node computing pool was constructed and tested within Dixon Hall's second floor lab using Condor. This pool was comprised of seventeen desk-tops running on a Windows NT platform, with the pool's mater housed in Lane hall acting as a Linux based server.

 The goal was to utilize all of our computers.  Gain knowledge about Supercomputing.  Setup a pool of computers that can be accessed by Polar Grid.  Familiarize team members with job submission and overall operation of Condor.

 What is Supercomputing? Supercomputing a term given to a system capable of processing at speeds much greater than commercially available CPU’s. High throughput computing is used in describing systems with intermediate processing abilities.

Distributive vs. Parallel Distributed computing utilizes a network of many computers, each accomplishing a portion of an overall task, to achieve a computational result much more quickly than with a single computer. Distributed computing also allows many users to interact and connect openly. Parallel processing is the simultaneous processing of the same task on two or more microprocessors in order to obtain faster results. The computer resources can include a single computer with multiple processors.

Parallel processing allows more intimate communication between nodes increasing efficiency. As the size of the network grows communication takes up a greater part of the CPU’s time This can be limited by using more than one type of protocol in a system

Condor is a specialized workload management system for compute-intensive jobs. Like other full-featured batch systems, Condor provides a job queueing mechanism, scheduling policy, priority scheme, resource monitoring, and resource management. Beowulf is a design for high-performance parallel computing clusters on inexpensive personal computer hardware. Beowulf cluster is a group of usually identical PC computers running a Free and Open Source Software (FOSS) Unix-like operating system, such as BSD, Linux or Solaris. BOINC is a software platform for volunteer computing and desktop Grid computing. BOINC is designed to support applications that have large computation requirements, storage requirements, or both.

History of Condor  The Condor project was started in Condor was built from the results of the Remote Unix project and from the continuation of research in the area of Distribute Resource Management (DRM).  Condor was created at the University of Wisconsin-Madison (UW-Madison), and it was first installed as a production system in the UW- Madison Department of Computer Science.

 Versatility  Capability of switching between distributive or parallel computing  Multiple programming codes for simple execution of jobs.  Operates on Multiple platforms

 Availability – Open source software  Easy Expansion – Any number of nodes can be added to an existing pool  Cost efficiency – Any CPU meeting the base requirements can be use efficiently.

Windows Condor for Windows requires Windows 2000 (or better) or Windows XP. 300 megabytes of free disk space is recommended. Significantly more disk space could be desired to be able to run jobs with large data files. Condor for Windows will operate on either an NTFS or FAT file system. However, for security purposes, NTFS is preferred. Unix The size requirements for the downloads are currently vary from about 20 Mbytes (statically linked HP Unix on a PA RISC) to more than 50 Mbytes (dynamically linked Irix on an SGI). In addition, you will need a lot of disk space in the local directory of any machines that are submitting jobs to Condor

Installation. Condor software can be access through their main website. Condor can be downloaded on various platform such as: Solaris, Linux/Unix, Windows, and MAC Administrative and user manuals are also available on the website.

Installation – overseen through the windows installation wizard Changes to default : Pool master node – Linux base machine in lane hall having a Linux based master will allow the eventual use of the full array of condor options. Read & Write access - parameters changed to include 10.*.*.* to allow fee back and access from different nodes. Due to the use of the CERSER labs during class hours each node is required to be idle for 15 minutes before it is available to perform tasks. If a tasks interrupted it will be restarted on a different machine, if the original node is not freed in less than ten minutes

Job Submission and Tracking Job Submission and Tracking Jobs can be submitted using any executable file format through the condor/bin directory. Jobs are submitted through the condor bin using the condor_submit filename,the status of the nodes within the system can be checked using the command condor_status,

Condor Status Menu condor _status command will bring up a menu given the condition that will list the current platform and availability of each node. Availability is signified by the one word qualifiers in the fourth column. Unclaimed: The node is open but is unable to perform the specified task Claimed: The node is currently running a specified task Matched : The node is opened and can perform a specified task Owner: The node has a local user demanding its attention

Job Submission and Tracking After submission a task can be traced through the pool using condor_q, command. The results of the tasks can be seen within the output files created through the executable. or through the.log file that is created automatically for each task.

Condor pool composed of 17 nodes running on windows NT platform has been established in the Dixon hall laboratory. Operating under a Linux based master housed at the lane hall offices. To date simple tasks have been submitted using C++ code and have ran successfully through the pool. Diagnostic assessment has shown two CPU’s unconnected to the network and that there were naming redundancies which hindered the installation of the condor system.

 Installation of Condor was a success.  Expansion of the cluster is easy and can be done efficiently with minimal cost in resources.  Management and Programming with Condor can be done on an undergraduate level and is encouraged.

 Familiarize more of CERSER teams with Condor software.  Continue the expansion of the Condor pool.  Link ECSU to the Polar Grid network.  Encourage the development of a programs to aide future CERSER research projects.

1.Andrew S. Tanenbaum, Maarten Van Steen (2002): Distributed Systems Principles and Paradigms. New Jersey: Prentice- Hall Inc. 2.Amza C., A.L. Cox, S. Dwarkadas, P. Keleher, R. Rajamony H. Lu, W. Yu, and W.Zwaenepoel. ThreadMarks: Shared memory computing on networks of workstations, to appear in IEEE Computer,(draft copy): 3.A.J. van der Steen, An evaluation of some Beowulf clusters, Technical Report WFI-00-07, Utrecht University, Dept. of Computational Physics, December (Also available through directory reports/.) 4.A.J. van der Steen, Overview of recent supercomputers high-end servers, June 2005, directory reports/