CSS434 Grid Computing1 Textbook No Corresponding Chapters Professor: Munehiro Fukuda A portion of these slides were compiled from The Grid: Blueprint for.

Slides:



Advertisements
Similar presentations
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
Advertisements

Agreement-based Distributed Resource Management Alain Andrieux Karl Czajkowski.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Task Scheduling and Distribution System Saeed Mahameed, Hani Ayoub Electrical Engineering Department, Technion – Israel Institute of Technology
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
Computer Science Department 1 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute.
Application-specific Tools Netsolve, Ninf, and NEOS CSE 225 Chas Wurster.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Legion Worldwide virtual computer. About Legion Made in University of Virginia Object-based metasystems software project middleware that connects computer.
Workload Management Massimo Sgaravatto INFN Padova.
Inter-cluster Job Deployment by AgentTeamwork Sentinel Agents Emory Horvath CSS497 Spring 2006 Advisor: Dr. Munehiro Fukuda.
Grids and Globus at BNL Presented by John Scott Leita.
Introduction to Grid Computing Ann Chervenak Carl Kesselman And the members of the Globus Team.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
The Globus Toolkit: Description and Applications Review Steve Tuecke & Ian Foster Argonne National Laboratory The University of Chicago Globus Co-PI: Carl.
Distributed Systems Early Examples. Projects NOW – a Network Of Workstations University of California, Berkely Terminated about 1997 after demonstrating.
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
STRATEGIES INVOLVED IN REMOTE COMPUTATION
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
Nimrod/G GRID Resource Broker and Computational Economy David Abramson, Rajkumar Buyya, Jon Giddy School of Computer Science and Software Engineering Monash.
DISTRIBUTED COMPUTING
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
Service Architecture of Grid Faults Diagnosis Expert System Based on Web Service Wang Mingzan, Zhang ziye Northeastern University, Shenyang, China.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
CS 390- Unix Programming Environment CS 390 Unix Programming Environment Topics to be covered: Distributed Computing Fundamentals.
Job Submission Condor, Globus, Java CoG Kit Young Suk Moon.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
A performance evaluation approach openModeller: A Framework for species distribution Modelling.
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
The Globus Project: A Status Report Ian Foster Carl Kesselman
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The Grid the united computing power Jian He Amit Karnik.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Globus Toolkit Massimo Sgaravatto INFN Padova. Massimo Sgaravatto Introduction Grid Services: LHC regional centres need distributed computing Analyze.
8/25/2005IEEE PacRim The Design Concept and Initial Implementation of AgentTeamwork Grid Computing Middleware Munehiro Fukuda Computing & Software.
Economic and On Demand Brain Activity Analysis on Global Grids A case study.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Chapter 14 Advanced Architectural Styles. Objectives Describe the characteristics of a distributed system Explain how middleware supports distributed.
Globus: A Report. Introduction What is Globus? Need for Globus. Goal of Globus Approach used by Globus: –Develop High level tools and basic technologies.
GridRPC Sources / Credits: IRISA/IFSIC IRISA/INRIA Thierry Priol et. al papers.
CSS497 Undergraduate Research Performance Comparison Among Agent Teamwork, Globus and Condor By Timothy Chuang Advisor: Professor Munehiro Fukuda.
The Globus Toolkit The Globus project was started by Ian Foster and Carl Kesselman from Argonne National Labs and USC respectively. The Globus toolkit.
MSF and MAGE: e-Science Middleware for BT Applications Sep 21, 2006 Jaeyoung Choi Soongsil University, Seoul Korea
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Workload Management Workpackage
TensorFlow– A system for large-scale machine learning
Clouds , Grids and Clusters
Duncan MacMichael & Galen Deal CSS 534 – Autumn 2016
OpenMosix, Open SSI, and LinuxPMI
Grid Computing.
The Client/Server Database Environment
University of Technology
CSS490 Grid Computing Textbook No Corresponding Chapter
Mobile Agents.
Basic Grid Projects – Condor (Part I)
Future EU Grid Projects
Wide Area Workload Management Work Package DATAGRID project
Resource and Service Management on the Grid
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

CSS434 Grid Computing1 Textbook No Corresponding Chapters Professor: Munehiro Fukuda A portion of these slides were compiled from The Grid: Blueprint for a New Computer Infrastructure.

CSS434 Grid Computing2 Network Infrastructure Users login their organizational systems first locally or remotely. If they are affiliated with other organizations, They can login from the system of their main use to some other systems. (They are given an opportunity to use those resources in parallel). Problems: They must orchestrate job execution among the resources they use. Should those resources be limited to such a handful number of researchers? High-speed Information high way

CSS434 Grid Computing3 Purposes of Computational Grid Use computing resource connected to high-speed information highway as if we use electric power grid Only 30% utilization in academic/commercial environments. Many applications have only episodic requirements. So, why don ’ t we share computation resource? Computational results and data should be also made available to all users. Users: Computational scientists and engineers Experimental scientists Association and corporations Training and education Consumers (E-commerce)

CSS434 Grid Computing4 Grid Applications CategoryExamplesCharacteristics Distributed supercomputing DIS and Stellar dynamics Very large problems needing lots of computing resource at a time High throughputChip design and parameter studies Harnessing many idle resources to increase aggregate throughput On demandMedical instrumentation Allocating special resource dynamically Data intensiveSky surveyUsing distributed data and needing high-volume data flows CollaborativeCollaborative design Education Support communication or collaborative work

CSS434 Grid Computing5 Grid Services Architecture from slide Applications Grid Services Layer InformationResource mgmt SecurityData accessFault detection... Grid Fabric Layer TransportMulticast InstrumentationControl interfacesQoS mechanisms... High-energy physics data analysis Regional climate studies Collaborative engineering Parameter studies On-line instrumentation Application Toolkit Layer Distributed computing Data- intensive Collab. design Remote viz Remote control

CSS434 Grid Computing6 Programming Model Uniform Access Paradigm Bag of task or master workers (Condor-MW) Client server (NetSolve) Object oriented (Legion) Synchronous applications (Not suited for massively parallel computation.) Language Support MPI-G – message passing (Globus) Open MP – shared memory Math Library – remote procedure (NetSolve)

CSS434 Grid Computing7 Resource Management Discovery, Allocation, and Scheduling Centralized resource manager +: easy to manage – : a bottleneck Decentralized resource manager A collection of centralized manager (Condor ’ s gate flocking) A combination of meta and local schedulers. SystemsResource descriptions Front-end process Resource manager Job launcher GlobusRSL: resource spec. language Broker and MDSGRAM CondorClassAd and DAGMan Schedd AgentMatchmaker and startd Sandbox (Starter) LegionIDL: interface def. language SchedulerCollectionEnactor

CSS434 Grid Computing8 Fault Tolerance Check-pointing At the master (Condor) At each node but collected at the master (Catalina) Use a whiteboard (Optimal Grid) Re-execution of fault worker jobs from the beginning (Bayanihan, Optimal Grid) Error code (NetSolve) User is responsible to handle errors.

CSS434 Grid Computing9 Security Resources covered with security layers Legion (Message/MayI layers) Entropia (Intercepting all system calls) A use of commodity tools SSL Public key Security Certificate Java sandbox Kerberos

CSS434 Grid Computing10 NetSolve RPC-based approach Clients Include a set of APIs called as (asynchronous) RPCs Agents Match client ’ s requests for services with servers Servers Encapsulates remotely accessed numerical libraries Agent Network of servers Client MPP servers Scalar server request choice reply

CSS434 Grid Computing11 Legion Legion classes Act as managers and make policy Core objects Provide mechanisms that classes use to implement policies: hosts (processors), vaults(memory), context, binding agents, etc. Per-Program Scheduling Participating sites can assure their local policies. User can choose a scheduling policy. Class Host Class tty Resources Resource database Scheduler Prog Converted Legion object ID By context objects Converted Logion object address By binding agents Enactor collection search request reserve

CSS434 Grid Computing12 Condor A: User ’ s local agent R: Each computer resource M: Central manager I/O forwarded to a user ’ s home

CSS434 Grid Computing13 AgentTeamwork at UWB Architecture

CSS434 Grid Computing14 Paper Review by Students Globus Legion Condor Netsolve Discussions What programming or execution model is each system based on? What resource allocation and scheduling algorithm does each system use? Are they fault-tolerant? Did they any special security features for their own?