CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei

Slides:



Advertisements
Similar presentations
CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
Advertisements

11 Application of CSF4 in Avian Flu Grid: Meta-scheduler CSF4. Lab of Grid Computing and Network Security Jilin University, Changchun, China Hongliang.
1 Dr. Xiaohui Wei College of Computer Science and Technology, Jilin University, China CSF4 Tutorial The 3rd PRAGMA Institute, Penang Malaysia,
National Institute of Advanced Industrial Science and Technology Advance Reservation-based Grid Co-allocation System Atsuko Takefusa, Hidemoto Nakada,
CSF4 Meta-Scheduler PRAGMA13 Zhaohui Ding or College of Computer.
PRAGMA BioSciences Portal Raj Chhabra Susumu Date Junya Seo Yohei Sawai.
Gfarm v2 and CSF4 Osamu Tatebe University of Tsukuba Xiaohui Wei Jilin University SC08 PRAGMA Presentation at NCHC booth Nov 19,
Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Experiences with GridWay on CRO NGI infrastructure Emir Imamagic, Srce EGEE User.
Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
C. Grimme, A. Papaspyrou Scheduling in C3-Grid AstroGrid-D Workshop Project: C3-Grid Collaborative Climate Community Data and Processing Grid Scheduling.
CERN LCG Overview & Scaling challenges David Smith For LCG Deployment Group CERN HEPiX 2003, Vancouver.
CSF4, SGE and Gfarm Integration Zhaohui Ding Jilin University.
NBCR Science Gateway: Transparent Access to Remote Resources Through Rich Desktop Clients NBCR Science Gateway: Transparent Access to Remote Resources.
A Computation Management Agent for Multi-Institutional Grids
PRAGMA9 – Demo Bioinformatics applications inside Gfarm using meta-scheduler (CSF) and local schedulers (LSF/SGE/etc) Dr. Xiaohui Wei, JLU, China Dr. Wilfred.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.
The Globus Toolkit Gary Jackson. Introduction The Globus Toolkit is a product of the Globus Alliance ( It is middleware for developing.
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Grids and Globus at BNL Presented by John Scott Leita.
Simo Niskala Teemu Pasanen
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting February 24-25, 2003.
Grid Computing 7700 Fall 2005 Lecture 17: Resource Management Gabrielle Allen
December 8 & 9, 2005, Austin, TX SURA Cyberinfrastructure Workshop Series: Grid Technology: The Rough Guide Configuring Resources for the Grid Jerry Perez.
Gilbert Thomas Grid Computing & Sun Grid Engine “Basic Concepts”
Ashok Agarwal 1 BaBar MC Production on the Canadian Grid using a Web Services Approach Ashok Agarwal, Ron Desmarais, Ian Gable, Sergey Popov, Sydney Schaffer,
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grid Computing, B. Wilkinson, b.1 National Science Foundation Middleware Initiative (NMI) Started in 2001 initially over 3 years “to create and deploy.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
Job Submission Condor, Globus, Java CoG Kit Young Suk Moon.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
OGF 25/EGEE User Forum Catania, March 2 nd 2009 Meta Scheduling and Advanced Application Support on the Spanish NGI Enol Fernández del Castillo (IFCA-CSIC)
Computational grids and grids projects DSS,
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
3-2.1 Topics Grid Computing Meta-schedulers –Condor-G –Gridway Distributed Resource Management Application (DRMAA) © 2010 B. Wilkinson/Clayton Ferner.
ChinaGrid Experience with GT4 Hai Jin Huazhong University of Science and Technology
Rochester Institute of Technology Job Submission Andrew Pangborn & Myles Maxfield 10/19/2015Service Oriented Cyberinfrastructure Lab,
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, An Overview of the GridWay Metascheduler.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
 Abstract  The applications in many scientific fields, like bioinformatics and high-energy physics etc, increasingly demand the computing infrastructures.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
Information Services Andrew Brown Jon Ludwig Elvis Montero grid:seminar1:lectures:seminar-grid-1-information-services.ppt.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Introduction to Grid Computing and its components.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Grid Compute Resources and Job Management. 2 Grid middleware - “glues” all pieces together Offers services that couple users with remote resources through.
Jaime Frey Computer Sciences Department University of Wisconsin-Madison What’s New in Condor-G.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
MSF and MAGE: e-Science Middleware for BT Applications Sep 21, 2006 Jaeyoung Choi Soongsil University, Seoul Korea
CSF. © Platform Computing Inc CSF – Community Scheduler Framework Not a Platform product Contributed enhancement to The Globus Toolkit Standards.
CSF4 Meta-Scheduler Zhaohui Ding College of Computer Science & Technology Jilin University.
Parallel Computing Globus Toolkit – Grid Ayaka Ohira.
Wide Area Workload Management Work Package DATAGRID project
Grid Computing Software Interface
Presentation transcript:

CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei College of Computer Science & Technology Jilin University

2 Agenda Meta-Scheduler & CSF4 Introduction CSF4 Architecture CSF4 Functionalities Future Work Demo and Practice

3 What is Meta-Scheduler Resource Allocation & Management Heterogeneous Heterogeneous Distributed Distributed Dynamic Dynamic Local Scheduler VS Meta-scheduler

4 Local Scheduler VS Meta-Scheduler Local Scheduler Meta Scheduler Administrative scope Cluster, Single Domain Grid, Multiple Domains, Virtual Organizations Hardware & Software (OS) HomogeneousHeterogeneousOS-independent Data management LAN file system (NFS, FTP, scp) Global file system (Gridftp, Gfarm) Certificates management Not supported Supported Resource Management Protocol Specified, Private Protocols for different local scheduler Standard, Open, General- Purpose Protocols (GRAM) Scheduling mode Centralized Centralized / Distributed

5 Meta-Scheduler VS Local Scheduler Local Scheduler LSF (Load Sharing Facility) LSF (Load Sharing Facility) PBS (Portable Batch System) PBS (Portable Batch System) SGE (Sun Grid Engine) SGE (Sun Grid Engine) Condor Condor IBM Loadleveler IBM LoadlevelerMeta-Scheduler CSF CSF Maui (Silver) Maui (Silver) Gridway Gridway Nimrod-G Nimrod-G Condor-G Condor-G

6 What is CSF4 What is CSF Meta-Scheduler Full Name: Community Scheduler Framework Full Name: Community Scheduler Framework CSF4 contains a group of grid services host in GT4 CSF4 contains a group of grid services host in GT4 CSF4 is a full WSRF compliant meta-scheduler. CSF4 is a full WSRF compliant meta-scheduler. Open Source project and can be accessed at Open Source project and can be accessed at Developed by Jilin University, China Developed by Jilin University, China CSF4 has been added to Globus Toolkit 4 as an Execution Component CSF4 has been added to Globus Toolkit 4 as an Execution Component

7 CSF4 in Globus Toolkit 4

8 Meta-Scheduler VS Local Scheduler A typical deployment for Meta-Scheduler and Local Schedulers

9 What CSF4 Can Do? Basic Functionalities Submit jobs to Grid without Specifying Cluster Submit jobs to Grid without Specifying Cluster Monitor and Control Jobs Monitor and Control Jobs Provide Queuing Service Provide Queuing Service Schedule jobs and resource by custom-built polices Schedule jobs and resource by custom-built polices CSF4 Portlet (A Web browser based User Interface) CSF4 Portlet (A Web browser based User Interface)

10 What CSF4 Can Do? (cont.) Advanced Functionalities Multiple Domains Resource Information Sharing Multiple Domains Resource Information Sharing Automatic user credentials delegation Automatic user credentials delegation Automatic data-staging Automatic data-staging Extensible scheduling policies Extensible scheduling policies Supporting grid parallel jobs (MPICH-G2) Supporting grid parallel jobs (MPICH-G2)

11 CSF4 – Architecture

12 CSF4 – Architecture User view

13 CSF4 – Architecture ( cont. ) Resource View NOTE: One RM Gram is for one cluster only Localclusters

14 CSF4 – Functionalities Local Scheduler Supported LSF LSF PBS PBS SGE SGE Condor Condor Infrastructure Supported Globus Toolkit 4 Globus Toolkit 4 Globus Toolkit 2 Globus Toolkit 2

15 CSF4 – Functionalities Scheduling Plug-in Framework Designed For Queuing Service Provide A set of policies CustomizableExtensible

16 Existent Scheduling Policies FCFS (First Come First Serve) round-robin Default policy Default policyThrottle Restrict the number of jobs in a scheduling cycle Restrict the number of jobs in a scheduling cycle Array Job Plug-in Design for life science applications (such as AutoDock, BLAST) Design for life science applications (such as AutoDock, BLAST) MPICH-G2 Plug-in By using VJM, the plug-in guarantee the synchronized resource allocation can be successful By using VJM, the plug-in guarantee the synchronized resource allocation can be successful Data intensive applications plug-in (under-developing)

17 Schedule plug-in & scheduling policies Each policy is implemented inside a scheduling plugin module Each policy is implemented inside a scheduling plugin module A queue can load multiple plugin modules A queue can load multiple plugin modules

18 Array Job AutoDock and Blast-like applications A large number of sub-jobs. A large number of sub-jobs. Execute same binary Execute same binary Different input/output files Different input/output files

19 Array Job (cont.) Advantages Submit job only once Submit job only once Save submission time and memory storage Save submission time and memory storage

20 Resource Information Sharing A MDS information provider for CSF4 Multiple CSF4 can share the resource information

21 CSF4 – Functionalities (cont.) Deploy Multiple CSF4 in a Grid Community Deploy Multiple CSF4 in a Grid Community

22 Automatic user credential delegation GT4 Delegation Service GT4 Delegation Service Full/Limited Delegation Full/Limited Delegation User Credential Delegation

23 Automatic user credential delegation

24 Data Staging Manual Data Staging Which clusters I can use? Which clusters I can use? Which clusters my jobs will running on? Which clusters my jobs will running on? Where is the output data? Where is the output data? When will the job finish, so that I can stage- out the output data? When will the job finish, so that I can stage- out the output data?

25 Automatic Data Staging Stage-in and Stage-out without Meta-Scheduler

26 Automatic Data Staging With CSF4 Automatic Data Staging

27 CSF4 – User interface ( cont.) Command line interface - Services available for grid users: JobService JobService csf-job-create, csf-job-start, csf-job-submit, csf-job-status, csf-job-stop, csf-job-resume, csf-job-list, csf-resource-list, csf-mpi-run ReservationService ReservationService csf-rsv-create, csf-rsv-status, csf-rsv-cancel csf-job-submit, … QueuingService QueuingService csf-queue-create, csf-queue-conf, csf-queue-data

28 CSF4 Portlet Web Browser based Interface Friendly interface

29 View Available Clusters

30 Submit a job

31 View Jobs history

32 View Job output

33 How to get CSF4? Homepage: Release Version (Recommended) : = = = = Developing Version (CVS, the newest but not stable) Globus Toolkit 4 Package : In “contrib” directory In “contrib” directory

34 Future Work Plug-in Framework Improvement More scheduling policies More scheduling policies Optimize for Parallel Job Optimize for Data intensive job Workflow supported Workflow supported

35 Practices View Credential Info Query Available Resource Submit A Simple Job Submit An Array Job Query Jobs History View Job Output

36 Thank you