1 Platform LSF6 What’s new in LSF6

Slides:



Advertisements
Similar presentations
The Moab Grid Suite CSS´ 06 – Bonn – July 28, 2006.
Advertisements

Complete Event Log Viewing, Monitoring and Management.
Windows IT Pro magazine Datacenter solution with lower infrastructure costs and OPEX savings from increased operational efficiencies. Datacenter.
Agreement-based Distributed Resource Management Alain Andrieux Karl Czajkowski.
SLA-Oriented Resource Provisioning for Cloud Computing
System Center 2012 R2 Overview
HP Quality Center Overview.
Cloud Computing to Satisfy Peak Capacity Needs Case Study.
Introduction to DBA.
Windows HPC Server 2008 Presented by Frank Chism Windows and Condor: Co-Existence and Interoperation.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
6/2/20071 Grid Computing Sun Grid Engine (SGE) Manoj Katwal.
Microsoft Virtual Server 2005 Product Overview Mikael Nyström – TrueSec AB MVP Windows Server – Setup/Deployment Mikael Nyström – TrueSec AB MVP Windows.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Workload Management Massimo Sgaravatto INFN Padova.
Xuan Guo Chapter 1 What is UNIX? Graham Glass and King Ables, UNIX for Programmers and Users, Third Edition, Pearson Prentice Hall, 2003 Original Notes.
Condor Overview Bill Hoagland. Condor Workload management system for compute-intensive jobs Harnesses collection of dedicated or non-dedicated hardware.
Kaspersky Open Space Security: Release 2 World-class security solution for your business.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting February 24-25, 2003.
Windows ® Powered NAS. Agenda Windows Powered NAS Windows Powered NAS Key Technologies in Windows Powered NAS Key Technologies in Windows Powered NAS.
Bologna Aprile Atempo Product Suite Atempo Time Navigator™ Secure, highly scalable protection of heterogeneous data in complex, mission-critical.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Copyright © 2008 Altair Engineering, Inc. All rights reserved. PBS GridWorks - Efficient Application Scheduling in Distributed Environments Dr. Jochen.
Extreme Networks Confidential and Proprietary. © 2010 Extreme Networks Inc. All rights reserved.
1 Autonomic Computing An Introduction Guenter Kickinger.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
Microsoft Active Directory(AD) A presentation by Robert, Jasmine, Val and Scott IMT546 December 11, 2004.
1 Integrated Workload Management for Beowulf Clusters Bill DeSalvo – April 14, 2004
ARGENT SOFTWARE Product Presentation ARGENT. ARGENT SOFTWARE Argent – Company Overview Argent Software is one of the world's leading systems management.
Computing and the Web Operating Systems. Overview n What is an Operating System n Booting the Computer n User Interfaces n Files and File Management n.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Business Plug-In B17 Organizational Architecture Trends.
©2002 Allen Systems Group, Inc. All Rights Reserved. by Scott Webb, ASG Senior Sales Engineer by Scott Webb, ASG Senior Sales Engineer ASG-sys*ADMIRAL.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
How to create DNS rule that allow internal network clients DNS access Right click on Firewall Policy ->New- >Access Rule Right click on Firewall.
AlphaServer UNIX Resource Consolidation.
When the Grid Comes to Town Chris Smith, Senior Product Architect Platform Computing
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Chapter 5 McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
1 Integrated Workload Management for Beowulf Clusters Bill DeSalvo – August 18, 2004
Server Performance, Scaling, Reliability and Configuration Norman White.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
1 Alexandru V Staicu 1, Jacek R. Radzikowski 1 Kris Gaj 1, Nikitas Alexandridis 2, Tarek El-Ghazawi 2 1 George Mason University 2 George Washington University.
Internet2 AdvCollab Apps 1 Access Grid Vision To create virtual spaces where distributed people can work together. Challenges:
Capacity and Capability Computing using Legion Anand Natrajan ( ) The Legion Project, University of Virginia (
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Hosting Websites and Web Applications with Microsoft ® SQL Server ® 2008.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Microsoft Azure and ServiceNow: Extending IT Best Practices to the Microsoft Cloud to Give Enterprises Total Control of Their Infrastructure MICROSOFT.
CSF. © Platform Computing Inc CSF – Community Scheduler Framework Not a Platform product Contributed enhancement to The Globus Toolkit Standards.
HUAWEI TECHNOLOGIES CO., LTD. Huawei FusionSphere Key Messages – Virtualization, Cloud Data Center, and NFVI.
Open Source and Business Issues © 2004 Northrop Grumman Corp. All rights reserved. 1 Grid and Beowulf : A Look into the Near Future NorthNorth F-18C Weapons.
Peter Idoine Managing Director Oracle New Zealand Limited.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
I/Watch™ Weekly Sales Conference Call Presentation (See next slide for dial-in details) Andrew May Technical Product Manager Dax French Product Specialist.
Workload Management Workpackage
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
CIM Modeling for E&U - (Short Version)
Introduction to Operating System (OS)
Management of Virtual Execution Environments 3 June 2008
Windows Azure 講師: 李智樺, Ruddy Lee
Example of usage in Micron Italy (MIT)
Time Gathering Systems Secure Data Collection for IBM System i Server
STATEL an easy way to transfer data
Presentation transcript:

1 Platform LSF6 What’s new in LSF6

© Platform Computing Inc What is the Platform LSF Family of Products?

© Platform Computing Inc How It Works - Platform LSF Load Information Manager Host Workload Manager LSF Web Services Broker Web Application Job Submission API Plugin Schedulers Cluster Workload Manager Cluster Workload Manager Job Queue Intelligent Scheduler Fairshare Preemption Resource Reservation Advance Reservation License Scheduling SLA Scheduling Service Level Agreement MultiCluster Other Scheduling Modules

© Platform Computing Inc Key Features - Platform LSF High Performing, Open, Scalable Architecture Scalable Scheduler Architecture External executable support OGSI compliance Intelligent Scheduling Policies Fairshare (user & project-based) Policy-based preemption Goal oriented SLA scheduling Job Groups Advanced Self-Management Flexible, comprehensive resource definitions Job-level exception management Automatic job migration and requeue Master scheduler failover Heterogeneous Platform Support Extensive Application Support Comprehensive, Extensible and Standards-based Security

© Platform Computing Inc Key Features – Platform LSF Intelligent Scheduling Policies Advanced Self-Management Heterogeneous Platform Support Extensive Application Support Comprehensive, Extensible and Standards-based Security High Performing, Open, Scalable Architecture

© Platform Computing Inc Scalable Scheduler Architecture Modularized into manager and scheduler plug-ins Supports over 500,000 active jobs per cluster More than 2,000 multi-processor host per cluster - with multiple processors in each host Process 5x more work Achieve 100% utilization Scale with your challenges Intelligent Scheduler Fairshare Preemption Resource Reservation Advance Reservation SLA Scheduling Service Level Agreement MultiCluster Other Scheduling Modules Plugin Schedulers License Scheduling

© Platform Computing Inc External Executable Support Collect information from multiple external resources to track site specific local and global resources Extends out-of-the-box capabilities to manage additional resources and customer application execution Differentiation Multiple vs single external resource collector

© Platform Computing Inc Job Groups Organize jobs into higher level work units - hierarchical tree Similar to the directory structure of a file system Easy to manage and control work Increases manageability by reducing complexity

© Platform Computing Inc OGSI Compliance CSF - “OGSI-compliant” & “Web Services enabled” Future-proof & protect grid investment using standards- based solutions Standardized approach to access Platform LSF Interoperate with third-party systems Differentiation First to comply

© Platform Computing Inc Key Features – Platform LSF Intelligent Scheduling Policies High Performing, Open, Scalable Architecture Advanced Self-Management Heterogeneous Platform Support Extensive Application Support Comprehensive, Extensible and Standards-based Security

© Platform Computing Inc Fairshare (User & Project-based) Ensure job resources are used for the right work Guarantees resource allocation among users and projects are met Co-ordinate access to the right number of resources for different users and projects according to pre-defined shares Differentiation Hierarchal & guaranteed Intelligent Scheduler Fairshare Preemption Resource Reservation Advance Reservation SLA Scheduling Service Level Agreement MultiCluster Other Scheduling Modules Plugin Schedulers License Scheduling

© Platform Computing Inc Policy-based Preemption Maximizes throughput of high priority critical work based on priority and load conditions Prevents starvation of lower priority work Differentiation Platform LSF supports multiple preemption policies Intelligent Scheduler Fairshare Preemption Resource Reservation Advance Reservation License Scheduling SLA Scheduling Service Level Agreement MultiCluster Other Scheduling Modules Plugin Schedulers

© Platform Computing Inc Goal-oriented SLA driven policies Based on customer SLA driven goals: Deadline, Velocity, Throughput Guarantees projects are completed on time Reduces projects and administration costs Provides visibility into the progress of projects Allows the admin focus on “What work and When” needs to be done, not “how” the resources are to be allocated

© Platform Computing Inc Key Features – Platform LSF Advanced Self-Management High Performing, Open, Scalable Architecture Intelligent Scheduling Policies Heterogeneous Platform Support Extensive Application Support Comprehensive, Extensible and Standards-based Security

© Platform Computing Inc Flexible, Comprehensive Resource Definitions Resources defined on a node basis across an entire cluster or subset of the nodes in a cluster Auto-detectable or user defined resources Adaptive membership – nodes join and leave Platform LSF clusters dynamically and automatically without administration effort Dynamic or static resources Heterogeneous support Enables dynamic scheduling

© Platform Computing Inc Job Level Exception Management Exception-based error detection to take automatic, configurable, corrective actions Increased job reliability & predictability Improved visibility on job and system errors Reduced administration overhead and costs

© Platform Computing Inc Automatic Job Migration and Requeue Automatically migrate and requeue jobs based on policies in the event of host or network failures Reduce user and administrator overhead in managing failures Reduce risk of running critical workloads

© Platform Computing Inc Master Scheduler Failover Automatically fail over to another host if the master host is unavailable Continuous scheduling service and execution of jobs Eliminate manual intervention

© Platform Computing Inc Key Features – Platform LSF High Performing, Open, Scalable Architecture Intelligent Scheduling Policies Advanced Self-Management Extensive Application Support Comprehensive, Extensible and Standards-based Security Heterogeneous Platform Support

© Platform Computing Inc Heterogeneous Platform Support UNIX Compaq - Alpha Tru64 IBM – AIX HP – HP-UX SGI – IRIX Sun – Solaris Linux Debian Caldera RedHat SuSE TurboLinux IA32/64 & AMD64 Windows 98, 2000, NT, XP Other NEC, Mac OS, Cray Mainframe Linux on IBM zSeries DCE, AFS, DFS, environments

© Platform Computing Inc Key Features – Platform LSF High Performing, Open, Scalable Architecture Intelligent Scheduling Policies Advanced Self-Management Heterogeneous Platform Support Comprehensive, Extensible and Standards-based Security Extensive Application Support

© Platform Computing Inc Extensive Application Support Electronics Industrial Manufacturing Life Sciences

© Platform Computing Inc Other Application integration initiative Offer to the market a competent Grid vision Grid Computing solutions showroom Allow customers to “test drive” the Grid Computing and Itanium2 from their own desk

© Platform Computing Inc Applications are the key to Grid success!!! We are involved in multiple convergent efforts in Research and Industry: EGEE/LCG/GENIUS Life Sciences, Chemistry, Rendering, Earth Observation, etc. GridAge Engineering EnginFrame integrations Engineering, Oil&Gas, Electronics, Telecom, etc. LSF integrations Electronics, Engineering, Oil&Gas, Finance, etc. What about working together on a joint, pragmatic, research- and industry-proven, common (and open?) standard?

© Platform Computing Inc What for? Standardized user interface  Relocable between GENIUS and EnginFrame (inherits layout, AAA, user mapping, etc.)  GridML to allow generic job & resource monitoring Standardized scripting kit  Relocable submission to LSF, LCG, GLOBUS, etc. (needs work)  Relocable job control via Grid plug-ins in EnginFrame Standardized application packaging?  Is the LCG work reusable?

© Platform Computing Inc Key Features – Platform LSF Comprehensive, Extensible and Standards-based Security High Performing, Open, Scalable Architecture Intelligent Scheduling Policies Advanced Self-Management Heterogeneous Platform Support Extensive Application Support

© Platform Computing Inc Additional New Features in Platform LSF V6.0 Administrator action messages Scheduler dynamic debug Administration & Diagnostics Resource allocation limit display Non-normalized job run limit Job Limit Enhancements Job starvation prevention plug-in Queue priority-based user Fairshare Queue-based Fairshare Scheduling Additional Features

© Platform Computing Inc

Q & A