Managing a growing campus pool Eric Sedore

Slides:



Advertisements
Similar presentations
1/17/20141 Leveraging Cloudbursting To Drive Down IT Costs Eric Burgener Senior Vice President, Product Marketing March 9, 2010.
Advertisements

Capacity Planning in a Virtual Environment
University of Notre Dame
Xrootd and clouds Doug Benjamin Duke University. Introduction Cloud computing is here to stay – likely more than just Hype (Gartner Research Hype Cycle.
Profit from the cloud TM Parallels Dynamic Infrastructure AndOpenStack.
SAM SPENCER Server Virtualization. Agenda Introduction History Server Virtualization Software Server Virtualization Hardware Determining Server Hardware.
Pankaj Kumar Qinglan Zhang Sagar Davasam Sowjanya Puligadda Wei Liu
Office of Technology Operations & Planning Unlocking the Power of Server Virtualization Rebecca Astin Office of Technology Operations and Planning National.
Duke Atlas Tier 3 Site Doug Benjamin (Duke University)
The Budget Crunch And Our Virtual Datacenter Presented By Joe Lanager.
Click to add text Introduction to the new mainframe: Large-Scale Commercial Computing © Copyright IBM Corp., All rights reserved. Chapter 3: Scalability.
Virtualization on the Intel Platform Scott Elliott Senior Systems Network Specialist Christie Digital A Customer Implementation with VMware and IBM.
Chapter 21: Mobile Virtualization Infrastracture and Related Security Issues Guide to Computer Network Security.
Building a Virtualized Desktop Grid Eric Sedore
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
SOFTWARE AS A SERVICE PLATFORM AS A SERVICE INFRASTRUCTURE AS A SERVICE.
1 Virtualization Services. 2 Cloud Hosting –Shared Virtual Servers –Dedicated Servers Managed Server Options Multiple Access Methods –EarthLink Business.
Tier-1 experience with provisioning virtualised worker nodes on demand Andrew Lahiff, Ian Collier STFC Rutherford Appleton Laboratory, Harwell Oxford,
VMware vSphere 4 Introduction. Agenda VMware vSphere Virtualization Technology vMotion Storage vMotion Snapshot High Availability DRS Resource Pools Monitoring.
Enterprise Storage Our Journey Thus Far John D. Halamka MD CIO, Harvard Medical School and Beth Israel Deaconess Medical Center.
CLOUD COMPUTING 2.0 Finally, the promise of the cloud has arrived v 1.8.
How to Resolve Bottlenecks and Optimize your Virtual Environment Chris Chesley, Sr. Systems Engineer
XenDesktop Built on FlexPod Flexible IT Infrastructure for Desktop Virtualization.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
PCGRID ‘08 Workshop, Miami, FL April 18, 2008 Preston Smith Implementing an Industrial-Strength Academic Cyberinfrastructure at Purdue University.
9/16/2000Ian Bird/JLAB1 Planning for JLAB Computational Resources Ian Bird.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
CDF data production models 1 Data production models for the CDF experiment S. Hou for the CDF data production team.
Ian Alderman A Little History…
©2013 – JouleX Enterprise Energy Management Quickly identify energy waste Reduce energy usage and costs Lower carbon emissions Agentless, network-based.
1 Moshe Shadmon ScaleDB Scaling MySQL in the Cloud.
Applications Requirements Working Group HENP Networking Meeting June 1-2, 2001 Participants Larry Price Steven Wallace (co-ch)
VMware View built on FlexPod Flexible IT Infrastructure for Desktop Virtualization.
HTCondor and Beyond: Research Computer at Syracuse University Eric Sedore ACIO – Information Technology and Services.
Taiwan APT OSM Sizing. THE SIZING ESTIMATES CONTAINED IN THIS DOCUMENT ARE BASED UPON THE ASSUMPTIONS OF PROPER APPLICATION CONFIGURATION AND TUNING,
Instruction Set Virtualization
Toward Green Data Center Computing Gregor von Laszewski Lizhe Wang.
Summer 2012 A Brief Review of Technology Upgrades at Mead Hall Technology Committee Meeting.
Factors affecting ANALY_MWT2 performance MWT2 team August 28, 2012.
The impacts of climate change on global hydrology and water resources Simon Gosling and Nigel Arnell, Walker Institute for Climate System Research, University.
IT Priorities Minimize CAPEX Maximize employee productivity Grow the business Add new compute resources real- time to support growth Meet compliance requirements.
Stairway to the cloud or can we take the highway? Taivo Liik.
Copyright © 2005 VMware, Inc. All rights reserved. How virtualization can enable your business Richard Allen, IBM Alliance, VMware
UKI-SouthGrid Overview and Oxford Status Report Pete Gronbech SouthGrid Technical Coordinator HEPSYSMAN – RAL 10 th June 2010.
Azure in a Day Training: Windows Azure Module 1: Windows Azure Overview Module 2: Development Environment / Portal – DEMO: Signing up for Windows Azure.
Atlas Software Structure Complicated system maintained at CERN – Framework for Monte Carlo and real data (Athena) MC data generation, simulation and reconstruction.
Windows Azure Overview for IT Pros Anton Boyko. Intro to Cloud Computing Intro to Windows Azure Cloud Services Web Sites Virtual Machines Workload Options.
Jaime Frey Computer Sciences Department University of Wisconsin-Madison Condor and Virtual Machines.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
Virtual Cluster Computing in IHEPCloud Haibo Li, Yaodong Cheng, Jingyan Shi, Tao Cui Computer Center, IHEP HEPIX Spring 2016.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Architecture of a platform for innovation and research Erik Deumens – University of Florida SC15 – Austin – Nov 17, 2015.
Condor on Dedicated Clusters Peter Couvares and Derek Wright Computer Sciences Department University of Wisconsin-Madison
VM Layout. Virtual Machine (Ubuntu Server) VM x.x You can putty into this machine from on campus. Or you can use vSphere to control the hardware.
NFV Group Report --Network Functions Virtualization LIU XU →
Dynamic Extension of the INFN Tier-1 on external resources
Virtualization OVERVIEW
Diskpool and cloud storage benchmarks used in IT-DSS
HTCondor at Syracuse University – Building a Resource Utilization Strategy Eric Sedore Associate CIO HTCondor Week 2017.
ATLAS Cloud Operations
Building a Virtual Infrastructure
Windows Azure Migrating SQL Server Workloads
”The Ball” Radical Cloud Resource Consolidation
Chapter 21: Virtualization Technology and Security
TYPES OFF OPERATING SYSTEM
Chapter 22: Virtualization Security
Managing Clouds with VMM
Traditional Virtualized Infrastructure
Cloud Security AWS as an example.
Cloud Security AWS as an example.
Presentation transcript:

Managing a growing campus pool Eric Sedore

Quick context  Scavenging CPU time from 2000 desktops on campus  Growth from 3000 to cores over the last year  Varied types of research (in all dimensions, run time, data needed per job, data access type (HTCondor transfer/NFS))  Challenge feeding data to the compute nodes  VPN tunnel bandwidth  SAN Storage / Virtual server environment  Campus network  Exploring steering jobs via average availability of each node

Steering jobs  More Traditional – CPU/Memory/Application requirements  Jobs that are not easily checkpointed and have longer run times  Dynamic average giving the job a view into the amount of scavenging time (in hours) of a node – publish via classad  sessionlength = "44"  sessionlength = "9"  sessionlength = "10"  sessionlength = "41"  Add to job requirements or ranking: TARGET.sessionlength >= "4“  Data gathered through parsing logs from the last seven days (from Condor VM Coordinator)

Dealing with growth  Data delivery infrastructure - horizontal and vertical scaling  Network infrastructure upgrades – upgrading 1 Gb links to 10 Gb links  Scaling out VPN end points – working toward a model, per endpoint  Throttling vs. bottleneck (protecting network links where there are high compute node populations)

Pushing NFS  ~6300 nodes supported on a single NFS server ( open files)  Can run out of CPU at peak times (current server has 16 cores) – context switching / I/O wait  Virtual Machine running in ESX 5.x – SAN storage (15K spindles on an IBM 5300)  virtual #35-Ubuntu SMP Fri May 25 18:35:12 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux

Pool up and running – supporting the load