Grid Means Business OGF-20, Manchester, May 2007

Slides:



Advertisements
Similar presentations
 What Is Desktop Virtualization?  How Does Application Virtualization Help?  How does V3 Systems help?  Getting Started AGENDA.
Advertisements

Windows Deployment Services WDS for Large Scale Enterprises and Small IT Shops Presented By: Ryan Drown Systems Administrator for Krannert.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Two Broad Categories of Software
6/2/20071 Grid Computing Sun Grid Engine (SGE) Manoj Katwal.
Microsoft Virtual Server 2005 Product Overview Mikael Nyström – TrueSec AB MVP Windows Server – Setup/Deployment Mikael Nyström – TrueSec AB MVP Windows.
4/17/2017 7:07 AM © 2007 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Agenda Master Expert Associat e Microsoft Certified Solutions Master (MCSM) Microsoft Certified Solutions Expert (MCSE) Microsoft Certified Solutions.
Cloud Computing Saneel Bidaye uni-slb2181. What is Cloud Computing? Cloud Computing refers to both the applications delivered as services over the Internet.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Introducing, Installing, and Upgrading Windows 7 Lesson 7.
Using the WDK for Windows Logo and Signature Testing Craig Rowland Program Manager Windows Driver Kits Microsoft Corporation.
Copyright © 2011 EMC Corporation. All Rights Reserved. MODULE – 6 VIRTUALIZED DATA CENTER – DESKTOP AND APPLICATION 1.
Simplifying Resource Sharing in Voluntary Grid Computing with the Grid Appliance David Wolinsky Renato Figueiredo ACIS Lab University of Florida.
© 2008 Quest Software, Inc. ALL RIGHTS RESERVED. Perfmon and Profiler 101.
Hosting an Enterprise Financial Forecasting Application with Terminal Server Published: June 2003.
Satisfy Your Technical Curiosity Specialists Enterprise Desktop -
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Linux Operations and Administration
© 2010 VMware Inc. All rights reserved Why Virtualize? Beng-Hong Lim, VMware, Inc.
BY: SALMAN 1.
Won Huh Product Marketing Manager
Overview – TI Services September-2016.
Building ARM IaaS Application Environment
Let's talk about Linux and Virtualization in 'vLAMP'
ReportWorX vs. ReportWorX Express
Chapter 1: Introduction
BY: SALMAN.
Docker Birthday #3.
Overview – SOE PatchTT November 2015.
Overview – SOE PatchTT December 2013.
Chapter 1: Introduction
Chapter 1: Introduction
Platform as a Service.
Objectives Differentiate between the different editions of Windows Server 2003 Explain Windows Server 2003 network models and server roles Identify concepts.
Configuration Management with Azure Automation DSC
Chapter 1: Introduction
CernVM Status Report Predrag Buncic (CERN/PH-SFT).
Chapter 1: Introduction
GGF15 – Grids and Network Virtualization
Chapter 1: Introduction
Grid Computing in the Enterprise: Micron Technology, Inc.
Migration Strategies – Business Desktop Deployment (BDD) Overview
Brooklin J. Gore Senior Fellow May
11/11/2018 Desktop Virtualization Corey Hynes Kyle Rosenthal President Technical Lead HynesITe Inc Spider Consulting @windowspcguy.
Semiconductor Manufacturing (and other stuff) with Condor
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Microsoft Ignite NZ October 2016 SKYCITY, Auckland.
Benefits of Condor-Enabled Process Control Micron Technology, Inc.
Objective Understand the concepts of modern operating systems by investigating the most popular operating system in the current and future market Provide.
Ch 4. The Evolution of Analytic Scalability
Example of usage in Micron Italy (MIT)
Follow-the-moon optimization with Condor-enabled genetic algorithms
Microsoft Virtual Academy
Chapter 1: Introduction
Follow-the-moon optimization with Condor-enabled genetic algorithms
Increase and Improve your PC management with Windows Intune
MDC-B203 Deploying Applications in Microsoft System Center Virtual Machine Manager Using Services John Messec Program Manager Microsoft.
Chapter 1: Introduction
Chapter 1: Introduction
Chapter 1: Introduction
General Purpose Computing with Condor
TechEd /23/2019 9:23 AM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
Software - Operating Systems
Chapter 1: Introduction
Objective Understand the concepts of modern operating systems by investigating the most popular operating system in the current and future market Provide.
Chapter 1: Introduction
Chapter 1: Introduction
Bethesda Cybersecurity Club
Presentation transcript:

Grid Means Business OGF-20, Manchester, May 2007 Scaling up to the Enterprise Level: Using 15,000 CPUs 66% of the Time Micron Technology, Inc. Brooklin J. Gore Senior Fellow, Advanced Computing 2004 Micron Technology, Inc. All rights reserved. Information is subject to change without notice. 1 1

Using 15,000 CPUs 66% of the Time Agenda Micron Overview Micron Grid Overview Grid Application Overview System Management Best Practices Conclusions 11/8/2018

Using 15,000 CPUs 66% of the Time Micron Overview …move… Capture… …Store 11/8/2018

Using 15,000 CPUs 66% of the Time Micron Overview 11/8/2018

Using 15,000 CPUs 66% of the Time Micron Grid Overview Idaho-3 Idaho-2 Virginia Idaho-1 Italy Japan Utah Singapore-1 Singapore-2 11/8/2018

Using 15,000 CPUs 66% of the Time Micron Grid Overview 14077 Processors, 8.053 TFlops, 63rd Top500 Rank 529 TeraBytes Disk 102 user accounts in all pools 1,281,636 job hours, 1780 Processor-months Primarily Windows, plus Linux, some Solaris Condor system managed in-house Centralized governance, distributed management 11/8/2018

Using 15,000 CPUs 66% of the Time Micron Grid Overview 11/8/2018

Using 15,000 CPUs 66% of the Time Grid-Enabled Application Processing Models Push Data processors are pushed to dynamically allocated grid resources Pull Data is pulled to processors on pre-allocated grid resources Portable Processors may be dynamically deployed on resources far away from source data 11/8/2018

Using 15,000 CPUs 66% of the Time The Push Processing Model Good for ‘long running’ jobs -- mitigate ~30s grid process activation delay One unit of work associated with one grid job Grid resources dynamically allocated as needed to run jobs Data (pointer) is pushed along with job to grid resource. A1 , A2 , … AL Work Queue P1 P2 Px P3 P4 Grid Resources A There is work to do (Schedule job to do work) Work In Out Allocate job to available resource Grid Scheduler 11/8/2018

Using 15,000 CPUs 66% of the Time The Pull Processing Model Good for many very short jobs -- avoids ~30s grid process activation delay Work queue size not equal to grid job queue size Work processors are pre-provisioned onto grid resources Work processors pull work from queue SETI@Home is clasic example A1 , A2 , … AL Work Queue P1 P2 Px P3 P4 Grid Resources A High Water (Add Processor) Low (Remove Processor) Work In Out (Un-) Provision processor (from) on resource Grid Scheduler 11/8/2018

Using 15,000 CPUs 66% of the Time The Portable Processing Model Portable Grid applications: Low data in/out Compute bound So, follow-the-moon: Direct jobs to sites Where the workers aren’t 11/8/2018

Using 15,000 CPUs 66% of the Time Grid Applications Overview Don’t believe in one killer grid application… …but many general purpose grid applications: Manufacturing applications (widget processing) Engineering applications (repetitive tasks) Reporting applications (chart generation) Data mining (log file processing) Software development (build, test, package, deploy) Security (proactive port scanning) Grid-enabled script engines (MATLAB, JMP, R, etc.) 11/8/2018

Using 15,000 CPUs 66% of the Time Grid System Management Software deployment and upgrades Unix systems use common system image on shared file system Windows systems use Altiris to deploy/install/upgrade sotware on local file system Grid host configuration Three-tier configuration files: global, pool-wide, host Unix systems utilize shared file system Windows ‘cron job’ checks for updates to central files and copies to local system. Today file-based, want web-based. Compute host and job management Global web interface 11/8/2018

Using 15,000 CPUs 66% of the Time Grid System Management 11/8/2018

Using 15,000 CPUs 66% of the Time Grid System Management The tricky parts Configuring hosts so grid jobs don’t impact users Run jobs when machine user is idle Evict jobs when user comes back (Don’t worry about CPU -- let the OS do it) Optimizing the grid application data ‘chunk’ size Too big: hard to checkpoint, tough on network Too small: file overhead is high, use messages (Best to be configurable and dial-in with experience) 11/8/2018

Using 15,000 CPUs 66% of the Time Governance Best Practices worker Infrastructure Fast, fat networks Centralized (fast) data stores Common system images Fast (fat) desktops worker worker worker Job dispatch result set “Edge push” Effect work units master (traditional client) database server 11/8/2018

Using 15,000 CPUs 66% of the Time Governance Best Practices People Articulate the value proposition Focus on low-hanging fruit Integrate new grid processes with existing system management processes Educate, educate, educate Technology is Easy, People are Hard 11/8/2018

Using 15,000 CPUs 66% of the Time Governance Best Practices Grid Management Centralize Grid Center of Excellence Global standards Grid tools development Distribute Grid resource management Application support Align Pools with Identity/Data Domains 11/8/2018

Using 15,000 CPUs 66% of the Time Conclusion Large scale grid computing on shared desktop systems in the enterprise is doable today and is… Not that difficult (from a technology perspective) Not that expensive (from a people and money perspective) Practical (from a grid-applications perspective) 11/8/2018

Scaling up to the Enterprise Brooklin J. Gore Open Grid Forum 20 Manchester UK, May, 2007 Grid Means Business Scaling up to the Enterprise Brooklin J. Gore Senior Fellow, Advanced Computing bgore@micron.com Micron and the Micron logo are trademarks and/or service marks of Micron Technology, Inc. All other trademarks are the property of their respective owners.

Using 15,000 CPUs 66% of the Time Abstract This presentation outlines the goal of a global company to increase the utilization of its computing infrastructure to 66%. We show how a grid computing system is used to combine the computing resources of almost 15,000 cpus at 6 sites on three continents. The presentation discusses how the grid software is deployed to the machines, most of which are knowledge worker desktops running Microsoft Windows, and how each machine is configured and managed. The presentation also covers key best practices and touches on the breadth of applications that are running on the system. 11/8/2018