UCB Millennium and the Vineyard Cluster Architecture Phil Buonadonna University of California, Berkeley

Slides:



Advertisements
Similar presentations
© 2010 UC Regents | UC Berkeley ist.berkeley.edu IST Platform Services IST Platform Services and Web Hosting Options Micronet Meeting June 23, 2010 Don.
Advertisements

Client/Server Computing (the wave of the future) Rajkumar Buyya School of Computer Science & Software Engineering Monash University Melbourne, Australia.
System Area Network Abhiram Shandilya 12/06/01. Overview Introduction to System Area Networks SAN Design and Examples SAN Applications.
1 Dynamic DNS. 2 Module - Dynamic DNS ♦ Overview The domain names and IP addresses of hosts and the devices may change for many reasons. This module focuses.
High Performance Computing Course Notes Grid Computing.
Xen , Linux Vserver , Planet Lab
Unique Opportunities in Experimental Computer Systems Research - the Berkeley Testbeds David Culler U.C. Berkeley Grad.
1 Networking through Linux Partha Sarathi Dasgupta MIS Group Indian Institute of Management Calcutta.
1 Week #1 Objectives Review clients, servers, and Windows network models Differentiate among the editions of Server 2008 Discuss the new Windows Server.
NPACI Panel on Clusters David E. Culler Computer Science Division University of California, Berkeley
VIA and Its Extension To TCP/IP Network Yingping Lu Based on Paper “Queue Pair IP, …” by Philip Buonadonna.
Clusters Massive Cluster Gigabit Ethernet Progress on System Architecture for Extreme Devices David Culler U.C. Berkeley.
Web Server Hardware and Software
Developing a Cluster Strategy for NPACI All Hands Meeting Panel Feb 11, 2000 David E. Culler Computer Science Division University of California, Berkeley.
IPPS 981 Berkeley FY98 Resource Working Group David E. Culler Computer Science Division U.C. Berkeley
Figure 1.1 Interaction between applications and the operating system.
Split-C for the New Millennium Andrew Begel, Phil Buonadonna, David Gay
NPACI: National Partnership for Advanced Computational Infrastructure August 17-21, 1998 NPACI Parallel Computing Institute 1 Cluster Archtectures and.
Research Computing with Newton Gerald Ragghianti Nov. 12, 2010.
VMware vCenter Server Module 4.
Overview SAP Basis Functions. SAP Technical Overview Learning Objectives What the Basis system is How does SAP handle a transaction request Differentiating.
Module 10 Configuring and Managing Storage Technologies.
Data Center Infrastructure

C Copyright © 2009, Oracle. All rights reserved. Appendix C: Service-Oriented Architectures.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
An Introduction to IBM Systems Director
CS 1308 Computer Literacy and the Internet. Introduction  Von Neumann computer  “Naked machine”  Hardware without any helpful user-oriented features.
Robert Fourer, Jun Ma, Kipp Martin Copyright 2006 An Enterprise Computational System Built on the Optimization Services (OS) Framework and Standards Jun.
Scalable Cluster Management: Frameworks, Tools, and Systems David A. Evensky Ann C. Gentile Pete Wyckoff Robert C. Armstrong Robert L. Clay Ron Brightwell.
Module 7: Fundamentals of Administering Windows Server 2008.
◦ What is an Operating System? What is an Operating System? ◦ Operating System Objectives Operating System Objectives ◦ Services Provided by the Operating.
 Protocols used by network systems are not effective to distributed system  Special requirements are needed here.  They are in cases of: Transparency.
Installation and Development Tools National Center for Supercomputing Applications University of Illinois at Urbana-Champaign The SEASR project and its.
ATCA based LLRF system design review DESY Control servers for ATCA based LLRF system Piotr Pucyk - DESY, Warsaw University of Technology Jaroslaw.
Management for IP-based Applications Mike Fisher BTexaCT Research
1 Introduction to Microsoft Windows 2000 Windows 2000 Overview Windows 2000 Architecture Overview Windows 2000 Directory Services Overview Logging On to.
Swapping to Remote Memory over InfiniBand: An Approach using a High Performance Network Block Device Shuang LiangRanjit NoronhaDhabaleswar K. Panda IEEE.
An application architecture specifies the technologies to be used to implement one or more (and possibly all) information systems in terms of DATA, PROCESS,
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Computers Operating System Essentials. Operating Systems PROGRAM HARDWARE OPERATING SYSTEM.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
DYNES Storage Infrastructure Artur Barczyk California Institute of Technology LHCOPN Meeting Geneva, October 07, 2010.
DISTRIBUTED COMPUTING. Computing? Computing is usually defined as the activity of using and improving computer technology, computer hardware and software.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
VApp Product Support Engineering Rev E VMware Confidential.
Slide 1 Cluster-on-Demand (COD) Justin Moore Duke University.
Internet2 AdvCollab Apps 1 Access Grid Vision To create virtual spaces where distributed people can work together. Challenges:
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Chapter 3 Selecting the Technology. Agenda Internet Technology –Architecture –Protocol –ATM IT for E-business –Selection Criteria –Platform –Middleware.
Linux Operations and Administration
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Renesas Electronics America Inc. © 2010 Renesas Electronics America Inc. All rights reserved. Overview of Ethernet Networking A Rev /31/2011.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
CITA 171 Section 1 DOS/Windows Introduction. DOS Disk operating system (DOS) –Term most often associated with MS-DOS –Single-tasking operating system.
COMP1321 Digital Infrastructure Richard Henson March 2016.
ConfigMgr Discovering and Organizing Resources Mariusz Zarzycki, Phd, MCT, MCTS, MCITP, MCSE, MCSA.....
Advanced Network Administration Computer Clusters.
Split-C for the New Millennium
Network Load Balancing
Grid Computing.
U.C. Berkeley Millennium Project
Client-Server Interaction
Progress on System Architecture for Extreme Devices
Oracle Architecture Overview
HC Hyper-V Module GUI Portal VPS Templates Web Console
Operating System Concepts
Microsoft Virtual Academy
Presentation transcript:

UCB Millennium and the Vineyard Cluster Architecture Phil Buonadonna University of California, Berkeley

10/9/99UC Berkeley Millennium2 Millennium Project Hierarchical “Cluster of Clusters” PIII-X 64x4 PII 8x2 PIII 32x2 ½ TBDLIB PII PIII Gigabit Ethernet (GbE) Ninja Math Bio CE Physics Astro

10/9/99UC Berkeley Millennium3 Millennium Agenda Investigate recent PC technologies in Clusters –NT/Linux –VI Architecture / GbE / Distributed I/O Harvest the lessons learned from NOW –Robust, flexible remote execution –Distributed resource management Investigate clusters that span administrative units –Turn-key cluster deployment –Sense of ownership Investigate the “Computational Economy” Approach –Resource management with a natural sense of ownership –Enough heterogeneous interests to be worthwhile Form basis for Sci. Computing, Internet Services, etc.

10/9/99UC Berkeley Millennium4 Vineyard Cluster Architecture Distributed resource utilization and management in a “Vineyard” of Clusters. - VIA / GM, GbE - Multicast Applications / Services - NT / Linux (2.2.x) - Stride Scheduler MPIVEXEC PBS I/O Mgmt / Monitoring REXEC TOOLS Rootstock Distribution

10/9/99UC Berkeley Millennium5 Outline Millennium Project Vineyard Cluster SW Architecture Important Component Technologies –Rootstock cluster SW distribution facility –REXEC: Robust Linux Remote Execution –Economic-based Resource allocation –CAN communication over VIA –IO Rivers Directions and Discussion

10/9/99UC Berkeley Millennium6 Rootstock Disseminate easy-to-build PC cluster system software Variety of cluster designs –well-engineered high-performance clusters –low-cost casual workgroup clusters –server farms –scalable internet servers Root Cluster Server (CS) –Provides cluster software stock Second-level customized distribution within each cluster from its own CS node

10/9/99UC Berkeley Millennium7 Rootstock Cluster Collection of nodes with IP connectivity –can be dedicated subnet, w/ or w/o NAT, or any collection –run nfsd (within cluster), httpd, ssl One node designated as Cluster Root –serves as the root of administrative operations and mgmt. –may be same or different from other nodes –may participate in normal cluster operation or not => is trusted by other nodes and has storage for dialtone May have designated front-end nodes or not May have dedicated cluster-area-network (eg. Myrinet) or not.

10/9/99UC Berkeley Millennium8 Rootstock Mechanics K cluster stock - build - os - drvrs - mill SW - os mods leased builds cs CAN Cluster System Distribution Center... IP network 1. Cluster Stock - Rootstock build pages - Full Current Linux - all fixes and pckgs - SSL, SSH - Cluster Drivers - Cluster System Layers - rexec, mpe, pbs - Optional SW ($) - Cluster Kernal Mods 5. Cluster Update button (future) - 2nd dialtone, CF engine, rolling update 2. Make the CS “graft” - specify IP address - pckg removes - dchp, dns, nis,... sanity check and build - resolv.conf, /etc/hosts,... constructs cluster build (lease) download CS build floppy Cluster 3. CS power-on build - xfer and localize DT - add local admin scripts - node build floppy 4. Node power-on build - local stock from CS

10/9/99UC Berkeley Millennium9 Computational Economy Market-based approach to resource allocation –Optimizes for user value Resources Economic F.E. APIAPI APIAPI Access Modules Resource Managers Time Share Batch Queue Apps (Value)

10/9/99UC Berkeley Millennium10 REXEC Remote Execution Secure, decentralized remote execution environment Features –Decouples resource discovery and selection –Multiple Allocation Policies (VEXECs) –Decentralized control Each client rexec is the root for a distributed task. –Dynamic discovery and configuration Resource announcements on a cluster multi-cast channel All Soft State –Simple, well-defined failure and cleanup models “They all fall down” –Secure Translates Pricing Mechanism to Resource Allocation

10/9/99UC Berkeley Millennium11 REXEC / VEXEC Components –rexecd, rexec & vexecd rexecd vexecd (Policy A) rexec Cluster IP Multicast Channel %rexec –n 2 –r 3 indexer minimum $ vexecd (Policy B) Node ANode BNode CNode D “Node A” run indexer on Nodes AB at 3 credits/min

10/9/99UC Berkeley Millennium12 Interactive Pricing Mechanism Most work on “economic mechanisms” focuses on single item or batch case –hold auctions (e.g., second-price sealed bid) integrated into Vineyard PBS –interactive case needs to be very simple Bidder i gets b i /  k  b k of CPU at rate b i –enforced by stride scheduler Running cluster mirror usage experiment –two identical clusters for one user community with $ accounts –one free and uncontrolled –one for bid and controlled –which is more desirable to use

10/9/99UC Berkeley Millennium13 Communication / VIA Multiple Physical Layers –Fast Ethernet –Gigabit Ethernet (Inter & Intra cluster net) –Myrinet w/ Lanai7 (Intra cluster net) Transports –IP, IP Multicast –VI Architecture / GM Explore integrated IPC and distributed I/O

10/9/99UC Berkeley Millennium14 AM Architecture Components –Endpoints –Virtual Networks –Bundles Operations –Request / Reply Short, Med, Long –Create, Map, Free –Poll, Wait Credit based flow control Proc A Proc B Proc C

10/9/99UC Berkeley Millennium15 AM-VIA Architecture VI Queue (VIQ) –Logical channel for AM message type –VI & independent Send/Receive Queues –Independent request credit scheme (counter n ) MAP Object –Container for 3 VIQ’s Short,Medium,Long –Single Registered Memory Region MAP Object

10/9/99UC Berkeley Millennium16 AM-VIA Integration Bundle: Pair of VI Completion Queues –Send/Receive Proc A Proc B Proc C Endpoints: Collection of MAP objects –Virtual network emulated by point-to-point connections

10/9/99UC Berkeley Millennium17