Cluster Infrastructure BoF Bruce J. Walker
Cluster BoF Cluster Summits What is a cluster Information about Clusters Types of clusters Characteristics of clusters Cluster components Clusters and Xen Cluster Issues
Cluster Summits Waldorf, Germany Summit – June, RH –Daniel Phillips will discuss in next BoF OLS Summit – Monday/Tuesday this week –Set up via OSDL cluster sig (clusters_sig mailing list
OLS Summit Attendees: –HP (OpenSSI), IBM (HA-Linux), ORNL (Oscar, etc.), MontaVista (Virtual Synchrony), Novell (CIM/DMTF), Suse, Cassat, RH (CLVM), Steeleye, … Notes will be sent out on clusters_sig mailing list Talks: Ha-Linux; Virtual Synchrony, DMTF/CIM Discussion: CLVM, cman GFS, DLM OCFS2 integration; component interop and substitution; memory pressure
What is a Cluster? –Multiple machines working together; –Standard computers with an OS kernel per node; –Peers, working together –NOT client-server –NOT SMP or NUMA (but have SMP or NUMA nodes) –Clusters and Grids? Grids are loose and can cross administrative lines; Use a grid only if you can’t set up a cluster; The best grid would be a collection of SSI clusters;
Information about clusters Clusters_sig mailing list (OSDL) Linux_clusters mailing list (RedHat) Sourceforge cluster web site Projects –Linux-ha.org, openssi.org, openmosix.org, kerrighed.org, bproc.org, OCFS2, GFS, Luster Products –Scyld, Steeleye, RH, Suse, HP (ServiceGuard), IBM, linux-ha ….
Many types of Clusters High Performance Clusters Beowulf; 1000 nodes; parallel programs; MPI Load-leveling Clusters –Move processes around to borrow cycles (eg. Mosix) Web-Service Clusters –LVS; load-level tcp connections; Web pages and applications Storage Clusters –cluster filesystems; same view of data from each node; GFS, OCFS2 Database Clusters –Oracle RAC; High Availability Clusters –ServiceGuard, Lifekeeper, Failsafe, ha-linux, failover clusters
SMP Availability Manageability Usability Scalability HUGE Perfect Cluster in all dimensions Really BIG SMP Ideal Linux Cluster log scale Ideal Linux Cluster Typical HA Cluster
Cluster Components Membership Communication Group Membership and Communicaiton –Virtual Synchrony DLM CLVM Cluster Filesystems Networking (Linux Virtual Server) Cluster Process Management Scheduling/ Workload Mgmt Remote Fileblock Remote IPC Distributed Shared Memory (DSM) DMTF / CIM
Clusters and Xen HA across DomU –On same host / on different hosts HA across Dom0 –Restart VMs Load balancing
Clustering Issues Different Cluster managers –Apis and resource mgmt Interoperability of cluster components CIM model for clusters / what apps want Clusters of Clusters Overlaping membeship