Www.openfabrics.org Open MPI Project State of the Union - April 2007 Jeff Squyres Cisco, Inc.

Slides:



Advertisements
Similar presentations
Technology Analysis LINUX Alper Alansal Brian Blumberg Ramank Bharti Taihoon Lee.
Advertisements

Using MapuSoft Instead of OS Vendor’s Simulators.
The Premier Software Usage Analysis and Reporting Toolset CELUG Presentation – May 12, 2010 LT-Live : License Tracker’s License Server Monitor.
OpenFabrics Enterprise Distribution for Windows 1 Stan C. SmithIshai RabinovitzEric Lantz 3/16/2010.
VisIt Software Engineering Infrastructure and Release Process LLNL-PRES Lawrence Livermore National Laboratory, P. O. Box 808, Livermore,
Evaluation of the Globus GRAM Service Massimo Sgaravatto INFN Padova.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Apache Jakarta Tomcat Suh, Junho. Road Map Tomcat Overview Tomcat Overview History History What is Tomcat? What is Tomcat? Servlet Container.
Sharepoint Portal Server Basics. Introduction Sharepoint server belongs to Microsoft family of servers Integrated suite of server capabilities Hosted.
1 © 2006 Cisco Systems, Inc. All rights reserved. Session Number Presentation_ID Using the Cisco Technical Support & Documentation Website for Security.
EclipseME Java Micro Edition The Eclipse Way Craig Setera April 19, 2006.
New Direction Proposal: An OpenFabrics Framework for high-performance I/O apps OFA TAC, Key drivers: Sean Hefty, Paul Grun.
DB2 (Express C Edition) Installation and Using a Database
Apache Tomcat Web Server SNU OOPSLA Lab. October 2005.
OFA-IWG - March 2010 OFA Interoperability Working Group Update Authors: Mikkel Hagen, Rupert Dance Date: 3/15/2010.
Parallel Computing The Bad News –Hardware is not getting faster fast enough –Too many architectures –Existing architectures are too specific –Programs.
An Introduction to Xen Prof. Chih-Hung Wu
RSC Williams MAPLD 2005/BOF-S1 A Linux-based Software Environment for the Reconfigurable Scalable Computing Project John A. Williams 1
SSI-OSCAR A Single System Image for OSCAR Clusters Geoffroy Vallée INRIA – PARIS project team COSET-1 June 26th, 2004.
1 Reseller Webinar Jay Petersen April 8, Slide 2 Announcing the creation of… A new organization in SCO with the charter to create new licensing.
1 Session Number Presentation_ID © 2001, Cisco Systems, Inc. All rights reserved. Using the Cisco TAC Website for IP Routing Issues Cisco TAC Web Seminar.
OFED 1.x Roadmap & Release Process November 06 Jeff Squyres, Woodruff, Robert J, Betsy Zeller, Tziporet Koren,
Ch.2 – Introduction to Routers
Module 12: Designing High Availability in Windows Server ® 2008.
WORK ON CLUSTER HYBRILIT E. Aleksandrov 1, D. Belyakov 1, M. Matveev 1, M. Vala 1,2 1 Joint Institute for nuclear research, LIT, Russia 2 Institute for.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco PublicOFA Open MPI 1 Open MPI Progress Jeff Squyres.
Tools and Utilities for parallel and serial codes in ENEA-GRID environment CRESCO Project: Salvatore Raia SubProject I.2 C.R. ENEA-Portici. 11/12/2007.
An Introduction to IBM Systems Director
Bright Cluster Manager Advanced cluster management made easy Dr Matthijs van Leeuwen CEO Bright Computing Mark Corcoran Director of Sales Bright Computing.
Open Fabrics BOF Supercomputing 2008 Tziporet Koren, Gilad Shainer, Yiftah Shahar, Bob Woodruff, Betsy Zeller.
Module 7: Fundamentals of Administering Windows Server 2008.
1 Apache. 2 Module - Apache ♦ Overview This module focuses on configuring and customizing Apache web server. Apache is a commonly used Hypertext Transfer.
Bonrix SMPP Client. Index Introduction Software and Hardware Requirements Architecture Set Up Installation HTTP API Features Screen-shots.
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Introduction to HP Availability Manager.
OpenFabrics 2.0 or libibverbs 1.0 Sean Hefty Intel Corporation.
Citrix MPS 3.0 Licensing Douglas A. Brown President
BLU-ICE and the Distributed Control System Constraints for Software Development Strategies Timothy M. McPhillips Stanford Synchrotron Radiation Laboratory.
Update on Database Issues Peter Chochula DCS Workshop, June 21, 2004 Colmar.
March 17, 2005 Roadmap of Upcoming Research, Features and Releases Bart Miller & Jeff Hollingsworth.
Taking the Complexity out of Cluster Computing Vendor Update HPC User Forum Arend Dittmer Director Product Management HPC April,
Copyright © 2002, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners
© 2002 IBM Corporation LANDP 24 th April 2003 LANDP for Linux Web Lecture.
An Overview of Berkeley Lab’s Linux Checkpoint/Restart (BLCR) Paul Hargrove with Jason Duell and Eric.
@ For more details visit : Opportunities for participation Modular Architecture Trace JIT compiler Interpreter Memory manager.
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
Derek Wright Computer Sciences Department University of Wisconsin-Madison MPI Scheduling in Condor: An.
OpenFabrics Enterprise Distribution (OFED) Update
Windows OpenFabrics (WinOF) Update Gilad Shainer, Mellanox Technologies November 2007.
SKYPIAX, how to add Skype capabilities to FreeSWITCH (and Asterisk) CHICAGO, USA, September 2009.
Cluster Software Overview
Presented by Open MPI on the Cray XT Richard L. Graham Tech Integration National Center for Computational Sciences.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
iSER update 2014 OFA Developer Workshop Eyal Salomon
Open MPI OpenFabrics Update April 2008 Jeff Squyres.
Connecting LabVIEW to EPICS network
Group Communication Theresa Nguyen ICS243f Spring 2001.
Chapter 1 Basic Concepts of Operating Systems Introduction Software A program is a sequence of instructions that enables the computer to carry.
Mellanox Connectivity Solutions for Scalable HPC Highest Performing, Most Efficient End-to-End Connectivity for Servers and Storage September 2010 Brandon.
© 2007 by Michal Dobisek; made available under the EPL v1.0 | EclipseCon 2007 Michal Dobisek, Inside Subversive The Subversion.
Software Systems Division (TEC-SW) ASSERT process & toolchain Maxime Perrotin, ESA.
Monitoring Dynamic IOC Installations Using the alive Record Dohn Arms Beamline Controls & Data Acquisition Group Advanced Photon Source.
9 Copyright © 2004, Oracle. All rights reserved. Getting Started with Oracle Migration Workbench.
Berkeley Lab Software Distribution Site NLIT Dan Pulsifer - Engineering May 11 th, 2008.
PERFORMANCE OF THE OPENMP AND MPI IMPLEMENTATIONS ON ULTRASPARC SYSTEM Abstract Programmers and developers interested in utilizing parallel programming.
Current Generation Hypervisor Type 1 Type 2.
Fabric Interfaces Architecture – v4
CRESCO Project: Salvatore Raia
CoreXL Lab Exercise Presentation
A Virtual Machine Monitor for Utilizing Non-dedicated Clusters
PyWBEM Python WBEM Client: Overview #2
Presentation transcript:

Open MPI Project State of the Union - April 2007 Jeff Squyres Cisco, Inc.

2 Overview  Project purpose  Sub projects  Current status  Continuing / future directions

3 Why does Open MPI exist?  Maximize all MPI expertise  Research / academia  Industry  …elsewhere  Capitalize on [literally] years of MPI research and implementation experience  The sum is greater than the parts Research / academia Industry

4 Why separate from M[VA]PICH?  Open, inclusive community  Not limited to just Open Fabrics  Common: TCP, shared memory, OFED* (MVAPICH only)  OMPI-specific: Myrinet, Portals, InfiniPath  M[VA]PICH have different project goals  They both chose to remain separate

5 Current membership  14 members, 6 contributors  4 US DOE labs  8 universities  7 vendors  1 individual

6 Not-so-subtle hint  …would love to see an iWARP vendor in the list!  (please come talk to me!)

7 Current projects  “Open MPI Project” is an umbrella organization for multiple projects  OMPI:Open MPI  ORTE:Open Run-Time Environment  PLPA:Portable Linux Processor Affinity  MTT:MPI (Middleware) Testing Tool

8 Project: Open MPI / ORTE  Recently released new 1.2 series  OF-related changes compared to v1.1 series  Better overall performance, lots of bug fixes  Improvements for run-time/launch scalability  Relocate installed MPI (good for ISVs)  Support for fork() with OFED 1.2  Support fixed limits for registered memory  Fixes for heterogeneous network environments  Native InfiniPath support

9 Version history

10 Success stories  OFED + Open MPI  Thunderbird Sandia cluster #6 in Top 500  Road Runner Los Alamos cluster 16k Opteron cores + 16k cell broadband engines  Coyote Los Alamos cluster 2580 Opteron cores  Sun ClusterTools v7

11 OFED involvement  Initially planned on “v1.2ofed”  Included some OF- specific updates  But community released v1.2.1 before OFED 1.2  Therefore, included community OMPI v1.2.1 release in OFED v1.2 OMPI SVN development trunk v1.2 series branch v1.2 v1.2ofed v1.2.1 Today

12 OFED involvement  “MPI Selector”  Menu-based and CLI commands  Trivially set system-wide and per-user default MPI selection  No editing of “dot” files necessary  Displays / select between all installed MPI’s  Works with all MPI’s  Including HP MPI and Intel MPI

13 Ongoing OFA-related work  More flexible OF wireup schemes  Heterogeneous networking scenarios  Multiple QP’s per connection  More flexible resource affinity schemes  Processor / core, HCA / port  Automatic path migration  RDMA CM functionality  Better LMC / multi-LID routing

14 Ongoing OFA-related work  Message coalescing  Asynchronous progress  Exploit new Mellanox HCA capabilities  Better utilization of network resources  Heterogeneity  Multicast, UD

15 Roadmap  1.2 series is current stable  v1.2.1 latest release  1.3 series tentatively targeted at end of year  Checkpoint / restart (and other FT)  Integration with debuggers  Windows support (*)  MPI collectives performance improvements  LSF integration

16 Project: Processor affinity (PLPA)  Linux API for affinity has changed 3 times  Changed number and type of arguments  Used same function name (!)  Both kernel and glibc functions  Installed glibc may not match kernel!  Affinity is critical for performance  Especially with increasing core count per host  Already critical on NUMA machines (locality!)

17 Which API to use?  Compile-time solution not sufficient  Need complex “configure” script to figure it out  Only determines glibc API, not kernel API, so it may not even be sufficient  Does not help for shipping static binaries (ISVs)  Need a run-time solution  Paul Hargrove (LBNL) devised safe kernel probe  PLPA library born

18 PLPA library  Constant API suitable for ISVs  BSD license  Automatically performs the run-time probe  Dispatches to correct back-end kernel function  Bypasses glibc

19 Current status  Releases:  Stable series: 1.0.x  Upcoming series: 1.1  New 1.1 features  Topology information Mapping between (socket,core) tuple, hardware threads, CPU node, and Linux processor ID’s  plpa-taskset(1) command Same as taskset(1), but groks topology information

20 Project: MPI Testing Tool (MTT)  Could be named “Middleware Testing Tool”  Very little (no?) MPI-specific  Not specific to Open MPI  Has been used with LAM, MPICH2, MVAPICH2  Used as primary test mechanism for OMPI  Distributed testing by member organizations

21 Open MPI MTT Usage  Distributed regression testing  Nightly and weekend runs  Results ed every weekday morning  Supports various resource managers  Supports correctness and performance tests  Cornerstone of Open MPI release process  Each member tests the platforms they care about

22 Indiana U. server Member site Tarball Nightly Regression Testing  Nightly tarballs created at Indiana U.  Member sites  Download tarball and tests  Compile and run tests  Members upload results to central DB  sent at 12 and 24 hour intervals  Real-time web querying

23 Indiana U. server Member site Tarball DB Nightly Regression Testing  Nightly tarballs created at Indiana U.  Member sites  Download tarball and tests  Compile and run tests  Members upload results to central DB  sent at 12 and 24 hour intervals  Real-time web querying

24 Indiana U. server Tarball DB Nightly Regression Testing  Nightly tarballs created at Indiana U.  Member sites  Download tarball and tests  Compile and run tests  Members upload results to central DB  sent at 12 and 24 hour intervals  Real-time web querying

25 Usage in Open MPI  Currently available to all OMPI members  Strongly “encouraged”  results examined every day  12 and 24 hour windows  Weekend windows  MTT software to be released publicly later this year

26 The Open MPI Project  More than just MPI  Concerned with real-world HPC  Open community  Come join us!  Solid OpenFabrics support is critical  Many unanswered questions  Plenty of room for academic and industry ongoing work

Thank You