Silicon Graphics, Inc. Presented by: Open|SpeedShop™ Status, Issues, Schedule, Demonstration Jim Galarowicz SGI Core Engineering March 21, 2006.

Slides:



Advertisements
Similar presentations
ORACLE Lecture 1: Oracle 11g Introduction & Installation.
Advertisements

Aneka: A Software Platform for .NET-based Cloud Computing
Figure 1.1 Interaction between applications and the operating system.
Computer Forensics Principles and Practices by Volonino, Anzaldua, and Godwin Chapter 6: Operating Systems and Data Transmission Basics for Digital Investigations.
SECTION 1: INTRODUCTION TO SIMICS Scott Beamer CS152 - Spring 2009.
Automated Tests in NICOS Nightly Control System Alexander Undrus Brookhaven National Laboratory, Upton, NY Software testing is a difficult, time-consuming.
Operating Systems.
–Streamline / organize Improve readability of code Decrease code volume/line count Simplify mechanisms Improve maintainability & clarity Decrease development.
Patch Management Module 13. Module You Are Here VMware vSphere 4.1: Install, Configure, Manage – Revision A Operations vSphere Environment Introduction.
Background History Note 1 Refer to Soltis: Appendix.
Linux Operations and Administration
Operating Systems Operating System
Silicon Graphics, Inc. Presented by: Open|SpeedShop ™ A Dyninst Based Performance Tool Overview & Status Jim Galarowicz Manager - SGI Tools group Bill.
1 Parallel Performance Analysis with Open|SpeedShop Trilab Tools-Workshop Martin Schulz, LLNL/CASC LLNL-PRES
1 Parallel Performance Analysis with Open|SpeedShop NASA NASA Ames Research Center October 29, 2008.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 2: System Structures.
© 2010 VMware Inc. All rights reserved Patch Management Module 13.
An Introduction to Software Architecture
Portions © Intel Corporation | Portions © Hewlett-Packard Corporation * Other brands and names may be claimed as the property of others.
Advanced PI Calculation Engine Makes Complex PI Calculations Easy! Use of EDICTvb for Multi-Plant Advanced PI Calculations Dane OverfieldEXELE Information.
Silicon Graphics, Inc. Presented by: Open|SpeedShop™ An Open Source Performance Debugger Overview Jack Carter MTS SGI Core Engineering.
The Pipeline Processing Framework LSST Applications Meeting IPAC Feb. 19, 2008 Raymond Plante National Center for Supercomputing Applications.
50mm Telescope ACS Course Garching, 15 th to 19 th January 2007 January 2007Garching.
Pegasus Status Update April April 2001 Karl Schopmeyer.
Nicholas LoulloudesMarch 3 rd, 2009 g-Eclipse Testing and Benchmarking Grid Infrastructures using the g-Eclipse Framework Nicholas Loulloudes On behalf.
Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, SCAPE Scalable Preservation Environments.
04/27/2011 Paradyn Week Open|SpeedShop & Component Based Tool Framework (CBTF) project status and news Jim Galarowicz, Don Maghrak The Krell Institute.
Views The architecture was specifically changed to accommodate multiple views. The used of the QStackedWidget makes it easy to switch between the different.
Linux Audio Mangler Project Design Presentation Yu Chong Hector Urtubia Tony Zuliani.
University of Maryland The DPCL Hybrid Project James Waskiewicz.
Processes and Threads CS550 Operating Systems. Processes and Threads These exist only at execution time They have fast state changes -> in memory and.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Martin Schulz Center for Applied Scientific Computing Lawrence Livermore National Laboratory Lawrence Livermore National Laboratory, P. O. Box 808, Livermore,
Generic API Test tool By Moshe Sapir Almog Masika.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Operating System What is an Operating System? A program that acts as an intermediary between a user of a computer and the computer hardware. An operating.
Metadata Management of Terabyte Datasets from an IP Backbone Network: Experience and Challenges Sue B. Moon and Timothy Roscoe.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
© 2002 IBM Corporation Confidential | Date | Other Information, if necessary June, 2011 Made available under the Eclipse Public License v Mobile.
Chapter 2 Processes and Threads Introduction 2.2 Processes A Process is the execution of a Program More specifically… – A process is a program.
Operating System Principles And Multitasking
1 SciDAC High-End Computer System Performance: Science and Engineering Jack Dongarra Innovative Computing Laboratory University of Tennesseehttp://
Abstract A Structured Approach for Modular Design: A Plug and Play Middleware for Sensory Modules, Actuation Platforms, Task Descriptions and Implementations.
1 CEN 4072 Software Testing PPT3: Making the program fail.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
21 Sep UPC Performance Analysis Tool: Status and Plans Professor Alan D. George, Principal Investigator Mr. Hung-Hsun Su, Sr. Research Assistant.
Copy to Tape TOI. 2 Copy to Tape TOI Agenda Overview1 Technical Feature Implementation2 Q&A3.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Chapter 1 Basic Concepts of Operating Systems Introduction Software A program is a sequence of instructions that enables the computer to carry.
Views The architecture was specifically changed to accommodate multiple views. The used of the QStackedWidget makes it easy to switch between the different.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Tuning Threaded Code with Intel® Parallel Amplifier.
CITA 171 Section 1 DOS/Windows Introduction. DOS Disk operating system (DOS) –Term most often associated with MS-DOS –Single-tasking operating system.
Fermilab Scientific Computing Division Fermi National Accelerator Laboratory, Batavia, Illinois, USA. Off-the-Shelf Hardware and Software DAQ Performance.
Advanced Operating Systems CS6025 Spring 2016 Processes and Threads (Chapter 2)
VisIt Project Overview
Linux Optimization Kit Many developers need to get a performance increase from their Linux OS Linux OK allows users to achieve higher performance.
OPERATING SYSTEMS CS3502 Fall 2017
The ETICS Build and Test Service
Introduction to Operating System (OS)
Pegasus Status Update April 2001
Operating System Concepts
From Open|SpeedShop to a Component Based Tool Framework
Section 1: Introduction to Simics
Presentation transcript:

Silicon Graphics, Inc. Presented by: Open|SpeedShop™ Status, Issues, Schedule, Demonstration Jim Galarowicz SGI Core Engineering March 21, 2006

–Intel, Intel Inside (logos) and Itanium are trademarks of Intel Corporation in the United States, other countries, or both. –Linux is a trademark of Linus Torvalds in the United States, other countries or both. –Qt and the Qt logo are trademarks of Trolltech in Norway, the United States and other countries. –Open|SpeedShop, SGI SpeedShop, IRIX, SGI and SGI Altix are trademarks of Silicon Graphics Inc. –IBM is a registered trademarks of International Business Machines Corporation in the United States, other countries, or both. –All other trademarks mentioned herein are the property of their respective owners Trademark Acknowledgements

Open|SpeedShop Presentation Agenda Agenda : –Project Introduction/Overview –Feature Status –Development Issues –Schedule/Milestones –Questions –Demonstration (If we have time) –Supplemental Info Commonly Used Term Definitions Credits

Open|SpeedShop Project Introduction Open|SpeedShop – an Open Source Performance Tool Based on software concepts from SGI® SpeedShop ™ for the IRIX® O/S –Uses a set of experiments to help answer the question: Where is my program (application) spending the most time and why? –Experiment Definition A set of collectors and executables bound together to generate performance metrics. –Same type of experiments (sampling, event counting, call tracing) –No recompilation of application being analysed is needed –Minimal perturbation of the application being analysed –Support for batch performance analysis –Save performance measurement data file for later analysis

Open|SpeedShop Project Introduction (continued) Open Source under GPL and LGPL licenses Uses Dynamic Instrumentation Technology –Dyninst: Universities of Wisconsin and Maryland Cluster Friendly Linux O/S Multi-Platform Support –Depends on availability of Dyninst –SGI target is SGI® Altix® system based on the Intel® Itanium ™ chip –X86 and X86_64 platforms also supported Implemented Using Open Source Components –DPCL – multi-platform cluster capability –PAPI – multi-platform hardware counter access –SQLite – lightweight multi-platform database DOE/NNSA co-funded - as part of ASC PathForward

Open|SpeedShop Design Overview Foundation of Open Source Components –Core Components (shown here) –Build System (Autoconf, Automake, Libtool, Make & GCC) –Source Control (CVS) –Testing (Python) Layered Approach –Open Source OS-Dependent Layer –Non-Monolithic to Encourage Reuse –Higher Levels of Abstraction Going Up Extensible via Plugin Modules –Support New Data & Views –Vendor Differentiation

Open|SpeedShop Feature Status Three modes of operation: batch, interactive command line interface (CLI), and graphical user interface (GUI) Multiple platform single system image (SSI), traditional cluster, and SGI super cluster support Support for large numbers of processes, threads, and ranks* Ability for users to create their own experiments via a well defined plug-in facility Attach to running processes, threads, and ranks to gather performance data* Automatically attach to all processes in a MPI job to gather performance data View performance data for all of application's lifetime View performance data for the application in smaller time slices* *Part or all of feature still in development and not available with Open|SpeedShop release

Open|SpeedShop Feature Status (Continued) View intermediate performance measurement data on demand (while the experiment is running) Save and restore performance experiment data and symbol information to do post experiment performance analysis Compare performance results for processes, threads, ranks, or to compare current experiment results with previous experiment results* GUI Wizard facility and context sensitive help aid in the tools' usability –Multiple usability studies have been conducted Interactive CLI help facility which lists the CLI commands, syntax, and typical usage Python Scripting API which accesses Open|SpeedShop functionality corresponding to CLI commands *Part of all of feature is still in development and not available with Open|SpeedShop release

Open|SpeedShop Development Challenges Development and Support for various Linux O/S variations –Dyninst support required –Various lab environments/tools –Configuration issues/package differences/platform differences (lib64 versus lib) Creating and Managing DPCL changes/fixes –Getting these fixes approved and into DPCL sourceforge –97 patches to DPCL so far Using older versions of Dyninst and transitioning to newer versions –Fixes for “older” versions of Dyninst –Trying to get newer versions of Dyninst to play well with DPCL Time consuming – Dyninst changing rapidly Takes time to verify new versions work with DPCL/openss

Open|SpeedShop Development Challenges Scalability Issues –Threading internally in Open|SpeedShop –User Interface scaling issues – user friendly and fast –Performance Database file scaling issues Make data retrieval and storage as fast as possible –Attaching to running processes in a timely manner Supporting Various MPI Implementations –SGI MPT, MPICH, LAMPI, OPENMPI Development Time –Schedules are tight

Open|SpeedShop Milestones and Schedule Infrastructure deliveries –September and December, 2004 Performance Experiments Delivery Schedule –PC Sampling1 st Quarter 2005 –User Time 2 nd Quarter 2005 –Hardware Counter3 rd Quarter 2005 –MPI call tracing4 th Quarter 2005 –MPI enhanced call tracing1 th Quarter 2006 –Input/Output call tracing1 st Quarter 2006 –Input/Output enhanced call tracing2 nd Quarter 2006 –FPE tracing2 nd Quarter 2006 “Live” cvs repository source is available now: –

Questions Questions: Jim Galarowicz

Open|SpeedShop Demonstration Open|SpeedShop Demonstration

Supplemental Info

Open|SpeedShop Terms and Definitions Experiment: A set of collectors and executables bound together to generate performance metrics. Component (s): A somewhat self-contained section of the Open|SpeedShop performance tool. This section of code does a set of specific related tasks for the tool. For example, the GUI component does all the tasks related to displaying Open|SpeedShop wizards, experiment creation, and results using a graphical user interface. The CLI component does similar functions but uses the interactive command line delivery method. Collector: The portion of the tool containing logic that is responsible for the gathering of the performance metric (information). A collector is a portion of the code that is included in the experiment plugin. Metric : The entity, which the collector/experiment is gathering. A time, occurrence counter, or other entity, which reflects in some way on the applications performance and is gathered by a performance experiment (by the collector). Framework: The set of API functions that allows the user interface to manage the creation and viewing of performance experiments. It is the interface between the user interface and the cluster support and dynamic instrumentation components.

Credits Presentation –William Hachfeld –Helen Stepanov –Al Stipek –Jack Carter Partners –DOE/NNSA via ASC PathForward LLNL, LANL, Sandia National Laboratories

Credits Continued Technology –D yninst: jointly developed by the University of Wisconsin and the University of Maryland - –DPCL : IBM ® Open Source Project -

Credits Continued Technology (Continued) –Python: Open Source Project - –PAPI: University of Tennessee - –SQLite: Open Source Project - –Qt ® : GUI toolkit -