Trilinos I/O Support (TRIOS)

Slides:



Advertisements
Similar presentations
© 2006 DataCore Software Corp SANmotion New: Simple and Painless Data Migration for Windows Systems Note: Must be displayed using PowerPoint Slideshow.
Advertisements

Cloud Computing: Theirs, Mine and Ours Belinda G. Watkins, VP EIS - Network Computing FedEx Services March 11, 2011.
SSRS 2008 Architecture Improvements Scale-out SSRS 2008 Report Engine Scalability Improvements.
University of Chicago Department of Energy The Parallel and Grid I/O Perspective MPI, MPI-IO, NetCDF, and HDF5 are in common use Multi TB datasets also.
SALSA HPC Group School of Informatics and Computing Indiana University.
1 Presentation at SciDAC face-to-face January 2005 Ron A. Oldfield Sandia National Laboratories The Lightweight File System.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Technical Architectures
UNCLASSIFIED: LA-UR Data Infrastructure for Massive Scientific Visualization and Analysis James Ahrens & Christopher Mitchell Los Alamos National.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Introduction To Windows Azure Cloud
The Pipeline Processing Framework LSST Applications Meeting IPAC Feb. 19, 2008 Raymond Plante National Center for Supercomputing Applications.
Data Intensive Computing at Sandia September 15, 2010 Andy Wilson Senior Member of Technical Staff Data Analysis and Visualization Sandia National Laboratories.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
Mainframe (Host) - Communications - User Interface - Business Logic - DBMS - Operating System - Storage (DB Files) Terminal (Display/Keyboard) Terminal.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Paradyn Project Paradyn / Dyninst Week Madison, Wisconsin April 29-May 3, 2013 Mr. Scan: Efficient Clustering with MRNet and GPUs Evan Samanas and Ben.
Towards Exascale File I/O Yutaka Ishikawa University of Tokyo, Japan 2009/05/21.
The HDF Group Milestone 5.1: Initial POSIX Function Shipping Demonstration Jerome Soumagne, Quincey Koziol 09/24/2013 © 2013 The HDF Group.
System-Directed Resilience for Exascale Platforms LDRD Proposal Ron Oldfield (PI)1423 Ron Brightwell1423 Jim Laros1422 Kevin Pedretti1423 Rolf.
Site Report DOECGF April 26, 2011 W. Alan Scott Sandia National Laboratories Sandia National Laboratories is a multi-program laboratory managed and operated.
LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
Tackling I/O Issues 1 David Race 16 March 2010.
Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |
LIOProf: Exposing Lustre File System Behavior for I/O Middleware
AZ PASS User Group Azure Data Factory Overview Josh Sivey, Solution Partner October
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
SOFTWARE TESTING TRAINING TOOLS SUPPORT FOR SOFTWARE TESTING Chapter 6 immaculateres 1.
Compute and Storage For the Farm at Jlab
Parallel Virtual File System (PVFS) a.k.a. OrangeFS
AuraPortal Cloud Helps Empower Organizations to Organize and Control Their Business Processes via Applications on the Microsoft Azure Cloud Platform MICROSOFT.
Developing IoT endpoints with mbed Client
Big Data is a Big Deal!.
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
Sushant Ahuja, Cassio Cristovao, Sameep Mohta
CIIT-Human Computer Interaction-CSC456-Fall-2015-Mr
Database Architectures and the Web
Status and Challenges: January 2017
GWE Core Grid Wizard Enterprise (
Open Source distributed document DB for an enterprise
Spark Presentation.
HDF5 October 8, 2017 Elena Pourmal Copyright 2016, The HDF Group.
Joseph JaJa, Mike Smorul, and Sangchul Song
What is SharePoint and why you should care
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
Current status and future work
Structural Simulation Toolkit / Gem5 Integration
Unstructured Grids at Sandia National Labs
#01 Client/Server Computing
Chapter 3: Windows7 Part 4.
NSF : CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science PI: Geoffrey C. Fox Software: MIDAS HPC-ABDS.
Lock Ahead: Shared File Performance Improvements
SDM workshop Strawman report History and Progress and Goal.
Concept of VLAN (Virtual LAN) and Benefits
Scalable Parallel Interoperable Data Analytics Library
Polly Baker Division Director: Data, Mining, and Visualization
Introduction to Apache
Appcelerator Arrow: Build APIs in Minutes. Connect to Any Data Source
Overview of big data tools
DAT381 Team Development with SQL Server 2005
Software Acceleration in Hybrid Systems Xiaoqiao (XQ) Meng IBM T. J
Big-Data Analytics with Azure HDInsight
Mark Quirk Head of Technology Developer & Platform Group
#01 Client/Server Computing
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Trilinos I/O Support (TRIOS) Proposed I/O Software for FY11 Approved for General Release SAND 2010-7732P November 2010 Ron Oldfield and Gregory Sjaardema Sandia National Laboratories

Trios Overview and Goals Parallel I/O Support for required I/O libraries Exodus, Nemesis, IOSS TPLs: netCDF, HDF5 POC: Greg Sjaardema Co-Design Vehicle for I/O Research Provides “brand name” and common distribution point for rapid deployment and testing Planned Software Products for Trios Scalable I/O Libraries for sparse and dense matrices (from NGC project) In-Transit Data Services (from CSSE/SIO Research) Network Scalable Service Interface (Nessie) netCDF caching/staging service Fragment detection and tracking for CTH HPC Database services (from CSRF: HPC System Support for Informatics) SQL Services Rapid Ingest Capabilities POC: Ron Oldfield Trios Software Review November 2010

Impact of Efficient I/O Libraries Scaling Challenges for Trilinos-Based Document Clustering Strong scaling exposes weaknesses Original methods for loading were not designed for production use. Improvements to enable large scale Sparse Matrix Reads Keep track of mapping Parallel I/O Dense Matrix Reads Convert to binary format Data ordering Memory efficient algorithms Multi-pass dense-matrix multiply for cosine-similarity allows calculation of full dataset Previous version could not cluster 400K docs (one matrix had to be resident in memory on each process) Early Performance Results: Bible Dataset NSSI only Trios Software Review November 2010

Multilingual Document Clustering Performance on JaguarPF With I/O improvements, application is compute bound even at large scale Trios Software Review November 2010

In-Transit Data Services Data Processing Between Simulation and Storage Motivation Improve “effective” I/O performance by injecting data services between app and storage Leverage available compute/service nodes Network Scalable Service Interface (Nessie) Developed for the Lightweight FS Project Framework for HPC client/server development Designed for scalable data movement Asynchronous RPC-like API Examples Preprocessing for seismic imaging (ca.1995) netCDF staging service CTH particle detection/tracking SQL Proxy for HPC/Database Integration Sparse-matrix viz, real-time network analysis Client Application (compute nodes) I/O Service (compute/service nodes) Lustre File System Processed Data Raw Data Visualization Client Cache/aggregate/process Trios Software Review November 2010

In-Transit Data Services NetCDF Staging Service Motivation Synchronous I/O libraries require app to wait until data is on storage device Most of application I/O is “bursty” Not enough cache on compute nodes for typical async I/O approaches NetCDF is basis of important I/O libs at Sandia (no code changes required) NetCDF Caching Service Service aggregates/caches data and pushes data to storage Async I/O allows overlap of I/O and computation Client Application (compute nodes) NetCDF Service NetCDF requests Processed Data Lustre File System Cache/aggregate Trios Software Review November 2010

In-Transit Data Services CTH Fragment Detection Motivation Fragment detection requires data from every time step (I/O intensive) Detection process takes 30% of time-step calculation (scaling issues) Integrating detection software with CTH is intrusive on developer CTH fragment detection service Extra compute nodes provide in-line processing (overlap fragment detection with time step calculation) Only output fragments to storage (reduce I/O) Non-intrusive Looks like normal I/O (spyplot interface) Can be configured out-of-band Status Developing client/server stubs Developing Paraview proxy service CTH (compute nodes) Fragment-Detection Service (compute nodes) Raw Data Fragment Data Lustre File System spyplot Detect Fragments Visualization Client Fragment detection service provides on-the-fly data analysis with no modifications to CTH. Trios Software Review November 2010

Client Application on XT/XMT Database Service Provides Fast Access to Remote Database from HPC Application Client Application on XT/XMT (compute nodes) Storage Arrays Database Service (service node) Portals ODBC/NZLoad © Netezza Corporation Database Service Features Provides “bridge” between parallel apps and external DWA Runs on Cray XT/XMT network nodes Applications communicate with DB service using Nessie (over Portals or IB) Service-level access to Netezza through standard interface (e.g., ODBC, nzload) Trios Software Review November 2010

Motivating Application: NISAC/N-ABLE Modeling Economic Security Model economic impact of disruptions in infrastructure Changes in U.S. Border Security technologies Terrorist acts on commodity futures markets Transportation disruptions on regional agriculture and food supply chains Optimized military supply chains Electric power and rail transportation disruptions on chemical supply chains Compute and data challenges Models economy to the level of the individual firm Model transactions from 10s of millions of companies Simulation data ingested into DB for analysis DB ingest is bottleneck (10x time to simulate data) Time to solution is critical… want answers in hours Trios Software Review November 2010

Performance-Based Motivation for Database Service Bytes Written Time (sec) Database Ingest Performance ODBC is the only available interface for remote access. It’s the interface and protocols, not the network that’s the bottleneck Service can use NZLoad from the Netezza host Trios Software Review November 2010

Summary Trios is a perfect vehicle for I/O co-design Rapid deployment for efficient production-quality I/O libraries Exodus, Nemesis, IOSS (from Seacas) Sparse/Dense Matrix I/O (from NGC) Research vehicle for in-transit data services Goal is to make efficient use of platform to reduce burden on file system Already demonstrated value for Seismic (Salvo) Enables exploration of new functionality for Trilinos codes on HPC systems netCDF staging to manage bursts of I/O In-transit fragment detection/tracking: reduce storage system requirements Integration with Data Warehouse Appliances Interactive observation/control of HPC application Trios Software Review November 2010

Trios Software Products Planned Timeline for Integration and Release Seacas Exodus, Nemesis, IOSS (in Trilinos release 10.6) Plans to move out of Trios subdirectory to a new package named “Seacas” Sparse/Dense matrix I/O libraries Installed and tested: November 30, 2011 Research software Network-Scalable Service Interface (Nessie) Installed and testedJan 1, 2011 I’m not sure how to test services using the Trilinos testing framework netCDF staging service Installed with basic tests: Feb 1, 2011 Link with Exodus and evaluate performance: March 2011 CTH tracking service In development, need working demo for ASC level III milestone: Summer 2011 Type of release depends on copyright resolution: currently for internal release only Trios Software Review November 2010