High Volume Batch Submission System for Earthquake Engineering ( Batchsubmit ) By Anup Thomas Hacker Gregory Rodgers.

Slides:



Advertisements
Similar presentations
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Advertisements

SACNAS, Sept 29-Oct 1, 2005, Denver, CO What is Cyberinfrastructure? The Computer Science Perspective Dr. Chaitan Baru Project Director, The Geosciences.
JUAN M. CAICEDO UNIVERSITY OF SOUTH CAROLINA USING SAP2000 ON NEESHUB 1.
A Computation Management Agent for Multi-Institutional Grids
Ellen M. Rathje University of Texas at Austin NEEScomm IT Development Team.
The Vision, Process, and Requirements for Creating EarthCube Presentation at Second EarthCube WebEx Aug 22, 2011.
NEESGrid: Early Experiences in CyberInfrastructure Charles Severance University of Michigan Sakai Project, NEESGrid Project, OGCE Project
Teleoperation and Teleparticipation of Instructional Shake Tables Using the NEES Cyberinfrastructure S.J. Dyke 1, Z. Jiang 2, R. Christenson 2, X. Gao.
The NEESacademy: A strong foundation for education and outreach Sean Brophy, Thalia Anagnos, Barbara Cooper, Jason Lambert and Jared West.
NEESGrid - A Collaborative Portal Study Charles Severance University of Michigan NEESGrid SI Team
Brian Rohler Rajesh Thyagarajan Hubbub 2012 IUPUI Conference Center, Indianapolis, IN September 25 th, 2012.
An Internet-based Educational Platform for Earthquake Engineering Laboratory Experiments Chencheng Wu, Le Yu, Shirley J. Dyke, Christopher Beeler and Jian.
DIRAC API DIRAC Project. Overview  DIRAC API  Why APIs are important?  Why advanced users prefer APIs?  How it is done?  What is local mode what.
NEEShub Simulation Capabilities February 17, 2012 Webinar George E Brown, JR. Network for Earthquake Engineering Simulation.
NEES, NEES2, and OSG Thomas Hacker Purdue University.
Thomas Hacker Barb Fossum Matthew Lawrence Open Science Grid May 19, 2011.
Ellen M. Rathje University of Texas at Austin NEEScomm IT Development Team.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
George E. Brown, Jr. Network for Earthquake Engineering Simulation Publication of Research Data in the NEEShub Stanislav Pejša, NEEScomm Data Curator HUBbub.
Dr. Thomas Hacker Co-Leader for Information Technology George E. Brown Network for Earthquake Engineering Simulation Assistant Professor, Computer & Information.
Jeopardy Final Jeopardy MATHNEES Engineering Physical Earth Science Science Version 3 July 16, 2011.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
National Center for Supercomputing Applications The Computational Chemistry Grid: Production Cyberinfrastructure for Computational Chemistry PI: John Connolly.
NEESgrid: Where Infrastructure Meets Cyberinfrastructure Kim Mish Presidential Professor of Structural Engineering Director, Donald G. Fears Structural.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
OpenSees on NEEShub Frank McKenna UC Berkeley. Bell’s Law Bell's Law of Computer Class formation was discovered about It states that technology.
Page 1 Simulation in NEEShub NEES & MCEER Annual Meeting 2011 Session #11: High Performance Computing and Simulation Gregory P. Rodgers Ph.D. NEESComm.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Teleoperation and Teleparticipation of Instructional Shake Tables Using the NEES Cyberinfrastructure S.J. Dyke 1, Z. Jiang 2, R. Christenson 2, X. Gao.
October 21, 2015 XSEDE Technology Insertion Service Identifying and Evaluating the Next Generation of Cyberinfrastructure Software for Science Tim Cockerill.
Our challenge is to “reduce the impact of earthquakes and tsunamis on society through Research, Engineering, Science, and Education...” Dr. Barbara Cooper,
Rachelle Howell and Ellen M. Rathje University of Texas at Austin NEEScomm IT Development Team.
Kurt Mueller San Diego Supercomputer Center NPACI HotPage Updates.
The Academic Impact of NEES Ian Buckle University of Nevada Reno.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
George E. Brown, Jr. Network for Earthquake Engineering Simulation Opening Remarks Prof. Julio Ramirez NEEScomm Center Director and NEES Chief Officer.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation GEON Speaker.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation Alternative.
GPO’s Federal Digital System December 10, 2009 U.S. Government Printing Office.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation Analyzing.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation Enabling.
ISERVOGrid Architecture Working Group Brisbane Australia June Geoffrey Fox Community Grids Lab Indiana University
Interactive Workflows Branislav Šimo, Ondrej Habala, Ladislav Hluchý Institute of Informatics, Slovak Academy of Sciences.
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation NEES TeraGrid.
1 NSF/TeraGrid Science Advisory Board Meeting July 19-20, San Diego, CA Brief TeraGrid Overview and Expectations of Science Advisory Board John Towns TeraGrid.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
George E. Brown, Jr. Network for Earthquake Engineering Simulation 4 th regular meeting of the NEES preservation advisory committee Stanislav Pejša
NEES Networking Needs The NEES MRE: Where the Infrastructure Community Meets the Cyberinfrastructure Community Kyran (Kim) Mish, Director Center for Computational.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
2011 Calendar Important Dates/Events/Homework. SunSatFriThursWedTuesMon January
Internet2 Applications Group: Renater Group Presentation T. Charles Yun Internet2 Program Manager, Applications Group 30 October 2001.
Experiences Running Seismic Hazard Workflows Scott Callaghan Southern California Earthquake Center University of Southern California SC13 Workflow BoF.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
The Gateway Computational Web Portal Marlon Pierce Indiana University March 15, 2002.
HUBbub 2013: Developing hub tools that submit HPC jobs Rob Campbell Purdue University Thursday, September 5, 2013.
A Research Collaboratory for Open Source Software Research Yongqin Gao, Matt van Antwerp, Scott Christley, Greg Madey Computer Science & Engineering University.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
Get Data to Computation eudat.eu/b2stage B2STAGE How to shift large amounts of data Version 4 February 2016 This work is licensed under the.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
INTRODUCTION TO XSEDE. INTRODUCTION  Extreme Science and Engineering Discovery Environment (XSEDE)  “most advanced, powerful, and robust collection.
Simulation Production System
Integrating Scientific Tools and Web Portals
NEESGrid - A Collaborative Portal Study
Overview of Workflows: Why Use Them?
2015 January February March April May June July August September
Presentation transcript:

High Volume Batch Submission System for Earthquake Engineering ( Batchsubmit ) By Anup Thomas Hacker Gregory Rodgers

Outline  Overview of NEES  Need for Batchsubmit  Batchsubmit Features  Batchsubmit Examples  Batchsubmit Architecture  Project Warehouse  Lessons Learned

NSF Network for Earthquake Engineering Simulation (NEES)  Safer buildings and civil infrastructure are needed to reduce damage and loss from earthquakes and tsunamis  To facilitate research to improve seismic design of buildings and civil infrastructure, the National Science Foundation established NEES  NEES Objectives  Develop a national, multi-user, research infrastructure to support research and innovation in earthquake and tsunami loss reduction  Create an educated workforce in hazard mitigation  Conduct broader outreach and lifelong learning activities

Vision for NEES  Facilitate access to the world's best integrated network of state-of- the art physical simulation facilities  Build a cyber-enabled community that shares ideas, data, and computational tools and models.  Promote education and training for the next generation of researchers and practitioners.  Cultivate partnerships with other organizations to disseminate research results, leverage cyberinfrastructure, and reduce risk by transferring results into practice.

NEES Research Facilities  NEES has a broad set of experimental facilities  Each type of equipment produces unique data  Located at 14 sites across the United States  Shake Table, Tsunami Wave Basin  Large-Scale Testing Facilities  Centrifuge, Field and Mobile Facilities  Large-Displacement Facility  Cyberinfrastructure

University of California Santa Barbara University of California San Diego University of California Los Angeles University of California Davis Lehigh University Rensselaer Polytechnic Institute 0 Cornell University University of Buffalo University of Minnesota University of Illinois- Urbana Oregon State University University of California Berkeley University of Nevada Reno University of Texas Austin

NEEShub ( )

21 st Century Science  Theory and Experiment  Computational Simulation  Third leg of science  Past 50 years or so  Data  Fourth “leg” of science  Researchers are flooded with data  Tremendous quantity and multiple scales of data  Difficult to collect, store, and manage  How can we distill meaningful knowledge from data?

Data is the 4 th Paradigm  Producing an avalanche of high resolution digital data  All (or most) of the data needs to be accessible over a long period of time  Much of the data is not reproducible  Example – NEES project  Structure or sample destroyed through testing  Very expensive to rebuild for more tests

Need for High Performance Computing(HPC) Simulation  Earthquake engineering problems:  Involves intense computation on large volumes of data  Will take days to complete simulation on normal computers  Earthquake engineering problems on an HPC resource takes hours to complete  Using HPC resources for solving computationally intensive problems results in huge time savings  Different HPC resources include:  XSEDE: Kraken, Stampede  Purdue: Hansen, Carter  Open Science Grid (OSG)

Why Batchsubmit? Different steps of job submission on an HPC resource includes:  Login to the remote venue and copy all input files  Ensure that the simulation program is available on the remote venue and copy program to the venue if necessary  Prepare venue specific script file specifying parameters for job submission  Submit the job to the correct job class and wait for the result  Copy the results to a local system for analysis and visualization

Why Batchsubmit?  Repeating this process for multiple simulations is tedious and time consuming  Batchsubmit automatically performs all the steps for submitting a job in an HPC resource  User can submit jobs using batchsubmit to multiple venues  Batchsubmit will intimate the user when job finishes execution and all results will be available in user job directory  Batchsubmit makes job submission easier and results in huge time savings

Batchsubmit Features  Simple command line interface and GUI interface  Easy to do parallel processing  Asynchronous job submission  Select HPC resource(venue) to run the job  Send executables to the venue, if required

Batchsubmit Features  Automatic retrieval of results  notification once results are available  Monitor job status  Monitor queue traffic at different venues  Cancel a job

Batchsubmit Examples  batchsubmit date  batchsubmit --venue carter --ncpus 16 OpenSeesMP /apps/opensees/NEEShubExamples/SmallMP/Example.tcl  batchsubmit --venue stampede --ncpus 64 --appdir /apps/share64/opensees/stampede --rcopyindir OpenSeesMP /LargeMP/Example.tcl

Architecture

Architecture: Workspace

Architecture: Batchsubmit Client JOB STATUS = PRESUBMIT

Architecture: Batchsubmit Server JOB STATUS = SUBMIT

Architecture: HPC Resource JOB STATUS = COMPLETE

Architecture: Batchsubmit Server JOB STATUS = COMPLETE

Architecture: Workspace JOB STATUS = COMPLETE

Accessing Batchsubmit  Workspace Tool  Linux Desktop in the browser  OpenSees Laboratory  Parallel Job Submission mode uses Batchsubmit

Project Warehouse  NEES online data repository  Allows researchers to upload, archive, and disseminate data from their physical, cyber, and hybrid experiments and simulations.  Provides user with tools to organize data into projects, experiments, trials, and results.  The data in the Project Warehouse can be cited and shared with researchers across the world, or only within a research group.  Facility to curate the data in the repository to ensure the long-term preservation of valuable NEES data.

Usage Statistics YearMonthSubmitted Number of Jobs 2012January1453 February1525 March1466 April1191 May92461 June18519 July1849 August4553 September3341 October6352 November2712 December January5260 February2898

Lessons Learned  Need for community gateway  Selection of correct venue  Specifying correct walltime limit  Not exceeding the venue specific parameter limits  Need for data backup

Lessons Learned  Asynchronous method, hence notifications are necessary  Listing available venues  Showing how busy are the queues of different venues  Need for proper documentation and tutorials

Summary  Batchsubmit makes parallel job submission easy for the user  Batchsubmit is Asynchronous – does not require the session to be alive till job finishes execution  Batchsubmit provides easy access to multiple supercomputers.  Batchsubmit system could be easily extended for use by other science and engineering domains.