LLNL-PRES-679957 This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under contract DE-AC52-07NA27344.

Slides:



Advertisements
Similar presentations
CLIENT SERVICE, IT(IL) BEST PRACTICES & REQUEST TRACKER ON A FEDERATED IT CAMPUS CLICK CLICK
Advertisements

The Datacenter Needs an Operating System Matei Zaharia, Benjamin Hindman, Andy Konwinski, Ali Ghodsi, Anthony Joseph, Randy Katz, Scott Shenker, Ion Stoica.
Lawrence Livermore National Laboratory ROSE Compiler Project Computational Exascale Workshop December 2010 Dan Quinlan Chunhua Liao, Justin Too, Robb Matzke,
1 Slides presented by Hank Childs at the VACET/SDM workshop at the SDM Center All-Hands Meeting. November 26, 2007 Snoqualmie, Wa Work performed under.
The Joel Test: 12 Steps to Better Code By Tim Denton.
Master/Slave Architecture Pattern Source: Pattern-Oriented Software Architecture, Vol. 1, Buschmann, et al.
Adding scalability to legacy PHP web applications Overview Mario A. Valdez-Ramirez.
Presented by Scalable Systems Software Project Al Geist Computer Science Research Group Computer Science and Mathematics Division Research supported by.
DataFoundry: An Approach to Scientific Data Integration Terence Critchlow Ron Musick Ida Lozares Center for Applied Scientific Computing Tom SlezakKrzystof.
1 A Web-Based Integral Evaluator: A Demonstration of the Successful Integration of WebEQ, Maple, and Java Wanda M. Kunkle Department of Mathematics & Computer.
Making the Most of What We Know: Towards Effective Use of Genomics Data Terence Critchlow Center for Applied Scientific Computing Lawrence Livermore National.
VisIt Software Engineering Infrastructure and Release Process LLNL-PRES Lawrence Livermore National Laboratory, P. O. Box 808, Livermore,
Computing and Data Infrastructure for Large-Scale Science Deploying Production Grids: NASA’s IPG and DOE’s Science Grid William E. Johnston
21 21 Web Content Management Architectures Vagan Terziyan MIT Department, University of Jyvaskyla, AI Department, Kharkov National University of Radioelectronics.
Computer Science 1620 Programming & Problem Solving.
Connecting Diverse Web Search Facilities Udi Manber, Peter Bigot Department of Computer Science University of Arizona Aida Gikouria - M471 University of.
Service Broker Lesson 11. Skills Matrix Service Broker Service Broker, provides a solution to common problems with message delivery and consistency that.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
2012 National BDPA Technology Conference Creating Rich Data Visualizations using the Google API Yolanda M. Davis Senior Software Engineer AdvancED August.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Lawrence Livermore National Laboratory A system for strong local account management. SLAM David Frye Lawrence Livermore National Laboratory, P. O. Box.
Database Update Kaveh Ranjbar Database Department Manager, RIPE NCC.
Practical Configuration Management Seattle Delphi Users Group June 6, 2002.
The Cluster Computing Project Robert L. Tureman Paul D. Camp Community College.
Integrating HPC into the ATLAS Distributed Computing environment Doug Benjamin Duke University.
Cloud Computing Computer Science Innovations, LLC.
Summary of distributed tools of potential use for JRA3 Dugan Witherick HPC Programmer for the Miracle Consortium University College.
Real World Case Study KM Summer Institute June Rano Joshi, Vorsite.
Sage ACT! 2013 SDK Update Brian P. Mowka March 23, 2012 Template date: October 2010.
The Right OS for Your Job Major: Computer Science Instructor: Dr Anvari Presenter: Ke Huang Student ID:
New perfSonar Dashboard Andy Lake, Tom Wlodek. What is the dashboard? I assume that everybody is familiar with the “old dashboard”:
Robin Mullinix Systems Analyst GeorgiaFIRST Financials PeopleSoft Query: The Next Step.
Turning science problems into HTC jobs Wednesday, July 29, 2011 Zach Miller Condor Team University of Wisconsin-Madison.
240-Current Research Easily Extensible Systems, Octave, Input Formats, SOA.
Derek Wright Computer Sciences Department University of Wisconsin-Madison MPI Scheduling in Condor: An.
LLNL-PRES-?????? This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
Mtivity Client Support System Quick start guide. Mtivity Client Support System We are very pleased to announce the launch of a new Client Support System.
Martin Schulz Center for Applied Scientific Computing Lawrence Livermore National Laboratory ASC STAT Team: Greg Lee, Dong Ahn (LLNL), Dane Gardner (LANL)
Lawrence Livermore National Laboratory Centralized Desktop Management at LLNL A Major Paradigm Shift CDM David Frye This work performed under the auspices.
A Data Access Framework for ESMF Model Outputs Roland Schweitzer Steve Hankin Jonathan Callahan Kevin O’Brien Ansley Manke.
JINI Coordination-Based System By Anthony Friel * David Kiernan * Jasper Wood.
Datalayer Notebook Allows Data Scientists to Play with Big Data, Build Innovative Models, and Share Results Easily on Microsoft Azure MICROSOFT AZURE ISV.
Adapting the Electronic Laboratory Notebook for the Semantic Era Tara Talbott, Michael Peterson, Jens Schwidder, James D. Myers 2005 International Symposium.
National Aeronautics and Space Administration ESGF & UV-CDAT Face-to-Face December 2015 Overview of the ESGF Compute Working Team (CWT) and.
LLNL’s Data Center and Interoperable Services 5 th Annual ESGF Face-to-Face Conference ESGF 2015 Monterey, CA, USA Dean N. Williams, Tony Hoang, Cameron.
SSS Build and Configuration Management Update February 24, 2003 Narayan Desai
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Midterm Exam Review Notes William Cohen 1. General hints in studying Understand what you’ve done and why – There will be questions that test your understanding.
Comprehensive Project Management Solutions with the.NET Server family.
ESGF-SWT PART 2 ESGF-SWT Members Katharina Berger Nicolas Careton Prashanth Dqarakanath Matthew Harris Georgi Kostov Torsten Rathmann Karl Taylor Frank.
LLNL-PRES-XXXXXX This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under contract DE-AC52-07NA27344.
Improving User Access to Metadata for Public and Restricted Use US Federal Statistical Files William C. Block Jeremy Williams Lars Vilhuber Carl Lagoze.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
LLNL-PRES-XXXXXX This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under contract DE-AC52-07NA27344.
LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
INFO 344 Web Tools And Development CK Wang University of Washington Spring 2014.
Accounting in DataGrid HLR software demo Andrea Guarise Milano, September 11, 2001.
CS Class 04 Topics  Selection statement – IF  Expressions  More practice writing simple C++ programs Announcements  Read pages for next.
A S P. Outline  The introduction of ASP  Why we choose ASP  How ASP works  Basic syntax rule of ASP  ASP’S object model  Limitations of ASP  Summary.
Advanced Higher Computing Science
Autonomy Paradigm Warning: This document is a part of my “Responsible Programming” theme. All docs related to that theme just gather some of my ideas.
Network Controllable MP3 Player
Big Data is a Big Deal!.
Topic: Functions – Part 1
The Client-Server Model
Topic: Functions – Part 2
PHY 114 A General Physics II 11 AM-12:15 PM TR Olin 101
Building and running HPC apps in Windows Azure
PyWBEM Python WBEM Client: Overview #2
Lecture 6 - Recursion.
Presentation transcript:

LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under contract DE-AC52-07NA Lawrence Livermore National Security, LLC Web Services Processing, Application Programming Interface Charles Doutriaux Sasha Ames Tom Maxwell Dan Duffy Dean Williams December 9 th, 2015

LLNL-PRES Overview  As computer power goes up, so does data Volume  Scientists generating and analyzing these data are many and dispersed  BUT the scientific need is greater than ever  ESGF solved the first part of the equation: universal, distributed access  Bringing all needed data to your computer or even to your facility is no longer feasible  Now we need to solve the analysis part.

LLNL-PRES  The ESGF-CWT is putting together an infrastructure for WPS  This talk is about the API part  API is two fold: — Developers: Common ground for creating new tools — Users: Standard way to querying/using resources  Goal: Ease things as much as possible for user, i.e. — What services are here? — Can I get their doc? — Let’s use it  As much decision as possible made for the user (but we still let these to be known to and forced by the user) 3 ESGF-CWT Solution

LLNL-PRES Basic Architecture Server Side Services Client Side ESGF API

LLNL-PRES  Documented at: climate.atlassian.net/wiki/display/ESGF/API+Standards+and+Re quirementshttps://acme- climate.atlassian.net/wiki/display/ESGF/API+Standards+and+Re quirements  First pass, will likely be tweaked/enhanced as more developers and users get involved  Focusing on JSON input data.  First problems we’re trying to solve: — Model Average — Model Ensemble — Multi-models Ensemble  Cater very basic needs so far, needs to grow as more features are required. Hint: That’s YOU here. 5 API?

LLNL-PRES API (excerpts) request=Execute&identifier=averager &datainputs=[domain={'id':'glbl','longitude’:{'start':% ,%20'end':% },'time’:{'start’:'1980’,'end’:'1982'}}; variable={'uri':'file://opt/nfs/cwt/uvcdat/latest/share/uvcdat/sample_data/tas_dnm-95a.xml','id':'tas','domain':'glbl'}]

LLNL-PRES   VERY BASIC Demo serve — Django-based — Uses UV-CDAT for computation  Will probably grow into a real full blown pretty server  Code is at: please fork and issue as many PR as possible and/or use issue tracker to give us feedbacks.  Also take a look at what others presenting here have already done. Let’s try to leverage from each other. 7 Where do I start?

LLNL-PRES Example? (stick this in “process” directory of server) class Process(esgfcwtProcess): def __init__(self): """Process initialization""" WPSProcess.__init__(self, identifier=os.path.split(__file__)[-1].split('.')[0], title='averager', version=0.1, abstract='Average a variable over a (many) dimension', storeSupported='true', statusSupported='true') self.domain = self.addComplexInput(identifier='domain', title='domain over which to average', formats=[{'mimeType': 'text/json', 'encoding': 'utf-8', 'schema': None}]) self.dataIn = self.addComplexInput(identifier='variable', title='variable to average', formats=[{'mimeType': 'text/json'}], minOccurs=1, maxOccurs=1) self.download = self.addLiteralInput(identifier='download', type=bool, title='download output', default=False) self.average = self.addComplexOutput(identifier='average', title='averaged variable', formats=[{'mimeType': 'text/json'}]) def execute(self): dataIn=self.loadData()[0] data,cdms2keyargs = self.loadVariable(dataIn) dims = "".join(["(%s)" % x for x in cdms2keyargs.keys()]) data = cdutil.averager(data,axis=dims) data.id=self.getVariableName(dataIn) self.saveVariable(data,self.average,"json") return

LLNL-PRES  No. The API is designed to be backend agnostic  But: — ESGF-CWT will use UV-CDAT where appropriate — UV-CDAT will be officially supported and will be part of the “compute node stack” — No, your preferred application is not guaranteed to be fully supported and/or part of the esgf stack  Yes the API team will listen to you even if you do not use UV- CDAT  But really… You “should” be using it ;) It’s so much simpler and it makes sense to have everybody using the same tools 9 Do I have to use UV-CDAT?

LLNL-PRES  Tom Maxwell -> NASA  Maarten Plieger -> sort of  ACME 10 Anybody is using this?

LLNL-PRES  LOTS!  Tighter integration with ESGF — result search as URI? — esgf:// new uri type?  Testing! — We need some basic dataset to run tests on — We need a mechanism to document “correct” solution to a problem  Once this is in place we can move to distributed analysis — Which nodes carry my diagnostic? — Which one should I use? (is it close to my data, is it overloaded, etc…) — Resource management  Multiple implementation of same diagnostics: — MPI vs SLURM vs MPI+SLURM vs HADOOP vs SPARK, vs combinations, etc… Which one to trust which one is faster for me? 11 So… What’s next?

LLNL-PRES  Still in its infancy but crystalizing  The time is NOW, the more you wait the harder it will be to get your voice heard. 12 Summary