Interactive Data Analysis on the Grid with JAS and Globus Interactive Data Analysis on the Grid with JAS and Globus David Alexander, Brian Miller, & John.

Slides:



Advertisements
Similar presentations
Three types of remote process invocation
Advertisements

WP 1 Grid Workload Management Massimo Sgaravatto INFN Padova.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
JAS – Distributed Data Analysis Grid Enabled Analysis Workshop Caltech - June 23-25, 2003.
Tony Hoare ¢ Turing Award 1980 ¢ Program Verification ¢ Algol 60 ¢ Axiomatic Semantics ¢ Floyd-Hoare Logic ¢ Concurrent Programs ¢ Communicating Sequential.
Workload Management Massimo Sgaravatto INFN Padova.
Hello Vaadin! CS 3130 Summer 2015.
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Victor Serbo, SLAC30 September 2004, Interlaken, Switzerland JASSimApp plugin for JAS3: Interactive Geant4 GUI Serbo, Victor (SLAC) - presenter Donszelmann,
1 Application Specific Module for P-GRADE Portal 2.7 Application Specific Module overview Akos Balasko MTA-SZTAKI LPDS
OnBase Module Deployment
Grid Monitoring By Zoran Obradovic CSE-510 October 2007.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
CONDOR DAGMan and Pegasus Selim Kalayci Florida International University 07/28/2009 Note: Slides are compiled from various TeraGrid Documentations.
Java Analysis Studio Tony Johnson Stanford Linear Accelerator Center CHEP 97 - April 1997.
LIGO-G E ITR 2003 DMT Sub-Project John G. Zweizig LIGO/Caltech Argonne, May 10, 2004.
DONE-10: Adminserver Survival Tips Brian Bowman Product Manager, Data Management Group.
Java Analysis Studio Status Update 12 May 2000 Altas Software Week Tony Johnson
Advanced Analysis Environments What is the role of Java in physics analysis? Will programming languages at all be relevant? Can commercial products help.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
JAS3 + AIDA LC Simulations Workshop SLAC 19 th May 2003.
CoG Kit Overview Gregor von Laszewski Keith Jackson.
XA R7.8 Link Manager Belinda Daub Sr. Technical Consultant 1.
TRASC Globus Application Launcher VPAC Development Team Sudarshan Ramachandran.
INFSO-RI Module 01 ETICS Overview Alberto Di Meglio.
Java Root IO Part of the FreeHEP Java Library Tony Johnson Mark Dönszelmann
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
Installation and Development Tools National Center for Supercomputing Applications University of Illinois at Urbana-Champaign The SEASR project and its.
INFSO-RI Module 01 ETICS Overview Etics Online Tutorial Marian ŻUREK Baltic Grid II Summer School Vilnius, 2-3 July 2009.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
Writing Extension Modules (Plugins) for JAS 3 Mark Donszelmann Tony Johnson Victor Serbo Max Turri CHEP2004, 27 september-1 october 2004, Interlaken, Switzerland.
V. Serbo, SLAC ACAT03, 1-5 December 2003 Interactive GUI for Geant4 by Victor Serbo, SLAC.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
TeraGrid CTSS Plans and Status Dane Skow for Lee Liming and JP Navarro OSG Consortium Meeting 22 August, 2006.
JAS3 - A general purpose data analysis framework for HENP and beyond Tony Johnson, Victor Serbo, Max Turri, Mark Dönszelmann, Joseph Perl SLAC.
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
AIDA Web Interface Tony Johnson, Victor Serbo, Max Turri AIDA Workshop, CERN, July 2003.
Java Analysis Studio Atlas Software Week February 2000 Tony Johnson
GRID Zhen Xie, INFN-Pisa, on DataGrid WP6 meeting1 Globus Installation Toolkit Zhen Xie On behalf of grid-release team INFN-Pisa.
Java Analysis Studio - Status CHEP 98 - September 1998 Tony Johnson - SLAC Jonas Gifford + Kevin Garwood - University of Victoria.
Cole David Ronnie Julio. Introduction Globus is A community of users and developers who collaborate on the use and development of open source software,
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
Java Analysis Studio and the hep.lcd class library Mike Ronan - LBNL Joanne Bogart, Gary Bower, Tony Johnson - SLAC Nick Sinev - Oregon Don Benton - U.
CERN IT Department t LHCb Software Distribution Roberto Santinelli CERN IT/GS.
Plug-in Architectures Presented by Truc Nguyen. What’s a plug-in? “a type of program that tightly integrates with a larger application to add a special.
Interactive Data Analysis on the “Grid” Tech-X/SLAC/PPDG:CS-11 Balamurali Ananthan David Alexander
© SERG Reverse Engineering (REportal) REportal: Reverse Engineering Portal (reportal.cs.drexel.edu)
Tool Integration with Data and Computation Grid “Grid Wizard 2”
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
JAS and JACO – Status Report Atlas Graphics Group August 2000 Tony Johnson.
Features of JAS Plots Plots update in real time. Data for plots can be local or remote (use Java RMI to connect to JAS Data Server). Rich variety of styles.
LCD WIRED FutureLCD Workshop May Joseph Perl 1 Future Plans for the LCD WIRED Event Display Joseph Perl SLAC Computing Services
IBM Express Runtime Quick Start Workshop © 2007 IBM Corporation Deploying a Solution.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
Mobile Analyzer A Distributed Computing Platform Juho Karppinen Helsinki Institute of Physics Technology Program May 23th, 2002 Mobile.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Discussion with Blueprint RTAG August 2002 Tony Johnson SLAC.
ClickOnce Deployment (One-click Deployment)
Packaging and Deploying Windows Applications
Diskpool and cloud storage benchmarks used in IT-DSS
Module 01 ETICS Overview ETICS Online Tutorials
Chapter 7 –Implementation Issues
Java Analysis Studio - Status
ClickOnce Deployment (One-click Deployment)
Outcome of the Lecture Upon completion of this lecture you will be able to understand Fundamentals and Characteristics of Java Language Basic Terminology.
Presentation transcript:

Interactive Data Analysis on the Grid with JAS and Globus Interactive Data Analysis on the Grid with JAS and Globus David Alexander, Brian Miller, & John Exby Tech-X Corporation ( Boulder, Colorado Tony Johnson, Massimiliano Turri, & Booker Bense Stanford Linear Accelerator Center Menlo Park, California Supported by U.S. Department of Energy Small Business Innovative Research Grant DE-FG03-02ER83556 and Stanford Linear Accelerator Center TechXHome.com

Project Overview Started with Java Analysis Studio (JAS) –Has distributed analysis system based on RMI Set up test grids on Linux clusters –Used Globus Toolkit 2.0 –Each node had GRAM & GridFTP servers and Java Runtime Environment Wrote a JAS grid plug-in –Used Java CoG Kit 0.9 Demonstrated at SC2002 –Hit remote and on-site cluster TechXHome.com

Java Analysis Studio (JAS) jas.freehep.org Open source application –Built for interactive data analysis, but flexible & modularized Publication quality plotting facilities User writes Java code to analyze data TechXHome.com

Abstracted data source interface –Modules are written to work with a variety of file formats (PAW, HIPPO, AIDA, Root, ODBC, flat files, SIO, HEP) Distributed System Available Versatile & Well used in high energy physics –Pure Java (Portable, Web Start installation & upgrade) –Flexible topology (stand-alone, client/server, cluster) –Integration w/ BaBar, Geant4, Wired TechXHome.com Java Analysis Studio (JAS) jas.freehep.org

Design Ideas & Added Features Goal: clustered deployment, launch, & federationGoal: clustered deployment, launch, & federation Special JAS Job useSpecial JAS Job use Minimal prerequisites:Minimal prerequisites: –Bare grid: Globus, Java, nothing else –Heterogeneous cluster –Off-grid (or not) client, data, codebase –Clients don’t need to be superusers Optional background deploymentOptional background deployment Single sign onSingle sign on TechXHome.com

About Resource Discovery Resource discovery –Software needs location of data files –Software needs location of Java-enabled hosts –Pluggable LDIF source (MDS, URL of text file) Community Authorization Service –Fine-grained access control –Is resource discovery in a way

Move code to data with GridFTP Location transparencyLocation transparency –User sees data sets –Could also have user choice Automatic deployment of JASAutomatic deployment of JAS –Multi-threaded task set –Verification of code version, GridFTP codebase to node if new –GridFTP/link data to user sandbox –Deploy control and catalog servers only on cluster head node –Worker nodes wait for catalog server to run TechXHome.com

Launch Application with GlobusRun Automatic launch of Java serversAutomatic launch of Java servers –Java Data Servers are run on specified JRE-enabled nodes Special Grid Job is now started (exit the Wizard)Special Grid Job is now started (exit the Wizard) Code loaded into client or written in editorCode loaded into client or written in editor -compiled -automatically distributed to Java Data Servers -results (std out, std err, & histograms) sent back TechXHome.com

A few more Impressive Features User can stop analysis, change code, & restart.User can stop analysis, change code, & restart. Distributed debugging can catch individual node failures.Distributed debugging can catch individual node failures. Histogram re-bin slider surprisingly responsiveHistogram re-bin slider surprisingly responsive TechXHome.com

Headaches and Issues Versions of Globus vs. Java CoG KitVersions of Globus vs. Java CoG Kit CoG properties configurationCoG properties configuration Client & server clocks disagreeClient & server clocks disagree MS-Windows text line breaksMS-Windows text line breaks Abandoned jobsAbandoned jobs FirewallsFirewalls TechXHome.com

Future Ideas Upgrade to Globus Toolkit 3Upgrade to Globus Toolkit 3 Pre-install code on cluster head or portal machine and deploy from therePre-install code on cluster head or portal machine and deploy from there Use more grid services (Condor, Replica)Use more grid services (Condor, Replica) Implement interfaces or service descriptions from PPDG CS-11 group.Implement interfaces or service descriptions from PPDG CS-11 group. TechXHome.com

Further Information on JAS for the latest on JAS see the 3pm Catogory 9 paper JAS3 - A general purpose data analysis framework for HENP and beyond. CONTACTS David Alexander, Brian Miller, Tony Johnson, Massimiliano Turri, Java Analysis Studio,