Download presentation
Presentation is loading. Please wait.
1
Interactive Data Analysis on the Grid with JAS and Globus Interactive Data Analysis on the Grid with JAS and Globus David Alexander, Brian Miller, & John Exby Tech-X Corporation (www.techxhome.com) Boulder, Colorado Tony Johnson, Massimiliano Turri, & Booker Bense Stanford Linear Accelerator Center Menlo Park, California Supported by U.S. Department of Energy Small Business Innovative Research Grant DE-FG03-02ER83556 and Stanford Linear Accelerator Center TechXHome.com
2
Project Overview Started with Java Analysis Studio (JAS) –Has distributed analysis system based on RMI Set up test grids on Linux clusters –Used Globus Toolkit 2.0 –Each node had GRAM & GridFTP servers and Java Runtime Environment Wrote a JAS grid plug-in –Used Java CoG Kit 0.9 Demonstrated at SC2002 –Hit remote and on-site cluster TechXHome.com
3
Java Analysis Studio (JAS) jas.freehep.org Open source application –Built for interactive data analysis, but flexible & modularized Publication quality plotting facilities User writes Java code to analyze data TechXHome.com
4
Abstracted data source interface –Modules are written to work with a variety of file formats (PAW, HIPPO, AIDA, Root, ODBC, flat files, SIO, HEP) Distributed System Available Versatile & Well used in high energy physics –Pure Java (Portable, Web Start installation & upgrade) –Flexible topology (stand-alone, client/server, cluster) –Integration w/ BaBar, Geant4, Wired TechXHome.com Java Analysis Studio (JAS) jas.freehep.org
5
Design Ideas & Added Features Goal: clustered deployment, launch, & federationGoal: clustered deployment, launch, & federation Special JAS Job useSpecial JAS Job use Minimal prerequisites:Minimal prerequisites: –Bare grid: Globus, Java, nothing else –Heterogeneous cluster –Off-grid (or not) client, data, codebase –Clients don’t need to be superusers Optional background deploymentOptional background deployment Single sign onSingle sign on TechXHome.com
6
About Resource Discovery Resource discovery –Software needs location of data files –Software needs location of Java-enabled hosts –Pluggable LDIF source (MDS, URL of text file) Community Authorization Service –Fine-grained access control –Is resource discovery in a way
7
Move code to data with GridFTP Location transparencyLocation transparency –User sees data sets –Could also have user choice Automatic deployment of JASAutomatic deployment of JAS –Multi-threaded task set –Verification of code version, GridFTP codebase to node if new –GridFTP/link data to user sandbox –Deploy control and catalog servers only on cluster head node –Worker nodes wait for catalog server to run TechXHome.com
8
Launch Application with GlobusRun Automatic launch of Java serversAutomatic launch of Java servers –Java Data Servers are run on specified JRE-enabled nodes Special Grid Job is now started (exit the Wizard)Special Grid Job is now started (exit the Wizard) Code loaded into client or written in editorCode loaded into client or written in editor -compiled -automatically distributed to Java Data Servers -results (std out, std err, & histograms) sent back TechXHome.com
9
A few more Impressive Features User can stop analysis, change code, & restart.User can stop analysis, change code, & restart. Distributed debugging can catch individual node failures.Distributed debugging can catch individual node failures. Histogram re-bin slider surprisingly responsiveHistogram re-bin slider surprisingly responsive TechXHome.com
10
Headaches and Issues Versions of Globus vs. Java CoG KitVersions of Globus vs. Java CoG Kit CoG properties configurationCoG properties configuration Client & server clocks disagreeClient & server clocks disagree MS-Windows text line breaksMS-Windows text line breaks Abandoned jobsAbandoned jobs FirewallsFirewalls TechXHome.com
11
Future Ideas Upgrade to Globus Toolkit 3Upgrade to Globus Toolkit 3 Pre-install code on cluster head or portal machine and deploy from therePre-install code on cluster head or portal machine and deploy from there Use more grid services (Condor, Replica)Use more grid services (Condor, Replica) Implement interfaces or service descriptions from PPDG CS-11 group.Implement interfaces or service descriptions from PPDG CS-11 group. TechXHome.com
12
Further Information on JAS for the latest on JAS see the 3pm Catogory 9 paper JAS3 - A general purpose data analysis framework for HENP and beyond. CONTACTS David Alexander, alexanda@txcorp.comalexanda@txcorp.com Brian Miller, bmiller@txcorp.combmiller@txcorp.com Tony Johnson, tony_johnson@SLAC.stanford.edutony_johnson@SLAC.stanford.edu Massimiliano Turri, turri@SLAC.stanford.edu Java Analysis Studio, http://jas.freehep.org
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.