Agent Technology for Data Analysis Tony Johnson - SLAC 21 st October 1998 WORKSHOP ON SCIENTIFIC DATA MANAGEMENT PROBLEMS AND SOLUTIONS.

Slides:



Advertisements
Similar presentations
Mobile Agents Mouse House Creative Technologies Mike OBrien.
Advertisements

What is RMI? Remote Method Invocation –A true distributed computing application interface for Java, written to provide easy access to objects existing.
COM vs. CORBA.
M. Muztaba Fuad Masters in Computer Science Department of Computer Science Adelaide University Supervised By Dr. Michael J. Oudshoorn Associate Professor.
RPC Robert Grimm New York University Remote Procedure Calls.
Introduction To Java Objectives For Today â Introduction To Java â The Java Platform & The (JVM) Java Virtual Machine â Core Java (API) Application Programming.
GridRPC Sources / Credits: IRISA/IFSIC IRISA/INRIA Thierry Priol et. al papers.
What iS RMI? Remote Method Invocation. It is an approach where a method on a remote machine invokes another method on another machine to perform some computation.
The road to reliable, autonomous distributed systems
Notes to the presenter. I would like to thank Jim Waldo, Jon Bostrom, and Dennis Govoni. They helped me put this presentation together for the field.
The Application Layer Chapter 7. Electronic Mail Architecture and Services The User Agent Message Formats Message Transfer Final Delivery.
UMass Lowell Computer Science Java and Distributed Computing Prof. Karen Daniels Fall, 2000 Lecture 1 Introduction/Overview Wed. 9/6/00.
Scripting Languages For Virtual Worlds. Outline Necessary Features Classes, Prototypes, and Mixins Static vs. Dynamic Typing Concurrency Versioning Distribution.
Java Environment (CSS444)
JAS – Distributed Data Analysis Grid Enabled Analysis Workshop Caltech - June 23-25, 2003.
27-Jun-15 Profiling code, Timing Methods. Optimization Optimization is the process of making a program as fast (or as small) as possible Here’s what the.
Tony Hoare ¢ Turing Award 1980 ¢ Program Verification ¢ Algol 60 ¢ Axiomatic Semantics ¢ Floyd-Hoare Logic ¢ Concurrent Programs ¢ Communicating Sequential.
Communication in Distributed Systems –Part 2
Victor Serbo, SLAC30 September 2004, Interlaken, Switzerland JASSimApp plugin for JAS3: Interactive Geant4 GUI Serbo, Victor (SLAC) - presenter Donszelmann,
Web Application Architecture: multi-tier (2-tier, 3-tier) & mvc
Information Technology for Ocean Observations and Climate Research TYKKI Workshop, December 9-11, 1998, Tokyo, Japan Nancy N. Soreide NOAA Pacific Marine.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
Java Analysis Studio Tony Johnson Stanford Linear Accelerator Center CHEP 97 - April 1997.
STRATEGIES INVOLVED IN REMOTE COMPUTATION
Web Based Applications
CPS120: Introduction to Computer Science The World Wide Web Nell Dale John Lewis.
Jaeki Song ISQS6337 JAVA Lecture 16 Other Issues in Java.
Advanced Analysis Environments What is the role of Java in physics analysis? Will programming languages at all be relevant? Can commercial products help.
JAS3 + AIDA LC Simulations Workshop SLAC 19 th May 2003.
Cli/Serv.: rmiCORBA/131 Client/Server Distributed Systems v Objectives –introduce rmi and CORBA , Semester 1, RMI and CORBA.
JAS/Wired + Geant 4 Tony Johnson July Contents What is JAS? What is WIRED? –Future Directions JAS+AIDA+GAG+Wired + Geant 4= ? Making it easy to.
Java Analysis Studio and the Java Framework for Future Linear Colliders. CERN - January 13th 1998 Tony Johnson - SLAC
SUMA: A Scientific Metacomputer Cardinale, Yudith Figueira, Carlos Hernández, Emilio Baquero, Eduardo Berbín, Luis Bouza, Roberto Gamess, Eric García,
1.8History of Java Java –Based on C and C++ –Originally developed in early 1991 for intelligent consumer electronic devices Market did not develop, project.
Algoval: Evaluation Server Past, Present and Future Simon Lucas Computer Science Dept Essex University 25 January, 2002.
Component Technology. Challenges Facing the Software Industry Today’s applications are large & complex – time consuming to develop, difficult and costly.
V. Serbo, SLAC ACAT03, 1-5 December 2003 Interactive GUI for Geant4 by Victor Serbo, SLAC.
A Distributive Server Alberto Pareja-Lecaros. Introduction Uses of distributive computing - High powered applications - Ever-expanding server so there’s.
1CPSD Software Infrastructure for Application Development Laxmikant Kale David Padua Computer Science Department.
ROOT and Federated Data Stores What Features We Would Like Fons Rademakers CERN CC-IN2P3, Nov, 2011, Lyon, France.
The basics of the programming process The development of programming languages to improve software development Programming languages that the average user.
Java Analysis Studio - Status CHEP 98 - September 1998 Tony Johnson - SLAC Jonas Gifford + Kevin Garwood - University of Victoria.
Visualization of Geant4 Data: Exploiting Component Architecture through AIDA, HepRep, JAS and WIRED Geant4 Workshop, CERN - 2 October 2002 Joseph Perl.
Parallelizing Spacetime Discontinuous Galerkin Methods Jonathan Booth University of Illinois at Urbana/Champaign In conjunction with: L. Kale, R. Haber,
CSI 3125, Preliminaries, page 1 SERVLET. CSI 3125, Preliminaries, page 2 SERVLET A servlet is a server-side software program, written in Java code, that.
Remote Method Invocation by James Hunt, Joel Dominic, and Adam Mcculloch.
Interactive Data Analysis on the “Grid” Tech-X/SLAC/PPDG:CS-11 Balamurali Ananthan David Alexander
JAVA Ekapap Julnonyang When it was implemented? Developed by Sun Microsystems. The first public implementation was Java 1.0 in 1995 The language.
Wide-Area Parallel Computing in Java Henri Bal Vrije Universiteit Amsterdam Faculty of Sciences vrije Universiteit.
Features of JAS Plots Plots update in real time. Data for plots can be local or remote (use Java RMI to connect to JAS Data Server). Rich variety of styles.
Distributed Computing in Life Science Research -Presenter: Yijian Yang
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
Background Computer System Architectures Computer System Software.
Nguyen Thi Thanh Nha HMCL by Roelof Kemp, Nicholas Palmer, Thilo Kielmann, and Henri Bal MOBICASE 2010, LNICST 2012 Cuckoo: A Computation Offloading Framework.
Сергей Лутай REFACTORING converts single-tier code into distributed RETARGETING converts MSIL code into code for other virtual.
SURENDRA INSTITUTE OF ENGINEERING & MANAGEMENT PRESENTED BY : Md. Mubarak Hussain DEPT-CSE ROLL
January 26, Ann Wollrath Copyright 1999 Sun Microsystems, Inc., all rights reserved. Java ™ RMI Overview Ann Wollrath Senior Staff Engineer Sun Microsystems,
Applications Active Web Documents Active Web Documents.
Alternatives to Mobile Agents
Distributed Computing
Java RMI CS-328 Internet Programming.
What is RMI? Remote Method Invocation
Computers Are Your Future
Tom Rink Tom Whittaker Paolo Antonelli Kevin Baggett.
File Transfer Protocol
Hybrid Programming with OpenMP and MPI
Java Analysis Studio - Status
Java Analysis Studio and the hep.lcd classes
Java Analysis Studio and the hep.lcd classes
Calypso Service Architecture
Presentation transcript:

Agent Technology for Data Analysis Tony Johnson - SLAC 21 st October 1998 WORKSHOP ON SCIENTIFIC DATA MANAGEMENT PROBLEMS AND SOLUTIONS

Motivation and Disclaimer b Many efforts to use supernetworks to link supercomputers to transfer huge datasets b Few efforts to make effective use of existing real-world networks Allow university users to access remote dataAllow university users to access remote data b I am not an agent technology expert We do have a prototype applicationWe do have a prototype application I’m hoping some of you are!I’m hoping some of you are!

Outline b Overview of problem Network restraintsNetwork restraints b Why agent technology? b Why Java For Agent Technology?For Agent Technology? For Data Analysis?For Data Analysis? b Analysis Studio application b More information

What Problem are we trying to solve? b Widely distributed users who need access to petabyte datasets Many university users with mediocre networksMany university users with mediocre networks Most universities have no way to handle petabyte data samplesMost universities have no way to handle petabyte data samples b Physicist needs unfettered access to data Would like effective use of desktop machineWould like effective use of desktop machine Canned analysis wont doCanned analysis wont do b CPU/data access requirements are infinite

Faster networks? Faster networks will not solve our problems anytime soonFaster networks will not solve our problems anytime soon No matter how fast networks are they are always saturated.No matter how fast networks are they are always saturated. As networks become saturated latency becomes highAs networks become saturated latency becomes high

Why Agent Technology? b By encapsulating users analysis code as a “user agent” we can send it to the data, wide-area network bandwidth requirements become trivial Analysis modules are typically small <10’s kBytesAnalysis modules are typically small <10’s kBytes HEP output is typically histograms (binned) and scatterplots, which are both smallHEP output is typically histograms (binned) and scatterplots, which are both small b Possible to do GUI based analysis of large datasets using 28.8 modem connection b Give user the impression his analysis is running locally.

Why Java for Agent Technology? b Java produces machine independent bytecodes Trivial to move from one machine to anotherTrivial to move from one machine to another Network handling and Remote Method Invocation (RMI c.f. Corba) built-inNetwork handling and Remote Method Invocation (RMI c.f. Corba) built-in (Remote) Dynamic loading build-in(Remote) Dynamic loading build-in Multithreaded servers easy to writeMultithreaded servers easy to write Built-in Java “Sandbox” can be used to restrict agentsBuilt-in Java “Sandbox” can be used to restrict agents

Why Java for Data Analysis b Easy to learn yet very powerful, fully OO language Very wide industry supportVery wide industry support Just In Time compilation = FastJust In Time compilation = Fast Dynamic Optimization = FasterDynamic Optimization = Faster Very fast code, load, test, fix cycleVery fast code, load, test, fix cycle Built in debugger, including remote debuggingBuilt in debugger, including remote debugging Numerical functionality goodNumerical functionality good –Java Grande Forum enhancing numerical support

“Java Analysis Studio” Network Data Server DIM Remote Data Desktop Client DIM Local Data Network Data Controller Distributed Data Data Server DIM Data Server DIM Data Server DIM Data Server DIM Data Server DIM Data Server DIM

Demo

Network Performance View (Histogram) Model (Data Source) View AdapterModel Adapter  Caching  Prefetching of data  Data clumping, streaming

More Information b Java b Java Analysis Studio b Java Grande Forum (numeric computing in Java) Desktop access to remote resourcesDesktop access to remote resources –