Presentation is loading. Please wait.

Presentation is loading. Please wait.

Scripting based architecture for Management of Streams and Services in Real-time Grid Applications Authors Harshawardhan Gadgil, Geoffrey Fox, Shrideep.

Similar presentations


Presentation on theme: "Scripting based architecture for Management of Streams and Services in Real-time Grid Applications Authors Harshawardhan Gadgil, Geoffrey Fox, Shrideep."— Presentation transcript:

1 Scripting based architecture for Management of Streams and Services in Real-time Grid Applications Authors Harshawardhan Gadgil, Geoffrey Fox, Shrideep Pallickara, Marlon Pierce Community Grids Lab, Indiana University, Bloomington Robert Granat NASA, Jet Propulsion Lab, Pasadena Presented By Harshawardhan Gadgil hgadgil@cs.indiana.edu

2 May 10, 2005 Community Grids Lab Presentation for CCGrid 20052 Talk Outline Introduction to HPSearch Architecture Introduction to HPSearch Architecture A quick view of NaradaBrokering middleware A quick view of NaradaBrokering middleware System Goals System Goals –RDAHMM Example Related and Future Work Related and Future Work

3 May 10, 2005 Community Grids Lab Presentation for CCGrid 20053 HPSearch A JavaScript based scripting runtime serving as an administration console A JavaScript based scripting runtime serving as an administration console –Currently uses Rhino (http://www.mozilla.org/rhino) implementation of JavaScript. Management viewed as Management viewed as –Setting up the distributed application (Involves setting up the broker network, initializing system components) –Querying Run-time System Metadata  For logging purposes  Monitor metadata to help dynamically rewire system to improve performance

4 May 10, 2005 Community Grids Lab Presentation for CCGrid 20054 HPSearch Architecture Component Summary Binds URIs to a scripting language Binds URIs to a scripting language –Allows us to manage (manipulate) the Resource identified by the URI  E.g. Read / Write to files, sockets, topics  Read from data base, data from FTP/HTTP resource  This data can then be streamed to distributed application OR data can be read from a stream and processed / stored “Host-objects” allow us to dynamically access the host system. “Host-objects” allow us to dynamically access the host system. –Useful for constructing objects that monitor system meta-data, perform management tasks –PerfMetrics gathers system performance data and allows us to query it at run-time.

5 May 10, 2005 Community Grids Lab Presentation for CCGrid 20055 HPSearch Architecture Diagram Request Handler JavaScript Shell Task Scheduler Flow Handler Web Service EP Other Objects HPSearch Kernel URIHandler DBHandler WSDLHandler WSProxyHandler Request Handler HPSearch Kernel HPSearch Kernel Broker Network SOAP/HTTP... DataBase Web Service Files Sockets Topics Network Protocol JDBC SOAP/HTTP WSProxy Service WSProxy Service WSProxy Service HPSearch Control Events using PUB/SUB on predefined topic Data buffers sent / received as Narada Events

6 May 10, 2005 Community Grids Lab Presentation for CCGrid 20056 NaradaBrokering NaradaBrokering NaradaBrokering –Messaging infrastructure for collaboration, peer-to- peer and Grid applications –Implements high-performance protocols (message transit time of 1 to 2 ms per hop) –Order-preserving, optimized message transport with QoS and security profiles for sent and received messages –Support for different underlying protocols such as TCP, UDP, Multicast, RTP –Discovery Service to locate nearest brokers

7 May 10, 2005 Community Grids Lab Presentation for CCGrid 20057 HPSearch + NaradaBrokering Managing Streams HPSearch uses NaradaBrokering to route data streams HPSearch uses NaradaBrokering to route data streams –Each stream is represented by a topic name –Components subscribe / publish to specified topic  The WSProxy component automatically maps topics to Input / Output streams  Each write (byte[] buffer) and byte[] read() call is mapped to a NaradaBrokering event

8 May 10, 2005 Community Grids Lab Presentation for CCGrid 20058 System Goals Investigate the use of HPSearch as a management console to deploy system and application components using scripting Investigate the use of HPSearch as a management console to deploy system and application components using scripting Use HPSearch in scientific / grid applications which can use streaming data Use HPSearch in scientific / grid applications which can use streaming data –Data filtering is essential in most cases and we do not want to transfer entire data set when only a small percentage of data would be used,  E.g. Choosing data satisfying certain input constraints. –Data re-ordering might be required for formatting input data to match the requirements of executable,  E.g. Converting GML to remove XML elements Use publish-subscribe methodologies to connect components. Use publish-subscribe methodologies to connect components.

9 May 10, 2005 Community Grids Lab Presentation for CCGrid 20059 GPS time series have modes caused by unknown underlying physical processes GPS time series have modes caused by unknown underlying physical processes RDAHMM allows us to identify these modes and time periods where physical processes dominated the sequence without any a-priori knowledge of these processes RDAHMM allows us to identify these modes and time periods where physical processes dominated the sequence without any a-priori knowledge of these processes Help to determine Help to determine –The actual physical causes –When is the system entering a new mode (perhaps a hint of some important seismic event) Example RDAHMM

10 May 10, 2005 Community Grids Lab Presentation for CCGrid 200510 Example Application Components GPS database (Surface displacement time series collected by SCIGN, http://www.scign.org) GPS database (Surface displacement time series collected by SCIGN, http://www.scign.org) Data filter (filters data, removes unwanted components, reorders data as required by the actual RDAHMM executable) Data filter (filters data, removes unwanted components, reorders data as required by the actual RDAHMM executable) RDAHMM executable, performs the time series analysis. This data is transferred to the Graph plotting application. RDAHMM executable, performs the time series analysis. This data is transferred to the Graph plotting application. Matlab based Graph plotting application Matlab based Graph plotting application

11 May 10, 2005 Community Grids Lab Presentation for CCGrid 200511 HPSearch Engine Applications Streaming Data Filtering Data Filter Filters the input data to get only the estimate and error values RDAHMM Analyze the data Matlab Plotting Script Output Graph HPSearch Handler GPS Data Sensor Source Sensor Source Handler Each Handler controls operation of 1 service

12 May 10, 2005 Community Grids Lab Presentation for CCGrid 200512 Sample Output Claremont, CA

13 May 10, 2005 Community Grids Lab Presentation for CCGrid 200513 Related Work Scripting languages have been very popular and successful for Rapid Application Deployment (RAD) Scripting languages have been very popular and successful for Rapid Application Deployment (RAD) –Sash by IBM, allow RAD and handlers for various tasks such as reading from databases, LDAP registries, invoking Web Services and providing GUI –Jython and Matlab are popular in many scientific communities. E.g. GeoDISE –XCAT project at IU, Extreme Lab uses Jython for scripting to deploy distributed components –Karajan (Ant Like scripting) is used for deploying applications over grid –WSRF::Lite uses perl to implement Web Services Resource Framework and host grid services

14 May 10, 2005 Community Grids Lab Presentation for CCGrid 200514 Future Work Investigate system and management scaling with increasing number of components. Investigate system and management scaling with increasing number of components. Use of security for streams Use of security for streams Negotiation of optimal (high performance) transport Negotiation of optimal (high performance) transport More handlers for different aspects of NaradaBrokering’s management (broker / topic discovery, security, replay etc…) More handlers for different aspects of NaradaBrokering’s management (broker / topic discovery, security, replay etc…) Investigate interaction with WS Management and WS - Distributed Management based systems Investigate interaction with WS Management and WS - Distributed Management based systems

15 May 10, 2005 Community Grids Lab Presentation for CCGrid 200515 To conclude… Presented a scripting based architecture for management of data streams and distributed services Presented a scripting based architecture for management of data streams and distributed services Shown how we can use publish subscribe methodologies to connect components that process data in a stream Shown how we can use publish subscribe methodologies to connect components that process data in a stream Create and deploy quick data filtering applications Create and deploy quick data filtering applications

16 May 10, 2005 Community Grids Lab Presentation for CCGrid 200516 More Information HPSearch Project Website HPSearch Project Websitehttp://www.hpsearch.org NaradaBrokering Project @ IU NaradaBrokering Project @ IUhttp://www.naradabrokering.org CGL Publications CGL Publicationshttp://grids.ucs.indiana.edu/ptliupages/publications/

17 May 10, 2005 Community Grids Lab Presentation for CCGrid 200517 Questions / Comments Any Questions / Comments ? Any Questions / Comments ?THANKS for attending the presentation


Download ppt "Scripting based architecture for Management of Streams and Services in Real-time Grid Applications Authors Harshawardhan Gadgil, Geoffrey Fox, Shrideep."

Similar presentations


Ads by Google