Using Topic-Based Publish/Subscribe for Managing Real Time GPS Streams Marlon Pierce, Galip Aydin, Zhigang Qi Community Grids Lab Indiana University 1.

Slides:



Advertisements
Similar presentations
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Advertisements

Architecture and Measured Characteristics of a Cloud Based Internet of Things May 22, 2012 The 2012 International Conference.
IU QuakeSim Updates Co-PIs: Marlon Pierce, Geoffrey Fox Students and Staff: Xiaoming Gao, Jun Ji, Jun Wang, Marie Ma, Josh Rosen.
QuakeSim Science Gateway: ACES Update Marlon Pierce, Jun Wang, Yu Ma, Jun Ji, Xiaoming Gao, Geoffrey Fox Indiana University.
E-DECIDER Workshop: QuakeSim Tools and Products Marlon Pierce Indiana University.
T-FLEX DOCs PLM, Document and Workflow Management.
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Ahmet Sayar, Galip Aydin, Mehmet Aktas, Harshawardhan Gadgil.
The Problem: Integrating Data, Applications, and Client Devices The key issue we try to solve is building the distributed computing infrastructure that.
Service Oriented Architecture for Geographic Information Systems Supporting Real Time Data Grids Galip Aydin Department Of Computer Science Indiana University.
Technical Aids for Mirroring Sites and China Popular Science Exhibition(CPSE) Chen Hui CNIC,CAS (Computer Network Information Center, Chinese Academy of.
Indiana University QuakeSim Activities Marlon Pierce, Geoffrey Fox, Xiaoming Gao, Jun Ji, Chao Sun.
Brief Overview of Major Enhancements to PAWN. Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the.
Using AJAX Galip Aydin, Ahmet Sayar, and Marlon Pierce Community Grids Lab Indiana University.
Technical solution presentation AVL System for Fire Brigades.
Tools for Publishing Environmental Observations on the Internet Justin Berger, Undergraduate Researcher Jeff Horsburgh, Faculty Mentor David Tarboton,
Distributed Data Analysis & Dissemination System (D-DADS) Prepared by Stefan Falke Rudolf Husar Bret Schichtel June 2000.
World Renewable Energy Forum May 15-17, 2012 Dr. James Hall.
Managing Data Interoperability with FME Tony Kent Applications Engineer IMGS.
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Yili Gong,
QuakeSim: Grid Computing, Web Services, and Portals for Earthquake Science Marlon Pierce Community Grids Lab Indiana University.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Ahmet Sayar, Galip Aydin, Mehmet Aktas, Harshawardhan Gadgil.
Presented by Abirami Poonkundran.  Introduction  Current Work  Current Tools  Solution  Tesseract  Tesseract Usage Scenarios  Information Flow.
High Performance Web Service Architecture for Sensors and Geographic Information Systems Galip Aydin.
Ohio State University Department of Computer Science and Engineering 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
QuakeSim Work: Web Services, Portlets, Real Time Data Services Marlon Pierce Contributions: Ahmet Sayar,
1 Grids for Real-time and Streaming Applications GCC2005 Beijing China December Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology.
1 Transport Protocols Relates to Lab 5. An overview of the transport protocols of the TCP/IP protocol suite. Also, a short discussion of UDP.
Implementing Geographical Information System Services for SERVOGrid Marlon Pierce Community Grids Lab Indiana University.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
Community Grids Lab SERVOGrid CCE Review May Geoffrey Fox and Marlon Pierce Indiana University.
SBIR Final Meeting Collaboration Sensor Grid and Grids of Grids Information Management Anabas July 8, 2008.
ESIP Federation 2004 : L.B.Pham S. Berrick, L. Pham, G. Leptoukh, Z. Liu, H. Rui, S. Shen, W. Teng, T. Zhu NASA Goddard Earth Sciences (GES) Data & Information.
Large Scale Nuclear Physics Calculations in a Workflow Environment and Data Provenance Capturing Fang Liu and Masha Sosonkina Scalable Computing Lab, USDOE.
GPS Sensor Web Time Series Analysis Using SensorGrid Technology Robert Granat 1, Galip Aydin 2, Zhigang Qi 2, Marlon Pierce 2 1 Science Data Understanding.
SensorGrid Galip Aydin June SensorGrid A flexible computing environment for coupling real-time data sources to High Performance Geographic Information.
QuakeSim Project: Portals and Web Services for Geo-Sciences Marlon Pierce Indiana University
QuakeSim Project: Portals and Web Services for Geophysics Marlon Pierce Indiana University
Online-Offsite Connectivity Experiments Catalin Meirosu *, Richard Hughes-Jones ** * CERN and Politehnica University of Bucuresti ** University of Manchester.
SensorGrid High Performance Web Service Architecture for Geographic Information Systems Thesis Proposal Galip Aydin
QuakeSim Project: Portals and Web Services for Geo-Sciences Marlon Pierce Indiana University
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
JPL Status Report, SCIGN Annual Meeting, 4/26/2004 JPL Update Danan Dong Michael Heflin Sharon Kedar Brian Newport Frank Webb.
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Ahmet Sayar,
HPSearch for Managing Distributed Services Authors Harshawardhan Gadgil, Geoffrey Fox, Shrideep Pallickara Community Grids Lab Indiana University, Bloomington.
November Geoffrey Fox Community Grids Lab Indiana University Net-Centric Sensor Grids.
1 MESSAGE EXCHANGE FOR Web Service-Based Mapping Services AHMET SAYAR INDIANA UNIVERSITY COMMUNITY GRIDS LAB. COMPUTER SCIENCE DEPARTMENT August 17, 2005.
QuakeSim Project: Portals and Web Services for Geo-Sciences Marlon Pierce Indiana University
Update on GPS/SCIGN REASoN CAN GPS Data Products for Solid Earth Science (GDPSES) Sponsored by NASA F Webb, Y Bock, D Dong, B Newport, P Jamason, M Scharber,
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
CUAHSI SURVEY RESULTS AT VIRGINIA TECH Nimmy Ravindran and Yao Liang Dept. of Electrical and Computer Engineering Virginia Tech.
Review CS File Systems - Partitions What is a hard disk partition?
1 Collaboration Grids GGF16 Athens Greece February Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana.
Chapter – 8 Software Tools.
Grid Builder Status Rui Wang July 16, Grid Builder The Grid Builder uses a management console to deploy grids dynamically and remotely –The user.
12/12/01Fall AGU Vertical Reference Frames for Sea Level Monitoring Thomas Herring Department of Earth, Atmosphere and Planetary Sciences
Climate-SDM (1) Climate analysis use case –Described by: Marcia Branstetter Use case description –Data obtained from ESG –Using a sequence steps in analysis,
Interacting Data Services for Distributed Earthquake Modeling Marlon Pierce, Choonhan Youn, and Geoffrey Fox Community Grids Lab Indiana University.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
Scaling and Fault Tolerance for Distributed Messages in a Service and Streaming Architecture Hasan Bulut Advisor: Prof. Geoffrey Fox Ph.D. Defense Exam.
Online Performance Analysis and Visualization of Large-Scale Parallel Applications Kai Li, Allen D. Malony, Sameer Shende, Robert Bell Performance Research.
AMSA TO 4 Advanced Technology for Sensor Clouds 09 May 2012 Anabas Inc. Indiana University.
Scripting based architecture for Management of Streams and Services in Real-time Grid Applications Authors Harshawardhan Gadgil, Geoffrey Fox, Shrideep.
Kai Li, Allen D. Malony, Sameer Shende, Robert Bell
Integrating Geographical Information Systems and Grid Applications
Integrating Geographical Information Systems and Grid Applications
Oil & Gas Integrated Operations using aspenONE
Transport Protocols Relates to Lab 5. An overview of the transport protocols of the TCP/IP protocol suite. Also, a short discussion of UDP.
Corps Water Management System (CWMS) Modernization
Presentation transcript:

Using Topic-Based Publish/Subscribe for Managing Real Time GPS Streams Marlon Pierce, Galip Aydin, Zhigang Qi Community Grids Lab Indiana University 1

California Real Time Network Network Data Rates Message Format TimeRYOASCIIGML CRTN GPS Site Positions (9 Stations) 1 second1.5KB4.03KB48.7KB 1 hour5.31MB14.18MB171.31MB 1 day127.44MB340.38MB4.01GB 1 month3.8GB9.97GB123.3GB 1 year45.8GB119.67GB1.41TB Entire SCIGN Network (250 stations) 1year1.23TB16.18TB160TB Continuous GPS Stations (CGPS) are depicted as triangles while the Real-Time stations are represented as circles. Image is obtained from SOPAC GPS Explorer at How does one manage all the data generated by the 85 stations? How can you get just the data you want? Note this is fundamentally different from traditional request/response style Web Services.

RDAHMM: GPS Time Series Segmentation Slide Courtesy of Robert Granat, JPL  Complex data with subtle signals is difficult for humans to analyze, leading to gaps in analysis  HMM segmentation provides an automatic way to focus attention on the most interesting parts of the time GPS displacement (3D) length two years. Divided automatically by HMM into 7 classes. Features: Dip due to aquifer drainage (days ) Hector Mine earthquake (day 626) Noisy period at end of time series

Use Case - GPS Sensors  GPS is used to identify long-term tectonic deformation and static displacements. SCIGN has 250 Real-Time GPS Stations.  SOPAC GPS networks:  8 networks for 80 stations produce 1Hz high resolution data.  Socket based real-time binary-RYO format access is available.  We developed filters to provide multiple format (RYO, ASCII, GML) real- time streaming access.  OHIO principle (a general principle required by DOD) and chain of filters.  Our Architecture  Uses publish/subscribe based NaradaBrokering for managing real-time GPS streams  Utilizes topics for hierarchical organization of the sensors  Deploys successive data filters ranging from format translators to data analysis codes  Could potentially be used to run RDAHMM clones to monitor state changes in the entire GPS network  We are partner in a pioneering project to use the real-time GPS data for the first time in this context. 4

Processing Real-Time GPS Streams 5 ryo2nb Raw Data RYO Ports NB Server ryo2asciiascii2gmlascii2pos Single Station Displacement Filter Station Health Filter RDAHMM Filter Scripps RTD Server Scripps RTD Server ryo2nb Raw Data ryo2asciiascii2pos Single Station RDAHMM Filter A Complete Sensor Message Processing Path, including a data analysis application. /SOPAC/GPS/CRTN01/RYO /SOPAC/GPS/CRTN01/ASCII /SOPAC/GPS/CRTN01/POS /SOPAC/GPS/CRTN01/DSME GPS Networks

Application Integration with Real-Time Filters Station Monitor Filter records real-time positions for 10 minutes and calculates position changes Graph Plotter Application creates visual representation of the positions. RDAHMM Filter records real-time positions for 10 minutes and invokes RDAHMM application which determines state changes in the XYZ signal. Graph Plotter Application creates visual representation of the RDAHMM output. 6

GIS Grid Usage Model – Earthquake Science  Supporting geophysical repositories and real-time sensors is essential  To analyze a typical earthquake it is important to access to precise measurements of the initial earthquakes and aftershocks  To support earthquake forecasting and the time and spatial positions of the forecasts  PI can be used with existing data  RDAHMM can be used with the real-time data  Earth Science field is moving from a previously data poor field to a data rich world. We will have thousands of sensors spread around the world. (i.e. GPS sensors, InSAR satellites) 7

Recording and Replaying Sensor Streams  Filters can be used to record and replay scenarios, such as Earthquakes in GPS case.  We developed RYO Recorder and RYO Publisher Filters.  The RYO Recorder creates daily archives of the GPS Streams.  RYO Publisher can be used to play daily or certain segments of the records.  We replayed the 2004 Southern California Earthquake using Parkfield GPS network archive  These filters are used in the performance and scalability tests. 8

SensorGrid Performance Tests  Two Major Goals: System Stability and Scalability  Ensuring stability of the distributed Filter Services for continuous operation.  Finding the maximum number of publishers (sensors) and clients that can be supported with a single broker.  Investigate if system scales for larger number of sensors and clients. 9

Test Methodology  The test system consists of a NaradaBrokering server and a three-filter chain for publishing, converting and receiving RYO messages.  We take 4 timings to calculate mean end-to-end delivery times of GPS measurements.  The tests were run at least for 24 hours.  GridFarm servers are used in these tests. T transfer = (T 2 – T 1 ) + (T 4 – T 3 ) 10 NB Server 1 RYO To ASCII Converter RYO Publisher Simple Filter

1- System Stability Test  The basic system with three filters and one broker.  The figure shows average results for every 30 minutes.  The average transfer time shows the continuous operation does not degrade the system performance. 11

NB Server RYO To ASCII Converter Simple Filter RYO Publisher 1 RYO Publisher 2 RYO Publisher n 2 – Multiple Publishers Test  We add more GPS networks by running more publishers.  The results show that 1000 publishers can be supported with no performance loss. This is an operating system limit. 12 Topic 1A Topic 1B Topic 2 Topic n

3 – Multiple Clients Test  We add more clients by running multiple Simple Filters which subscribe to the same ASCII topic.  The system can support as many as 1000 clients with very low performance decrease. 13 NB Server RYO To ASCII Converter Simple Filter 1 RYO Publisher 1 Simple Filter 2 Simple Filter n Topic 1A Topic 1B 1000 Clients

Extending Scalability  The limit of the basic system appears to be 1000 clients or publishers.  This is due to an Operating System restriction of open file descriptors (1024 for Red Hat Linux) which can be increased by changing OS parameters.  To overcome this limit we create NaradaBrokering networks with linking multiple brokers. NB supports scalable linkage of the brokers for building tree like architectures.  We run 2 brokers to support 1500 clients.  Number of brokers can be increased indefinitely, so we can potentially support any number of publishers and subscribers. 14

4 – Multiple Brokers Test  NaradaBrokering allows creation of Broker networks.  We create a two-broker network.  Messages published to first broker can be received from the second broker.  We take timings on each broker.  We connect 750 clients to each broker and run for 24 hours. We chose 750 clients to stay well below the saturation limit.  The results show that the performance is very good and similar to single broker test. 15 NB Server 1 NB Server 2 RYO To ASCII Converter Simple Filter 1 RYO Publisher Topic 1A Topic 1B Simple Filter 2 Simple Filter 750 Simple Filter 751 Simple Filter 752 Simple Filter 1500 Topic 1B NB Server 2

4 – Multiple Brokers Test 16

Real-Time Filters Test Results  The RYO Publisher filter runs at 1Hz and publishes 24-hour archive of the CRTN_01 GPS network, which contains 9 GPS stations.  The single broker configuration can support 1000 clients or publishers (GPS networks individual stations).  The system can be scaled up by creating NaradaBrokering broker networks.  Message order was preserved in all tests. 17

Acknowledgement  Galip Aydin: Designer and implementer  ZhiGang Qi: SensorGrid Performance Tests  We thank Prof. Yehuda Bock and his group at SIO for their help with real-time GPS data streams.  Robert Granat, JPL: RDAHMM developer 18