High Performance Web Service Architecture for Sensors and Geographic Information Systems Galip Aydin.

Slides:



Advertisements
Similar presentations
Web Service Ahmed Gamal Ahmed Nile University Bioinformatics Group
Advertisements

Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Ahmet Sayar, Galip Aydin, Mehmet Aktas, Harshawardhan Gadgil.
Service Oriented Sensor Web Xingchen Chu and Rajkumar Buyya University of Melbourne, Australia Presented by: Gerardo I. Simari CMSC828P – Fall 2006 Professor.
The Problem: Integrating Data, Applications, and Client Devices The key issue we try to solve is building the distributed computing infrastructure that.
Service Oriented Architecture for Geographic Information Systems Supporting Real Time Data Grids Galip Aydin Department Of Computer Science Indiana University.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
Latest techniques and Applications in Interprocess Communication and Coordination Xiaoou Zhang.
Technical Architectures
Cesar Martinez Internetworked Virtual Reality October 2002 Cesar Martinez Internetworked Virtual Reality October 2002 A Distributed Architecture for Supporting.
Asper School of Business University of Manitoba Systems Analysis & Design Instructor: Bob Travica System architectures Updated: November 2014.
Systems Architecture, Fourth Edition1 Internet and Distributed Application Services Chapter 13.
Chapter 9: Moving to Design
Using AJAX Galip Aydin, Ahmet Sayar, and Marlon Pierce Community Grids Lab Indiana University.
1 Alternate Title Slide: Presentation Name Goes Here Presenter’s Name Infrastructure Solutions Division Date GIS Perfct Ltd. Autodesk Value Added Reseller.
Client/Server Architecture
Web Service What exactly are Web Services? To put it quite simply, they are yet another distributed computing technology (like CORBA, RMI, EJB, etc.).
Web Services Michael Smith Alex Feldman. What is a Web Service? A Web service is a message-oriented software system designed to support inter-operable.
Principles for Collaboration Systems Geoffrey Fox Community Grids Laboratory Indiana University Bloomington IN 47404
A Scalable Framework for the Collaborative Annotation of Live Data Streams Thesis Proposal Tao Huang
Windows.Net Programming Series Preview. Course Schedule CourseDate Microsoft.Net Fundamentals 01/13/2014 Microsoft Windows/Web Fundamentals 01/20/2014.
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Yili Gong,
GIS technologies and Web Mapping Services
Database Architectures and the Web Session 5
Web Services Architecture1 - Deepti Agarwal. Web Services Architecture2 The Definition.. A Web service is a software system identified by a URI, whose.
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Ahmet Sayar, Galip Aydin, Mehmet Aktas, Harshawardhan Gadgil.
Geospatial Systems Architecture Todd Bacastow. GIS Evolution
DISTRIBUTED COMPUTING
An Introduction To Building An Open Standard Web Map Application Joe Daigneau Pennsylvania State University.
material assembled from the web pages at
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Microsoft Visual Studio 2010 Muhammad Zubair MS (FAST-NU) Experience: 5+ Years Contact:- Cell#:
QuakeSim Work: Web Services, Portlets, Real Time Data Services Marlon Pierce Contributions: Ahmet Sayar,
Integrated Collaborative Information Systems Ahmet E. Topcu Advisor: Prof Dr. Geoffrey Fox 1.
1 Grids for Real-time and Streaming Applications GCC2005 Beijing China December Geoffrey Fox Computer Science, Informatics, Physics Pervasive Technology.
Event-Based Hybrid Consistency Framework (EBHCF) for Distributed Annotation Records Ahmet Fatih Mustacoglu Advisor: Prof. Geoffrey.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
Using Topic-Based Publish/Subscribe for Managing Real Time GPS Streams Marlon Pierce, Galip Aydin, Zhigang Qi Community Grids Lab Indiana University 1.
Service Oriented Sensor Web: NOSA Approach Rajkumar Buyya and Xingchen Chu Grid Computing and Distributed Systems (GRIDS) Laboratory Dept. of Computer.
GPS Sensor Web Time Series Analysis Using SensorGrid Technology Robert Granat 1, Galip Aydin 2, Zhigang Qi 2, Marlon Pierce 2 1 Science Data Understanding.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
SensorGrid Galip Aydin June SensorGrid A flexible computing environment for coupling real-time data sources to High Performance Geographic Information.
Web Services and Geologic Data Interchange Simon Cox CSIRO Exploration & Mining
9 Systems Analysis and Design in a Changing World, Fourth Edition.
SensorGrid High Performance Web Service Architecture for Geographic Information Systems Thesis Proposal Galip Aydin
RSISIPL1 SERVICE ORIENTED ARCHITECTURE (SOA) By Pavan By Pavan.
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Ahmet Sayar,
1 MESSAGE EXCHANGE FOR Web Service-Based Mapping Services AHMET SAYAR INDIANA UNIVERSITY COMMUNITY GRIDS LAB. COMPUTER SCIENCE DEPARTMENT August 17, 2005.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.
Geospatial Systems Architecture
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Managing Enterprise GIS Geodatabases
Distributed Handler Architecture (DHArch) Beytullah Yildiz Advisor: Prof. Geoffrey C. Fox.
Distributed Handler Architecture Beytullah Yildiz
1 Service Oriented Architecture SOA. 2 Service Oriented Architecture (SOA) Definition  SOA is an architecture paradigm that is gaining recently a significant.
Event-Based Infrastructure for Reconciling Distributed Annotation Records Ahmet Fatih Mustacoglu Advisor: Prof. Geoffrey C. Fox.
Web Services Blake Schernekau March 27 th, Learning Objectives Understand Web Services Understand Web Services Figure out SOAP and what it is used.
1 Web Service Information Systems and Applications GGF16 Semantic Grid Workshop Athens Greece February Geoffrey Fox Computer Science, Informatics,
Scaling and Fault Tolerance for Distributed Messages in a Service and Streaming Architecture Hasan Bulut Advisor: Prof. Geoffrey Fox Ph.D. Defense Exam.
Copyright, Open Geospatial Consortium Making Location Count Peer-to-Peer File Sharing An Answer to the SDI blues North Carolina GIS Conference February,
Software Architecture Patterns (3) Service Oriented & Web Oriented Architecture source: microsoft.
AMSA TO 4 Advanced Technology for Sensor Clouds 09 May 2012 Anabas Inc. Indiana University.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Scripting based architecture for Management of Streams and Services in Real-time Grid Applications Authors Harshawardhan Gadgil, Geoffrey Fox, Shrideep.
Integrating Geographical Information Systems and Grid Applications
Integrating Geographical Information Systems and Grid Applications
Information Services for Dynamically Assembled Semantic Grids
Gordon Erlebacher Florida State University
Presentation transcript:

High Performance Web Service Architecture for Sensors and Geographic Information Systems Galip Aydin

Geographic Information Systems A Geographic Information System is a system for creating, storing, sharing, analyzing, manipulating and displaying spatial data and associated attributes. GIS history saw the evolution from mainframe GIS to Desktop GIS to Distributed GIS. Modern GIS require:  Distributed data access for spatial databases  Utilizing remote analysis, simulation or visualization tools.

Traditional Distributed GIS Approach Problems with traditional approaches :  Distributed nature of the geo-data; various client-server models, databases, HTTP, FTP, RDBs, XML DBs etc.  Data format problems, conversion overheads  Data processing issues, hardware and software requirements, COM+/ActiveX, CORBA/IIOP frameworks Which introduce three challenges  Assembling data from distributed repositories  Adoption of universal standards for format interoperability  Interoperable services for better utilization of computational resources

Open Geographic Standards Open GIS Standards bodies aim to make geographic information and services neutral and available across any network, application, or platform. Two major standard bodies: OGC and ISO/TC211, former being most popular OGC Specifications are widely accepted:  Data Format Specs: GML, SensorML, O&M  Service Specs: WFS, WMS, WCS OGC Services are HTTP GET/POST based; limited data transport capabilities (HTTP, FTP, files etc.) Not Web Services; tightly coupled, point to point communication results in centralized, synchronous applications.

Motivations Lack of service orchestration capabilities  Complex problems require GIS applications to collaborate. Coupling data sources to scientific applications Data transport requirements Proliferation of Sensors  Ability to analyze data on-the-fly, continuous streaming support, scalable systems for addition of new sensors. High performance and high rate messaging  Real-time data access, rapid response systems, crisis management etc. From the Grids perspective  To apply general Grid/Distributed computing principles to GIS  Investigate how to integrate with geophysical and other scientific applications

Motivating Use Cases Pattern Informatics  Earthquake forecasting code developed by Prof. John Rundle (UC Davis) and collaborators, uses seismic archives. Regularized Dynamic Annealing Hidden Markov Method (RDAHMM)  Time series analysis code, can be applied to GPS and seismic archives, can be applied to real-time data. Interdependent Energy Infrastructure Simulation System (IEISS)  Models infrastructure networks (e.g. electric power systems and natural gas pipelines) and simulates their physical behavior, interdependencies between systems. SOPAC GPS Networks provide real-time messages.

Research Issues 1 Applying Web Service principles to GIS data services  Orchestration of Services, workflows, simple services are not suitable for large data sets and where quick response is required High Performance s upport in GIS services. Interoperability  The system should bridge GIS and Web Service communities by adapting standards from both.  Other GIS applications should be able to consume data without having to do costly format conversions.

Research Issues 2 Scalability  The system should be able to handle high volume and high rate data transport and processing.  Plugging new sensors, data sources or geoprocessing applications should not degrade system’s overall performance. Flexibility and extendibility  How to develop real-time services to process sensor data on the fly.  Ability to add new filters without system failures. Quality of Service Issues  Is latency introduced by services in processing real-time sensor data acceptable?

SOA for GIS – Geophysical Data Grid We utilize Web Services to realize Service Oriented Architecture, OGC data formats and application interfaces for interoperability at both levels. GIS Data Grid Properties  Based on the sources geospatial data can be seen as archival and real-time data. The architecture provides standard control and access interfaces for both types.  Supports alternate transport and representation schemes, uses topic based messaging infrastructure for large volume data transport.  UDDI based FTHPIS as services registry.  Streaming and non-streaming services to access archived data.  Real-Time and near real-time services for accessing sensor metadata and sensor measurements.

Geophysical Data Grid Architecture Archival Data Grid Real-Time Data Grid

GIS Grid 1 - Archival Data Services Web Feature Service is the default OGC specification for vector data. We have built Web Service version of WFS for accessing geospatial data on distributed databases. The first Web Service version of WFS has been successfully used in several scientific workflows with other services (WMS, HPSearch, FTHPIS). WFS can access multiple distributed databases, can query other WFSs for remote features. Problems with Web Service version of the WFS  Request-response, not asynchronous,  Performance: GI Services are not designed to handle non-trivial data transfers. Large data requests, SOAP overhead.  XML Encoding: Size of the geospatial data increases with GML encoding which increases transfer times, or may cause exceptions

WFS Performance Improvements Streaming WFS To improve performance of the WFS :  Utilized publish/subscribe messaging system for high performance data transfer. Similar to WFS but data and control channel separation, allows one to many data distribution.  Used streaming database connection (MySQL) for faster retrieval of the query results, and lower GML creation overhead.  Binary XML Frameworks are integrated for reducing XML payload size which improves transfer times.  Binding data transfer to Grid messaging middleware reduces SOAP creation overhead.

WFS Interaction with services and data sources

GIS Grid Example – IEISS Integration WMS – Ahmet Sayar UDDI, Context Service – Mehmet Aktas

Streaming WFS Performance  We test the system for up to features  The tests reveal the performance of the streaming service with and without Binary XML integration  We use BNUX and Fast Infoset Binary XML Frameworks for compressing the GML FeatureCollection documents  The BNUX and FI timings include encoding and decoding costs

GIS Grid 2 - Real-Time Data Services Sensors and sensor networks are being deployed for measuring various geo-physical entities. Sensors and GIS are closely related. Sensor measurements are used by GIS for statistical or analytical purposes. With the proliferation of the sensors, data collection and processing paradigms are changing. Most scientific geo-applications are designed to work with archived data. Critical Infrastructure Systems and Crisis Management environments require fast and accurate access to real-time sources and a flexible/pluggable architecture for geoprocessing of the data.

SensorGrid Architecture Major components :  Real-Time filters  Grid Messaging Substrate  Information Service Filters can be run as Web Services to create workflows. Filter Chains can be deployed for complex processing. Streaming messaging provide high-performance transfer options.

Real-Time Filters Real-time data processing is supported by employing filters around publish/subscribe messaging system. The filters are extended from a generic class to inherit publish and subscribe capabilities. They can be connected in parallel or serial as chains to solve complex problems. Input SignalOutput Signal Filter

Filter Metadata and Chains Parallel Operation Serial Operation

Use Case - GPS Sensors A good example for scientific sensors are GPS station networks. GPS measurements are used for determining post- seismic deformation, understanding long-term crustal movement etc. SOPAC GPS networks :  8 networks for 80 stations produce 1Hz high resolution data.  Socket based real-time binary-RYO format access is available, but not utilized!  We developed filters to provide multiple format (RYO, ASCII, GML) real-time streaming access.  OHIO principle and chain of filters. We use publish/subscribe based NaradaBrokering for managing real-time streams, topics for hierarchical organization of the sensors.

SOPAC Real-Time Filters for GPS Streams

Application Integration with Real-Time Filters Station Monitor Filter records real-time positions for 10 minutes and calculates position changes Graph Plotter Application creates visual representation of the positions. RDAHMM Filter records real-time positions for 10 minutes and invokes RDAHMM application which determines state changes in the XYZ signal. Graph Plotter Application creates visual representation of the RDAHMM output.

AJAX and Real-Time positions on Google maps

Recording and Replaying Sensor Streams Filters can be used to record and replay scenarios, such as Earthquakes in GPS case. We developed RYO Recorder and RYO Publisher Filters. The RYO Recorder creates daily archives of the GPS Streams. RYO Publisher can be used to play daily or certain segments of the records. We replayed the 2004 Southern California Earthquake using Parkfield GPS network archive

SensorGrid Performance Tests Two Major Goals: System Stability and Scalability  Ensuring stability of the distributed Filter Services for continuous operation.  Finding the maximum number of publishers (sensors) and clients that can be supported with a single broker. Investigate if system scales for large number of sensors and clients.

Test Methodology The test system consists of a NaradaBrokering server and a three-filter chain for publishing, converting and receiving RYO messages. We take 4 timings for determining mean end-to-end delivery times of GPS measurements. The tests were run at least for 24 hours. GridFarm servers are used in these tests. Ttransfer = (T2 – T1) + (T4 – T3)

1- System Stability Test The basic system with three filters and one broker. The figure shows average results for every 30 minutes. The average transfer time shows the continuous operation does not degrade the system performance.

2 – Multiple Publishers Test We add more GPS networks by running more publishers. The results show that 1000 publishers can be supported with no performance loss. This is an operating system limit.

3 – Multiple Clients Test We add more clients by running multiple Simple Filters which subscribe to the same ASCII topic. The system can support as many as 1000 clients with very low performance decrease. Adding clients 1000 Clients

Extending Scalability The limit of the basic system appears to be 1000 clients or publishers. This is due to an Operating System restriction of open file descriptors (1024 for Red Hat Linux). To overcome this limit we create NaradaBrokering networks with linking multiple brokers. We run 2 brokers to support 1500 clients.  Number of brokers can be increased indefinitely, so we can potentially support any number of publishers and subscribers.

4 – Multiple Brokers Test Messages published to first broker can be received from the second broker. We take timings on each broker. We connect 750 clients to each broker and run for 24 hours. The results show that the performance is very good and similar to single broker test.

4 – Multiple Brokers Test 750 Clients

Real-Time Filters Test Results The RYO Publisher filter runs at 1Hz and publishes 24-hour archive of the CRTN_01 GPS network, which contains 9 GPS stations. The single broker configuration can support 1000 clients or publishers (GPS networks individual stations). The system can be scaled up by creating NaradaBrokering broker networks. Message order was preserved in all tests.

Contributions A SOA approach to create a common platform to support both archival and real-time geospatial data in data-centric Grids. Merging Web Services and Open Geographic Standards for supporting interoperability at both data and application levels. We have shown that the GIS Services can be implemented as streaming services. Integration of Binary XML Frameworks with the Streaming Services shows performance gains for long network distances. We have shown that the Sensor Grids can be built on top of the publish/subscribe middleware. Real-Time continuous data support is realized in a Service Architecture. Scalable architecture implementation for large number of sensor networks.