Managing Dynamic Metadata and Context Mehmet S. Aktas.

Slides:



Advertisements
Similar presentations
Siebel Web Services Siebel Web Services March, From
Advertisements

TSpaces Services Suite: Automating the Development and Management of Web Services Presenter: Kevin McCurley IBM Almaden Research Center Contact: Marcus.
Chapter 10: Designing Databases
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Web Service Ahmed Gamal Ahmed Nile University Bioinformatics Group
Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
This product includes material developed by the Globus Project ( Introduction to Grid Services and GT3.
A Java Architecture for the Internet of Things Noel Poore, Architect Pete St. Pierre, Product Manager Java Platform Group, Internet of Things September.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
The Problem: Integrating Data, Applications, and Client Devices The key issue we try to solve is building the distributed computing infrastructure that.
Distributed components
Latest techniques and Applications in Interprocess Communication and Coordination Xiaoou Zhang.
Technical Architectures
Notes to the presenter. I would like to thank Jim Waldo, Jon Bostrom, and Dennis Govoni. They helped me put this presentation together for the field.
Rheeve: A Plug-n-Play Peer- to-Peer Computing Platform Wang-kee Poon and Jiannong Cao Department of Computing, The Hong Kong Polytechnic University ICDCSW.
Software Engineering and Middleware: a Roadmap by Wolfgang Emmerich Ebru Dincel Sahitya Gupta.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
.NET Mobile Application Development Introduction to Mobile and Distributed Applications.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
A Web Services Based Streaming Gateway for Heterogeneous A/V Collaboration Hasan Bulut Computer Science Department Indiana University.
Secure Systems Research Group - FAU Web Services Standards Presented by Keiko Hashizume.
A Scalable Framework for the Collaborative Annotation of Live Data Streams Thesis Proposal Tao Huang
THE NEXT STEP IN WEB SERVICES By Francisco Curbera,… Memtimin MAHMUT 2012.
Managing Service Metadata as Context The 2005 Istanbul International Computational Science & Engineering Conference (ICCSE2005) Mehmet S. Aktas
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
1 HKU CSIS DB Seminar: HKU CSIS DB Seminar: Web Services Oriented Data Processing and Integration Speaker: Eric Lo.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
International Telecommunication Union Geneva, 9(pm)-10 February 2009 ITU-T Security Standardization on Mobile Web Services Lee, Jae Seung Special Fellow,
Distributed Systems: Concepts and Design Chapter 1 Pages
Web Services Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
Managing Dynamic Metadata and Context Mehmet S. Aktas Advisor: Prof. Geoffrey C. Fox.
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
Web Services Based on SOA: Concepts, Technology, Design by Thomas Erl MIS 181.9: Service Oriented Architecture 2 nd Semester,
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
Dr. Azeddine Chikh IS444: Modern tools for applications development.
Web Services. Abstract  Web Services is a technology applicable for computationally distributed problems, including access to large databases What other.
Web Services Presented By : Noam Ben Haim. Agenda Introduction What is a web service Basic Architecture Extended Architecture WS Stacks.
1 Managing Dynamic Metadata and Context Mehmet S. Aktas Computer Science, Informatics, Pervasive Technology Laboratories Indiana University Bloomington.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
State Key Laboratory of Resources and Environmental Information System China Integration of Grid Service and Web Processing Service Gao Ang State Key Laboratory.
Hwajung Lee.  Interprocess Communication (IPC) is at the heart of distributed computing.  Processes and Threads  Process is the execution of a program.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Web Service Resource Framework WSMO Presentation Jos de Bruijn Digital Enterprise Research Institute http ://
1 G52IWS: Web Services Chris Greenhalgh. 2 Contents The World Wide Web Web Services example scenario Motivations Basic Operational Model Supporting standards.
Course: COMS-E6125 Professor: Gail E. Kaiser Student: Shanghao Li (sl2967)
A Demonstration of Collaborative Web Services and Peer-to-Peer Grids Minjun Wang Department of Electrical Engineering and Computer Science Syracuse University,
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Managing Metadata in Service Architectures Mehmet S. Aktas Advisor: Prof. Geoffrey C. Fox.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Problem On a regular basis we use: –Java applets –JavaScript –ActiveX –Shockwave Notion of ubiquitous computing.
1 Web Service Information Systems and Applications GGF16 Semantic Grid Workshop Athens Greece February Geoffrey Fox Computer Science, Informatics,
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
Scaling and Fault Tolerance for Distributed Messages in a Service and Streaming Architecture Hasan Bulut Advisor: Prof. Geoffrey Fox Ph.D. Defense Exam.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Service Oriented Architecture (SOA) Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Chapter 1 Characterization of Distributed Systems
Sabri Kızanlık Ural Emekçi
Computer Science Department
Grid Services B.Ramamurthy 12/28/2018 B.Ramamurthy.
Introduction to Web Services
Distributed System using Web Services
The Anatomy and The Physiology of the Grid
The Anatomy and The Physiology of the Grid
Information Services for Dynamically Assembled Semantic Grids
Distributed System using Web Services
Presentation transcript:

Managing Dynamic Metadata and Context Mehmet S. Aktas

2 of 29 Outline  Introduction  Problem Statement, Hypothesis, Design Goals  Literature Survey  Research Issues  Milestones  Contributions  Summary

3 of 29 Context  Def: "Context is any information that can be used to characterize the situation of an entity, where an entity can be a person, place, or computational object.“ Dey A. et al, 1999  Context is metadata associated to both services and their activities  Context can be independent of any interaction  static context  Examples: type or endpoint of a service, less likely to change  dynamic context  Examples: throughput of a service, likely to change over time generated as result of interaction  information associated to an activity or session  Examples: session-id, URI of the coordinator of a session

4 of 29 Gaggle of Services  Gaggle of Services are set of actively collaborating managed services put together for a particular functionality, such as collaboration, visualization or sensor Grid collaborate for a particular common goal  Example: emergence preparedness and response are actively generate events as result of interactions are very small part of the whole Grid

5 of 29 Motivation  Current Grid Information Services provide information describing services independent of their interactions.  We need management of all information associated with services for; correlating activities of widely distributed services  workflow-style, SOA based applications management of events especially in multimedia collaboration  distributed session management  for instance; audio, video, audio/video meetings in Chinese Olympics

6 of 29 Motivation II  More reasons for management of Context enabling uniform query capabilities to both dialog or monolog context information  “Give me list of services satisfying C:{a,b,c..} QoS requirements and participating S:{x,y,z..} sessions” enabling real-time replay/playback capabilities in collaboration based sessions enabling session failure recovery

7 of 29 Application Use Domain  Multimedia Collaboration domain: Global MMCS multiple A/V services talk to various collaboration clients and services defines a general session collaboration protocol (XGSP) XSGP enables different collaboration tools to talk to each other e.g. AccessGrid, H.323 needs a distributed session management systems  Characteristics of the domain widely distributed services metadata of events (archival data)  mostly read-only  persistent, but lifetime is bounded to lifetime of events

8 of 29 Application Use Domain - II  Workflow-style distributed application: Geographic Information System Grid sensor grid data services generates events when a certain magnitude event occurs firing off various codes, filtering, analyzing raw data, generating images, maps needs a distributed context management to correlate workflow activities  Characteristics of domain any number of widely distributed services can be involved conversation metadata  transient  multiple writers

9 of 29 1 WMS GUIWFS HP Search Data Filter PI Code Data Filter Context Information Service 2 5,6, , session profile information related WMS user profile shared data for HPSearch activity activity shared state <soap:Header encodingStyle=“WSCTX URL" mustUnderstand="true"> http.. <activity-list mustUnderstand="true" mustPropagate="true"> SOAP header for Context 1.session associated dynamic metadata 2.user profile 3.activity associated dynamic metadata 4.service associated dynamically generated metadata What are the examples of dynamically generated metadata in a real-life example? 3,4: WMS starts a session, invokes HPSearch to run workflow script for PI Code with a session id 5,6,7: HPSearch runs the workflow script and generates output file in GML format (& PDF Format) as result 8: HPSearch writes the URI of the of the output file into Context 9: WMS polls the information from Context Service 10: WMS retrieves the generated output file by workflow script and generates a map HPSearch associated additional data generated during execution of workflow. service associated

10 of 29 Problem Statement What is a novel process of building Information Services, maintaining dynamic session-related metadata of widely distributed services, providing uniform interface to both interaction-independent and conversation-based context?

11 of 29 Hypothesis  A fault-tolerant, high performance, scalable information system maintaining widely distributed dynamically generated metadata for Gaggle of Services providing uniform interface to context information  utilization of existing Grid Information Services for interaction-independent context to improve search capabilities enabling coordination of widely distributed services in Gaggles  workflow-style Grid applications enabling distributed event management and various capabilities for A/V conferencing applications  discovery of entities in a session  enabling playback/replay capabilities,  enabling session failure recovery

12 of 29 Architectural Design Goals  Key Design Goals of our Design scalability  with respect to #  widely distributed services performance  high responsiveness, reduced access latency fault tolerance  high availability of information  robust to replica crashes flexibility  accommodate broad range of application domains  read-dominated, read/write dominated

13 of 29 Literature Survey  Main Stream Grid Information Services MDS, R-GMA, UDDI (Grimories)  Specifications for stateful service interactions WS-CAF, WSRF, WS-Metadata Exchange  Linda TupleSpaces coordination model

14 of 29 MDS4-(GT4)R-GMA (European Data Grid) Grimories UDDI Extension (myGrid) Functionalitymonitoring and discoveryperformance monitoring, information registry and discovery of services and workflows Componentsaggregator services, information sources registry, producersregistry Provided dataapplication-oriented resource-oriented, stateful interaction data application-oriented, resource-oriented application- oriented Distribution, Organizational Structure decentralized, hierarchicaldecentralized, hierarchical, peer-to-peer centralized Main Stream Grid Information Services

15 of 29 Limitations in Grid Information Services  Lack of support for session related dynamic metadata MDS4 adopts WSRF approach which does not scale managing activities of multiple services sharing same state  Lack of support for advanced query capabilities ex: “Give me list of WFS services participating “fault displacement calculations” workflow session where the service connected by a network path over 2MB/sec of bandwidth with max 100 msec of latency.”

16 of 29 WS-CAF WS-Context - Key Concepts  WS Composite Application Framework (WS-CAF) WS-Context, WS-Coordination, WS-Transaction Mngmt.  WS Context defines context, context service and mapping on SOAP shared data to correlate service activities context information dependent on the type of the activity  transactional activity: the URI of the coordinator in a session context service maintains associated context participants of an activity register with context service for lifecycle of that activity

17 of 29 Web Service Resource Framework Key Concepts  defines standard interfaces and behaviors for distributed system integration standard XML-based information model standard interfaces for push and pull mode access to service data  enables every service to expose state data for query, update monitoring shared state  models resource state as private to a service  supports resource oriented approach for stateful interactions requires the identity of the resource to be passed in the SOAP message

18 of 29 WS-Metadata Exchange Key Concepts  WS Metadata is key to interactions WS-Policy: capabilities, requirements, general characteristics of services WSDL: describes message operations, supported network protocols used by services  WS-Metadata Exchange provides mechanism for sharing information about the capabilities of individual Web services allows querying a WS Endpoint to retrieve metadata about what to know to interact with them defines request/response message pairs to retrieve WS metadata

19 of 29 Limitations in Specifications for Service Communication  WSRF does not actually accomplish state management by just enabling access and update rights heterogeneous service environment workflow-style applications  WSRF, WS-Metadata Exchange models service metadata private to a service does not scale in managing activities of multiple services WS-Metadata Exchange defines only how to access interaction-independent metadata  WS-Context is promising it has limitations simple framework for context management limited query capability does not address distributed management aspects of context metadata

20 of 29 TupleSpaces Paradigm  a communication paradigm space-based asynchronous communication first described in Linda project in 1982 at Yale pioneered by David Gelernter  Linda is a coordination language using primitive operations on shared data in shared space  data-centric coordination model  communication units are tuples data-structure consisting of one or more typed fields  a TupleSpace is an intermediary container

21 of 29 JavaSpaces [Sun Microsystems]  JavaSpaces is an object oriented strongly influenced by Linda model Java based, platform independent  spaces are transactionally secure mutual exclusive access to objects  spaces are persistent temporal, spatial uncoupling  spaces are associative content based search  limitations centralized inefficient reading/writing performance dependent on stack of different software layers

22 of 29 Research Issues  Recap on key design goals: scalability, performance, fault tolerance  research issues related replicating dynamic metadata deployment (dynamic vs. static replication)  Where to place replicas of given context metadata?  What are the properties of new location must meet?  How to know if replica location stable?  How can we provide tailored replication based on R/W properties?

23 of 29 Research Issues II consistency  What is the appropriate consistency model?  How do replicas exchange replica updates in what direction?  How can we utilize an ordering capability based on NTP (Network Time Protocol) to provide consistency on the replicated context metadata?  performance efficient metadata access  How to choose a replica server to best serve client request?  How to avoid performance degradation due to repetitive queries?

24 of 29 Research Issues III  scalability load balancing strategies  How to manage load balancing?  other research issues replay/playback capabilities  How to enable real-time replay/playback capabilities? session recovery  How to enable session recovery? uniform interface to context  How to provide a uniform interface to context?

25 of 29 Milestones  Implementation of TupleSpaces paradigm  Uniform Update and Query (search, discovery) Services  Sequencer Service ensures that an order is imposed on actions/events that take place in a session

26 of 29 Milestones II  Storage (Replication) Service decide # and placement of replicas enable autonomous behavior support robust behavior for replica crashes  Access (Request Distribution) Service distribute request among object replicas  Expeditor Service generalized caching mechanism reduce storage access due to repetitive queries

27 of 29 Evaluation of Hypothesis  Qualitative evaluation Does the system delivers what it promises in terms of functionality?  Example test domains: Geographical Information System Grid, Global MMCS  How does the system function incase of replica crashes?  Quantitative evaluation How well the system delivers what it promises in terms of performance? What are the performance cost and gains brought together with scalability and fault tolerance?  trade offs between fault-tolerance, scalability and performance  what limitations does the trade offs impose to the practical use of my system?  what is # of replicas needed for certain availability?  what is the cost of fault tolerance?  what is the cost of scalability?

28 of 29 Contribution of this Thesis  Identifies a novel approach for building Information Services managing session related context.  Identifies a novel approach for providing fault tolerance and scalability while providing high performance when managing dynamic metadata Identifies a dynamic replication mechanism for widely distributed dynamic and transient metadata

29 of 29 Summary  This thesis addresses following problems Lack of support in Grid Information Services for context (session-related dynamic metadata) management to correlate activities in workflow-style applications:  by providing a novel approach for management of widely distributed, shared session-related dynamic metadata Lack of support in Grid Information Services to provide distributed session management:  by providing distributed event management system enabling session failure recovery or replay/playback capabilities Lack of search capabilities in Grid Information Services:  by providing uniform search interface to both interaction independent and conversation-based context enabling service discovery through events