Federated Service Oriented Information Management Ahmet Sayar

Slides:



Advertisements
Similar presentations
IVOA, Pune India September Data Access Layer Working Group Pune Workshop Summary Doug Tody National Radio Astronomy Observatory International.
Advertisements

September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Remote Visualisation System (RVS) By: Anil Chandra.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
UCSD SAN DIEGO SUPERCOMPUTER CENTER Ilkay Altintas Scientific Workflow Automation Technologies Provenance Collection Support in the Kepler Scientific Workflow.
SWIM WEB PORTAL by Dipti Aswath SWIM Meeting ORNL Oct 15-17, 2007.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
Federated Hierarchical Filter Grids STTR-funded project with Indiana, Caltech and Deep Web Technologies A Grid infrastructure for Data Analysis Integrates.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Impromptu Data Extraction and Analysis Data Mining and Analytics Framework for VLSI Designs Sandeep P
February Semantion Privately owned, founded in 2000 First commercial implementation of OASIS ebXML Registry and Repository.
GRAPPA Part of Active Notebook Science Portal project A “notebook” like GRAPPA consists of –Set of ordinary web pages, viewable from any browser –Editable.
Virtual Observatory --Architecture and Specifications Chenzhou Cui Chinese Virtual Observatory (China-VO) National Astronomical Observatory of China.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
Managing Service Metadata as Context The 2005 Istanbul International Computational Science & Engineering Conference (ICCSE2005) Mehmet S. Aktas
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Jan Storage Resource Broker Managing Distributed Data in a Grid A discussion of a paper published by a group of researchers at the San Diego Supercomputer.
DISTRIBUTED COMPUTING
WSRF Supported Data Access Service (VO-DAS)‏ Chao Liu, Haijun Tian, Dan Gao, Yang Yang, Yong Lu China-VO National Astronomical Observatories, CAS, China.
EdSkyQuery-G Overview Brian Hills, December
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
AIXM Users’ Conference, March Implementing AIXM in Instrument Flight Procedures Automation Presenter: Iain Hammond MacDonald, Dettwiler &
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
MapServer Support for Web Coverage Services Stephen Lime - Minnesota DNR Dr. Thomas E. Burk - University of Minnesota MUM Ottawa, Canada.
GIS On The Web: An Overview of ArcIMS. *The easy flow of geographic data can offer real-life solutions in many societal sectors, including municipal government,
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
* Working Group 4. 2 AstroGrid-D Meeting, Heidelberg Tobias Scholl Astrometric Matching Prototype (D4.2) 50 RASS-BSC sources Correlation with.
Chris Kuruppu NWS Office of Science and Technology Systems Engineering Center (Skjei Telecom) 10/6/09.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
INFSO-RI Enabling Grids for E-sciencE OGSA DAI Data Access and Integration Marek Ciglan Institute of Informatics, Slovac Academy.
Kepler includes contributors from GEON, SEEK, SDM Center and Ptolemy II, supported by NSF ITRs (SEEK), EAR (GEON), DOE DE-FC02-01ER25486.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
RSISIPL1 SERVICE ORIENTED ARCHITECTURE (SOA) By Pavan By Pavan.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
A Data Access Framework for ESMF Model Outputs Roland Schweitzer Steve Hankin Jonathan Callahan Kevin O’Brien Ansley Manke.
1 MESSAGE EXCHANGE FOR Web Service-Based Mapping Services AHMET SAYAR INDIANA UNIVERSITY COMMUNITY GRIDS LAB. COMPUTER SCIENCE DEPARTMENT August 17, 2005.
Introduction to The Storage Resource.
12 Oct 2003VO Tutorial, ADASS Strasbourg, Data Access Layer (DAL) Tutorial Doug Tody, National Radio Astronomy Observatory T HE US N ATIONAL V IRTUAL.
REST By: Vishwanath Vineet.
Biomedical Informatics Research Network The Storage Resource Broker & Integration with NMI Middleware Arcot Rajasekar, BIRN-CC SDSC October 9th 2002 BIRN.
Improving User Access to Metadata for Public and Restricted Use US Federal Statistical Files William C. Block Jeremy Williams Lars Vilhuber Carl Lagoze.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
Glossary WMS – OGC Web Mapping Services WFS – OGC Web Feature Services XML- Extensible Markup Language OGC – Open GIS Consortium ADN –
Publishing Combined Image & Spectral Data Packages Introduction to MEx M. Sierra, J.-C. Malapert, B. Rino VO ESO - Garching Virtual Observatory Info-Workshop.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
VO Data Access Layer IVOA Cambridge, UK 12 May 2003 Doug Tody, NRAO.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
Ideas on Opening Up GEOSS Architecture and Extending AIP-5 Wim Hugo SAEON.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
The Data Grid: Towards an architecture for Distributed Management
Middleware independent Information Service
Federated Hierarchical Filter Grids
Google Sky.
Presentation transcript:

Federated Service Oriented Information Management Ahmet Sayar

Introduction Aim is utilizing distributed heterogeneous information and knowledge provided by different repositories and vendors in an efficient and robust manner. No agreed upon –useful- architecture framework for  Federating  Obtaining  Analyzing  Interpreting the heterogeneous distributed data/information for decision makers in scientific application domains

Motivation SOA based on Web Services Information Sources are “Filters”:  A service inputs DIKW (Data-Information-Knowledge-Wisdom Hierarchy) from Grid and outputs DIKW  Web Services, easy to extend and federate.  Easy to publish, located and bind.  predictable input and output interfaces and defined by metadata Information management through ASIS (Application Specific Information System) framework in Science Domains such as GIS. Data and metadata concepts and formats A repository or sensor has or gets DIKW from "outside Grid"; it outputs DIKW

Problem Recognition Vector data Bitmap data netCDF Bar graphs Coverage data Image jpeg XML data Statistics data Plots images Binary data Interactive Tools DB Raw Data Data Information Knowledge Wisdom Decisions SS HDF5

Problem Recognition Services like discovery and notification do not need to be made application specific. BUT If the domain changes then :  choices,  Database requirements,  data format,  core service requirements,  attributes, and  metadata context CHANGES ! What are the common concepts and characteristics for  Data,  Metadata,  Query Language,  Services, and  Communication language, in order to drive information/knowledge from the heterogeneous data/information sources in Application Domains ?

Overall Structure Solution ASL : Application Specific Language. XML based hierarchical data representation format.  Cross language, platform and operating system ASVS : Application Specific Visualization System  Last filter before the decision maker.  Provides information/knowledge in human readable formats ASFS : Application Specific Feature Service.  Stores and provides common data model (ASL) Treat binary and common data (in ASL) differently. ASFS AS “Sensor” AS Tool (generic) AS Service (user defined) AS Tool (generic) ASVS Display Message Using ASL AS Repository

Overall Structure Solution -cont Common data (in ASL) is kept in ASFS. Enables interactive querying through GUI. Tentative architecture. In the DIKW world, everything is mixed as data and filters In a given domain every filter speaks in ASL ASVS both visualize information and provide a way of navigating ASFS and their underlying DB. ASVS can itself be federated and present output interface. GIS and Astronomy have some standards but not many others have

Example (1): GIS Domain (OGC) FS-2 MD Vector data FS-1 FS Raster data FS-3 FS-4 Interactive Decision Support Data capability FS-1 : Master Filter (WMS)  Providing the available data list and capabilities to the end user clients - Interactive tools FS-2 : Web Feature Server  Provides vector data such as rivers, state and city boundaries in GML FS-3 : Web Map Server  Provides image data in the form of jpeg, svg, png etc. Defined in its capabilities file FS-4 : Web Coverage Server  Provides coverage (raster) data. Grided data, pixel info Query : No Standard – Filter specification – SQL Data Encodings : GML, images Metadata : capability doc. No event notification – we use WSContext for asynchronous run Registry : WRS – MD Queryable Data in : WFS (Nasa) (CGL) (Minnesota) Data:a Data:b Data:b Data:c Data:a Data:b Data:c PORTAL Data:a Data:b Data:c

Example (2): Astronomy Domain (IVOA) FS-2 DB FS-1 DB FS-3 FS-1 : VOPlot  Integrating, Interacting visualization tools FS-2 : SkyNode  ADQL based SOAP interface returning VOTable based results FS-3 : SIA  2D sky projection, logically a grid of pixels encoded as a FITS image FS-4 : SSA  URL-based returning a dataset "document" (VOTable) Query : ADQL –extension of SQL Data Encoding: VOTable, FITS Metadata : UCD, VOResource Event notification : VOEvent Registry : VORegistry QueryableData in : SSAP and SIAP, VOStore DB FS-4 MD Interactive Decision Support Data capability PORTAL

Interactive Decision Support Tools - Interactive query, - Interactive display, movie and animation - Integration to Application Science Simulations (R. Williams et al.)

Issues To Be Discussed (1) Requirements for the domain metadata in capability  What does capabilities do and need to have to federate filters? Requirements for the ASL (such as CML, GML)  What does ASL need to have to federate the filters? Concept of data (such as feature, coverage)  Common representation? Possible? To what extend? A common information management framework which can be applied to any domain.  some instructions- any field, what needs to be done

Issues To Be Discussed (2) Application level data/information federation Integrating the system with application science simulations. Creating interactive decision support tools utilizing integrated filter services.  Tools for map animation, map movies, images  Interactive query support to get further information on the image and/or animation. Enabling binding of services into pipelines with or without human intervention through metadata. Caching and load balancing to handle large scientific data in an efficient and robust manner (application based)

Summary of SRB & Ogsa-DAI SRB  Storage Resource Broker  Uniform access to dist. heterogeneous data resources by attributes  Catalog service is MCAT (Metadata Catalog Service)  Resource and data location transparency  Remote authentication authorization – user groups  Not just for access, transferring and replicating  Sample projects using SRB: BIRN and NASA IPG Ogsa-DAI  Open Grid Service Architecture - Data Access and Integration  Access to heterogeneous data via common interfaces on the grid.  Catalog service is MCS (Metadata Catalog Service)  OGSI-compliant Grid  Components are Grid services. Resources should be registered.  Sample projects using Ogsa-DAI : LEAD, MyGrid

Discussions on SRB & Ogsa-DAI SRB  Monolithic – does too much  MCAT dependent  MCAT has limited support for application-level metadata Need diff metadata for diff domain, and extensions for applications  Not standard based – Not open source  Not handling data based on DIKW hierarchy Ogsa-DAI  At the data and Database level  MCS dependent  MCS has limited support for application-level metadata Need diff metadata for diff domain, and extensions for applications  For Grid applications - GGF standards  Data only in relational and XML database or ordinary files  Not handling data based on DIKW hierarchy

Our Work Compared to SRB & Ogsa-DAI MasterSRB Ogsa-GDSF RRR FS RRR Wisdom decisions, knowledge and information extraction by the user -Reusable components Filter Services with specific ports and interfaces -Distributed DIKW abstraction -Metadata in capability document -Metadata aggregators -New metadata for different domains -User uses just getData interface to query Ready to use information and knowledge -Central data access abstraction. Uniform access to heterogeneous data sources -Metadata : SRB/MCAT, Ogsa- DAI/MCS -Both provides extensible metadata arch for diff domains -SRB has “zone” concept address similar issues but different Wisdom decisions Information/knowledge Data access and query SRB Agents Ogsa/GDS

Why are we different ? SOA (Service Oriented Architecture)  Easy to extend  Reusable components  Cross platform and language.  XML based hierarchical data representation Easy data integration Easy querying Human readable information Easy to access data – no command line  Interactive tools  On the fly query creation. Not only accessing data but also transforming through its path to end users. Ports to integrate application simulation to application specific information system (ASIS)

Contributions Instructions how to build ASL and metadata in capability for the application sciences. Instructions how to build application specific information system (ASIS) federating multiple filters speaking ASL. Information grid (ASIS) formalization through capabilities metadata, defining all the data/information sources as interacting Web Service filters with standard metadata service ports. Optimize and enhance the distributed heterogeneous information management.

THANKS Ahmet Sayar