CDAWeb+ Java client: an Experiment in Integrating Disparate Data Systems Robert M. Candey (Code 612.4) Reine A. Chimiak (Code.

Slides:



Advertisements
Similar presentations
1 Copyright © 2002 Pearson Education, Inc.. 2 Chapter 1 Introduction to Perl and CGI.
Advertisements

The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
CIM2564 Introduction to Development Frameworks 1 Overview of a Development Framework Topic 1.
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
Introduction to Web Database Processing
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
Introduction to Web Interface Technology (CSE2030)
Ch 12 Distributed Systems Architectures
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
Systems Architecture, Fourth Edition1 Internet and Distributed Application Services Chapter 13.
WWW and Internet The Internet Creation of the Web Languages for document description Active web pages.
1 The World Wide Web. 2  Web Fundamentals  Pages are defined by the Hypertext Markup Language (HTML) and contain text, graphics, audio, video and software.
Client/Server Architectures
INTRODUCTION TO WEB DATABASE PROGRAMMING
Page 1 An Overview of the Space Physics Data Facility (SPDF) for ViRBO Team Presented 8/31/2007 at GMU [ ]
Copyright © cs-tutorial.com. Introduction to Web Development In 1990 and 1991,Tim Berners-Lee created the World Wide Web at the European Laboratory for.
Time Table exchange QSAS / CL / CAA / AMDA CESR, 25/26 feb
ASP.NET The.NET Framework. The.NET Framework is Microsoft’s distributed run-time environment for creating, deploying, and using applications over the.
DATA COMMUNICATION DONE BY: ALVIN SAMPATH CARLVIN SAMPATH.
The Old World Meets the New: Utilizing Java Technology to Revitalize and Enhance NASA Scientific Legacy Code Michael D. Elder Furman University Hayden.
11/16/2012ISC329 Isabelle Bichindaritz1 Web Database Application Development.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
EGY Meeting March Page 1 The Data Policy for NASA's Heliophysics Science Missions & the eGY Geoscience Information Commons D. A. Roberts.
At A Glance VOLT is a freeware, platform independent tool set that coordinates cross-mission observation planning and scheduling among one or more space.
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
1 HKU CSIS DB Seminar: HKU CSIS DB Seminar: Web Services Oriented Data Processing and Integration Speaker: Eric Lo.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
National Center for Supercomputing Applications NCSA OPIE Presentation November 2000.
1 Importance of SECAA Web Services for the Integrated SEC Data Environment Robert M. Candey (Code 632) Bernard T. Harris (Code.
NOVA Networked Object-based EnVironment for Analysis P. Nevski, A. Vaniachine, T. Wenaus NOVA is a project to develop distributed object oriented physics.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Building from Where We Are: Web Services and Java-Based Clients to Enable Virtual Observatories Robert M. Candey (Code 612.4)
May 2003National Coastal Data Development Center Brief Introduction Two components Data Exchange Infrastructure (DEI) Spatial Data Model (SDM) Together,
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
March 2004 At A Glance autoProducts is an automated flight dynamics product generation system. It provides a mission flight operations team with the capability.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
HDF EOS Workshop David Han Code
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
Information Technology: GrADS INTEGRATED USER INTERFACE Maps, Charts, Animations Expressions, Functions of Original Variables General slices of { 4D Grids.
05/22/2006 VxO Workshop 1 A Virtual Observatory for the Ionosphere-Mesosphere- Thermosphere Community D. Morrison, M. Weiss, R. Daley, L. Immer, S. Nylund,
1 Introduction to Servlets. Topics Web Applications and the Java Server. HTTP protocol. Servlets 2.
Plasma Node demonstrators 1/Registry : –Set of XML descriptors of planetary plasma data (MAPSKP, VEX, MEX ) –Compliant with the SPASE data model
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
WEB SERVER SOFTWARE FEATURE SETS
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
Website Design, Development and Maintenance ONLY TAKE DOWN NOTES ON INDICATED SLIDES.
The Virtual Heliospheric Observatory and Distributed Data Processing T.W. Narock 1,2, A. Szabo 2, A. Davis 3 1. L3 Communications,
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Virtual Space Physics Observatory (VSPO) (vspo in Google) 22 May 2006 Overview D. Aaron Roberts NASA GSFC.
.NET Mobile Application Development XML Web Services.
March 2004 At A Glance ITPS is a flexible and complete trending and plotting solution which provides user access to an entire mission full-resolution spacecraft.
THEMIS Orbits and Data at SPDF Dieter Bilitza 1, Robert McGuire 2, Robert Candey 2, Reine Chimiak 3, Bernard Harris 3, David.
SPDF Science Advisory Group - September 29-30, 2005 Page 12/24/2016 9:09:48 PM Services of the Space Physics Data Facility (SPDF) / Sun-Earth Connection.
Distributed Data Servers and Web Interface in the Climate Data Portal Willa H. Zhu Joint Institute for the Study of Ocean and Atmosphere University of.
STEREO and the Virtual Heliospheric Observatory Tom Narock 1,2 Adam Szabo 1 Jan Merka 2 (1) NASA/Goddard Space Flight Center (2) L3 Communications, GSI.
Physical Oceanography Distributed Active Archive Center THUANG June 9-13, 20089th GHRSST-PP Science Team Meeting GHRSST GDAC and EOSDIS PO.DAAC.
By Jeremy Burdette & Daniel Gottlieb. It is an architecture It is not a technology May not fit all businesses “Service” doesn’t mean Web Service It is.
A Presentation Presentation On JSP On JSP & Online Shopping Cart Online Shopping Cart.
HELIO: Discovery and Analysis of Data in Heliophysics Robert Bentley, John Brooke, André Csillaghy, Donal Fellows, Anja Le Blanc, Mauro Messerotti, David.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
z/Ware 2.0 Technical Overview
MERRA Data Access and Services
improve the efficiency, collaborative potential, and
Processes The most important processes used in Web-based systems and their internal organization.
The Web Service based approach for data distribution at the IRIS DMC
Presentation transcript:

CDAWeb+ Java client: an Experiment in Integrating Disparate Data Systems Robert M. Candey (Code 612.4) Reine A. Chimiak (Code 583) John F. Cooper (Code 612.4) David B. Han (Code 586) Bernard T. Harris (Code 583) Rita C. Johnson (QSS, Code 612.4) Colin A. Klipsch (QSS, Code 612.4) Tamara J. Kovalick (QSS, Code 612.4) Howard A. Leckner (QSS, Code 612.4) Michael H. Liu (Raytheon ITSS, Code 612.4) Robert E. McGuire (Code 612.4) NASA Goddard Space Flight Center, Greenbelt MD Spring AGU SH51B May 27

Abstract The Space Physics Data Facility (SPDF) at NASA GSFC has developed a strong foundation in space science mission services and data for enhancing the scientific return of space physics research and enabling integration of these services into the emerging NASA Virtual Observatory paradigm. Our vision is that distributed components of a space physics virtual observatory work together via standard interfaces and metadata agreements to form a globally unified system, comparable to a single super-instrument from a multi-mission ensemble of many data sources. Such a unitary (but not monolithic) view presents geophysical measurements and models across time and space, enabling researchers to easily and seamlessly analyze data from many more sources than possible before. We are providing a critical set of foundation components, leveraging our data format expertise and our existing and very popular science and orbit data web-based services, such as Coordinated Data Analysis Web [CDAWeb] and Satellite Situation Center Web [SSCweb]. We have added web services for orbit location, data finding across FTP sites and in CDAWeb, data file format translation, and display. These services can now tie together existing data holdings, standardize and simplify their use, and enable much enhanced interoperability and data analysis.

Coordinated Data Analysis Web Java client (CDAWeb+) CDAWeb+ is our new Java prototype client built as an experiment in integrating disparate science data services Client enables selecting data by combinations of 5 keys (region, mission, instrument type, time span, keyword search) CDAWeb+ and its underlying web services and CDAWeb database provide: –simultaneous multi-mission, multi-instrument selection –comparison of science data via graphics, digital listings, file retrieval and merged/subsetted CDF creation CDAWeb comprises >300 datasets of current and past space missions and ground-based facilities (>1M files of science data) CDAWeb is very popular: –165k user sessions, 94k plots, 622k FTP requests, 67k ASCII listings, 586 CDF create requests, 2882 file retrieval requests in FY2004 CDAWeb data and services are available through: –FTP file access, including software and documents –Web browser (adds data listings, plots, original data files and combined files) –Web services (same as above but via SOAP API) –CDAWeb+ client for providing additional functionality of underlying CDAWlib IDL library and tying together many other services, including services external to SPDF Takes advantage of underlying standards (CDF, ISTP Guidelines, SPASE)

Why build CDAWeb+? Integrates disparate services in-house and some external services Displays results from detailed queries across many services at once Generalizes very popular CDAWeb service to call or point to many services: –CDAWeb –SSCweb (spacecraft orbit locations) –OMNIweb (solar wind, magnetic field and plasma data, energetic proton fluxes, and geomagnetic and solar activity indices) –COHOweb/Helioweb (deep space magnetic field, plasma, and spacecraft/planet ephemerides data) –ATMOweb (ionospheric and atmospheric data) –Modelweb (space physics models) –MSQS (Magnetospheric State Query System) –FTP Browser (display of subsets of NSSDC’s ftp-accessible ASCII datasets) –NSSDC Master Catalog (Oracle catalog of information about most spacecraft and instruments) –Anonymous FTP data sites (PWGdata, NSSDCftp, etc.) –APL Timed GUVI and other PI sites –Offline holdings at NSSDC Ties together variety of service protocols: FTP, CGI and SOAP web services, Oracle queries, and links Makes our existing services more visible and standardizes and simplifies their uses

CDAWeb+ Approach Table of manually entered metadata for each dataset or service XML version of metadata table, plus other information (such as CDF masters metadata and FTP filename-driven time ranges) FTP File Finder concept –Using only a few pieces of metadata, we return URLs to files matching a range of time in many FTP and some HTTP data sites –Filenaming format based on strptime % strings –Example XML (required tags in bold) ftp://cdaweb.gsfc.nasa.gov/pub/istp/ace/mfi_h0 </datasetftp://cdaweb.gsfc.nasa.gov/pub/istp/ace/mfi_h0 Web Services and SOAP for each service (see below) Java and Java 3D for clients Java WebStart for easy client install IDL on server for data compilation, listing and plotting Caveats: –No subsetting inventory of given dataset –Not attempting to uniformize identification or naming of variables

Web services are key to CDAWeb+ and VOs Services Oriented Architecture (vs. client-server) Distributed software to software communication, analogous to older technologies such as RPC, DCOM, CORBA, RMI No HTML or human interaction required Cross-platform and language-independent Enables others to develop tools and services leveraging core logic and science data and orbit information Everyone can use their own clients/tools Interoperable web services as basic components of VOs Strung together in many combinations Form an integrated system much greater than the sum of its parts Easily extendable Open to other systems and external applications by using standard distributed Application Program Interfaces (APIs) Based on XML and Simple Object Access Protocol [SOAP] standards and/or HTTP calling interface Tie together existing data holdings Standardize and simplify their use Enable much enhanced interoperability and data analysis WS Client Apache HTTPD IDL our WS Java code CDAWeb metadata database non-CDAWeb FTP/HTTP site metadata (XML) Apache Tomcat CDAWeb Server Remote Computer FTP Server Remote Computer HTTP Server Remote Computer SOAP request/ response Get dir Get dir Create execute HTTP GET data/response CDAWeb data file/ plot/listing Read create Get data file read get file

CDAWeb+ Experiment/Questions Try it soon! -- We need your feedback The experiment: How to integrate services of varying functionality? –This is an important challenge to VxOs –The current range of SPDF services is an interesting testbed Possible approaches (for instance, going directly to one spacecraft in the called service and not seeing you can select multiple spacecraft) –More comprehensive service calls simpler (subset of features): okay –Simpler service calls more powerful (superset of features): perhaps also have pointer to more powerful service main interface to get to additional functionality –Partially overlapping set of features between services: how to merge functionality? Does CDAWeb+ sufficiently enhance interoperability and data analysis capabilities? For example: –Is the dataset-centered paradigm effective? –Is it too difficult to use the large lists of results? How else can we shorten it? –Are the disclosure triangles effective? –Will IT-challenged scientists understand how to use WebStart to start the Java client? –Is keeping the navigation window open with other windows popping up helpful? How does CDAWeb+ compare to other interfaces (CDAWeb, VSPO, new SPDF Data&Orbits page, etc.)? What other search keys should we add? How else to identify data and services? Inventory level or variable level?

CDAWeb+ Concerns Metadata population (much is inherently manual and tedious) [However, separate metadata provides powerful middle layer for integrated user view] Performance and potential load on our servers Useful statistics characterizing usage (while preserving privacy) Social issues: –How to get effective credit (and usage statistics) for services when called by other services –How to give credit to other services that you are calling –How to assign responsibility to other services being called How to call services requiring logins and database queries To what extent should we point to external services? How to handle incomplete capture of other services and datasets (appear more comprehensive than really are)? How do you get a complete list of services in a given domain and maintain it? (distributed domains exponentially harder) CDAWeb+ Future Add parent/child display for grouping related variables (in progress) Display of data availability per datasets Allow multiple time spans –Event lists server (accept/send XML lists, combine lists (and/or), allow annotation, add user-defined or service-defined fields) –Bow shock crossings, Magnetopause crossings, etc. Add SSC Query (complex multi-spacecraft queries) and OMNIweb extended query (search activity indices) functions Allow user more control of plot displays (font, sizes, etc.) Add time shifting between datasets to correlate distant spacecraft Add plotting to the client (need good Java plot library) Sonification (for accessibility and as alternative mode for discovery) More pointers to external services (see above concern)??

CDAWeb+ Java client

CDAWeb+ files