Distributed Archives Interoperability Cynthia Y. Cheung NASA Goddard Space Flight Center IAU 2000 Commission 5 Manchester, UK August 12, 2000.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
Chapters 14 & 15 Internet Databases. E-Commerce  Bringing new products, services, or ideas to market, supporting and enhancing business operations 
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Information Retrieval in Practice
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Information Integration. Modes of Information Integration Applications involved more than one database source Three different modes –Federated Databases.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Overview of Search Engines
Client-Server Processing and Distributed Databases
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Inter-American Workshop on Environmental Data Access Panel discussion on scientific and technical issues Merilyn Gentry, LBA-ECO Data Coordinator NASA.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Metadata Tools and Methods Chris Nelson Metanet Conference 2 April 2001.
Introducing Dreamweaver MX 2004
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
AERONET Web Data Access and Relational Database David Giles Science Systems and Applications, Inc. NASA Goddard Space Flight Center.
THE GITB TESTING FRAMEWORK Jacques Durand, Fujitsu America | December 1, 2011 GITB |
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
Introduction to MDA (Model Driven Architecture) CYT.
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Interfacing Registry Systems December 2000.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Information System Development Courses Figure: ISD Course Structure.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Federation and Fusion of astronomical information Daniel Egret & Françoise Genova, CDS, Strasbourg Standards and tools for the Virtual Observatories.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
N NESSTAR: A Semantic Web Application for Statistical Data and Metadata Pasqualino “Titto” Assini Nesstar Ltd - UK.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
SDMX IT Tools Introduction
AstroGrid NAM 2001 Andy Lawrence Cambridge NAM 2001 Andy Lawrence Cambridge Belfast Cambridge Edinburgh Jodrell Leicester MSSL.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
XMC Cat: An Adaptive Catalog for Scientific Metadata Scott Jensen and Beth Plale School of Informatics and Computing Indiana University-Bloomington Current.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
5/19/05 New Geoscience Applications 1 A DISTRIBUTED WORKFLOW DATABASE DESIGNED FOR COREWALL APPLICATIONS Bill KampBill Kamp, Lumnilogical Research Center,
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
1 The XMSF Profile Overlay to the FEDEP Dr. Katherine L. Morse, SAIC Mr. Robert Lutz, JHU APL
Information Retrieval in Practice
Search Engine Architecture
Applying Domain-Specific Modeling Languages to Develop DRE Systems
Chapter 1 Database Systems
Intermountain West Data Warehouse
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Web Mining Department of Computer Science and Engg.
Metadata The metadata contains
Chapter 1 Database Systems
Presentation transcript:

Distributed Archives Interoperability Cynthia Y. Cheung NASA Goddard Space Flight Center IAU 2000 Commission 5 Manchester, UK August 12, 2000

Current Status Global Astrophysics Data Resources Loosely Connected By the Internet – Observational data archives or repositories – Derived data products (astronomical catalogs, browse images, video) – Data analysis packages – Visualization/presentation packages – Special services (bibliography, discipline-specific knowledge bases, directories) Distributed Storage, Processing, and Management – Multispectral surveys (Data volume ~ terabytes) – Islands of Information? Requires both Vertical and Horizontal Integration

Path to the Future Current (connections via hyperlinks): one to one Near Future (connections to multiple DBs all at once, via middleware): one to many Long Term (multiple inter- connectivity, federated databases): many to many - Distributed Autonomous data centers - Intelligent Agents - User-defined Profiles and Preferences - Access via Multiple Interfaces User Middleware Resources

Components of Interoperability Integrated search and discovery - URL Registry (e.g., Yellow Pages, GLU, AstroBrowse) - Query processor (e.g., AMASE, ISAIA) - Browsing/visualization to support selection (ADC Data Viewer, AEQ) - Batch queries (Feed output stream of one data service to another) - Tools to support integration of results Data and software exchange - FTP of data and software updates (pull) - Download of Browser Plug-in (pull) - Automated Updates (HST DB replication) (push) - Hybrid Techniques (with data cache or aircache) (push & pull) - Packaging of software with data (XDF)

Technical Issues and Challenges Example: Positional correlation of objects in a region of the sky across multiple wavelengths (Radio, IR, Optical, UV, X-rays, Gamma Rays) Data volume and network bandwidth – Cache of pre-computed results (e.g., astronomical catalogs) – Data filtering at data site, ship results only – Deployment of user code (platform independent S/W) – Data visualization for exploration and selection Registration, Sensitivity, Positional Accuracy – Coordinate transformation on a large scale – Calibration and normalization Query Optimization across Multiple Sites – Query execution plan for efficient cross-correlation – Indexing for fast access

Semantic Interoperability Content-based Searches – Science goal driven queries instead of SQL Data Understanding (Domain Context) – Human Interface —> S/W Mapping —> Object-oriented Mapping Data Annotation for Correct Interpretation – Measured parameters, units, quality, range of validity – Algorithm and calibration used, pedigree – Theoretical models applied Data Organization – File directory structure – Database schema Need Information in both Machine-understandable and Human-understandable form

Metadata Standards Syntax – Directory Structure – Size, Format, Location, URL Semantics – Usage Convention (e.g., FITS) –Extensible Standards to Encompass Different Disciplines (DTD, XML) – Astronomical Nomenclature and Designation Conceptual Data Model Metadata Language or Representation – FITS, ASCII, IEEE Binary –Astronomical XML

Aspects of Metadata Usage [Ref: Bretherton & Singley 1994 Proc of 7th SSDBM, p. 166] Search, browse, retrieval (Human) –Data extraction and interpretation –Navigate among services Ingest, quality assurance, (re-)processing –Science product generation pipeline –Content analysis Storage, archive (Data Management) –Information relevant for effective system design and operation Application to application transfer (Machine) –Enable “ context ” interchange (distributed queries and transformations) Need transfer language with mappings from conceptual level to different logical representation

Other Supporting Tools Interface Standards for Software Tools Tools for Schema Mapping – Document logical structure of database (key elements and relationship) – Mapping of local definitions into common terminology – Track changes and updates at other sites Tools for Data Integration and Fusion – Dynamic Interface with user preferences – Intelligent Software Agents to mediate interaction Goal: Global query to many distributed autonomous evolving data resources