XMC Cat: An Adaptive Catalog for Scientific Metadata Scott Jensen and Beth Plale School of Informatics and Computing Indiana University-Bloomington Current.

Slides:



Advertisements
Similar presentations
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
Advertisements

A PPARC funded project AstroGrid Framework Consortium meeting, Dec 14-15, 2004 Edinburgh Tony Linde Programme Manager.
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Building an Operational Enterprise Architecture and Service Oriented Architecture Best Practices Presented by: Ajay Budhraja Copyright 2006 Ajay Budhraja,
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
Principles of Personalisation of Service Discovery Electronics and Computer Science, University of Southampton myGrid UK e-Science Project Juri Papay,
As computer network experiments increase in complexity and size, it becomes increasingly difficult to fully understand the circumstances under which a.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Apache Axis: A Set of Java Tools for SOAP Web Services.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
The Earth System Grid Discovery and Semantic Web Technologies Line Pouchard Oak Ridge National Laboratory Luca Cinquini, Gary Strand National Center for.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Magdi Latif Regional Knowledge and Information Management Officer FAO Partnership, Advocacy and Capacity Development Division FAORNE Jordan Plant Genetic.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
The Design Discipline.
About CUAHSI The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is an organization representing 120+ universities.
© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. THE NEON APPROACH TO DATA INGEST, CURATION, AND SHARING Christine Laney (Data.
OFC304 Excel 2003 Overview: XML Support Joseph Chirilov Program Manager.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
1 If I Could Start All Over Again: Lessons To be Learnt From The HE Community Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
® IBM Software Group © 2007 IBM Corporation J2EE Web Component Introduction
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Web Services Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Scott Jensen Data to Insight Center Indiana University University of Chicago – March 2, 2012.
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Language and Computation Day University of Essex 4 October 2005.
Using Taxonomies Effectively in the Organization KMWorld 2000 Mike Crandall Microsoft Information Services
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
ANDS and its Services Phenomics Data & Informatics Workshop 2010, Friday, 23rd April 2010.
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
Large Scale Nuclear Physics Calculations in a Workflow Environment and Data Provenance Capturing Fang Liu and Masha Sosonkina Scalable Computing Lab, USDOE.
DOC ID © Chevron 2005 Knowledge Management and Open Standards Chevron Perspective John Hanten Chevron Technology Ventures Energistics 2007 Annual Meeting.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Technology behind using Taverna in caGrid caGrid user meeting Stian Soiland-Reyes, myGrid University of Manchester, UK
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
1 Chapter 1 Introduction to Databases Transparencies.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
ISERVOGrid Architecture Working Group Brisbane Australia June Geoffrey Fox Community Grids Lab Indiana University
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
IT Enablement Approaches Large Business may have hundreds of processes to be enabled by IT. Several Types of Application may be deployed –Departmental.
Introduction to Active Directory
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
Application of RDF-OWL in the ESG Ontology Sylvia Murphy: Julien Chastang: Luca Cinquini:
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
Distributed Archives Interoperability Cynthia Y. Cheung NASA Goddard Space Flight Center IAU 2000 Commission 5 Manchester, UK August 12, 2000.
XMC CAT Among the Stars Yiming Sun Beth Plale Scott Jensen SC11.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
Hydroinformatics Lecture 15: HydroServer and HydroServer Lite The CUAHSI HIS is Supported by NSF Grant# EAR CUAHSI HIS Sharing hydrologic data.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Presentation transcript:

XMC Cat: An Adaptive Catalog for Scientific Metadata Scott Jensen and Beth Plale School of Informatics and Computing Indiana University-Bloomington Current Projects & Future Directions Researchers have noted the deluge of scientific data being generated and the need for detailed discovery metadata to find relevant data. Additionally, as noted in a study by the UK e-Science Core Programme, “metadata is key to being able to share results”. This need for detailed metadata has lead scientific communities to develop XML schemas to describe their data products. One focus of our research is on the characteristics of scientific metadata schemata – with an important characteristic being that they are composed of complex concepts that describe a data object. Exploiting this characteristic allows the XMC Cat metadata catalog to communicate using the schema of the community in which it is implemented, while storing metadata using a generalized relational schema that can be applied across domains and is loosely coupled to the domain schema. This approach is possible due to a partitioning of the schema based on its metadata concepts, which allows for a static global ordering of the concept elements. Paired with a hybrid XML-Relational approach to shredding and storing metadata, XMC Cat provides a scalable solution with fast query response rates. The XMC Cat metadata catalog was initially developed in the context of the Linked Environments for Atmospheric Discovery (LEAD) project and is deployed as a web service using a lightweight software stack based on Apache Tomcat, Axis2, and MySQL. Features & Benefits of XMC Cat Indentify metadata concepts and create configuration files using a point-and-click web interface. XMC Cat adapts to the scientific community instead of scientists adapting to it. Searching is performed using a point-and-click web interface that walks the scientist through creating a query. Search definitions can be saved and shared – allowing scientist to share searches for experiment configurations or data know to create model instability or anomalous results – before they publish! Metadata capture can be automated to harvest additional metadata from data objects as they are added to the metadata catalog by registering post-processing plugins. A scientist’s data remains in their private workspace until they decide to publish it to the community. Metadata is easily and efficiently added incrementally – allowing scientists to monitor long-running experiments. Initially developed for atmospheric metadata, we are now implementing XMC Cat in other scientific communities. We are exploring using XMC Cat to capture astronomical metadata as well as metadata from other scientific domains. Science gateways often use a profile or extension of an existing metadata standard, so we are working on providing configurations based on publicly available domain metadata schemata. Although the focus of XMC Cat has been on managing metadata describing scientific data captured as files – particularly in binary formats, the Data to Insight Center is exploring using XMC Cat to assist scientists with managing legacy research data. In support of a long-term collaborative research network with decades of data on forest management and ecological data, we are working with researchers to implement XMC Cat as a metadata search tool over legacy databases containing decades of measurements and survey results. This research was funded in part by NSF CNS , ATM and EIA Photo credit: NOAO/AURA/NSF. Copyright WIYN Consortium, Inc., all rights reserved. Partitioning scientific metadata schemata based on the concepts they contain enables a loose coupling of a scientific community’s XML schema and XMC Cat’s relational schema - allowing XMC Cat to provide: Adaptability to a range of scientific domains. Detailed queries over a domain’s metadata concepts while using a domain-agnostic relational schema for storage. Fast query responses that return results using the domain’s metadata schema. Efficient incremental inserts of metadata. Enabling the Discovery and Reuse of Scientific Data Metadata Concepts