Semantic Metadata for Scientific Data Access and Management Richard M. Keller, Ph.D. Group Lead for Information Sharing & Integration Intelligent Systems.

Slides:



Advertisements
Similar presentations
Improving Learning Object Description Mechanisms to Support an Integrated Framework for Ubiquitous Learning Scenarios María Felisa Verdejo Carlos Celorrio.
Advertisements

International Technology Alliance In Network & Information Sciences International Technology Alliance In Network & Information Sciences Paul Smart, Ali.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Matthew Cechini Raytheon - EED ID: IN31C-07.  ECHO Metadata Overview  Introduction  Problem Space  Solutions ISO Lessons Learned – Perceived.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
1 Adaptive Management Portal April
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Geographic Information System Geog 258: Maps and GIS February 17, 2006.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Digital Library Architecture and Technology
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
Technology Infusion for the Decadal Survey Era: Data Quality Capability Needs Based on information derived from the NASA Technology Infusion Working Group's.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
Introduction to OBIS-USA Biological Data, Applications, & Relationships March 14, 2011.
Organizational Memory: Issues in Design & Implementation Sree Nilakanta May 1, 2000.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
Research Data Management At the Smithsonian Using SIdora Nano Tech Working Group May 15, 2014.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Exploring the Applicability of Scientific Data Management Tools and Techniques on the Records Management Requirements for the National Archives and Records.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Page 1 Informatics Pilot Project EDRN Knowledge System Working Group San Antonio, Texas January 21, 2001 Steve Hughes Thuy Tran Dan Crichton Jet Propulsion.
Chapter 4 Realtime Widely Distributed Instrumention System.
Intelligent Systems Division (IC  TC  TI) Collaborative and Assistant Systems (CAS) Research ROSES Partnership Opportunities Rich Keller
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
The Saguaro Digital Library for Natural Asset Management Dr. Sudha RamSudha Ram Advanced Database Research Group Dept. of MIS The University of Arizona.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Life Cycle Models & Principles Jake Carlson Associate Professor of Library Science Data Services Specialist Purdue University Libraries.
Preservation Strategies: Framing The Approach Nancy Hoebelheinrich Knowledge Motifs LLC Data Management Workshop American Geophysical.
PDS Geosciences Node Page 1 Archiving Mars Mission Data Sets with the Planetary Data System Report to MEPAG Edward A. Guinness Dept. of Earth and Planetary.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
IST Programme - Key Action III Semantic Web Technologies in IST Key Action III (Multimedia Content and Tools) Hans-Georg Stork CEC DG INFSO/D5
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
DataONE: Preserving Data and Enabling Data-Intensive Biological and Environmental Research Bob Cook Environmental Sciences Division Oak Ridge National.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Digital Libraries Lillian N. Cassel Spring A digital library An informal definition of a digital library is a managed collection of information,
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Developing Visualization Techniques for Semantics-based Information Networks Rich Keller David Hall NASA Ames QSS Group, Inc. Information Sharing and Integration.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
Clinical research data interoperbility Shared names meeting, Boston, Bosse Andersson (AstraZeneca R&D Lund) Kerstin Forsberg (AstraZeneca R&D.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
A Perspective on the Electronic Geophysical Year Raymond J. Walker UCLA Presented at eGY General Meeting Boulder, Colorado March 13, 2007.
Integrated Modeling Environment System Engineering Seminar Johnny Medina / Code 531 Chris Stone / Code 531 / Constellation Software Engineering.
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
VI-SEEM Data Repository
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Bird of Feather Session
Presentation transcript:

Semantic Metadata for Scientific Data Access and Management Richard M. Keller, Ph.D. Group Lead for Information Sharing & Integration Intelligent Systems Division NASA Ames Research Center February 17, 2005 ROSES Workshop

Focus of Work Scientific data management, not data analysis Computational infrastructure related to: storing locating searching integrating sharing scientific data

Specific Problems Integrating Heterogeneous Scientific Data from Multiple Sources Searching/finding Relevant Scientific Data Organizing/indexing Data for Rapid, Intuitive Access

Culprit: Inadequate Metadata Metadata is typically limited to essentials only (e.g. data format, instrument, date) –inadequate for extensive indexing, precise searching Each data repository defines its own metadata, using its own terminology and data dictionary –difficult to search across repositories –difficult to integrate and combine datasets No common frame of reference for cross- repository comparison

Common Approach To facilitate storage, retrieval, integration, and comprehension of scientific data: capture the semantic metadata that provides a rich context for each data product

What is “semantic metadata”? Semantic Metadata: information relating to the context in which the scientific data are generated and used –how? –when? –where? –why? –who?

Collection of microbial mats in the field Early Microbial Ecosystems Investigation Trace gas production and consumption under “Early Earth” conditions Greenhouse Incubator Microbial mat (algae) Detailed studies of mat biogeochemistry monitoring analysis experimentation geographically-disbursed team of collaborators B. Bebout D. Des Marais T. Hoehler, et al. Code SSX

Semantic Context Surrounding Mat “4b” (“Semantic Network”) collected-at Spring Beach collected-by Brad Bebout stored-in Greenhouse has-measurement measured- with O 2 Microsensor O 2 Concentration HBC-2 Microbial culture Culture prep B notes for Lee has-culture cultivated-by Culture recipe Mary Hogan has-recipe imaged-with Electron Microscope has-image

Semantic Network Structure culture photo measurement site instrument sample hypothesis Links: relationships among resources ( e.g.,“measured by”, “supports hypothesis”) Attached files: electronic products associated with resources (e.g., datasets, images, documents) Attributes: properties of resources (metadata) Nodes: key info resources or organizational structures (describes people, places, measurements, hypotheses) date size format Ontology: Specifies the types of nodes, attributes and links defined for scientific investigation Rules: Add/modify nodes, links & attributes in the network

DNA sequence image document culture person sample photographic image SEM image Scientific Data Collection Ontology (partial) other experiment Scientific Information Nodes project measurement site equipment camera gas chromatograph stub O2 microsensor N2 microsensor SEM O2 concentration N2 concentration spectrometer spectrograph chromatogram other micrograph cultivated-from cultivated-by has-genetic-sequence pictured-in researcher lab tech

Benefits of Semantic Metadata Approach Semantic context provides a unifying framework for integrating data across data collections Sophisticated “semantic search” methods allow retrieval based on semantic relationships among data Intuitive data indexing, access, and organization schemes derive from semantic data models Formal semantic representation enables automated inference about the data

Challenge Semantic metadata approach has been applied to small, PI-maintained data repositories Tremendous volume of earth and space science data is stored in huge, curated data repositories maintained by NASA, USGS, ESA, universities, and others. How to translate semantic metadata ideas to operate on the scale of large data repositories? Seeking Collaborators!

SemanticOrganizer System (Mat Sample: Spring-M4-b)

Photo: SprM4b excised

What is ScienceOrganizer? A Web-based collaborative knowledge management tool for distributed teams of scientific investigators Facilitates information sharing, integration, correlation A project information repository / digital library: users upload/download heterogeneous project information products -- images, datasets, documents, and various types of scientific records (describing samples, field sites, measurements, instruments, etc.) Features cross-linkage: enables rapid access to interrelated information; permits linking data and observations to scientific hypotheses Supports inference capabilities: permits formal reasoning about the repository contents A “project archive” system: tracks history of project team’s fieldwork, labwork, and associated data collection activities

ScienceOrganizer Users ARC Microbial Ecosystems Group: field & lab science, experiments, data analysis. NAI Ecogenomics Focus Group: cross-discipline collaboration, data analysis. ARC Electron Microscopy Lab: electron microscopy image archiving, sample cataloging. MARTE Mission: analog Mars drilling mission, support for remote science data acquisition, storage, and access JSC Astrobiology Institute for the Study of Biomarkers: electron microscopy image archive, sample collection, cataloging, and storage; support for education & outreach. NIH/NASA Malaria Control Study: African malaria study - data collection and archiving. ASU/NSF Desert Microbial Survey (NSF): microbial survey; provides publicly- accessible repository. Mobile Agents Demonstration Project: analog Mars surface exploration, support for remote science data acquisition, storage, and access Astrobionics Technology Integration: technology infusion program