Electronic Notebooks: An Interface Component for Semantic Records Systems James D. Myers, Michael Peterson, K Prasad Saripalli, Tara Talbott Mathematics.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Technical and design issues in implementation Dr. Mohamed Ally Director and Professor Centre for Distance Education Athabasca University Canada New Zealand.
Grid Content Management Jim Myers PNNL. GFS-WG Aims to –describe and manage the namespace of federated data sets, access control mechanisms, and meta-
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
Ontology Notes are from:
Capturing and Supporting Contexts for Scientific Data Sharing via the Biological Sciences Collaboratory George Chin Jr. and Carina S. Lansing (PNNL) Appeared.
An Architecture for Creating Collaborative Semantically Capable Scientific Data Sharing Infrastructures Anuj R. Jaiswal, C. Lee Giles, Prasenjit Mitra,
1 Workshop on Metadata Interoperability for Electronic Records Management November 15, 2001 Archives II, College Park, MD.
eGovernance Under guidance of Dr. P.V. Kamesam IBM Research Lab New Delhi Ashish Gupta 3 rd Year B.Tech, Computer Science and Engg. IIT Delhi.
1 Component Description CMU Note-Taker Tools Human Computer Interaction Institute Carnegie Mellon University Prepared by: Bill Scherlis March 26, 1999.
Mapping Physical Formats to Logical Models to Extract Data and Metadata Tara Talbott IPAW ‘06.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
Data Integration in Service Oriented Architectures Rahul Patel Sr. Director R & D, BEA Systems Liquid Data – XML-based data access and integration for.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Research sponsored by Mathematics, Information and Computational Sciences Office U.S. Department of Energy Al Geist Jens Schwidder David Jung Computer.
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Towards a Provenance Architecture Karen Schuchardt PNNL.
Fundamentals of XML Management Greg Alexopoulos Systems Engineer Documentum.
Peer-to-Peer Data Integration Using Distributed Bridges Neal Arthorne B. Eng. Computer Systems (2002) Supervisor: Babak Esfandiari April 12, 2005 Candidate.
1st Workshop on Intelligent and Knowledge oriented Technologies Universal Semantic Knowledge Middleware Marek Paralič,
Managed by UT-Battelle for the Department of Energy 1 Integrated Catalogue (ICAT) Auto Update System Presented by Jessica Feng Research Alliance in Math.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Environmental Molecular Sciences LaboratoryDOE Security Workshop Electronic Notebooks (Collaboratories) James D. Myers EMSL Collaboratory Project Pacific.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Informational Objects TypeExamples 1. Structured Items Vouchers, Travel Orders, Invoices, Purchase Orders 2. Semi-Structured Items Letters, Memoranda,
Exploitation of Dynamic Information Relations in the Service-Oriented AFRL Information Management Systems Andrzej Uszok, Larry Bunch, Jeffrey M. Bradshaw.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Data Provenance and Annotation Dec. 2, 2003 Collaboratory for Multi-scale Chemical Science (CMCS): A Knowledge Grid/ Adaptive Informatics Infrastructure.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
N NESSTAR: A Semantic Web Application for Statistical Data and Metadata Pasqualino “Titto” Assini Nesstar Ltd - UK.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Create Content Capture Content Review Content Edit Content Version Content Version Content Translate Content Translate Content Format Content Transform.
XML stands for Extensible Mark-up Language XML is a mark-up language much like HTML XML was designed to carry data, not to display data XML tags are not.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Scientific Annotation Middleware (SAM) Jim Myers, Elena Mendoza PNNL Al Geist, Jens Schwidder ORNL.
Adapting the Electronic Laboratory Notebook for the Semantic Era Tara Talbott, Michael Peterson, Jens Schwidder, James D. Myers 2005 International Symposium.
DSpace - Digital Library Software
SEEK Science Environment for Ecological Knowledge l EcoGrid l Ecological, biodiversity and environmental data l Computational access l Standardized, open.
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
Application of RDF-OWL in the ESG Ontology Sylvia Murphy: Julien Chastang: Luca Cinquini:
Scientific Annotation Middleware: Data/Metadata Access/Transformation Services On/Off the Grid James D. Myers Pacific Northwest National Laboratory.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Overview: Fedora Architecture and Software Features
Analyzing and Securing Social Networks
Presentation transcript:

Electronic Notebooks: An Interface Component for Semantic Records Systems James D. Myers, Michael Peterson, K Prasad Saripalli, Tara Talbott Mathematics and Computational Science Directorate Pacific Northwest National Laboratory

2 OutlineOutline Why have an electronic notebook? The changing science/IT landscape Semantic repositories Scientific Annotation Middleware ENs on semantic repositories The ELN on SAM

3 Secure shared WWW based space Hierarchical Chapters/Pages/Notes Add/View/Search Notes File upload, sketch, text, equations, forms, image capture, … Interactive views of data Editor/Viewer APIs Cross-out capability Digital Signatures/Timestamps Java Client, Perl and Java (2001+) servers … PNNL Electronic Laboratory Notebook (ELN) ~1995+

4 What distinguishes ENs from other tools? Emphasis on multimedia human-entered information Chronological, page-oriented display Master/personal project record Records functionality: Non-repudiation - digital signatures and timestamps Persistence/completeness - write-once/no deletions/audit trail Standardized lifecycle – signing/witnessing policies, archiving, retention schedules, …

5 The Systems Science Revolution Community Resources Bi-directional flow/feedback of information Partial results being combined to produce new knowledge Experiment/Theory/Model comparisons Multiscale optimizations Rapid Evolution High Complexity Shifting/Emerging disciplinary boundaries Resources will be distributed With multiple curators

6 Advances in Problem Solving Environments/Grids/Semantic Technologies Multiple Applications recording data Pedigree/Provenance Experiment Metadata Project Organization Workflow Categorization Detected Features Instrument logs … Replica Locations Endorsements Community Annotations … How do we provide EN capabilities in this larger context?

7 Semantic Repositories Use self-describing metadata/relationships Triple-stores RDF OWL Aggregate information generated by multiple applications Allows browsing, searching, reasoning across integrated information

8 Scientific Annotation Middleware (SAM) - 5 yr DOE funded research project Develop middleware to create semantic repositories Enable the sharing of this information among portals and problem solving environments, software agents, scientific applications, and electronic notebooks With different levels of sophistication Without global schema Improve the completeness, accuracy, and availability of the scientific record.

9 SAM Architecture Notebook Services Semantic Services Metadata Services DataGrid Database Web DAV, DASL, JMS, SAM Extensions DAV, JDBC, GridFTP

10 Web Distributed Authoring and Versioning (WebDAV) An early web service Put/Get data with arbitrary properties (dynamic) Properties can be discovered and accessed independently DASL, Versioning, Transactions, … Widely supported (MS Office, databases, file system drivers,…)

11 Binary Format Description (BFD) Language XML Language to describe ASCII, Binary, and XML data formats Generic Parser to extract and semantically tag data in files/streams The meaning of data can be captured, regardless of format, for future use Data Format Description Language Standard XSL Stylesheet (reformat) XSLT Processor XML Format 2 BFD Parser BFD Description 1 XML Format 1 File Format 1 meters <XBFDvalue-of select="/XSIL/Param 4 <Stream Type=“remote” XBFDStreamnumber=“0” Encoding = “biinary”/>

12 SAM Metadata Services Layer Jakarta Slide DAV server plus configurable: Mapping to Data Store(s) Property Generation from binary/ASCII/XML files Dynamic Virtual Translations Server generated Properties and Relationships Timestamp, size, CopyOf Fortran Application ‘Local Disk’ DAV DAV+ Content ELNProp1 Prop2 Prop1 hastranslation BFD Web Service XSLT … Translated Content RDF Export

13 SAM Semantic Services Layer SAM Metadata Layer plus configurable: Relation-scoped Queries Translation of DAV Properties to RDF Triples RDF/GXL Pedigree Generation …

14 Back to ENs… What is needed to be able to provide Unstructured human entry of information? Chronological, page-oriented display? A master/personal project record? Records functionality?

15 Creating Notes A ‘standard’ ELN client can create notes Stored as content with a hasNote relationship with pages, notes Plus…any app can store notes the same way Page generation – works as before

16 ENs as a Primary View? Instruments, PSEs, etc. may organize parts of the experiment that an EN should not duplicate  define other relationships as part of the EN chapter/page/note hierarchy: Project Experiment1 Experiment2 Data1 Data2  Notebook1 Chapter1 Chapter2 Page1 Page2 Defined by PSEInterpreted by EN

17 Records?Records? Digital Signatures, Timestamps, etc. are services that can be exposed as repository services and associated metadata But What do we sign (content/metadata)? Where is the edge of the record? How deep do we travel through the web of relations? How do we stop other applications from changing/deleting signed content?

18 Multiple Options Simple: Sign content plus defined subset of metadata Stop at edge of server Treat relationship cycles as links Lock content and metadata subset when signed Advanced: Multiple self-describing signatures (e.g. XMLSignature) Allow records across servers via trust, cached metadata/data Define fine-grained retention schedules

19 SAM Notebook Services Layer SAM Metadata and Semantic Layers plus: Notebook Management, Page Display, … Digital Signatures Canonicalization Notarized Timestamps Data/Signature Migration Capabilities Notebook API, Notebook Components Supports ELN 5.1, Annotation Applet, new portal-based EN client EN Portlets

20 Collaboratory for Multiscale Chemical Science (CMCS) SAM as primary data system, pedigree, notebook NEESgrid/CHEF Portal/NMI Grid User Computing Environments ELN, SAM as a metadata/pedigree store? Genomes-To-Life SAM as annotation/metadata repository, notebook Internal PNNL Projects Concept Map Repository, Interface to Lustre, Biological Data Annotation DOE2000 Notebook Community ( addresses) Upgrades to DOE2K Notebooks E.g. Columbia University Environmental Science Lab Notebooks CollaborationsCollaborations

21 A Scientific Content Repository Vision Notebooks become just one view of the scientific information Applications contribute data, metadata, and relationships directly Records functionality provided by middleware, available to multiple applications Content is stored in multiple repositories managed independently The scientific record becomes richer and re-integrated

22 AcknowledgmentsAcknowledgments Carina Lansing, PNNL Al Geist, Jens Schwidder, David Jung, ORNL U.S. Department of Energy Pacific Northwest National Laboratory Pacific Northwest National Laboratory is a multiprogram national laboratory operated by Battelle Memorial Institute for the U.S. Department of Energy under Contract DE-AC06-76RL Oak Ridge National Laboratory Oak Ridge National Laboratory is a multiprogram national laboratory operated by UT-Battelle, LLC for the U.S. Department of Energy under Contract DE-AC05-00OR22725 Mathematical, Information and Computational Sciences Division of the Office of Science