The Planets Interoperability Framework Rainer Schmidt AIT Austrian Institute of Technology 1st DPIF Symposium, April 21-23, 2010,

Slides:



Advertisements
Similar presentations
웹 서비스 개요.
Advertisements

TSpaces Services Suite: Automating the Development and Management of Web Services Presenter: Kevin McCurley IBM Almaden Research Center Contact: Marcus.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 12 Slide 1 Distributed Systems Design 2.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
1 Persistent identifiers, long-term access and the DiVA preservation strategy Eva Müller Electronic Publishing Centre Uppsala University Library, Sweden.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
Latest techniques and Applications in Interprocess Communication and Coordination Xiaoou Zhang.
SOAPI: a flexible toolkit for implementing ingest and preservation workflows Mark Hedges Centre for e-Research, King’s College London Arts and Humanities.
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
May Archiving PAWN: A Policy-Driven Software Environment for Implementing Producer- Archive Interactions in Support of Long Term Digital.
1 MPEG-21 : Goals and Achievements Ian Burnett, Rik Van de Walle, Keith Hill, Jan Bormans and Fernando Pereira IEEE Multimedia, October-November 2003.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
2006 IEEE International Conference on Web Services ICWS 2006 Overview.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Software Engineering Module 1 -Components Teaching unit 3 – Advanced development Ernesto Damiani Free University of Bozen - Bolzano Lesson 2 – Components.
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
A Service for Data-Intensive Computations on Virtual Clusters Rainer Schmidt, Christian Sadilek, and Ross King Intensive 2009,
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Web Services Michael Smith Alex Feldman. What is a Web Service? A Web service is a message-oriented software system designed to support inter-operable.
Getting Started with WCF Windows Communication Foundation 4.0 Development Chapter 1.
UNIT-V The MVC architecture and Struts Framework.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Adapting Legacy Computational Software for XMSF 1 © 2003 White & Pullen, GMU03F-SIW-112 Adapting Legacy Computational Software for XMSF Elizabeth L. White.
Digital Object Architecture
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
Web Services Description Language (WSDL) Jason Glenn CDA 5937 Process Coordination in Service and Computational Grids September 30, 2002.
Access Across Time: How the NAA Preserves Digital Records Andrew Wilson Assistant Director, Preservation.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Web Services based e-Commerce System Sandy Liu Jodrey School of Computer Science Acadia University July, 2002.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
Web Services. Abstract  Web Services is a technology applicable for computationally distributed problems, including access to large databases What other.
07/09/04 Johan Muskens ( TU/e Computer Science, System Architecture and Networking.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Technical Support to SOA Governance E-Government Conference May 1-2, 2008 John Salasin, Ph.D. DARPA
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Slide 1 Service-centric Software Engineering. Slide 2 Objectives To explain the notion of a reusable service, based on web service standards, that provides.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
EMEA Beat Schwegler Architect Microsoft EMEA HQ Ingo Rammer Principal Consultant thinktecture
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
De Rigueur - Adding Process to Your Business Analytics Environment Diane Hatcher, SAS Institute Inc, Cary, NC Falko Schulz, SAS Institute Australia., Brisbane,
Software Architecture Patterns (3) Service Oriented & Web Oriented Architecture source: microsoft.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Service Oriented Architecture (SOA) Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
An Introduction to Tessella and The Safety Deposit Box Platform
Joseph JaJa, Mike Smorul, and Sangchul Song
Web Ontology Language for Service (OWL-S)
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
An Architecture for Complex Objects and their Relationships
Service-centric Software Engineering
Service-centric Software Engineering 1
Implementing an Institutional Repository: Part II
Service Oriented Architecture (SOA)
Malte Dreyer – Matthias Razum
Implementing an Institutional Repository: Part II
SDMX IT Tools SDMX Registry
Presentation transcript:

The Planets Interoperability Framework Rainer Schmidt AIT Austrian Institute of Technology 1st DPIF Symposium, April 21-23, 2010, Dresden, Germany. Integrated Access to Preservation Tools

DPIF Symposium, April 21-23, 2010, Dresden Outline  Overview of the Integrated Environment  Main Objectives and Architecture  Planets Preservation Services  Digital Objects and Metadata  Integrating Repositories  The Workflow Execution Engine (WEE)  Conclusions & Lessons Learned

DPIF Symposium, April 21-23, 2010, Dresden Planets Project  “Permanent Long-term Access through NETworked Services”  Addresses the problem of digital preservation  driven by National Libraries and Archives  Project instrument: FP6 Integrated Project  5. IST Call  Consortium: 16 organisations from 7 countries  Duration: 48 months, June 2006 – May 2010  Budget: 14 Million Euro 

DPIF Symposium, April 21-23, 2010, Dresden The Planets Interoperability Framework  An integrated System for the development and evaluation of preservation strategies.  Uniform access mechanisms to a broad range of “commodity” tools, e.g. for characterization, migration, emulation.  Integration of existing repositories, data/metadata formats.  Specification, execution, recording of preservation workflows.  Integration with end-user applications for preservation planning and the evaluation of tools/strategies.  PLANETS Preservation Planning Tool and Testbed

DPIF Symposium, April 21-23, 2010, Dresden Agents and Activities Preservation Expert IF Gateway Server > Digital Library/Repository > Preservation Services > Application Provisioning Provenance Experiment Repository Data Model Mapping Service Orchestration Access Pres. Applications Service Registration Data Transfer Deposit Result > User Management Export Digital Objects

DPIF Symposium, April 21-23, 2010, Dresden Service-Orientated Architecture  XML Web Services (SOAP, WSDL, WS-*)  Platform, Language, and Location Independence  Homogeneous interfaces for preservation activities, data management, workflow execution.  Remotely access repositories and data.  Discover and dynamically utilize tools in a workflow.  Supports distributed and cross-organizational deployments  Shared hardware, software, maintenance  Browser-based access to large number of resources

DPIF Symposium, April 21-23, 2010, Dresden Service Gateway Architecture Preservation Planning Tool Experimentation Testbed Application Notification and Logging System Workflow Execution UI Workflow Execution and Monitoring Experiment Data and Metadata Repository Service and Tool Registry Application Services Execution Services Data Access Services Administration UI Authentication and Authorization User Applications Portal Services Application Execution and Data Services Physical Resources, Computers, Networks

DPIF Symposium, April 21-23, 2010, Dresden Preservation Interfaces  Define atomic preservation activities (level-one)  Concentrates on low-level concepts and actions Bit-stream operations, no data management  Designed to be light-weight and easy to implement  Independent from a specific tool, language, or content type  E.g. Characterize, Migrate, Compare, CreateView  >50 Tools wrapped/provided as Planets Services  Provides the basic abstractions for assembling workflows.

DPIF Symposium, April 21-23, 2010, Dresden Preservation Interfaces (the Verbs)  Define atomic preservation activities (level-one)  Concentrates on low-level concepts and actions Bit-stream operations, no data management  Designed to be light-weight and easy to implement  Independent from a specific tool, language, or content type  E.g. Characterize, Migrate, Compare, CreateView  >50 Tools wrapped/provided as Planets Services  Provides the basic abstractions for assembling workflows.

DPIF Symposium, April 21-23, 2010, Dresden Digital Objects  Generic data abstraction for modeling digital entities.  Encapsulates content and metadata  Consumed and/or produced by Planets preservation services  Provides minimal and generic model for data management  Stored in Object Repository  Does not prescribe serialization schema  May be created from DC/ORE RDF record and be  serialized using METS/PREMIS schemas.

DPIF Symposium, April 21-23, 2010, Dresden Digital Objects Content Digital Object Properties Events Metadata contains_object fragment Type, Time, Agent, Service, Result, … Creator, Title, Description, Format, … Embedded Data or Repository URL Relationships (possibly associated with event) Tagged Uninterpreted Metadata Chunks

DPIF Symposium, April 21-23, 2010, Dresden Digital Object Managers  Individual adapters for retrieving (& storing) Planets DOs  Provide access to existing repositories.  Map metadata records to Planets DOs  Ingest digital objects to Planets data repositories  Current implementation for  retrieving OAI-PMH records, BL digitized newspaper, Web resources, Amazon S3 buckets, …  Planets Data Registry services (ingesting DOs) based on Apache Jackrabbit and Fedora Commons.

DPIF Symposium, April 21-23, 2010, Dresden

Data Registry  A service to deposit, access, and organize Planets digital objects based on bi-directional Digital Object Manager.  Accessible to Workflow Execution Engine  Records Experiment and Preservation Metadata  Supports Export of Experiment Results  A Repository that implements Planets Digital Object Model and naming schema (Planets URIs).  Supports asynchronous pass-by-reference and direct access to binary Content (Content Resolver)

DPIF Symposium, April 21-23, 2010, Dresden Data Registry  A service to deposit, access, and organize Planets digital objects based on bi-directional Digital Object Manager.  Accessible to Workflow Execution Engine  Records Experiment and Preservation Metadata  Supports Export of Experiment Results  A Repository that implements Planets Digital Object Model and naming schema (Planets URIs).  Supports asynchronous pass-by-reference and direct access to binary Content (Content Resolver)

DPIF Symposium, April 21-23, 2010, Dresden

Workflow Orchestration  Separation of concerns:  Fragments of complex workflow logic (templates) are implemented by >  > selected from predefined templates, configure them, and execute individual processes.  Templates implement abstract and reusable processes definitions based on level-on operations (API) and decision logic.  Execute in trusted environment (level-two)  handle digital objects in metadata repository and  basis for recording provenance and preservation information

DPIF Symposium, April 21-23, 2010, Dresden Workflow Execution Engine (WEE) Service Template WEE Template Rep. Service Workflow Client Application Cmp. WEE Execution Service Cmp. > XML > Experimenter Workflow Developer

DPIF Symposium, April 21-23, 2010, Dresden

Summary  Research infrastructure for  integrating variety of tools and repositories  executing defined preservation operations  recording provenance and preservation metadata  Not necessary an “out-of-the-box” solution  Extensible network of services,  Public deployment,  Allows sharing of resources and results.  Downloadable package available for local installation of selected preservation tools/services.

DPIF Symposium, April 21-23, 2010, Dresden Conclusions (1) - Preservation Actions  Defined interfaces for Preservation Actions required  Prerequisite for QA and other complex pres. strategies (workflows)  Preservation strategy often trivial (complexity within the tool)  Automation and Quality Control are key issues  Verifiability of technical interoperability is crucial  Depends much on communication method (native, DSL) keep as simple as possible  Semantic interop. requires well defined properties and metrics often domain dependent defined tests and benchmarks required

DPIF Symposium, April 21-23, 2010, Dresden Conclusions (2) - Component Framework  The Planets IF provides an environment for preservation components to run and interact  Distributed system required for extensibility and integration  Service interfaces specified at exchange language level (HTTP, SOAP, WS* Specs.)  Interoperability often not a problem of specification but of inconsistencies in different implementations  3rd party tools impose multiple levels of indirection  OS calls, different languages, different middleware stacks  Supporting (proprietary) tools may impact hosting environment and factors like performance, robustness, and fault tolerance.

DPIF Symposium, April 21-23, 2010, Dresden Conclusions (3) - Repository Integration  Planets provide a flexible approach for bridging access to heterogeneous repository systems.  Diverse APIs, metadata representation, data access  Stds. exist (OAI-ORE, RDF) but not yet adopted  Missing standards for integration of digital preservation actions with digital repository systems  (a) Defined Methods for Access, Re-Ingest, Versioning  (b) Entirely integrated with repository  Considerable efforts required to adapt data management systems in place