Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.

Slides:



Advertisements
Similar presentations
DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
Advertisements

An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Goals for RUcore o Flexible, extensible cyberinfrastructure for Rutgers University o Integrating platform for legacy information systems o Support preservation.
A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science.
The Hierarchy of Data Bit (a binary digit): a circuit that is either on or off Byte: 8 bits Character: each byte represents a character; the basic building.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil Sandy Payette Cornell Information Science.
1 Introduction The Database Environment. 2 Web Links Google General Database Search Database News Access Forums Google Database Books O’Reilly Books Oracle.
1 CS 502: Computing Methods for Digital Libraries Lecture 22 Repositories.
All Things to All People Combining Resources to Build an Integrated Digital Repository Preservation and Access for Electronic College and University Records.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
©Silberschatz, Korth and Sudarshan1.1Database System Concepts Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition.
Basic features ● Document database ● Paid deployment ● JSON ● C#, HTTP REST, Java ● version 3.0.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Demonstration of repositories Fedora (Flexible Extensible Digital Object Repository Architecture) Marie Lagerwall MIDESS Partners Meeting February 9, 2007.
Knowledge Portals and Knowledge Management Tools
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Databases & Data Warehouses Chapter 3 Database Processing.
PHASE 3: SYSTEMS DESIGN Chapter 7 Data Design.
Architecting an Extensible Digital Repository Anoop Kumar, Ranjani Saigal,Rob Chavez, Nikolai Schwertner Tufts University, Medford, MA.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
Presenting Statistical Data Using XML Office for National Statistics, United Kingdom Rob Hawkins, Application Development.
1st Workshop on Intelligent and Knowledge oriented Technologies Universal Semantic Knowledge Middleware Marek Paralič,
Architecture for a Database System
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Sponsored by the National Science Foundation 1 March 15, 2011 GENI I&M Update: MD Objects and Descriptors Goals Architecture Overview –Process –Functional.
Copyright © cs-tutorial.com. Overview Introduction Architecture Implementation Evaluation.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
Fedora Content Modeling for Improved Services for Research Databases Open Repositories 2009 Mikael Karstensen Elbæk Alfred Heller Gert Schmeltz Pedersen.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Object storage and object interoperability
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Research Data Management At the Smithsonian PASIG, Washington, DC May 24, 2013.
Improving User Access to Metadata for Public and Restricted Use US Federal Statistical Files William C. Block Jeremy Williams Lars Vilhuber Carl Lagoze.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
The Mellon-Funded Fedora Project A Presentation to the European Digital Library Conference September 17, 2002 Sandy Payette and Thornton Staples.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
An Introduction to Data Modeling with Fedora Thorny Staples Fedora Commons, Inc.
Eliot Wilczek University Records Manager Digital Collections and Archives Tufts University Repositories: How are They Evolving? A NERCOMP Workshop September.
Building Foundations: Fedora, Fez, and the ADR prepared by Jessica Branco Colati ADR Project Director, Colorado Alliance of Research Libraries
Fedora, Fez, and the ADR an ePoster presented at Institutional Repositories: Disseminating, Promoting, and Preserving Scholarship Utah State University.
Managing Data Resources File Organization and databases for business information systems.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
What is a database? (a supplement, not a substitute for Chapter 1…) some slides copied/modified from text Collection of Data? Data vs. information Example:
3/24/051 Value-Added Surrogates for Distributed Content Establishing a Virtual Control Zone By: Sandra Payette & Carl Lagoze Presented By: Mohamed Elmiligui.
Introduction to DBMS Purpose of Database Systems View of Data
Intro to MIS – MGS351 Databases and Data Warehouses
The Fedora Project March 10, 2003
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil
Overview: Fedora Architecture and Software Features
Flexible Extensible Digital Object Repository Architecture
GSAF Grid Storage Access Framework
Flexible Extensible Digital Object Repository Architecture
An Architecture for Complex Objects and their Relationships
The Re3gistry software and the INSPIRE Registry
DIGITAL LIBRARY.
Introduction to DBMS Purpose of Database Systems View of Data
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Malte Dreyer – Matthias Razum
Tutorial 7 – Integrating Access With the Web and With Other Programs
Presentation transcript:

Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li

Digital Contents: not just documents… Some conventional objects Some conventional objects Complex, compound, dynamic objects Complex, compound, dynamic objects

Traditional Online Access Method The web server & application must be aware of the types and components of the managed digital contents. The web server & application must be aware of the types and components of the managed digital contents. HTML documents… HTML documents… JPG/GIF photos… JPG/GIF photos… Video clips… Video clips… Stream media… Stream media… System developer and interface designer must provide appropriate handler for these file types. System developer and interface designer must provide appropriate handler for these file types.

Traditional Data Storage Access For tuples stored in a relational database, the system developer must have knowledge of the table schema and encode it explicitly in a query: For tuples stored in a relational database, the system developer must have knowledge of the table schema and encode it explicitly in a query: Select name from authors where affiliation=“Penn State”; Select name from authors where affiliation=“Penn State”; For on-disk file access, a file path needs to be specified. For on-disk file access, a file path needs to be specified.

Problems Lack of flexibility Lack of flexibility Lack of extensibility Lack of extensibility Lack of support for complex data structures Lack of support for complex data structures Lack of security control at the data level Lack of security control at the data level Unnecessary duplicate work needs to be performed by different applications / parties Unnecessary duplicate work needs to be performed by different applications / parties

Key Research Questions How can clients interact with heterogeneous collections of complex objects in a simple and interoperable manner? How can clients interact with heterogeneous collections of complex objects in a simple and interoperable manner? How can complex objects be designed to be both generic and genre-specific at the same time? How can complex objects be designed to be both generic and genre-specific at the same time? How can we associate services and tools with objects to provide different presentations or transformations of the object content? How can we associate services and tools with objects to provide different presentations or transformations of the object content? How can we associate specialized, fine-grained access control policies with specific objects, or with groups of objects? How can we associate specialized, fine-grained access control policies with specific objects, or with groups of objects? How can we facilitate the long-term management and preservation of objects? How can we facilitate the long-term management and preservation of objects?

Look into the Nature, We Have the Hints… Data is no longer an isolate existence in most current applications. Data is no longer an isolate existence in most current applications. Metadata Metadata Structural information Structural information Legal methods that can be applied to the data Legal methods that can be applied to the data Access control policies Access control policies Links to other digital contents Links to other digital contents These features can be grouped into an integral unit, which in return simplify the applications. These features can be grouped into an integral unit, which in return simplify the applications. Sort of similar with a Java class Sort of similar with a Java class

Solution We propose a middleware which virtually represents each digital content in a generic model. This middleware separates front-end applications from the back-end storages, gives abstraction to both sides. We propose a middleware which virtually represents each digital content in a generic model. This middleware separates front-end applications from the back-end storages, gives abstraction to both sides.

Persistent ID (PID) Default Disseminator SystemMetadata Datastream (item) Digital object identifier Service Perspective: methods for disseminating “views” of content Internal: key metadata necessary to manage the object Item Perspective: Set of content or metadata items Digital Object Model Architectural View Datastream (item) Your Extension

A Well-Known Digital Object Management System: Fedora

Fedora Service Framework

My Project Work Bring new features into the previous framework. Bring new features into the previous framework. Fedora does not provide powerful indexing and query capabilities. Fedora does not provide powerful indexing and query capabilities. Full text indexing based on Lucene. Full text indexing based on Lucene. Dynamic field indexing. Dynamic field indexing. Dynamically build in-memory indexing to improve query performance. Dynamically build in-memory indexing to improve query performance.