An Introduction to Data Modeling with Fedora Thorny Staples Fedora Commons, Inc.

Slides:



Advertisements
Similar presentations
Fedora TM and Repository Implementation at UVa Leslie Johnston, UVa Library DASER Summit November 22, 2003.
Advertisements

EThOSnet Repositories and Web Services Workshop 2 nd June2009 Richard Green
DuraSpace, Fedora and DuraCloud Triangle Research Libraries Network September, 2009.
DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
XML: Extensible Markup Language
Goals for RUcore o Flexible, extensible cyberinfrastructure for Rutgers University o Integrating platform for legacy information systems o Support preservation.
A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
Introduction to Databases
Depositing e-material to The National Library of Sweden.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
Fedora Commons: Introduction and Update Swedish National Library June 24, 2008.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil Sandy Payette Cornell Information Science.
Properties of Text CS336 Lecture 3:. 2 Information Retrieval Searching unstructured documents Typically text –Newspaper articles –Web pages Other documents.
Policy-Carrying, Policy-Enforcing Digital Objects Sandra Payette Project Prism - Cornell University DLI2 All-Projects Meeting June 14, 2000.
The Mellon-Funded Fedora Project A Briefing for the Los Alamos National Laboratory August 26, 2002 Sandy Payette Cornell Information Science.
Web Service Architecture Part I- Overview and Models (based on W3C Working Group Note Frank.
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
Modernizing the Data Documentation Initiative (DDI-4) Dan Gillman, Bureau of Labor Statistics Arofan Gregory, Open Data Foundation WICS, 5-7 May 2015.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
DIGITIZATION OF RARE LIBRARY MATERIALS Metadata Format Access to Digital Documents © Adolf Knoll, National Library of the Czech Republic.
DuraSpace, Fedora and DuraCloud Thorny Staples Director, Community Strategy and Alliances ESIP Meeting, July 8, 2009.
FEDORA at Northwestern University Bill Parod Academic Technologies Northwestern University
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
Research Data Management At the Smithsonian Using SIdora Nano Tech Working Group May 15, 2014.
Fedora Commons Overview and Future Plans Sandy Payette, Executive Director Cornell University Library Metadata Working Group June 13, 2008.
The DigiTool to FDA Program Lydia Motyka Florida Center for Library Automation.
Completing the Model Common Problems in Database Design.
Information Network Overlay Architecture Adding Value to Digital Content Carl Lagoze CS 431 – May 4, 2005 Cornell University.
Next-generation databases Active databases: when a particular event occurs and given conditions are satisfied then some actions are executed. An active.
Library Repositories and the Documentation of Rights Leslie Johnston, University of Virginia Library NISO Workshop on Rights Expression May 19, 2005.
The Sustaining Digital Scholarship Project ETD 2005 Thornton Staples Digital Library Research and Development University of Virginia Library.
The Mellon-Funded Fedora Project A Briefing for the Cornell University Library January 24, 2002 Sandy Payette Thorny Staples Ross Wayland.
Andrew S. Budarevsky Adaptive Application Data Management Overview.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
Introduction to the Semantic Web and Linked Data
Modul 4 Struktur Informasi Mata Kuliah Preservasi Informasi Digital.
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
MyGrid/Taverna Provenance Daniele Turi University of Manchester OMII f2f Meeting, London, 19-20/4/06.
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation.
BI Practice March-2006 COGNOS 8BI TOOLS COGNOS 8 Framework Manager TATA CONSULTANCY SERVICES SEEPZ, Mumbai.
 Metadata’s Next Top Model RDF and Its Impact on Digital Library Technology Juliet L. Hardesty, Metadata Analyst Digital Library Brown Bag February 18,
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Research Data Management At the Smithsonian PASIG, Washington, DC May 24, 2013.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
The Mellon-Funded Fedora Project A Presentation to the European Digital Library Conference September 17, 2002 Sandy Payette and Thornton Staples.
What is Fedora Commons, and Why Should You Care? Cole Hudson and Graham Hukill.
Data Models. 2 The Importance of Data Models Data models –Relatively simple representations, usually graphical, of complex real-world data structures.
XML 1. Chapter 8 © 2013 Pearson Education, Inc. Publishing as Prentice Hall SAMPLE XML SCHEMA (XSD) 2 Schema is a record definition, analogous to the.
Fedora Digital Object in a Nutshell Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Working Group: Data Foundations and Terminology (Practical Policy Considerations) Reagan Moore.
Building Foundations: Fedora, Fez, and the ADR prepared by Jessica Branco Colati ADR Project Director, Colorado Alliance of Research Libraries
Fedora, Fez, and the ADR an ePoster presented at Institutional Repositories: Disseminating, Promoting, and Preserving Scholarship Utah State University.
The Fedora Project March 10, 2003
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil
Overview: Fedora Architecture and Software Features
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
An Architecture for Complex Objects and their Relationships
9/22/2018.
eSciDoc – Content model requirements
Data Model.
Metadata Framework as the basis for Metadata-driven Architecture
Metadata The metadata contains
Presentation transcript:

An Introduction to Data Modeling with Fedora Thorny Staples Fedora Commons, Inc.

Fedora Abstractions Data objects Content models Behaviors of objects Policies about objects Relationships among objects

DC Persistent ID RELS-EXT AUDIT n n Reserved Datastreams Custom Datastreams (any type, any number) A data object is one unit of content POLICY

Content Models Create classes of data objects Expressed as Cmodel objects A Cmodel object defines the number and types of data streams for objects of that class A Cmodel object binds to service objects to enable appropriate behaviors to be inherited by data objects

Optional Object Behaviors Data objects can have different views or transformations Sets of abstract behaviors that different kinds of objects can subscribe to Corresponding sets of services that specific objects can execute The business logic is hidden behind an abstraction

Persistent ID (PID) Service Definition Metadata SystemMetadata Datastreams Cmodel Object Persistent ID (PID) Service Binding Metadata (WSDL) SystemMetadata Datastreams Web Service service contract service subscription data contract Persistent ID (PID) RDF data Datastreams System Metadata Service Mechanism Object Service Definition Object Persistent ID (PID ) System Metadata Datastreams Data Objects

For example: Can also add parameter and a date-time stamp to access earlier versions of the object A behavior call is a URL that contains : Object PID + SDef Name + Method Name

Policies Machine enforceable expressions of rules, what they are applied to and who they affect. Who is affected can be defined in different authorization sources, such as LDAP services Rules can be as simple as “allow” or “deny”. Rules are applied to objects as a whole, any datastream, or a dissemination, as well as each API call and more.

Historical Census Object 1870 Aggregate Census file of the US A character data representation of the dataset is the master It has a datastream that is used to access an SQL database that is only accessible through the object The SQL data can always be rebuilt from the character data It has a DDI codebook which has descriptive and structural metadata about the object

Relationships Among Objects Describes adjacency relationships among objects RDF data of the form: PID – typeOfRelationship – relatedObjectPID Can used to assemble aggregations of objects

Content Modeling Styles Atomistic objects have a small number of datastreams that are each expressions of the whole, with relationships to other objects Many objects Much more flexible Compound objects usually include many datastreams, including information about the whole and its parts Forces a mixture semantics of whole and parts More difficult to take advantage of the Fedora abstractions cleanly

Book Objects XML file is the main datastream that represents the book as a whole Using the atomistic approach, a book with 400 pages would be 401 objects Using the compound approach, 1 object with as many as 1201 datastreams for image files and the book file Example

Objects Representing Aggregations Creating parent objects for complex resources Representing explicit collections Representing implicit collections Creating digital surrogates for physical entities

Explicit Aggregations The parent aggregation object has explicit references to the PIDs of its children These references can be relationships listed in the objects Rels-Ext datastream Or they can be PIDs embedded in an XML datastream that gives a more descriptive context and can explicitly order them Example

Implicit Aggregations One object that represents the aggregation as a whole –information about its meaning –Rules for how to find the members Any number of objects can assert an “isMemberOf” relationship to the PID of the aggregation

Collections can be expressed as implicit aggregations PID5 PID3 PID2 isMemberOfCollection Query PID1 isMemberOfCollection Collection Object Resource Index