Unified Digital Format Registry (UDFR) Stakeholder Meeting Library of Congress Washington, DC April 13, 14, 2011.

Slides:



Advertisements
Similar presentations
Capacity Building for Repositories Dr. Helena Asamoah-Hassan University Librarian, KNUST, Kumasi, Ghana at BioMed Open Access Africa Conference held at.
Advertisements

Do we need a GN of NGOs? Yes! (as far as participation in the GN does not reduce/affect involvement in the GP/DRR) The GN should build on existing networks.
ENTITIES FOR A UN SYSTEM EVALUATION FRAMEWORK 17th MEETING OF SENIOR FELLOWSHIP OFFICERS OF THE UNITED NATIONS SYSTEM AND HOST COUNTRY AGENCIES BY DAVIDE.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LSA Symposium: The Open Language Archives Community.
LAO PDR Summary Findings from NOSPA Mission and Possible Next Steps.
Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Cooperative Print Archiving by Domain Developing an Infrastructure to Sustain Scholarly Resources in Law & Agriculture Amy Wood Center for Research Libraries.
Post-Implementation Organization & Support Loren Blinde Director, Administrative Systems Group.
European Clearing-House Mechanism Portal Toolkit Expert Group Meeting
Data modeling Goal: Agree on data modeling process and ontology.
A centre of expertise in data curation and preservation MIS Seminar :: University of Edinburgh :: 2 October 2006 Funded by: This work is licensed under.
H a r v a r d U n i v e r s i t y L i b r a r y Global Digital Format Registry An Update July 2006.
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
Building Publishing Services in the Academic Library Brian Rosenblum University of Kansas Colorado Academic Library Summit Denver, Colorado June 1, 2007.
© HATII, University of Glasgow Introduction to the UK ’ s Digital Curation Centre Prof Seamus Ross Visiting Fellow at Oxford Internet Institute ,
Navigating the Maze How to sell to the public sector Adrian Farley Chief Deputy CIO State of California
David Giaretta Associate Director (Development) Funders: DCC Development Digital Curation Centre a centre of expertise in data curation and preservation.
GEF Project Cycle Sub-Regional Workshop for GEF Focal Points in the Pacific SIDS Auckland, New Zealand, September 2008.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
Integrating Digital Curation in a Digital Library curriculum: the International Master DILL case study Anna Maria Tammaro University of Parma Florence,
Overview.  Accreditation is both a status and a process  Status:  Status: Accreditation provides public notification that standards of quality are.
Isabel Silver and Laurie Taylor IMLS Library Publishing Services Workshop May 5, 2011 UF Smathers Libraries Publishing Services.
Social Science Data and ETDs: Issues and Challenges Joan Cheverie Georgetown University Myron Gutmann ICPSR – University of Michigan Austin McLean ProQuest.
California Integrated Waste Management Board 1 Agenda Item 14 Consideration of Streamlining Grant Processes to Enhance Program Efficiency January 17, 2006.
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
Update on UDFR (Unified Digital Format Registry) NDIIPP Meeting June 25, 2009 Andrea Goethals.
24 March 2010Atlanta, Georgia Passing it on: Notes on digital initiative sustainability Marty Kurth HBCU Library Alliance – Cornell University Library.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Professor Norah Jones Dr. Esyin Chew Social Software for Learning – The Institutional Policy of the University of Glamorgan ICHL 2012, China
California Statewide Prevention and Early Intervention (PEI) Projects Overview May 20, 2010.
ESRI User Conference, August 8, 2006 Long-term archiving of geospatial data: the NGDA project Julie Sweetkind-Singer John Banning Stanford University.
Report on the Evaluation Function Evaluation Office.
HATHITRUST A Shared Digital Repository HathiTrust and TRAC DigitalPreservation 2012 July 25, 2012 Jeremy York, Project Librarian, HathiTrust.
& Collaborating to Build an Open Access Archive of Public Policy Research Coalition for Networked Information Task Force Meeting.
ExOB Discussions on Development Test Center WRF ExOB Meeting U.S. Naval Observatory, Washington, D.C. 28 April 2006.
Family Service System Reform Grant Application Training Video FY Donna Bostick-Knox, Pennsylvania Department of Public Welfare, Office of Children.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
What Agencies Should Know About PDF/A-1 April 6, 2006 Mark Giguere
Libraries, Archives, and Digital Preservation: The Reality of What We Must Do Leslie Johnston Acting Director, National Digital Information Infrastructure.
1 Strategic Plan for Digital Archives Programme DAP PROJECT SCOPE OVERVIEW STATUS.
eSciDoc Community Model Draft eSciDoc Community Model Overview 1.Introduction 2.Requirements on the Community Model 3.Organizational.
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
UK LOCKSS Alliance: Investigation into Private LOCKSS Networks Adam Rusbridge EDINA, University of Edinburgh.
Global Digital Format Registry Progress Andrea Goethals, Harvard University Library NDIIPP Digital Preservation Partners’ Meeting Arlington, VA July 9,
GPO POLICIES AND PLANS FOR SPATIAL INFORMATION DISTRIBUTION GPO POLICIES AND PLANS FOR SPATIAL INFORMATION DISTRIBUTION Judy Russell Superintendent of.
Staffing and training. Objectives To understand approaches to the development of strategies and policies for staffing of a Regulatory Authority including.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
Katherine Skinner, Educopia Institute Emily Gore, Clemson University U.S. Workshop on Roadmap for Digital Preservation Interoperability Framework NIST,
FY 2010 Coordinated Family and Community Engagement Grant Application.
Building Strong Library Associations | Sustaining Your Library Association BSLA Stakeholders Workshop Yaounde, Cameroon, April 2012 Managing Relationships.
Cooperative Print Archiving by Discipline Developing an Infrastructure to Sustain Scholarly Resources in Agriculture Amy Wood Center for Research Libraries.
Aligning Digital Preservation Policies with Community Standards Nancy McGovern Digital Preservation Officer.
1 JRNL: Journal Retention and Needs Listing A Software Tool for Print Journal Archives Judith C. Russell Dean of University Libraries Benjamin Walker Assistant.
Connect2Complete Theory of Change Development for Colleges and State Offices November 10, 2011 OMG Center for Collaborative Learning.
The International Coastal Atlas Network (ICAN) Overview and Recent Activities Ned Dwyer Dawn Wright.
Achieve OER State Policy Recommendations July 30, 2015 CC BYCC BY Achieve 2015.
A Shared Commitment to Digital Preservation and Access.
Creation of the Archiving Component of a Memorandum of Understanding (MOU) Template for International Missions IPDA MOU project members.
+ The Learning Registry: A How To Primer for Digital Content Publishers and Aggregators December 20, 2011.
The National Archives Washington DC July 10, 2008
ESMF Governance Cecelia DeLuca NOAA CIRES / NESII April 7, 2017
Marie Waltz Center for Research Libraries
CNI Spring 2010 Membership Meeting
Stakeholder Consultations
Nancy Y. McGovern Digital Preservation Officer, ICPSR IASSIST 2007
Digital Library and Plan for Institutional Repository
Digital Library and Plan for Institutional Repository
Presentation transcript:

Unified Digital Format Registry (UDFR) Stakeholder Meeting Library of Congress Washington, DC April 13, 14, 2011

Welcome! Stephen Abrams, Associate director Lisa Colvin, UDFR project manager Alex Genadinik, UDFR project developer University of California Curation Center Bibliothèque nationale de FranceLibrary of Congress Data Conservancy / Johns Hopkins ULos Alamos National Laboratory DataONE / UC Santa BarbaraNational Archives [UK] Deutsche NationalbibliothekNational Archives [US] Ex LibrisNational Library of New Zealand Family SearchNew York University Florida Center for Library AutomationOpen Planets F / Nationaal Archief GDFR / Harvard UniversityTessella Georgia Institute of Technology University of Pennsylvania Government Printing Office [US] Virginia Institute of Technology Koniklijke Bibliotheek

Objectives The desired outcomes of this stakeholder meeting are: Agreement on the scoping of functional and non-functional requirements Agreement on the data modeling process and ontology Agreement on key technology decisions Agreement on project plan and schedule Groundwork for the administrative and technical continuity of UDFR as an ongoing service

Key questions What subset (or superset) of PRONOM and GDFR functionality and data modeling should be supported? Is there a useful distinction between format “facts” and “policies”? What are the criteria for contributor eligibility? To what level of technical review should/will contributed information be subject, and by whom? Are new contributions immediately visible in an unreviewed state? What is the appropriate granularity of provenance and review? Should UDFR identifiers be transparent or opaque? Should UDFR support static or dynamic inheritance of properties? Must there be an explicit grant of license by content contributors? What is the proper replication model: master/slave(s) or peer-to-peer? Should UDFR support classes of information that is not replicated? What are the criteria for node eligibility? What is the ongoing relationship between PRONOM and UDFR?

Agenda TimeTopic 09:00 – 09:20Welcome and introductions 09:20 – 09:30Review of objectives and agenda 09:30 – 10:00Project background 10:00 – 10:30Use cases and functional requirements 10:30 – 11:00Break 11:00 – 11:30Function requirements (continued) 11:30 – 12:30Data modeling and ontology 12:30 – 13:30Lunch 13:30 – 14:30Data modeling and ontology (continued) 14:30 – 15:00Technical architecture 15:00 – 15:30Break 15:30 – 16:30Technical platform decisions 16:30 – 17:00Questions and discussion 17:00Adjourn

Agenda TimeTopic 09:00 – 09:30Project schedule 09:30 – 10:15Initial population of UDFR 10:15 – 10:45Community building 10:45 – 11:15Break 11:15 – 12:30Community building (continued) 12:30 – 13:00Follow-up planning 17:00Adjourn

Project background Why worry about formats? Information preservation Bit preservation Since formatted digital assets are inherently mediated by technology, they are particularly susceptible to disruptive technological change Format a set of syntactic and semantic rules for mapping between an information model and a serialized bit stream

Project background PRONOM Global Digital Format Registry (GDFR) Unified Digital Format Registry (UDFR) – “The Unified Digital Format Registry (UDFR) will provide a reliable, sustainable and publicly accessible knowledge base of file format information” – Fully open source implementation that “unifies” the function and data holdings of PRONOM and GDFR

UDFR project 1 year, 2+ FTE, funded by the Library of Congress Features – Use cases and functional requirements developed by the stakeholder community over the past two years – Support for linked data and semantic web – Support for a distributed network of independent but interoperable UDFR nodes Deliverables – Working, documented, single-node registry system, initially populated with an export from PRONOM, GDFR, and other appropriate sources – BSD license

Community building How can we ensure the administrative and technical continuity of the UDFR once the LC-funded work is completed? Policy and strategic planning Operation of the initial registry node Recruitment of additional nodes Technical maintenance and enhancement of the code base Content contribution Review of contributed information

Policy and strategic planning What is the lightest weight governance structure that is effective? Continue as an ad hoc group or develop a more formal organization? Operate as loose consortium under an MOU Look for an administrative umbrella under an existing organization

Operational considerations CDL is prepared to provide an operational home for the initial production node on an interim basis Any long-term commitment may require some (minimal) level of cost recovery Additional replication nodes Eligibility requirements? Minimal/maximal number desired?

Technical maintenance and enhancement Manage source code in a public code repository Enhancement planning and prioritization – Call for community-wide evaluation at 6/12 months of production operation Eligibility for contributors? Committers?

Content contribution Contributor eligibility – Are contributors recruited or self-selected ? What can we do to encourage contribution? – Engagement by institution and discipline

Technical review Reviewer eligibility – Are reviewers recruited or self-nominated? Single or multiple levels of scrutiny? Standard criteria for evaluation – What is the appropriate level of due diligence?

Follow-up planning Next steps Ongoing project work with early prototype releases Production release (single node) in January 2012 Governance, policy, and planning structure Solicitation of replication nodes Solicitation of content contribution 6/12 month evaluation

Key questions What subset (or superset) of PRONOM and GDFR functionality and data modeling should be supported? Is there a useful distinction between format “facts” and “policies”? – Priority for “facts”; support for “policies” as time permits. What are the criteria for contributor eligibility? – No criteria, but user account required (i.e. no anonymous contribution). To what level of technical review should/will contributed information be subject, and by whom? Are new contributions immediately visible in an unreviewed state? – Opportunity (but not a requirement) for review. Strong provenance will be maintained, as well as explicit tagging indicating the level of review. What is the appropriate granularity of provenance and review? – Individual assertion. … answered?!

Key questions Should UDFR identifiers be transparent or opaque? – Opaque, and without a node identifier component (to avoid the co- reference problem). Should UDFR support static or dynamic inheritance of properties? – Not clear if inheritance is a feature of the model, the query system, or the UI. Must there be an explicit grant of license by content contributors? – Yes, ideally using CC0. What is the proper replication model: master/slave(s) or peer-to- peer? – Master/slave(s), but replication is not the highest immediate priority. However, nothing in the design or implementation of the registry should preclude adding support for replication in the future. … answered?!

Key questions Should UDFR support classes of information that is not replicated? – Need to deal gracefully with legally encumbered information. In a master/slave configuration, data entered at a slave node would remain local. What are the criteria for node eligibility? – With no consensus on the immediate need for replication, this question does not require an immediate answer. Some identified criteria include: geographic dispersion and high-availability operation. What is the ongoing relationship between PRONOM and UDFR? – Continued close consultation and collaboration. … answered?!

Thank you! Safe travels!