MAINTAINING QUALITY METADATA: TOWARD EFFECTIVE DIGITAL RESOURCE LIFECYCLE MANAGEMENT Daniel Gelaw Alemneh University of North Texas.

Slides:



Advertisements
Similar presentations
Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital.
Advertisements

Digital Initiatives at the University of North Texas Libraries Cathy Nelson Hartman University of North Texas Libraries Texas Conference on Digital Libraries.
METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
UKOLN is supported by: Bridget Robinson and Ann Chapman From analytical model to implementation and beyond CD Focus Schema Forum, CBI Conference Centre.
Information Professionals and Learning Object Repositories … more than just metadata quality … Sarah Currier Stòr Cùram Project Librarian JISC X4L Repository.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
1 Uppsala University Library Eva Müller Peter Hansson Stefan Andersson Uwe Klosa Electronic Publishing Centre Krister Östlund Waller project.
Perspectives from The Alberta Library Learn, think, CHANGE 2004 Online Learning Symposium November 3, 2004 Zahina Iqbal.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
The NSDL Registry Diane Hillmann  Jon Phipps. What We’re Doing Received an NSF grant in Oct. 2006, to: Register metadata schemas, vocabularies, application.
THE RUTGERS WORKFLOW MANAGEMENT SYSTEM Mary Beth Weber Cataloging and Metadata Services Rutgers University Libraries August 3, 2007.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Introducing Symposia : “ The digital repository that thinks like a librarian”
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
E-journal Publishing Strategies at Pitt Timothy S. Deliyannides Director, Office of Scholarly Communication and Publishing and Head, Information Technology.
DUBLIN CORE: BEYOND THE LIBRARY David Hirsch LIS Knowledge Organization Dr. Selenay Aytac Spring 2013.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
SobekCM’s Community Ecosystems & Socio-Technical Practices Presented by Mark V. Sullivan June 10 th, 2014 Sobek image created by Jeff Dahl and is shared.
Integrating Dublin Core/RDF records with MARC21 via the OCLC Connexion service at the Centre for Digital Library Research Gordon Dunsire Presented at the.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
Emerging Trends and Evolving Issues in Open Access and Scholarly Communications Daniel Gelaw Alemneh Digital Curation Coordinator University of North Texas.
Diving In: Testing the Archivists’ Toolkit. 21 Oct. 2006Archivists' Toolkit at NEA2 Bradley D. Westbrook, UC San Diego Katherine Stefko, Bates College.
“Mapping the Southwest”: UNT-UTA Collaborative Project Daniel Gelaw Alemneh, Jerrell Jones, University of North Texas (UNT), and Ann Hodges University.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Descriptive metadata in the Finnish National digital library and the role of CIDOC CRM in the standards portfolio of NDL Juha Hakala The National Library.
Enhancing Content Visibility in Institutional Repositories: Maintaining Metadata Consistency Across Digital Collections Ahmet Meti Tmava and Daniel Gelaw.
University of North Texas Enhancing the Quality of Metadata: Modular Approach to Digital Resource Lifecycle Management Daniel Gelaw Alemneh & Mark E. Phillips.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
The Portal to Texas History: Harnessing Technology to Enable Collaboration with Small Museums and Libraries CNI, December 6, 2005 Cathy Nelson Hartman.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Aligning library-domain metadata with the Europeana Data Model Sally CHAMBERS Valentine CHARLES ELAG 2011, Prague.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
1 A Very Large Digital Library Technology Demonstration William Y. Arms Cornell University.
Introduction to metadata
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
METS Application Profiles Morgan Cundiff Network Development and MARC Standards Office Library of Congress.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
A Whirlwind Tour Through Part of the Metadata Landscape Jenn Riley Metadata Librarian IU Digital Library Program.
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
A centre of expertise in digital information managementwww.ukoln.ac.uk DCMI Affiliates: Implications for Institutions Rosemary Russell UKOLN University.
Ensuring Long-term Access to Electronic Theses and Dissertations: Local vs. Global Lifecycle Management Daniel Gelaw Alemneh, University of North Texas.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
From Theses and Dissertations to ETD: Retrospective Digitization and New Forms of Scholarship Kathryn Loafman, Daniel Alemneh, and Jeremy Berg University.
Jenn Riley Metadata Librarian IU Digital Library Program
Metadata (and cataloging?) Jenn Riley Metadata Librarian IU Digital Library Program.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Collaborative Approach to Address Scholarly Communications and Digital Curation Challenges Kris Helge, Laura Waugh, Daniel Alemneh SCDC Affinity Group.
A Training Program for Shareable Metadata Metadata for You & Me is a collaboration between the University of Illinois Library and Indiana University. This.
Santi Thompson - Metadata Coordinator Annie Wu - Head, Metadata and Bibliographic Services 2013 TCDL Conference Austin, TX.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Shades of Grey: Integrating Metadata for Discovery in a Mixed-Content Single-Subject Library GENEVIEVE PODLESKI FEDERAL RESERVE BANK OF ST. LOUIS.
Building Digital Archives Mark Phillips Cathy Hartman June 6, 2008.
Digitization Workflows From the Digital Projects Unit University of North Texas Libraries Mark E. Phillips Jeremy D. Moore February 12, 2009.
Meta/Data As If Research Depends On It
The Use of EAD in Archival Based Repositories
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Introduction to Metadata
Metadata to fit your needs... How much is too much?
Preserving Our Collective Digital History
PREMIS Tools and Services
Introduction to Metadata
Attributes and Values Describing Entities.
Presentation transcript:

MAINTAINING QUALITY METADATA: TOWARD EFFECTIVE DIGITAL RESOURCE LIFECYCLE MANAGEMENT Daniel Gelaw Alemneh University of North Texas

ICKM 2008 University of North Texas (UNT) Libraries Digital Initiatives Collaborative Initiatives CyberCemetery GPO NARA – Affiliated Archive Texas Register Archive Secretary of State’s Office Texas Laws and Resolutions Archive Secretary of State’s Office The Portal to Texas History 45 Libraries & Museums Web-at-Risk Project California Digital Library New York University National Digital Newspaper Program (NDNP) Between 1836 and 1922.

ICKM 2008 University of North Texas (UNT) Libraries Digital Initiatives Library Digital Collections: Congressional Research Service Archive 10,000+ CRS Reports World War Poster Collection 500 WWI and WWII Posters Advisory Commission on Intergovernmental Relations 408 reports = 47,874 pages Federal Communications Commission (FCC) Record 136 issues = 43,115 pages (6 of 21 volumes completed) Electronic Theses and Dissertations (ETDs) more in queue Jean-Baptiste Lully (Music) Collection 27 scores = 10,000 pages Other digitization projects

ICKM 2008

Metadata Environment Metadata-based digital resource management activities UNT Libraries metadata locally qualified Dublin Core based descriptive metadata. Detailed technical and preservation metadata elements Web based metadata creation and editing Interoperability Metadata Crosswalks Mods Marc oai_dc PREMIS

ICKM 2008 Metadata Quality The two aspects of digital library data quality: The quality of the data in the objects themselves The quality of the metadata associated with the objects Poor metadata quality: Ambiguities Poor recall Poor precision Inconsistency of search results

ICKM 2008 Metadata Quality … Most Common errors: Incorrect Data: Letter transposition Letter omission Letter insertion Letter substitution or misstrokes Missing Data Elements and values not present at all (null) Insufficient or incomplete data Ambiguous Data Confusing or inconsistent data e.g. multiple spellings, multiple possible meanings, mixed cases, initials, etc.

ICKM 2008 Factors Influencing Metadata Quality Local Requirements: Objects Heterogeneity What type of objects will the repository contain? Granularity How will they be described? Functionality What functionality is required? How will it be interfaced?

ICKM 2008 Factors Influencing Metadata Quality … Collaborative Requirements: Diversity of Users How best diverse information-seeking behaviors can be met? Interoperability Will metadata be meaningful within aggregations of various kinds? What is required for interoperability? (Structure, semantics, & syntax) Digital rights issues Will access restrictions be imposed? Are requirements formal or informal? Are there other access and associated digital rights issues?

ICKM 2008 Factors Influencing Metadata Quality… Training Issues Necessary expertise to create and manage rigorous metadata Metadata quality can be determined to a great extent by: knowledge of the source, and knowledge of the methodology used to create the statement Cost Rigorous metadata is resource intensive and too costly

ICKM 2008 UNT Metadata Quality Assurance Mechanisms & Tools The two main stages of metadata qualities assurances: Pre-injust 1. Metadata Creation tools (Templates) Post-injust 2. Metadata Analysis tools (Web-based tools)

ICKM 2008 Quality Assurance Mechanisms and Tools: Templates 1. Metadata Creation Tools (Templates) Validates Mandatory elements Metadata Template Creator Template Reader Controlled vocabularies (UNTLBS)

ICKM 2008

UNT Metadata Quality Assurance Mechanisms & Tools… 2. Metadata Analysis Tools NULL Values List/Browse All Values (by each qualifiers and elements) List Authorities Values Graphical reports and other fun stuff Clickable Maps by Institution and Collection Word Clouds by elements Records added overtime and other graphical reports

ICKM 2008

Summary Determine level of quality required Partners may have much in common, but they have diverse and sometimes conflicting metadata requirements. Determine nature of gap and how to close it effectiveness, efficiency, practicability, scalability Machine verses human error handling How much of the process can be automated? Human review of results is still essential (e.g. highlighted items) Compromise One size does not fit all! Prioritize Resources very unlikely to be available to meet all requirements Test the workflow Test, retest, and evaluate the quality cycle continuously

ICKM 2008

Questions? Thank You!