Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital.

Slides:



Advertisements
Similar presentations
2008 EPA and Partners Metadata Training Program: 2008 CAP Project Geospatial Metadata: Intermediate Course Module 3: Metadata Catalogs and Geospatial One.
Advertisements

At first the Web was just reading. 250,000 sites visited for 45 million users.
Ali Alshowaish. Collective realization that machine-processability requires a coherent data model A casual discussion at WWW-2 in Chicago, October of.
Harvesting and archiving the Web Nordunet2000, Juha Hakala Helsinki University Library.
Description of the Tools Training Infrastructure 2know2 This is an online survey that helps WP9 obtaining the information needed to report all training.
Digital Initiatives at the University of North Texas Libraries Cathy Nelson Hartman University of North Texas Libraries Texas Conference on Digital Libraries.
Beyond the Google Book: the Future of the Digital Library Cory Snavely Library IT Core Services manager University of Michigan April 20, 2010.
WDL Technical Architecture Working Group (TAWG) June 2010 Achievements and Recommendations Co-chaired by Noha Adly, Bibliotheca Alexandrina Babak Hamidzadeh,
California Digital Library NISO Standardized Usage Statistics Harvesting Initiative (SUSHI): Z39.93 Chan Li California Digital Library ALA Midwinter 2009.
Introduction to UIS and RICYT data collection tools and guidelines CARIBBEAN REGIONAL WORKSHOP ON SCIENCE, TECHNOLOGY AND INNOVATION.
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Sam Hastings University of North Texas School of Library and Information Sciences User Input into Image Retrieval Design.
DC2001, Tokyo DCMI Registry : Background and demonstration DC2001 Tokyo October 2001 Rachel Heery, UKOLN, University of Bath Harry Wagner, OCLC
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Possibility in Digital Collection Management Introduction to CONTENTdm TM Hitoshi Kamada University of Arizona Presentation for OCLC-CJK Users Group Annual.
Metadata for Digital Content at the Library of Congress Jane Mandelbaum Information Technology Services Library of Congress May 2009.
Alabama Mosaic continues an IMLS funded project, ( ) called the Cornerstone Project, which established the infrastructure.
A centre of expertise in digital information management UKOLN: providing support to the RSCs. Dr Liz Lyon, Director RSC Managers Meeting.
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Libraries for Future Generations Martha Anderson Director National Digital Information Infrastructure and Preservation Program The Library of Congress.
1 Advances In Web Technologies Brian Kelly UK Web Focus UKOLN University of Bath
Copyright © 2006 Quest Software Quest SharePoint Management.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Creating Page Layouts using SharePoint Designer or Visual Studio Becky Bertram MCSD, MCAD MCTS WSS Development MCTS MOSS Development
E-business infrastructure
Interoperability Scenarios All Working Groups Meeting May, Rome, Italy.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
Web indexing ICE0534 – Web-based Software Development July Seonah Lee.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
1 Adaptive Management Portal April
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Retrieval Evaluation: Precision and Recall. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity.
Overview of Search Engines
Building and Analyzing Social Networks Case Studies of Semantic Social Network Analysis Dr. Bhavani Thuraisingham February 22, 2013.
Mark Phillips Digital Projects Department University of North Texas Annexation of Texas Project.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Folksonomy: TechTalk Daniel Gelaw Alemneh University of North Texas, Information Technology Services, Digital Projects Unit July 2nd, 2008.
Enhancing Content Visibility in Institutional Repositories: Maintaining Metadata Consistency Across Digital Collections Ahmet Meti Tmava and Daniel Gelaw.
University of North Texas Enhancing the Quality of Metadata: Modular Approach to Digital Resource Lifecycle Management Daniel Gelaw Alemneh & Mark E. Phillips.
MAINTAINING QUALITY METADATA: TOWARD EFFECTIVE DIGITAL RESOURCE LIFECYCLE MANAGEMENT Daniel Gelaw Alemneh University of North Texas.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Personal Information Management Vitor R. Carvalho : Personalized Information Retrieval Carnegie Mellon University February 8 th 2005.
The Portal to Texas History: Harnessing Technology to Enable Collaboration with Small Museums and Libraries CNI, December 6, 2005 Cathy Nelson Hartman.
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Definitions Collaboration – working together on team projects and sharing information,
Digital curation activities enhance access and retrieval, maintain quality, add value, and facilitate use and re-use over time. This poster demonstrates.
A Semantic-Web based Framework for Developing Applications to Improve Accessibility in the WWW Michail Salampasis Dept. of Informatics TEI of Thessaloniki.
Curating the Southwest Region’s Maps: UNT-UTA Collaborative Project Daniel G. Alemneh, Mark E. Phillips, and Cathy Hartman University of North Texas (UNT)
Solutions using Microsoft Content Management Server 2002 Connector for SharePoint Technologies Sue Corke Mark Harrison Microsoft UK.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
Overviews of the Library of Texas & ZLOT Project Dr. William E. Moen Principal Investigator.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
O PEN A CCESS TO O UR H ERITAGE The Gateway to Oklahoma History Cross Timbers Library Conference – August 16, 2013 Sarah Lynn Fisher University of North.
Collaborative Approach to Address Scholarly Communications and Digital Curation Challenges Kris Helge, Laura Waugh, Daniel Alemneh SCDC Affinity Group.
Integrating Laserfiche and SharePoint PO108 Alex Wilson and Jessica Huang.
Archon: Facilitating Access to Special Collections Prepared for PACSCL Conference Something New for Something Old: Innovative Approaches.
Digital Research tools – Endnote, RefWorks, Zotero, Mendeley, Papers.
A Peer-to-Peer Approach to Review Compliance with Trustworthy Repository Audit and Certification (TRAC) Introduction A trusted digital repository has been.
Digitization Workflows From the Digital Projects Unit University of North Texas Libraries Mark E. Phillips Jeremy D. Moore February 12, 2009.
Folksonomy: TechTalk Daniel Gelaw Alemneh University of North Texas,
Programming by a Sample: Rapidly Creating Web Applications with d.mix
Building Search Systems for Digital Library Collections
VI-SEEM Data Repository
DIGITAL LIBRARY.
Metadata to fit your needs... How much is too much?
Preserving Our Collective Digital History
Productivity Loop PowerWriter A systematic approach to world-class
Presentation transcript:

Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital Libraries (TCDL) May 27-28, 2009

Information Retrieval. Match Document Document Representation Query Information Need Bates, M. J. (1989). The design of browsing and berrypicking techniques for the online search interface. Online Review, 13(5),

Trends Information creation, organization, retrieval, use, and preservation is becoming more complex User as creator, annotator, indexer, searcher, and eventual user of his/her content Visualization of the information space instead of a ranked list of search results

Total Sites Across All Domains August 1995-April Jan 1996Jan 2000Jan 2005 Apr 2009 Hostnames Active

Digital Projects UNT Digital Collections Portal to Texas History Collaborators Congressional Research Service Archives Other Statewide and National Projects

Factors Influencing Metadata Quality Local Requirements: Objects Granularity Functionality Collaborative Requirements: Diversity of Users Interoperability Digital Rights Issues

Ambiguities Poor recall Poor precision Inconsistency of search results Poor Metadata Quality

Common Errors The data is: Incorrect Missing Ambiguous

Metadata Quality Assurance Mechanisms & Tools at UNT Pre-Ingest Post-Ingest Training Creation Tools Proofing & Editing Tools Analysis Tools

Training Face-to-Face Instruction Metadata Schema & Documentation Internal Project Wikis Staff Support

Metadata Creation Template

Template Validation

Template Reader

jEdit Text Editor

UNT Metadata Analysis Tools: Post-ingest

Enhanced by Highlighter – On/Off Enhanced by Qualifier – Use/Ignore

Null Value Analysis Tools

Controlled Vocabularies (UNTL-BS)

Better Metadata More Functionality

Better Metadata More Functionality

Summary Determine level of quality required Determine nature of gap and how to close it Machine verses human error handling Compromise Prioritize Test the workflow

UNTL Metadata UNTL Metadata Generation User System Precision Recall Browsing Searching Data Entry Evaluation Understanding ChangeChange Measure Quality and Usefulness of UNT Metadata

References & Web Sites Consulted Bates, M. J. (1989). The design of browsing and berrypicking techniques for the online search interface. Online Review, 13(5), Netcraft (2009). April 2009 Web Server Survey. Retrieved May 19 th, 2009 from OCLC (2007). Sharing, privacy and Trust in our Networked World. Retrieved May 19 th, 2009 from TechSmith, Co. (2008). UX 2.0: Any User, Any Time, Any Channel. Retrieved May 19 th, 2009 from: UNT Libraries Metadata Initiative page. Retrieved May 19 th, 2009 from:

Questions?