Data Wrangling: Developing Local Best Practice for Born Digital Metadata Tracy Popp, Digital Preservation Coordinator Ayla Stein, Metadata Librarian University.

Slides:



Advertisements
Similar presentations
Pulling it all together… with thanks to Sheila Anderson.
Advertisements

Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
GETTING BITS OFF DISKS Using Open Source Tools to Prepare Born-Digital Materials for Long-Term Preservation and Access To connect to the audio portion.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
April 2011 Registry Prototype demonstration. Relationships between records.
PREMIS: To Be or Not To Be in My METS The Preservation Journey at the University of Connecticut Libraries ALA Annual 2013 ALCTS PARS Intellectual Access.
The world’s libraries. Connected. Demystifying Born Digital ARLIS/NA, Pasadena, 27 April 2013 Jackie Dooley Program Officer OCLC Research.
Interoperability and Preservation with the Hub and Spoke (HandS) Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign.
Interoperability and Preservation with the Hub and Spoke (HandS) Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign
Applying Theoretical Archival Principles and Policies to Actual Born Digital Collections LEIGH ROSIN | Digital Archivist | National Library of New Zealand.
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
1 Extending the Implementation of PREMIS to Geospatial Resources in the Stanford Digital Repository: An Exploration By Nancy J. Hoebelheinrich Metadata.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
THE RUTGERS WORKFLOW MANAGEMENT SYSTEM Mary Beth Weber Cataloging and Metadata Services Rutgers University Libraries August 3, 2007.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Maximizing Resources: EAD and MARC in the small repository Valerie Gillispie Assistant University Archivist Wesleyan University Middletown, Connecticut.
Metadata standards, tools and processes for audio preservation at the British Library: An overview of new systems for audio description, preservation and.
Untitled (Hidden Track): Born Digital Content Preservation Service at UIUC Tracy Popp, MS LIS, CAS Digital Preservation Coordinator University Library.
Plan for the preservation of digital content and archives in THUL Jiang Airong, Dong Li Tsinghua University Library EMANI Meeting GRENOBLE – 16 Oct, 2006.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
1 A journey of a thousand miles begins with a single step. Chinese Proverb.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
VIDEO ARCHIVING Models and opportunities Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library Executive Director,
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
DIGITAL FORENSICS Forensic Toolkit: a tool to process born digital records Emma Jolley Curator of Digital Archives.
1 Digital Archives - Past, Present & Future Issues Anne Van Camp Manager, Member Initiatives The Research Libraries Group Digital Archives Directions (DADs)
Metadata Handling in the North Carolina Geospatial Data Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives Rob Farrell Geospatial.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
BUILDING ON COMMON GROUND: EXPLORING THE INTERSECTION OF ARCHIVES AND DATA CURATION Lizzy Rolando & Wendy Hagenmaier 6/3/2015IASSIST 2015.
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
Cataloging Compound Digital Objects: Using METS for Digitized Sanborn Maps Christopher Cronin Head of Digital Resources Cataloging University of Colorado.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
The Roger Conatser Aerial Photographs Collection Bethany C. Fiechter, Archivist for Manuscript and Digital Collections Amanda A. Hurford, Metadata and.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
HATHI TRUST A Shared Digital Repository Use of PREMIS for Internet Archive AIPs September 22, 2010.
Interoperability and Collection of Preservation Metadata for Digital Repository Content Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University.
PREMIS at the British Library Markus Enders, The British Library PREMIS Implementation Fair, San Fransisco, CA 07 October 2009.
April 25, 2012 Making the Most of Library Collaboration and Cooperative Projects Partnering for Discovery: Jennifer LissErika Dowell Metadata/Cataloging.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation.
Digitization & Digital Preservation
The OAIS Reference Model and Trustworthy Repositories Josh Lubell Manufacturing Engineering Laboratory NIST
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Santi Thompson - Metadata Coordinator Annie Wu - Head, Metadata and Bibliographic Services 2013 TCDL Conference Austin, TX.
Image Discovery & Access ACRL Image Resources Interest Group ALA Annual, Saturday, June 26, 2010 Nicole Finzer, Visual Resources Librarian, Digital Collections,
Digital Library Development Kyle Rimkus and Bill Ingram February 10, 2016.
Automating the Audit: Updates from the Metadata Upgrade Project at the University of Houston Libraries Andrew Weidner, Metadata Librarian Santi Thompson,
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
CENTRAL/WESTERN MASSACHUSETTS AUTOMATED RESOURCE SHARING Digitization GOALS & THEIR LOGISTICS Michael J. Bennett Digital Initiatives Librarian C/WMARS,
Kyle Rimkus, Thomas Padilla, Tracy Popp, Greer Martin full paper to appear in March/April 2013 edition of d-Lib Magazine Preservation Unit, University.
Digital Asset Management Part 15: Summary
Digital Collections Update
Introduction to Metadata
Committee on Technical Processing Council on East Asian Libraries
Introduction to Implementing an Institutional Repository
Integrating PREMIS and METS
Metadata to fit your needs... How much is too much?
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
A Brief Introduction to Digital Forensics
Robin Dale RLG OAIS Functionality Robin Dale RLG
Local Rules Apply: Creating and Sustaining a Cost Effective Digital Preservation System on a Limited Budget Matt Ransom, Digital Assets Manager Belk Library.
Managing the Institutional Repository for OA Khawulile Radebe: Librarian: Repository Administrator & Metadata.
ArchivesSpace – Archivematica – DSpace Workflow Integration
Presentation transcript:

Data Wrangling: Developing Local Best Practice for Born Digital Metadata Tracy Popp, Digital Preservation Coordinator Ayla Stein, Metadata Librarian University Library University of Illinois Urbana-Champaign

Intro What will be addressed: Institutional context Project needs Challenges Current progress Future work

Institutional Context University Library –Campus-wide network of libraries –Largest public university research library in U.S. thirteen million volumes 24 million items and materials Over 12,000,000 digital files Main Library building, East Entrance

Institutional Context Collaborative effort: –Content Access Management (Cataloging and Metadata) Ayla – Metadata Librarian –Preservation Unit Tracy – Born Digital Content Reformatting –Special Collections University Archives RBML, Sousa, etc. –Back to Preservation Kyle Rimkus – Preservation Librarian –Digital Content Long-term Preservation (Medusa)

Project Needs Ayla (Metadata) and Tracy (Born Digital Content Reformatting) Identify –Metadata currently captured Make –Schema Recommendations Technical Administrative Descriptive –Controlled Vocabulary

Overview of Challenges Behemoth spreadsheet Various reports not in a schema No controlled vocabulary Redundant data entry Ideally aligns with Medusa data

Born Digital Reformatting Behemoth spreadsheet –Project tracking and data entry Reports –Structured but not to a schema From FTK Imager: »Directory list of media structure (created at time of disk imaging); item level information »Hash list of exported files From TreeSize Pro »Media group level reports

Challenges - Schema No one schema appropriate –Many layers of transformation –varying types of metadata Born Digital Reformatting Collecting Unit Digital Preservation Repository Recover from obsolete media Arrangement Description Access Medusa: Long term Preservation

Challenges – Controlled Vocabulary Reformatting request form is paper –Project tracking system in works No Controlled Vocabulary Reviewed: MANY Chose: –PBCore instantiationMediaType –PBCore instantiationPhysical

Schema Choices METS, MODS, and PREMIS Why? –MODS and PREMIS align with Medusa terms

Schema Choices PREMIS –Record technical info of item pre- reformatting –Encode actions and digital forensics reports as ‘events’ –Can have full provenance of a digital object in a cohesive piece

Schema Choices The Catch: –Medusa supports limited metadata Collection & file group level Event info does not pre-date ingest into repository –Metadata file as content METS wraps up MODS & PREMIS info Deposit METS record with content

Good Practice Interoperability Various levels that will assist in the digital preservation life cycle

Summary: Work In Progress Schema Choice: –METS, MODS, and PREMIS Controlled Vocabulary Choices: –Data Type: instantiationMediaType –Media Type: PBCore instantiationPhysical

Future Work Creating centralized, web-based tracking tool –Allow curating units to add descriptive information –Avoid data duplication Import metadata and reports –Structured in schema More controlled vocabulary –Rights

Thank You! Tracy Popp Digital Preservation Coordinator Ayla Stein Metadata Librarian @TheStacksCat