Download presentation
Presentation is loading. Please wait.
Published byAnthony Willis Modified over 9 years ago
1
National Library of Finland Metadata in the Digitisation Process Cultural unity and diversity of the Baltic Sea Region – common history, different languages, mixed culture Helsinki, 21st–22 nd October 2010 Tiina Ison, Senior Analyst, National Library of Finland
2
Outline 1.Front End - National Digital Library and Long Term Preservation (KDK/PAS) 2.Back End - Digitisation Production Process, METS Profiles 3.Descriptive Metadata 4.Administrative/Technical Metadata 5.Structural Metadata 6.Wrapping things together: METS Profile 7.Processes towards distrubed work, crowd soucing, annotaiton and ontologies
3
1. Frond End: National Digital Library and Long- Term Preservation Infrastructure Ministry of Education www.kdk2011.fi/fi/tietoa-hankkeesta www.minedu.fiwww.kdk2011.fi/fi/tietoa-hankkeestawww.minedu.fi Libraries / Archives / Museums BACK END SYSTEMS In their digitisation production memory institutions produce authentic, trustworthy digitised content and collections OPM-KD Project 2007-2009, digitisation production revewed http://www.kansalliskirjasto.fi/extra/vanhat_bulletinit/b ulletin09/article6.html Infrastructure Intiatives: National Digital LibraryNational Long-Term Prservation http://www.kdk2011.fi Rights Management... METS profiles Kansallisen Digitaalisen Kirjaston Arkkitehtuuri http:// www.kdk2011.fi/images/stories/Kokonaisarkkitehtuuri-yleiskuva-fi_iso.jpg
4
2. Back End: Digitisation Production Processes, METS Profiles SOURCE MATERIAL PHYSICAL COLLECTIONS Structural metadata METS, ALTO METS EXPORT Packesges include: JPEG2000 OCR TXT as ALTO XML PDF JPEG(150) METSXML MARCXML DIGITAL RESOURCE COMPREHENSIVE DIGITIAL COLLECTIONS Standards & OAI-PMH complient METS SIP packages Two Bibliographic Records CATALOUGING SCANNING POST PROCESSING LEVEL OF MARK UP Articles Illustrations Poems Descriptive metadata MARC21/MODS Administrative/technical metadata MIX/PREMIS Newspapers Serials Books Parchments Notes Maps Audio
5
3. Descriptive Metadata CATALOUGING Catalogued Items Un-catalogued Items – Minimal bibligraphic record Bar Code ID’s – Unique ID’s for Physical Items Ingest of bibliographic metadata into digitisation produciton MARC21 conversion into MARCXML (MODS) Two bibliographic recrods – physical and digital (link 776) Post cataloguing for minimal records Enrichmnent of catalogue
6
4. Administrative/Technical Metadata An XML Schema designed for expressing technical metadata for digital still images Technical Metadata for Digital Still Images - (NISO Z39.87 Data Dictionary) MIX: Image width, Color space, color profile, Scanner metadata, Digital camera settings Preservation Metadata/Premis (information about actions on object, on even, on technical environment) Rights Metadata (access restriction) Persistent ID’s SCANNING
7
5. Structural Metadata Navigation, use and access ? Logical Structure Physical Structure METS structMap – relatinships between parts POST PROCESSING
8
6. Level of Structural Mark Up Material types books, serials, newspaoers, audio, projects Granularity - different level of structural mark up - i.e. article, illustration, poem Granularity - all material types: pages, footnotes, running title, tables, advertisemnts, image (captions and categories) Labour intensive Phased approach in production Crowd sourcing LEVEL OF MARK UP
9
7. Wrapping things together; METS Profiles METS profiles for different material types monographs, serials, newspapers, audio… Export files : JPEG2000, lossless, PDF, OCR TXT as ALTO XML, JPEG (150dpi), METSXML and MARCXML METS container or wrapper provides a SIP package for delivery and exchange of digital objects accross systems that is OAI-PMH compliant. Wraps descriptive, administrative and structural metadata + PREMIS. MODS and MARCXML for descriptive and bibliographical metadata (http://www.loc.gov/standards/mods/) (http://www.loc.gov/standards/marcxml/)http://www.loc.gov/standards/marcxml/ MIX for image technical metadata (http://www.loc.gov/standards/mix/)http://www.loc.gov/standards/mix/ PREMIS for preservation metadata (http://www.loc.gov/standards/premis/)http://www.loc.gov/standards/premis/ (standardi salkku)
10
8. Processes towards distributed work, crowd sourcing, annotation and ontolgies OCR Correction Content and context as part of digitisation processes… Automatic and semiautomatic proccess for data extraction … Distributed work processes i.e. for: Mark up level OCR correction Controlled annotation Social tagging
11
THANK YOU
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.