National Library of Finland Sustainable Access Library digitisation production workflows and processes adding value to print collections Kuopio 30.10.2009.

Slides:



Advertisements
Similar presentations
0 DIGITIZING GREY LITERATURE FROM THE ANTARCTIC BIBLIOGRAPHY COLLECTION Tina Gheen and Sue Olmsted National Science Foundation Arlington, Virginia USA.
Advertisements

Programs and Research Public Private Agreements for Mass Digitisation Ricky Erway JISC Digitisation Conference July 2007.
1 Metadata Tools for JISC Digitisation Projects of still images and text Ed Fay BOPCRIS, Hartley Library University of Southampton.
The metadata challenge for libraries: a view from Europe Michael Day UKOLN: The UK Office for Library and Information Networking, University of Bath
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
MacKenzie Smith Associate Director for Technology MIT Libraries.
Cultural Content and Digital Heritage Bernard Smith European Commission INFSO/D2.
Services Digitisation & Content Management. 600 People – India.
1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young Project Manager, 19 th Century.
Digital Collections: Use, Value and Impact Lorna Hughes University of Wales Chair in Digital Collections, National Library of Wales Aberystwth University.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
1 Uppsala University Library Eva Müller Peter Hansson Stefan Andersson Uwe Klosa Electronic Publishing Centre Krister Östlund Waller project.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
Archiving the Web: the PANDORA archive at the National Library of Australia Preserving the Present for the Future Copenhagen, June 2001 Warwick Cathro,
Preserving Digital Collections Andrea Goethals Florida Center for Library Automation (FCLA)
Metadata standards, tools and processes for audio preservation at the British Library: An overview of new systems for audio description, preservation and.
The National Digital Newspaper Program (NDNP) An NEH/LC Collaborative Program Enhancing access to historical newspapers Release: September 2006.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Dspace 1 Introduction to DSpace Mukesh Pund Scientist NISCAIR, New Delhi.
Digitisation of Cultural Heritage at the National Library of Latvia: Past and Future Uldis Zariņš Head of Strategic Development National Library of Latvia.
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
1 Guidelines For The Future Sharing Best Practice For National Bibliographies In The Digital Era Neil Wilson Information Coordinator IFLA Bibliography.
Learning the lessons … David Dawson. MLA Museums, libraries and archives building a successful and creative nation by connecting people to knowledge and.
‘The Universal Catalogue’ a cultural sector viewpoint David Dawson Senior Policy Adviser (Digital Futures) Museums, Libraries and archives Council.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
© January/2008 CCS Content Conversion Specialists GmbH Weidestr. 134, Hamburg, Germany consulting technology digitization services.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
The role of Parthenos for CLARIN ERIC Steven Krauwer CLARIN ERIC Executive Director 1.
The DigiTool to FDA Program Lydia Motyka Florida Center for Library Automation.
Planning Digitisation Projects Aly Conteh The British Library 30/11/2012 CERL Annual Seminar.
Publisher’s Perspective: Digitization of print resources, and archiving of digital resources Judy Best, June 13, 2006.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
GEO: a special collection for Earth Science community *Stefania Biagioni, *Silvia Giannini, **Cecilia Giussani *CNR-ISTI, **CNR-IGG Pisa, Italy GL13 Conference,
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
The Knowledge Exchange Presentation to CNI April 2005 Bas Cordewener, SURF Sigrun Eckelmann, DFG Norman Wiseman, JISC.
Europeana Libraries: building a pan-European aggregator Wouter Schallier, LIBER Executive Director Eva/Minerva 15/11/2011.
Establishing a National Strategy for the Provision and Use of e-Books in UK Academic Libraries Ray Lonsdale Department of Information Studies, University.
Common challenges, common issues Lorcan Dempsey School for scanning The Hague, 16 October 2002.
Library Repositories and the Documentation of Rights Leslie Johnston, University of Virginia Library NISO Workshop on Rights Expression May 19, 2005.
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
Shruthi(s) II M.Sc(CS) msccomputerscience.com. Introduction Digital Libraries have become the source of information sharing across the globe for education,
Cataloging Compound Digital Objects: Using METS for Digitized Sanborn Maps Christopher Cronin Head of Digital Resources Cataloging University of Colorado.
Digitization of library material in Europe. Problems, obstacles, and perspectives anno 2007 by Erland Kolding Nielsen, Director General of the Royal Library,
VIVO and Scholarly Repositories: Synergistic Opportunities.
Research libraries in a European e-science infrastructure Wouter Schallier Executive Director LIBER (Association of European Research Libraries)
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
National Library of Finland Metadata in the Digitisation Process Cultural unity and diversity of the Baltic Sea Region – common history, different languages,
National Library of Finland Strategic, Systematic and Holistic Approach in Digitisation Cultural unity and diversity of the Baltic Sea Region – common.
Warwick Cathro Assistant Director-General Resource Sharing and Innovation National Library of Australia Trove – a service built on collaboration OCLC Asia.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Portico’s “d-collections” preservation service Stephanie Orphan Positive trends in sustainability? Emerging approaches to archiving commercial databases.
EDLproject WP3 “Developing the European Digital Library” LIBER – EBLIDA workshop Digitisation of Library Material in Europe Copenhagen, October.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
National Library of Finland Digitisation Policy, Production Processes and Access (...and beyond Access) Tiina Ison, Senior Analyst Present-Day Library:
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
Ktisis: Building an Open Access Institutional and Cultural Repository Alexia Kounoudes, Petros Artemi, Marios Zervas Library and Information Services,
Digital Collection Development Policy
Introduction to DSpace
Experiences of the Digital Repository of Ireland
Metadata for research outputs management
Metadata to fit your needs... How much is too much?
Presentation transcript:

National Library of Finland Sustainable Access Library digitisation production workflows and processes adding value to print collections Kuopio Tiina Ison, Senior Analyst

Outline 1.Paradigm Shifts Sustainable Strategies for Print Digitisation (PPP) ? 3.User Community Value Drivers 4.Library Value Drivers- Market Positioning 5.National Digital Library (KDK/PAS) 6.(Mass) Digitisation Project ( ) 7.In-House Digitisation Production and Workflows 8.Interoperable Metadata and METS Profiles 9.Adding Value in Production to Print Collections 10.Sustainability....

1. Paradigm Shifts.... Sustainable access to printed collections  One book, one article – is publishing/licensing concept sustainable.... ?  Open access – including use and reuse of content, repurposing Print versus data intensive science paradigm  Granularity and aggregation of content, repurposing, repackaging...  Text and data mining of corpus of collections accross disciplines interlinking datasets by computational tools  Text mining and automated concept extraction methods used to add semantic metadata to text Eeva Ahonen and Eero Hyvönen: Publishing Historical Texts on the Semantic Web - A Case Study. Proceedings of the Third IEEE International Conference on Semantic Computing (ICSC2009) (forthcoming), Berkeley, CA, USA, September, 2009, Historical Texts on the Semantic Web - A Case Study Community generated contribution – crowd sourcing  Volunteer contribution economy, Two Way Engagment (JISC study being commissioned)  Galaxy Zoo  Australian Historial Newspapers

example of open access to corpus of print collection Open Access for Research, Education and Citizen use and contribution, non commercial use in a free knowledge economy (sounds like public domain and cc license citizen contribution ? non commercial use ?) National Library of Australia, Historical Newspapers

2. Emerging Print Digitisation Strategies (PPP) European and National Digitisation Strategies aim for critical mass (mass) digitization of print EU Comission warning members states of SILOs... Mr. Javier Hernandez_Ros on behalf of Vivienne Redding 1. Books (Danish Library, Director General Erland Kolding Nielsen categorizing market at Liber digitisation seminar2009 with some additions … ) Early Printed books up to 18th Century – ProQuest Market Penetration Printed Books Google Market Penetration Books under copyright – Nordic Extended Licensing/Rights Registry Google/Arrow Project EU Black Market and Priacy Economy (Global Information Divide, Social Generation Divide) 2. Scientific Journals A publisher’s market? Not covered by Google settelment 3. Newspapers Likelyhood of ongoing government funding ? high value corpus ? 4. Ephemera Did i remember to mention no ongoing funding for sustainable digitisation strategies ?

example of toll gated/silo access to corpus of print collection Toll gated access to digitised content limited by membership, limited to country boundaries. Toll gated citizen access. Limited reuse and repurposing. British Library, Newspaper Collection (sounds like closed access )

3. Community Value Drivers Sustainability builds on user and community needs open access digital format minimum restrictions use and resuse at will data intensive research free content for semantic web and data linking free community contribution fun and play (creativity) are user community value drivers informing descisions about building infrastructure, workflows and processes for sustainable access and long term preservation ? non users of today are user market of tomorrow

4. Library Value Drivers – Market Positioning Sustainable access invests in quality infrastructure and workflows for: Trustworthy, authoritative sources Citation with trust Physical and digital provenance Rights management Links between physical items and digitial surrogates and manifestations Complex objects, granularity Level and perisistend ID’s Links in catalogue/union catalogues Interoperable metadata and use of standards Long term preservation Analogue work practices are well established in libraries..... Do Libraries have the know how to move from analogue to digitial world – no mention has yet been made about scanning !

5. National Digitial Library KDK/PAS RAKE Structural Change in Higher Education, Finland National Infrastructure Development Projects funded by the Education Ministry 1.National Digital Library Initiative 2008– Long Term Preservation Initiative 2012, 2016 ? 3.(Mass) Digitisation Project, 2007–2009 METS profiles Rights Management... Ministry of Education Did i remember to mention copyright, orphan woks, data protection issues ?

1.One Production Line - one production line between back end digitisation production and National Digital Library Infrastructure. 2.Process modeling – library wide logistics, processes and workflows are modeled and renewed where needed 3.Interoperable Metadata - quality of metadata used, captured and packaged throughout the digitisation production line is adequate for access and long term preservation needs 4. Tools - Ensure appropriate tools are put in place for tracking and managing workflows between National Library at Helsinki and Digitisation Centre at Mikkeli 6. (Mass) Digitisation Project,

7. In-House Digitisation Production affects Workflows Process Modelling Sustainable in-house digitisation production affects Library wide workflows and requires workflow re-design in a distributed environment. Processes, tools, work practices and standards are required for controlling: 1.Physical printed Item (preservation and logistics of transport) 2.Management of digital objects from production to access/preservation 3.Control of metadata Metadata for Printed and Digital Traditional concept of metadata for a printed object extended with lconcept of metadata for digital resouces for provision of sustainble access and preservation. 1.Bibliographic Metadata (MARC21) 2.Administrative Metadata 1.Technical Metadata 2.Rights Metadata 3.Long Term Preservation Metada 3.Structural Metadata 1.Physical Structure 2.Logical Structure

8. Interoperable Metadata Standards and METS Profiles METS profiles for monographs, newspapers, journals, audio… EXPORT FILES : JPEG2000, lossless, PDF, OCR TXT as ALTO XML, JPEG (150dpi), METSXML and MARCXML METS container or wrapper provides a SIP package for delivery and exchange of digital objects accross systems that is OAI-PMH compliant. Wraps descriptive, administrative and structural metadata + PREMIS. MODS and MARCXML for descriptive and bibliographical metadata ( ( MIX for image technical metadata ( PREMIS for preservation metadata ( PREMIS for rights management metadata. Metadata standards and METS incorporated into National Digital Library (KDK) recommendations by Technical Working Group for DL metadata portflio (standardi salkku)

9. Adding Value in Production to Print Collections...  Unique ID for Physical Items at Collections (Barcodes)  Minimal Bibliographic Record for non catalogued items  Status in catalogue  Two bibliographic records will be created into Fennica catalogue  Unique and persistent ID’s for digital objects, pages, segments  URN:NBN resolver  Metadata re-use  Catalogue enrichment  Complex objects, granularity level and structural mark-up  Technical metadata and provenance

10. Sustainability... OPiNiONS ? Should google or publihsers do it ?