| IFLA2010. Newspaper section | 2010-02-26 Changing preservations tasks for the German National Library: Some insights and preliminary remarks IFLA International.

Slides:



Advertisements
Similar presentations
1 Data for the Future: the German Project "Co-operative Development of a Long-term Digital Information Archive" (kopal) Hands-on Workshops Reinhard Altenhöner,
Advertisements

Testing and Evaluation in Digital Preservation Projects: the case of KEEP Milena Dobreva Janet Delve, David Anderson, Leo Konstantelos.
Kopal - a Co-operative Approach to develop a Long-Term Digital Information Archive ICOLC 2006, Rome Dr. Thomas Wollschläger, German National Library (GNL)
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 National and International.
ICOLC October 4, 2001 OCLC Services. Purpose Libraries’ web-based information portal needs –Maximize consortia’s role in their members’ use of database.
| IFLA2010. Newspaper Section | Newspaper Resources in transition: Digital Preservation and Access - keynote - IFLA International Newspaper.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
Vocabulary Mapping Framework & Libraries Alan Danskin Metadata & Bibliographic Standards Coordinator.
1 Managing Legal Deposit for Online Publications in Germany Cornelia Diebel.
ETD‘s as pilot materials for long-term preservation efforts in kopal 9th ETD Conference 2006, Quebec Dr. Thomas Wollschläger, German National Library (GNL)
Joachim Bauer Senior System Engineer, CCS
DSpace Devika P. Madalli DRTC, ISI Bangalore.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
Contents and Formats Existing Digital Sources Gertraud Griepke Cornell University, July 26th 2002.
The FAO Open Archive Enhancing the Access to FAO Publications Using International Standards and Exchange Protocols Claudia Nicolai, Imma Subirats and.
Kristin Eberle Monica Hampton Carmen Velasquez Kristin Eberle Monica Hampton Carmen Velasquez Knowledge Management.
Introducing Symposia : “ The digital repository that thinks like a librarian”
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
Demonstration of repositories Fedora (Flexible Extensible Digital Object Repository Architecture) Marie Lagerwall MIDESS Partners Meeting February 9, 2007.
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
Presented by Mina Haratiannezhadi 1.  publishing, editing and modifying content  maintenance  central interface  manage workflows 2.
Release 4 of the COUNTER Code of Practice for e- Resources and new usage- based measures of impact Peter Shepherd COUNTER May 2014.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Create and Manage METS in retrodigitization Markus Enders Goettingen State and University Library
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
New Partnerships for Smarter Data Discovery, eBooks and Digital Asset Management Thailand IUG 2012 – Mahidol University.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
Svein Arne Brygfjeld National Library of Norway Nordic Web Archive.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
UPSpace An institutional research repository for the University of Pretoria Presented by Ina Smith to the School of Public Management and Administration.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
1 The Universal Object Format - A METS Profile for an archiving and exchange format for digital objects.
Brussels, Belgium, ABD/BVD 60, Conference 2007 november 19 The legal deposit for digital publications - new challenges for the German National Library.
Contactforum: Digitale bibliotheken voor muziek. 3/6/2005 Real music libraries in the virtual future: for an integrated view of music and music information.
1 Guidelines For The Future Sharing Best Practice For National Bibliographies In The Digital Era Neil Wilson Information Coordinator IFLA Bibliography.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
From Concept to Reality: An overview of the University of Wisconsin Digital Collections Melissa Mclimans.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
Digital Archiving in the Hungarian Széchényi Library The story and the plans of the Hungarian Electronic Library Rome, 21. Oct István Moldován OSZK,
Unit no. 5 Digital Library Adolf Knoll National Library of the Czech Republic © Adolf Knoll, National Library of the Czech Republic.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
International Seminary on Digitisation: Experience and Technology 11 th May 2004 | National Library | Lisbon – Portugal DIGITAL ARCHIVE OF PORTUGUESE ART.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Metadata and Documentation Iain Wallace Performing Arts Data Service.
| Ingest Levels and Persistent Identification | October Ingest Levels and Persistent Identification Services for R & D and heritage organisations.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
PAN-European Exploitation of the Results of the Libraries Programme - EXPLOIT German Libraries Institute Berlin EXPLOIT 1 Electronic library materials.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Warwick Cathro Assistant Director-General Resource Sharing and Innovation National Library of Australia Trove – a service built on collaboration OCLC Asia.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
DSpace - Digital Library Software
Chapter Three Presentation: User interface How to Build a Digital Library Ian H. Witten and David Bainbridge.
FACES General Overview ViRR (Virtueller Raum Reichsrecht) Software Solutions Kristina Büchner and Bastien Saquet Contact:Kristina Buechner:
Identifiers for a Digital World June 29, 2010 Patricia Payton Senior Director of Publisher Relations & Content Development
Radoslav Pavlov, Galina Bogdanova, Desislava Paneva- Marinova, Todor Todorov, Konstantin Rangochev
Presenting Documents How to Build a Digital Library Ian H. Witten and David Bainbridge.
Primo at the British Library Mandy Stewart. 2 About the British Library The British Library is the National Library of the UK It is a world-class.
Preservation Functionality in a Digital Archive Erik Oltmans Koninklijke Bibliotheek Raymond J. van Diessen IBM Business Consulting Services Hilde van.
Building A Repository for Digital Objects
An Overview of Data-PASS Shared Catalog
An Introduction to Tessella and The Safety Deposit Box Platform
Introduction to DSpace
Metadata to fit your needs... How much is too much?
Managing the Institutional Repository for OA Khawulile Radebe: Librarian: Repository Administrator & Metadata.
Presentation transcript:

| IFLA2010. Newspaper section | Changing preservations tasks for the German National Library: Some insights and preliminary remarks IFLA International Newspaper Conference 2010 at IGNCA, New Delhi in India during 26th February to 28th February, 2010 "Digital Preservation and Access to news and views” Reinhard Altenhöner 1

| IFLA2010. Newspaper section | ToC 2 1.Starting situation / setting 2.Digital Preservation in DNB 3.Practical Example: E-Papers

| IFLA2010. Newspaper section |  Publications issued in Germany since 1913  Since June 22, 2006: Online- / Net- publications are covered by the new law  Newspapers as well: Ca. 450 newspapers (this means selection!) are microfilmed every day  About datasets in the central database  Some years ago we started some brainstorming on alternatives for this MF- approach  collecting e-papers from the web  Archiving of print-files  Cooperation with media / clipping agencies DNB: Our task: Collecting and archiving, providing permanent access 3

| IFLA2010. Newspaper section |  Frequent update-processes  Dedicated publication workflow: database, Content-Management-System, presentation on the fly  Web 2.0-facilities for comments, blogging & tagging  Multiple ways of embedded advertisement  Complex navigation and search functions  Harvesting extremely difficult  some experiments (e.g. on newsletters), but no running workflow Characteristics Online- newspapers 4c

| IFLA2010. Newspaper section | „kopal“  Co-operative development of a long-term digital information archive  Start in 2004  Task: Development of a standardized long- term preservation solution to facilitate resp. solutions for other libraries / industries  Basis: DIAS (Digital Information and Archiving System) of the Royal Dutch Library, condensed and extended with peripheral open-source  Enhancement for cooperative usage  Development of an universal object scheme  Hosting outside the library (remote access) 5

| IFLA2010. Newspaper section | kopal: cooperation GWDG: Hosting IBM: Archiving SW DNB: Ingest/Acess SW SUB: Ingest/Acess SW Common task: Preservation Planning 6

| IFLA2010. Newspaper section | GWDG (Göttingen) DIAS by IBM Account 1 Account 2 SUB Göttingen DNB (Frankfurt) Local software Local software Local software Local software kopal: Structure & concept Partners nn 7

| IFLA2010. Newspaper section | Packaging Submission Information Package Object METS 1.4 UniversalObjectFormat LMER 1.2 – Long-term preservation Metadata for Electronic Ressources Header dmdSec amdSec File Section Structural Map Mets.xml 8

| IFLA2010. Newspaper section | Administration Interface koLibRI Online-Archivist Machine Interface

| IFLA2010. Newspaper section | Kopal preservation strategy  Migrate object with urn xxx into new format yyy  Migrate all objects  of format xxx and/or  that have been ingested before a certain date and/or  that are larger than zzz MB  into new format xyz (e.g. from TIFF to PNG)  Implementation of emulation view paths  No restriction as of file size or file format / type – all known and unknown file formats are being accepted (text, pictures, video, audio, executables,... etc.) 10

| IFLA2010. Newspaper section | Digital newspapers in DNB  Some results (collections) from digitisation projects -Simple graphics-data -access in a dedicated system -Including full text OCR & access  Online-Newspapers: Some pre-studies on objects like „Spiegel“ – but no running workflow  Concentration on e-papers 11

| IFLA2010. Newspaper section | Digitisation results in DNB 1 12

| IFLA2010. Newspaper section | Digitisation results in DNB 2 13

| IFLA2010. Newspaper section | E-papers in DNB Preliminary thoughts: Requirements  Structured normalised metadata-set: Article/photo – issue – newspaper  Persistent identification of each unique objects, linkage between them, citable  Added information for author / title on the article level is useful but not necessarily needed 14

| IFLA2010. Newspaper section |  Quantity: -One newspaper: ca. 150 articles per day / 900 a week / per year per year  Start modestly  Retrodigitisation (collection started with 1913) will extend this to more than 1 bil. articles  Challenge in terms of resources and technical capacities E-paper requirements 15

| IFLA2010. Newspaper section |  In cooperation with a vendor after a tender procedure  Ca. 20 important newspapers, starting with two  Metadata should be delivered in ONIX.  Harvesting Interface OAI-PMH  All data delivered in a XML-File  Integrated Digital Preservation in the kopal environment E-paper project (recently started) 16

| IFLA2010. Newspaper section | XML record for e-Papers 17

| IFLA2010. Newspaper section | E-Paper & Access  Principal question for access: Integration in Portal environment or dedicated (independent) search-area  Advanced requirements for segmentation of text  Direct link between portal (metadata) and text  Navigation / Browsing within the object, direct access to single chapters / pages  Zooming, scroll  Integrated Full text search  Print and Store facilities  DRM, IDM 18

| IFLA2010. Newspaper section | Film Information about actors, director, producers, music, sequence, year of production. Short description of the picture, video sequence… What is in the film, rights. Any other relevant information as short summary of content for fast access… Related books Year of printing, editions, authors, summary of the book…. Related internet links Year of printing, editions, authors, summary of the book…. Related music score Year of printing, editions, authors, summary of the book…. Related films Year of printing, editions, authors, summary of the book…. Related songs Year of printing, editions, authors, summary of the book…. Related news Year of printing, editions, authors, summary of the book…. Semantic Multimedia- Search 5 CORE Professionals (Media archives…) MANTLE Automated (Learning) SHELL End-User (Wikipedia) Open Knowledg eNetworks 4 Knowledge base Semantic relation 3 Face Logo Text Person Speaker 1 Speaker 2 Image Text Title Content- analysis 2 Automated optimisation 1 digitisation Reuse of results from CONTENTUS-project 19

| IFLA2010. Newspaper section | Data processing  Automated Page- segmentation (headlines, images, tables)  OCR + entity recognition  Full text search  Semantic search interface Based on:  Intellectual approved authority files  Statistical data analysis | 20 20

| IFLA2010. Newspaper section | Our solution currently 21 Integrated search and retrieval

| IFLA2010. Newspaper section | Next step: Integrated E-papers 22

| IFLA2010. Newspaper section | Integrated E-paper „ZEIT“ 1 23

| IFLA2010. Newspaper section | Bereitstellung von freien Texten 24 Integrated E-paper „ZEIT“ 2

| IFLA2010. Newspaper section | Integrated E-paper „ZEIT“ 3

| IFLA2010. Newspaper section | Reinhard Altenhöner 26