The European(a) Newspapers Project A Gateway to European Newspapers Online Paris, 12.04.2012 Thorsten Siegmann, Staatsbibliothek zu Berlin, Germany.

Slides:



Advertisements
Similar presentations
A Gateway to European Newspapers Online Building Common History and Identity Around Digital Materials INFORUM, Prague, May 21-24, 2012 Vesna Vuksan, University.
Advertisements

THE PROJECT LIFECYCLE National Contact Point
Ute Schwens, Die Deutsche Bibliothek, IFLA Sattelite Meeting Information Technology and DCMI, Goettingen 12/08/03, 1/19 Ute Schwens, Die Deutsche Bibliothek.
EU-Bureau of the Federal Ministry of Education and Research PT-DLR Joseph-Schumpeter-Allee Bonn Tel: 0228 / Fax: 0228 /
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
06/02/2014 NORDUnet 2000 : Renardus – The Clever Route to Information Renardus – The Clever Route to Information Janne Kanner CSC – Scientific Computing.
Michael contribution to national and European strategies EVA Florence - 18 April 2008 MINERVA and MICHAEL fostering access to digital cultural and scientific.
Israel, 10th and 11th of December 2003 Italy Israel Bi-national Seminar on Digital Access to Scientific and Cultural Heritage Antonella Fresa MINERVA Technical.
Online Access to Cultural Heritage through Digital Collections: the MICHAEL Project Giuliana De Francesco Ministero per i beni e le attività culturali,
Digital libraries and culture portals Rossella Caffo - MiBAC Coordinator of the MICHAEL and MINERVA eC projects.
Antonella Fresa Vilnius, 4th October 2007 Antonella Fresa Technical Coordinator MinervaEC MInisterial NEtwoRk for Valorising Activities in digitisation,
WDL Technical Architecture Working Group (TAWG) June 2010 Achievements and Recommendations Co-chaired by Noha Adly, Bibliotheca Alexandrina Babak Hamidzadeh,
CrossAsia at the Staatsbibliothek zu Berlin an approach to organise access to research material in the field of Asian studies.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
WP7 Internal Evaluation & Quality Assurance Green Employability Project StudioCentroVeneto - Toni Brunello and Paolo Zaramella Vienna, 10th January 2012.
The Europeana Newspapers Project A Gateway to European Newspapers Online.
LIBER, Europeana and the Europeana Newspapers Project Dresden, Aleš Pekárek, Association of European Research Libraries, Den Haag, NL.
Europeana Newspapers Project A Gateway to European Newspapers Online.
2 Cataloguing and retroconversion: what do we need to know as a community ? Patricia Methven.
PwC SCHEMAS Forum for metadata schema implementers The SCHEMAS project and metadata ETB Workshop, London, 9-10 January 2001 Michael Day,
Collection-level description & collection management: tool for the trade or information trade-off? Collection Description Focus Workshop 4 Newcastle, 8.
NRG, Bristol, November 17, 2005 Hans Petschar The European Library Vision for European Digital Library.
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
1 Welcome to KTH! KTH, the Royal Institute of Technology Excellence in Education, Research and Entrepreneurship Victor Kordas, KTH Grants Office.
WP3. Evaluation, Monitoring and Quality Plan Dr. Luis Sobrado 27 th May 2011.
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
The Business Support Professional Career Pathway Leonardo Partnership Management Meeting CECA´s headquarter Seville, Spain March 2010.
1 Dissemination to Policy and Decision Makers and a Wider Audience Peter J. Bates pjb Associates
The European Activities of BR Communication e-CODEX e-Justice Communication via Online Data Exchange Bucharest, June 14 th 2013.
WP2 – communication and dissemination
Communicating the Rhine release Jon Purday Senior Communications Advisor, Europeana.
ETIS+: European Transport Policy Information System - Development and Implementation of Data Collection Methodology for EU Transport Modelling Funded by.
Testing and Evaluation in Digital Preservation Projects: the case of KEEP Milena Dobreva Janet Delve, David Anderson, Leo Konstantelos.
Europeana Collections Remembering the First World War Rotterdam, Thorsten Siegmann, Staatsbibliothek zu Berlin – Preußischer Kulturbesitz.
Europeana Collections National Libraries and further partners digitising materials from World War One Thorsten Siegmann, Staatsbibliothek zu.
On the Two Sides of the Pond By Hans-Jörg Lieder, Head of the Department of Bibliographic Services – Union Catalogue of Serials Staatsbibliothek zu Berlin.
The view from Europe Paola Gargiulo – CASPUR (and Valentina Comba University of Bologna – Italy) Fiesole Collection Development Retreats Fiesole 2004 March.
Converging parallel universes Library services as building blocks of digital humanities research 42nd LIBER Annual Conference Munich June 2013 Gregor Horstkemper.
OpenUp! A New Project on Opening up the European Natural History Heritage for EUROPEANA W. G. Berendsohn, A. K. Michel, A. Güntsch, W.-H. Kusber (2011)
Creating Access to Europe’s Television Heritage FIAT/IFTA World Conference Madrid :: October 2006 Prof. Dr. Sonja de Leeuw (Project coordinator - Utrecht.
The German Union Catalogue of Serials and its interlibrary services Hans-Jörg Lieder Head of the Department of Bibliographic Services Staatsbibliothek.
DISSEMINATION / VALORISATION PLAN AND ACTIVITIES PRESENTED BY DR SHYAM PATIAR.
Mary Rowlatt AccessIT Project Coordinator MDR Partners
Exploring Europe's Television Heritage in Changing Contexts Connected to: Funded by the European Commission within the eContentplus programme
Europeana Libraries: what is the value of a library domain aggregator? Susan Reilly (LIBER) LIBER 2012, Tartu.
Europeana Sounds – Uniting the sounds of Europe Richard Ranft (British Library) Zane Grosa (National Library of Latvia) IASA Nordic conference, 26 May.
Enhancing the Culture of Reading and Books in the Digital Age - ARROW Olav Stokkmo, Chief Executive, IFRRO 13 October 2009IFLA-IFRRO-WIPO-IPA-EWC Conference;
ICT PSP Infoday Brussels Call 2011 – Theme 2 Digital Content ICT-PSP Call Theme 2: Digital Content Federico Milani, Marc Röder Infso E6/eContent.
E-SENS Electronic Simple European Networked Services WP2 kick off Berlin, Germany Apr 10th 2013.
Consolidating the European Library Space Luxembourg November 1999.
Exploring Europe's Television Heritage in Changing Contexts Connected to: Funded by the European Commission within the eContentplus programme
EUscreen: Examining An Aggregator ’ s Role in Digital Preservation Samantha Losben Digital Preservation - Final Project December 15, 2010.
Mary Rowlatt AccessIT Project Coordinator MDR Partners
Andreas Juffinger 14 June, 2012, Washington DC Europeana Research Opening Up Europeana for Research.
Europeana Libraries: building a pan-European aggregator Wouter Schallier, LIBER Executive Director Eva/Minerva 15/11/2011.
1 Women Entrepreneurs in Rural Tourism Evaluation Indicators Bristol, November 2010 RG EVANS ASSOCIATES November 2010.
National Library of Estonia in the TEL-ME-MOR project IST4Balt workshop in Estonia June 2006 Baltic ICT Community.
Cultural Heritage Projects Bundesamtsgebäude Wien, June 30, 2000 Cultural Heritage Projects: Renardus Dr. Heike Neuroth Lower Saxony State and University.
Participation in 7FP Anna Pikalova National Research University “Higher School of Economics” National Contact Points “Mobility” & “INCO”
The MICHAEL Project is funded under the European Commission eTEN Programme The multilingual catalogue of digital cultural heritage in Europe.
Unesco / WSIS+1026 February 2013 ENUMERATE: Measuring the progress of digital heritage in Europe Marco de Niet (DEN Foundation, NL) Unesco WSIS+10 Review.
Benchmarking tool for Quality Assurance in VET.
National Library of Finland Strategic, Systematic and Holistic Approach in Digitisation Cultural unity and diversity of the Baltic Sea Region – common.
Collection of Pan-European Terminology Resources through Cooperation of Terminology Institutions EUROTERMBANK Andrejs Vasiļjevs, Tilde, Latvia.
EDLproject WP3 “Developing the European Digital Library” LIBER – EBLIDA workshop Digitisation of Library Material in Europe Copenhagen, October.
Creating Access to Europe’s Television Heritage Vienna, EDL Workshop November Dr. Alexander Hecht (Austrian Broadcasting Corporation ORF) Johan.
VIDEO ACTIVE Creating Access to European Television History Project Update FIAT World Conference, Lisbon October 15th, 2007 Alexander Hecht (ORF, A) –
eContentplus 2008 Work Programme
DRIVER Digital Repository Infrastructure Vision for European Research
APENet and EUROPEANA: Digitization Issues in the European Context
Presentation transcript:

The European(a) Newspapers Project A Gateway to European Newspapers Online Paris, Thorsten Siegmann, Staatsbibliothek zu Berlin, Germany

2 Content Project Profile Aims Consortium Framework Areas of activity

3 Europeana Newspapers – why newspapers? "Die Zeitungen sind die Sekundenzeiger der Geschichte." (Newspapers are the sweep hands of history) Arthur Schopenhauer Why newspapers? Relevant to all citizens Highly relevant to European policies incl. Europeana Newspapers in libraries – between Heaven = solid and complete originals, excellent microfilm copies and Hell = frail and crumbly originals, missing editions, incomplete supplements, poor microfilm copies, legal uncertainties with contemporary material

4 Europeana Newspapers: Aims and Objectives Europeana Newspapers aims at the aggregation and refinement of newspapers for The European Library and Europeana. will use refinement methods for OCR, OLR (article segmentation), and named entity (NER) and class recognition the libraries participating in the project will provide around 18 million digitised newspaper pages to Europeana Further libraries will be encouraged to contribute newspapers to Europeana and TEL by the project

5 Project Profile: Consortium & stakeholders 17 partners from 12 countries within the consortium National libraries University libraries SME External partners and stakeholders: Involvement of libraries outside the project consortium Framework: Funded as a Best Practise Network in the ICTPSP programme of the European Commission Project Duration: February 2012 – January 2015

Europeana Newspapers Consortium NLF SBB ONB NLP BnF NLE SUB HH USAL NLL KB LIBER CCS NLT UB UBIK LFT BL

Consortium Partners 9. University of Salford 10. CCS Content Conversion Specialists GmbH 11. Stichting LIBER 12. National Library of Latvia 13. National Library of Turkey 14. University Library of Belgrade 15. University of Innsbruck 16. Landesbibliothek Dr. Friedrich Tessmann 17. The British Library 1. Staatsbibliothek zu Berlin (project co-ordinator) 2. National Library of the Netherlands 3. National Library of Estonia 4. Österreichische Nationalbibliothek 5. National Library of Finland 6. Staats- und Universitätsbibliothek Hamburg 7. Bibliothèque nationale de France 8. National Library of Poland

8 Project Profile: Objectives 1) Selection, Refinement & Aggregation of content Make Europeana the largest provider of pan-European newspaper collections Provision of more than 18 million newspaper pages to Europeana, many of those with full-texts Support move from images to texts in Europeana 2) Analysis of existing newspaper collections Survey of newspaper holdings in Europe 3) Quality Assurance & Best practise recommendations Contribute to optimised workflows and data aggregation infrastructures Provide best practice recommendations for digitization, refinement, workflows, metadata etc. and evaluation tools 4) Presentation and full-text search Improve access to newspaper collections within Europeana

9 1) Selection, Refinement & Aggregation of content Aggregation of 18 million pages of digitised newspapers to Europeana and to The European Library 8 million pages as is (content providers) 10 million refined pages: OCR (UIBK, Austria) 2 million refined pages: OCR/OLR (article segmentation) (CCS, Germany) Analysis of available digital newspaper collections and selection of subsets suitable for refinement

1) Refinement – OCR and OLR 10 million refined pages: OCR (UIBK, Austria) 2 million refined pages: OCR/OLR (article segmentation) (CCS, Germany) UIBK enriches the OCR with structural information from their Document Understanding Platform CCS produces OCR and verification of column recognition, zoning, article segmentation, and page class recognition CCS provides libraries with a client technology for manual correction of recognition and segmentation results CCS: Column recognition, article segmentation UIBK: Detection of headings, footnotes, etc. Table of contents extraction

1) Refinement - Named Entity Recognition KB provides named entities recognition (NER) for material from up to three languages (Dutch, English, and German)

2) Analysis of existing digitised newspaper collections Project partners and others will be contacted until summer 2012 to analyse the extent of digitised newspapers collections at their institutions Results will be embedded in Zeitschriftendatenbank of Staatsbibliothek zu Berlin (Union Catalogue of Serials) Potential new partners for the extension of the network will be suggested by survey May also be useful to judge technical status of digitised data and as part of gathering descriptive metadata If you hold digital newspaper collection and like to participate in the survey please contact:

3) Analysis of work & Best Practise Recommendations Analysis of metadata formats in use by libraries in digitisation projects Align metadata models with the METS/ALTO standard and release best practise recommendation on how to apply these formats in newspaper digitisation and refinement Usability of the recommendation will be tested through an evaluation cycle Provide recommendations on best practices for refinement of digitized newspaper collections for Europeana

4) Presentation & Access to full-texts Within the lifetime of the project, a content browser will be built within TEL portal so that users can … Search full text, e.g. by search term, by named entities by collections of newspapers by date …. See newspaper images Be linked to relevant library sources This browser will be built in TEL during project; and exported to Europeana after the project

15 5) Dissemination Objectives: Establishment of publicity Increasing usage of Europeana Awareness raising among target groups Tasks: 1. Media Communication 2. Workshops and conferences Three main dissemination workshops National information days Network extension 3. Exploitation

… and more will come soon Detailed information will be available soon: (Launch: End of April) Participating in the survey on European Newspapers:

Thank you for your attention! Thorsten Siegmann, Staatsbibliothek zu Berlin (Launch: End of April)