1 Helping communities access and explore their newspaper heritage. Rose Holley – Manager Newspaper Digitisation Program

Slides:



Advertisements
Similar presentations
Copyright © 2000 by RT Lawrence Corporation. La Mirada, California, USA. All Rights Reserved. RTLFiRST – Flexibility & Ease of Configuration Full control.
Advertisements

Beyond the Google Book: the Future of the Digital Library Cory Snavely Library IT Core Services manager University of Michigan April 20, 2010.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
ISI Web of Knowledge – Innovative Solutions ISI Web of Knowledge / Web of Science – coming developments BIOSIS Archive Web Citation Index – New product.
DO WE STILL NEED A CATALOGUE? Discovery, delivery, and engagement at the National Library of Australia DR MARIE-LOUISE AYRES.
Illinois Newspapers: Anna FitzSimmons, Amy Sullivan, Tracy Nectoux, Nathan Yarasavage Preparing Our Past for the Future.
Strategies for Building Successful Digital Initiatives: Tools, Workflows and Ideas for Small to Medium Institutions Rachel L. Frick & Andrew Rouner University.
An insight into the project: Shellharbour Connect The simple way to connect with your local organisations, spaces and resources in Shellharbour.
The COUNTER Code of Practice for Books and Reference Works Peter Shepherd Project Director COUNTER UKSG E-Books Seminar, 9 November 2005.
Connected Histories Sources for Building British History, Funded under the JISC eContent Capital Programme for 18 months Partners:  Prof. Tim.
Colin Potter and Caroline Foxon – Sunshine Coast Regional Library Service
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
ARCHIVE IMAGING SEARCHABLE VIA THE WEBPAC Marthie de Kock The Hong Kong Institute of Education 9 December 2002.
These ain’t “Old News”! Creating access to historic newspapers Christine Guenther OCLC Product Manager, Digital Services Preservation Service Centers Bethlehem,
NATIONAL LIBRARY OF MEDICINE PubMed Central Martha Fishel National Library of Medicine CENDI Meeting September 15, 2004.
Information Retrieval in Practice
Lund Online 2011 – Products & Platforms Gale NewsVault Unlock the wealth of history Monique Schutterop.
1 THE AUSTRALIAN NEWSPAPERS DIGITISATION PROGRAM (NDP) Rose Holley – Manager Newspaper Digitisation Program Presentation at the Association of Parliamentary.
1 Moving type from past to present: chronicling Australia through the digitisation of newspapers. Cathy Pilgrim – Director, Australian Newspaper Digitisation.
1 History in a digital world: helping communities access and explore their heritage through newspapers. Cathy Pilgrim – Director, Australian Newspaper.
All rights reserved. National Library Board Singapore Liberating the Printed Form: The Singapore NLB’s Experience Ms Ngian Lek Choh Director, National.
Debbie Campbell Director Collaborative Services National Library of Australia Electronic Resources Australia Annual Forum Sydney 10 July 2012 Trove’s Application.
Progress in Access Technologies: NLM Video Search Jennifer Marill Chief, Technical Services Division Edward Luczak Systems Architect, Office of Computer.
The added value information service that focuses on the European Union, the countries of Europe, and on the issues of concern to citizens, stakeholders.
1 Newspaper Digitisation Workflows Rose Holley- Manager ANDP Presentation to Cultural Heritage Digitisation professionals 26 November 2008.
1 Australian Newspapers Digitisation Program Development of the Newspapers Content Management System Rose Holley – ANDP Manager ANPlan/ANDP Workshop, 28.
The Voice of A Community Chinese Times Digitization Project Ian Song Prepared for the Multicultural Canada Conference
Customer Forum OTech’s New Web Publishing Service Web Services Section – April 29, 2015.
A step-by-step tutorial by Henry Liu Auckland City Libraries Make a start Chinese Digital Community.
The National Digital Newspaper Program (NDNP) An NEH/LC Collaborative Program Enhancing access to historical newspapers Release: September 2006.
Public Library Use 19 th Century U.S. Newspapers Digital Archive.
Sam Kalb Scholarly Communication Services Coordinator QUEEN’S.
LHeC beyond science Alessandra Valloni WHAT IF YOU DIGIT “LHeC” ON GOOGLE…? Many papers and presentations…
Evitherm MTR 07 Sept 04 Phill Giles Thermal Technology Consulting (TTC) Evitherm project mid-term review Work Package 2 Phill Giles Thermal Technology.
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
Erin Kinney, Wyoming State Library. Motivation #1 priority that came out of 2004 statewide digitization meeting WSL received many reference questions,
Rose Holley, Trove Manager National Library of Australia ALIA Online Conference, Sydney 1-3 February 2011 Find and Get in Trove: Making Getting Better.
Robyn Holmes: Curator of Music Rose Holley: Trove Manager Karen Vinoles: Music Australia Manager IAML Conference, Brisbane 3 September 2010 Consultation.
Rose Holley: Trove Manager Resource Sharing and Innovation National Library of Australia ARLIS ANZ Conference, Darwin September 2010 Developments.
UMB Healey Library Sept 11, 2007 Prof E. Schaefer Soc 211G: Race & Power in the US.
Mass digitisation? Astrid Verheusen Projectmanager Research & Development Division National library of the Netherlands LIBER-EBLIDA Workshop on Digitisation.
Publisher’s Perspective: Digitization of print resources, and archiving of digital resources Judy Best, June 13, 2006.
11-15 April 2011 Mauritius Institute of Health S.S.Pillai
Technology Choices for the JSTOR Online Archive Presented by Chang Feng Department of Computer Engineering and Computer Science, University of Missouri-Columbia,
Rose Holley Manager, Trove 2010 Reference at the Metcalfe Seminar, SLNSW 4 May 2010 Collecting, sharing and improving data: Changing roles for librarians.
1 JACoW Joint Accelerator Conferences Website Presented by J. Vigen on behalf of John Poole, JACoW.
Wisconsin Land Information Association Presentation March 4, 2004.
Libraries Australia Report and Strategic Directions Tony Boston Assistant Director-General Resource Sharing.
1 Using Digital Technologies to unlock history for researchers. Rose Holley – Manager Newspaper Digitisation Program Australian Academy of the Humanities.
International Congress of Archives Web 2.0 Workshop, Brisbane 24 August 2012 Presenter: Rose Holley BUILDING AND MANAGING ONLINE.
Open Borders Project The new Open Borders Project — A merger of the old Open Borders (Project 2) and Connecting and Discovering Content (Project 10)
Rose Holley: Trove Manager National Library of Australia Royal Australian Historical Society Conference, Richmond, NSW October.
Service updates: People Australia ARROW Discovery Service Picture Australia Basil Dewhurst Manager, Resource Discovery Services
Warwick Cathro Assistant Director-General Resource Sharing and Innovation National Library of Australia Trove – a service built on collaboration OCLC Asia.
O PEN A CCESS TO O UR H ERITAGE The Gateway to Oklahoma History Cross Timbers Library Conference – August 16, 2013 Sarah Lynn Fisher University of North.
INFOTRAC Thomason-Gale’s InfoTrac is a database customized by the local librarian. The one accessed through the Riverside County Library System contains.
Rose Holley: Trove Manager Resource Sharing and Innovation National Library of Australia ALIA Conference, Brisbane 1-3 September 2010 Trove: More than.
1 Overview of Progress Cathy Pilgrim – Director ANDP Presentation to NSLA 19 February 2009, National Library of Australia Australian Newspapers Digitisation.
Locating News Resources 8 Mar Outline Mastering E-newspapers –Factiva –WiseNews –SCMP Archive –ProQuest Historical Newspapers: South China Morning.
1 THE AUSTRALIAN NEWSPAPERS DIGITISATION PROGRAM (NDP) Rose Holley – Manager Newspaper Digitisation Program Presentation for Spydus 31 October 2007, NLA,
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
1 Australian newspaper digitisation program Bronwyn Lee National Library of Australia Presentation to 13 th IASI World Congress – 13 March 2009 Sports.
1 Australian Newspapers Beta Summary of Usage and Feedback August – November 2008 ANPlan-ANDP Workshop,
Literary Reference Center Tutorial support.ebsco.com.
Moving to Web 2.0 The Implementation of PRIMO at the British Library
IFLA Newspapers pre-conference Geneva, Arturs Zogla
Locating News Resources
ASEAN PATENTSCOPE Service
Literary reference center
RESEARCH TOPICS Web-Interface Performance DTD Extensibility Imaging
Presentation transcript:

1 Helping communities access and explore their newspaper heritage. Rose Holley – Manager Newspaper Digitisation Program Australian Media Traditions Conference 23 November 2007, Charles Sturt University, Bathurst

2 Status of the Program November 2006 Minister for Arts and Sports approval Budget approval -$8 million for 3 million pages over 4 years Contracts signed with digitisation suppliers April 2007 program pilot phase commences

3 Content and Coverage National Content Initially a title from each state Focus on major titles from each state first Anticipated that ‘regional’ titles may be contributed later Coverage: published between 1803 – 1954 (out of copyright) West Australian Northern Territory Times Courier Mail Advertiser Sydney Gazette Argus Mercury Canberra Times

4 First Newspaper First page of first Australian newspaper ever published The Sydney Gazette and New South Wales Advertiser Saturday March

5 Through 150 years Up to 1954 (when Copyright applies), and later if agreement with publishers. The Argus 22 August 1945

6 Relationship - ANPLAN Website:

7 Keep Up to Date with Progress Website:

8 National Help NLA working with State and Territory Libraries as part of ANPLAN. Libraries suggest titles and dates and provide microfilm for digitising. ANPLAN members and other stakeholders will provide feedback on the search and delivery prototype. Developing model for national contribution of regional newspapers.

9 Process in brief National sourcing of selected newspaper microfilm masters. Masters scanned by Contractor, Sydney to tiff files. NLA perform quality assurance, add metadata. Contractor, India process tiff files - OCR, zoning, xml markup. NLA QA files, ingest to system, create derivatives for delivery.

10 Logistics Australia (State Capitals – Sydney/Canberra) USA (Virginia) - India (Hyderabad, Chennai)

11 6 Month Progress IT Infrastructure and storage implemented at NLA Content management and ingest software developed by NLA to support workflow Quality assurance and production software developed by US/India contractor Pilot data sent to contractors to test workflows, systems and software against agreed project spec.

12 Next 6 months Acceptance of pilot data then commence production phase (3 million pages) Development of search and delivery prototype Public launch of service with a good body of content in 2008 Progressive addition of content – national program ongoing

13 Technology – internal NLA Old newspapers being processed and delivered using latest digital technology NLA developing in house: –Ingest and storage system –Workflow and content management system including quality assurance module –Search and delivery system NLA providing: –System Infrastructure (storage, backup, disaster recovery)

14 Infrastructure and Storage Online Storage – 70 TB: Working space for images in processing 40TB for 1 million pages Search and delivery derivatives 30TB for 3 million pages XML files, database systems and indexes 1 TB Offline Storage – unlimited for master images on tape.

15 Establishing Workflows

16 Technology - external Scanning microfilm using Flexscan/Eclipse scanner and latest software (nextstar) from NextScan ,000 pages a week.

17 Scanning Contractor

18 Digital Images returned to NLA

19 Quality Assurance at NLA Use 2 widescreen monitors placed vertically. Can view complete page within context of issue. Add metadata, sort out missing and duplicate pages within an issue. Prepare batches to send for OCR.

20 Metadata

21 Page verification

22

23 Technology - external Software developed to: Zone areas and articles on a page Flag continuing articles across multiple pages Categorise articles on a page OCR text on a page Re-key headings and first 4 lines of text. Deliver XML files (ALTO) and METS/MODS files.

24 India Facility - Hyderabad

25

26 Quality Assurance

27 OCR Accuracy

28 Batch reporting

29 Acceptance Criteria

30 Prototype Development Under discussion: Derivative sizes and zoom technology testing Search and Browse features Results and refinement of results User interaction with source (web 2.0) Interface design

31 Digital Newspaper Searching Newspapers full text searchable Image captions searchable Search across multiple papers e.g. by persons name. Refine searching by: –Date –Newspaper title –State published

32 Refine search by categories News Advertising Birth Death Marriage notices Obituaries Editorial commentary and letters Shipping News Arts and leisure Detailed lists, results, guides

33 Search Illustrations Categorised as: Photo Cartoon Map Graph Illustration Captions searchable Canberra Times 26 July 1928 page 6

34 Browsing and Viewing Browse papers page by page Zoom in and out of image –to read small text –to view context of article within page layout Print article or entire page or issue

35 Zoom technology

36 Testing derivative sizes and zooming

37 Prototype wireframe

38 Other features Under discussion: OCR correction by users Personal annotation of articles by users Tagging results Creating public sets (for historical events) Clustering results Searching across other relevant resources (paid subscription services, international resources, other digital resources)

39 Prototype release To be released to stakeholders who have given microfilm content Stakeholders able to view their data Feedback on data quality and search functionality Amendments made and then ‘search and delivery version 1’ released to a wider group for testing and feedback before public launch in 2008.

40 Pilot Data Canberra Times Sydney Gazette Northern Territory Times South Australia Advertiser Hobart Town Gazette, Courier, Colonial, Mercury Melbourne Argus Perth Gazette West Australian Brisbane Courier Mail (12 titles, 8000 issues = 50,000 pages = 500,000 articles)

41