Options for digital delivery Record Society Conference, April 19 th 2007 Bruce Tate Project Manager British History Online.

Slides:



Advertisements
Similar presentations
Don’t Type it! OCR it! How to use an online OCR..
Advertisements

Collecting data Chapter 6. What is data? Data is raw facts and figures. In order to process data it has to be collected. The method of collecting data.
1 of 18 Information Dissemination New Digital Opportunities IMARK Investing in Information for Development Information Dissemination New Digital Opportunities.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Delivering textual resources. Overview Getting the text ready – decisions & costs Structures for delivery Full text Marked-up Image and text Indexed How.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Developed with material from W3C Web Accessibility Initiative (WAI) IMPORTANT: Instructions Please read carefully the Instructions for.
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
Importing Transfer Equivalencies: How to Maximize Efficiency How Columbia College Office of Registrar improved productivity through third party solutions.
Dana Marlowe Accessibility Partners Accessibility Partners © Not to be reproduced without permission. 1 Giving a Picture 1000 Words: Accessibility.
© 2010 Adobe Systems Incorporated. All Rights Reserved. Adobe Confidential. Adobe Acrobat XI Accessibility Features Matt May | Accessibility Evangelist.
Make your choice from more than 70 templates to get a quick start online!70 templates.
© 2011 Adobe Systems Incorporated. All Rights Reserved. Adobe Confidential. Kiran Kaja | Accessibility Engineer Ensuring Accessibility in Document Conversion.
Advanced Accessible PDF Document Training Adobe Acrobat 11.
Intelligent Online Marketing Local Digital Advertising Specific To Your Business Presented by: Dexter Nelson Live Minder Trend Analysis & Marketing.
Beyond the Digital Incunabular Period: Toward Web 2.0 Gideon Burton Asst. Prof. of English Assoc. Editor, BYU Studies Presentation to the Harold B. Lee.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
High Volume Production of Alternative Text: Supporting a Statewide System The Alternative Media Access Center.
How to Create Accessible PowerPoint Presentations Elizabeth Tu and Thayer Watkins April, 2010.
Creating an HTML page Skills: edit and debug HTML pages IT concepts: text editor This work is licensed under a Creative Commons Attribution-Noncommercial-
HTML Elements. HTML documents are defined by HTML elements.
UNIVERSITY OF MACEDONIA ECONOMIC AND SOCIAL SCIENCES Support and Inclusion of students with disabilities at higher education institutions in Montenegroz.
George Irwin Syracuse University.  Definitions  Creating PDF  Retrofitting PDF documents  Assistive technology and PDF  Resources.
Processing PDF: How to Go from PDF to E-text to Audio Gaeir Dietrich Director High Tech Center Training Unit of the California Community Colleges Foothill.
Advanced OCR with OmniPage and FineReader. Overview Optical character recognition Optical character recognition Structural recognition Structural recognition.
Creating Accessible PDF’s in Adobe Acrobat Professional 7.0.
Digital Text Primer Prepared for: AIEA Roundtable on Digitization of Armenian Documents Saturday 7 October 2006, University of Geneva, Switzerland Roland.
October 29, Marla Roll Director Shannon Lavey Service Coordinator and Provider Allison Kidd Assistive Technology IT Coordinator Accessibility Specialist.
1 Newspaper Digitisation Workflows Rose Holley- Manager ANDP Presentation to Cultural Heritage Digitisation professionals 26 November 2008.
1 Australian Newspapers Digitisation Program Development of the Newspapers Content Management System Rose Holley – ANDP Manager ANPlan/ANDP Workshop, 28.
Document Delivery Formats for the Web and Legal Digital Collections Kevin Reiss June 18 th, 2004 Law Library Rutgers-Newark School of Law.
Introduction to Desktop Publishing Using Adobe InDesign ®
Supporting Literacy Skills with Alternative Formats. EA Draffan. Highlighted text.
Luc Audrain Hachette Livre Head of digitalization
Adobe Dreamweaver CS5 Introduction Web Site Development and Adobe Dreamweaver CS5.
Lesson 4: Using HTML5 Markup.  The distinguishing characteristics of HTML5 syntax  The new HTML5 sectioning elements  Adding support for HTML5 elements.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Peoplesoft XML Publisher Integration with PeopleTools -Jayalakshmi S.
PICASA Erceg Aleksandra II4. About -Picasa is used for organizing, viewing and editing digital photos. It also has an integrated photo-sharing website.
Christine Laham, Fahed Abdu, David Dezano,Shelly Kim.
Copyright ©: SAMSUNG & Samsung Hope for Youth. All rights reserved Tutorials The internet: Blogging Suitable for: Advanced.
Support the spread of “good practice” in generating, managing, analysing and communicating spatial information Participatory Internet-based Mapping Map.
ReScript – collaborative online editing of historical texts Bruce Tate British History Online Institute of Historical Research University of London © Bruce.
1 Using Digital Technologies to unlock history for researchers. Rose Holley – Manager Newspaper Digitisation Program Australian Academy of the Humanities.
10/18/2015 NORTEL NETWORKS CONFIDENTIAL – FOR TRAINING PURPOSES ONLY Global Documentation Evolution System Overview and End-to-End Process Training.
Introduction to web development and HTML MGMT 230 LAB.
E-Books Presentation. Hard Copy (Book) Scanning OCR Text Document HTML Conversion Text Formatting Linking Image Insertion Final QC Soft Copy (JPG/TIFF)
“This presentation is for informational purposes only and may not be incorporated into a contract or agreement.”
Practical Experiences With the Adoption of XML in Commercial Publishing Richard Kidd Neil Hunter
INTELLECTUAL RIGHTS AND HISTORIC CORPORA Mark Sandler University of Michigan ICOLC, March, 2003.
An exercise in preservation and applied technology Making an Electronic Text.
EMu Interface and the Web Clear identification of web fields for users and administrators Visual identifier of the web presentations in EMu, ie Collection.
Review of Data Capture. Input Devices What input devices are suitable for data entry? Keyboard Voice Bar Code MICR OMR Smart Cards / Magnetic Stripe cards.
How to Create Accessible Online Course Content Shivan Mahabir Athanasia (Tania) Kalaitzidis Kevin Korber Danny Villaroel.
Document Computing Technologies for Managing Electronic Document Collections Ross Wilkinson... [et al.] Circulation Counter [RES3H] ZA4080.D
Claro Software Dr. Alasdair King Claro Software. Assistive Software Development at Claro “Assistive software development and publishing is what we do.
Accessible PDF’s using Adobe Acrobat Standard or Professional Jarilyn Weber 06/11/2014 “Leading for educational excellence and equity. Every day for every.
Understanding Web-Based Digital Media Production Methods, Software, and Hardware Objective
DIGITIZATION IN THEORY AND PRACTICE WEBSITE: Helen Nneka Okpala Presentation done at University of.
ITL conference 2003 Putting Your Content on a Diet Using rich online media without download woes.
Services Provided by Josoft Technologies  Data Entry Services  Data processing  Data Conversion  Outsourcing Services  Inbound Call Center  Outbound.
DATA COLLECTION Data Collection Data Verification and Validation.
Lecture 4 Web Design. Part 1.
LIS1510 Library and Archives Automation Issues Basics of XHTML
Infty Software - Assistive Tools to Access STEM -
TAKING THE BIG LEAP-FROM NECESSARY EVIL TO BUSINESS ASSET
MODULE 8: PRODUCTION.
My Program Session Title
PRODUCTION PHASES CHANGES
Quick and Dirty: the art of OCR
Presentation transcript:

Options for digital delivery Record Society Conference, April 19 th 2007 Bruce Tate Project Manager British History Online

April 19th 2007Bruce Tate2 The current situation  Widening audience  Maximising usage  Need for revenue generation  Current technology evolving  New emerging technology  Changing / rising user expectations

April 19th 2007Bruce Tate3 Making sense of the situation  Consider all 3 elements and their inter-relationships  Constantly updated intellectual model Users Technology Organisation

April 19th 2007Bruce Tate4 Option 1/3 – simple HTML  The ‘webmaster’ phase  Could be born digital, or converted from typescript version used for print  Simple transcription can be done by non- technical volunteers  Little control over the data, no possibility for adding extra semantic meanings

April 19th 2007Bruce Tate5 Option 2/3 – double keying into XML  Requires scanned page images  Text is transcribed by two operators separately who also add bespoke XML tags at the same time  Third operator runs comparison checks, correcting any errors  Highly accurate (99.9% and above) – minimum benchmark for academic use  More expensive - £2 to £4 per page  Examples: –British History Online –Old Bailey Online

April 19th 2007Bruce Tate6 Option 3/3 – page image + OCR  Scanned page image is ‘read’ by Optical Character Recognition software, and a text transcript produced  Either, –output text for conversion to HTML, or –Save as image with ‘hidden’ text transcription to enable searching  Very fast, can be trained to learn from mistakes - processing can be refined many times  Requires proof reading and error checking; non-standard layouts, older typefaces (esp. oblique), and poor quality originals  Adobe Capture - £350, Omnipage Pro - £320, Abbyy Finereader £60

April 19th 2007Bruce Tate7 Comparing the options TimeCostQuality Option 1: HTML HighLowHigh Option 2: Double key MediumHighHigh Option 3: OCR Low- Medium

April 19th 2007Bruce Tate8 Some possible trends  More ‘co-operative’ content creation  More and more silver surfers  Growth of print on demand  ‘Mature’ Web 2.0: –Blog with useful information –Wiki with specialised terminology –Overlay Google Maps with data sets / photos

April 19th 2007Bruce Tate9 Round-up  Period of slow and intense change  Distinctions breaking down between: –User and creator –Society and contract publisher –Society and bookseller  Keep thinking about your users, organisation and technology plus the relationships between them.