TPEN: Transcription for Paleographic and Editorial Notation Funded by the Andrew W. Mellon Foundation and The National Endowment for the Humanities Initial.

Slides:



Advertisements
Similar presentations
NATIONAL LIBRARY OF MEDICINE PubMed Central Edwin Sequeira National Library of Medicine May 26, 2004.
Advertisements

The Advanced, Enterprise Publishing Environment for Cross-media Output to Print & Web.
SharePoint Forms All you ever wanted to know about forms but were afraid to ask.
Sharpdesk Overview Desktop Composer Search Imaging      
EXtensible Catalog David Lindahl University of Rochester.
MAE Training for User July 8, Agenda Wiki FishEye Crucible Stash.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Book of the Dead Project: A new approach to Digital Editions of Ancient Manuscripts using CIDOC-CRM, FRBRoo and RDFa Dr. Barry Norton, Development Manager,
Open Annotation Overview Frankfurt Germany, 10 th of October Open Annotation: Social Bookmarking and Annotation of eBooks Robert Sanderson
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
Introducing new web content management tools for Priority...
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
Sakai Overview ITS Teaching and Learning Interactive Aurora Collado January 10, 2008.
Resource Discovery Module DigiTool Version 3.0. Resource Discovery 2 Deposit Approval Search & Index Dispatcher & Viewers Single & Bulk Web Services DigiTool.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Personal Bibliographic Software Roger Mills. PBS A replacement for the card index Originally intended to manage references downloaded from abstracting/indexing.
1 of 6 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
SQL Reporting Services Overview SSRS includes all the development and management pieces necessary to publish end user reports in  HTML  PDF 
DIGITAL MANUSCRIPT INTEROPERABILITY SharedCanvas and IIIF in Practice Benjamin Albritton Digital Manuscript Product
Computer Literacy BASICS: A Comprehensive Guide to IC 3, 5 th Edition Lesson 14 Sharing Documents 1 Morrison / Wells / Ruffolo.
1 of 5 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
Drupal Workshop Introduction to Drupal Part 1: Web Content Management, Advantages/Disadvantages of Drupal, Drupal terminology, Drupal technology, directories.
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall.
Adagio4 Web Content Management EP Information Offices.
Proprietary & Confidential The Thread That Ties it All Together Voicethread and Discovery Education Jennifer Dorman denblogs.com/jendorman.
Malaysian Grid for Learning October DC 2004, Shanghai, China. © 2004 MIMOS Berhad. All Rights Reserved Metadata Management System DC2004: International.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
CPS120: Introduction to Computer Science The World Wide Web Nell Dale John Lewis.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Web based METS creation Ralf Stockmann case study.
Options for digital delivery Record Society Conference, April 19 th 2007 Bruce Tate Project Manager British History Online.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Seattle Drupal Clinic Introduction to Drupal Part 1: Web Content Management, Advantages/Disadvantages of Drupal, Drupal terminology.
“This presentation is for informational purposes only and may not be incorporated into a contract or agreement.”
Adobe Dreamweaver CS3 Revealed CHAPTER SIX: MANAGING A WEB SERVER AND FILES.
January 2005MERLOT Reusable Learning Design Guidelines OVERVIEW FOR MERLOT Copyright 2005 Reusable Learning This work is licensed under a Attribution-NoDerivs-NonCommercial.
Solutions using Microsoft Content Management Server 2002 Connector for SharePoint Technologies Sue Corke Mark Harrison Microsoft UK.
Advanced Technical Writing 2006 Session #4. Today in Class… ► Meet with your editorial team, refine/post deliverables ► Send URL for deliverables to Bill.
Introduction to EBSCOhost Tutorial support.ebsco.com.
Nicklas Dagersten What’s new and upcoming Configura.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Archaeology of infrastructure Jaap Geraerts Sayeed Choudhury CNI 2015 Fall Membership Meeting.
+ Publishing Your First Post USING WORDPRESS. + A CMS (content management system) is an application that allows you to publish, edit, modify, organize,
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
C. Candace Chou University of St.Thomas EndNote for Researchers.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Timothy W. Cole, Jacob Jett, Thomas G. Habing The University Library & the Center for Informatics Research in Science and Scholarship, GSLIS Open Annotation.
Integrating and Extending Workflow 8 AA301 Carl Sykes Ed Heaney.
Excel Services Displays all or parts of interactive Excel worksheets in the browser –Excel “publish” feature with optional parameters defined in worksheet.
Digitally Enabled Scholarship with Medieval Manuscripts Michael Appleby Yale Digital Collections Center (YDC2) March 1, 2013.
17 Copyright © 2006, Oracle. All rights reserved. Information Publisher.
Quality Education for a Healthier Scotland New Features of the Clinical Knowledge Publisher May 2016.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Web Content And Customer Relationship Management Solution. Transforming web sites into a customer-focused, revenue generating channel with less stress.
AEM Digital Asset Management - DAM Author : Nagavardhan
Dr. Barry Norton, Development Manager, ResearchSpace*
LMEvents SharePoint Portal How-to Guide
Metadata Editor Introduction
Jenn Riley Metadata Librarian Digital Library Program
Tutorial Introduction to support.ebsco.com.
ICEweb 2 a new way of compiling high-quality web-based components for ICE corpora Martin Weisser Center for Linguistics & Applied Linguistics, Guangdong.
Manuscript Transcription Assistant Initiative
Using CuCMS: a workshop
Reference Management and Knowledge Organization
Jenn Riley Metadata Librarian Digital Library Program
USING CONFLUENCE AS YOUR CMS
Tutorial Introduction to help.ebsco.com.
SDMX IT Tools SDMX Registry
Presentation transcript:

TPEN: Transcription for Paleographic and Editorial Notation Funded by the Andrew W. Mellon Foundation and The National Endowment for the Humanities Initial beta release October Publishing transcriptions as annotations of manuscript images Jonathan Deering Saint Louis University

A bit of history CCCC 415 the Norman Anonymous Latin text written around 1106 in Normandy Repositories providing digital images of manuscripts provide viewing environments that are fine for inspecting images, but not for transcribing them Connecting the text with the image at the line level has a number of benefits for transcribing and viewing Automatic line segmentation can handle identifying the lines quite well

Connect a line of transcribed text with a line from the image

Adding a repository TPEN runs discovery process on a new repository, noting all MSS available and which image URLs make up that MSS using a customized spider or parsing a manifest Metadata about MSS is stored as is image metadata That is all! Currently have CEEC, e-codices, Houghton Library (Havard Univesity), La biblioteca del Sacro Convento di Assisi, and Parker on the Web.

Choosing a manuscript

The transcription Environment User requests to transcribe a manuscript. They may forgo modifying the list of images included and the image order, and being transcribing the first page. TPEN downloads the first image, parses the lines, and uses the information to draw the transcription environment, which includes a request to the repository for the image. The UI drawn for the user includes a request for the image from the repository, not from TPEN.

The transcription UI

Anatomy of a transcription Transcribed text Optional additional comment as annotation on the transcription Image url + xyhw Creator - useful when choosing among multiples Date

The life of a transcription The user creates and saves their transcription. It is not made public unless they have given permission. Exporting the transcription allows you to transform any xml tagging you may have included, and output the transcription as PDF, RTF, and XML. You may also make it available as a set of OAC annotations which TPEN will host.

Common editing processes 1. Transcribe (months) 2. Edit (years) 3. Publish (???)

Why transcriptions as annotations? Created content is based on original content, but separation is maintained Creation requires some editorial decision making Multiple annotations and transcriptions can exists for the same original content

Publishing the transcription as an OAC annotation OAC annotations: 3 parts Body- The content of the annotation Target- The item that is being annotated Relationship-The fact that the relationship is annotation RDF transcriptionAnnotation hasTarget hasBody Image

An actual annotation oac:hasBody oac:hasTarget tationis exercere conveniat.

Why oac? Semantic web approach fits well with the model of connecting text we have with images someone else has Fits with sharedcanvas mechanism we use to publish image order of virtual manuscripts Allows the transcription to be used in mashups without rehosting the transcription Goal is machine readability!

Transcriptions can have great value aside from being an intermediate step in the production of an edition. Here are a few of the ways they can be used... Treating transcriptions as a separate outcome

Search can use the full text of the manuscript

Text and image can be displayed side by side or overlayed to enable easier inspection

Complex edited editions Allows easy reference back to source transcriptions from within the edition Allows reference to the source images without any additional effort by the editor Allows interoperability of tools

Stanford University Libraries Houghton Library, Harvard University John Hopkins University University of Kentucky University of Freibourg University of Cologne Bibblioteca del Sacro Convento di Assisi (Italy) Acknowledgements