Manuscript Markup with TEI

Slides:



Advertisements
Similar presentations
METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
Advertisements

Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Microsoft ® Office Word 2007 Training Header and footer basics Sweetwater ISD presents:
1. Content – Collective term for all text, images, videos, etc. that you want to deliver to your audience. 2. Structure – How the content is placed on.
METS at UC Berkeley Part I: Generating METS Objects.
XHTML1 Building Document Structure. XHTML2 Objectives In this chapter, you will: Learn how to create Extensible Hypertext Markup Language (XHTML) documents.
METS Metadata Encoding and Transmission Standard Metadata Working Group Forum April 19, 2002.
Uncovering the TEI and ODD A pedagogical strip-tease Laurent Romary - Max Planck Digital Library.
Unit 2, cont. September 14 HTML,Validating your pages, Publishing your site.
METS: An Introduction Part III METS and MOA2. MOA2: A Brief History Digital Library Federation project started in 1997 Main goal was to create a digital.
Introduction to XML: Yong Choi CSU Bakersfield.
Introduction to XML This material is based heavily on the tutorial by the same name at
Incompatible or Interoperable? A METS bridge for a small gap between two digital preservation software packages Lucas Mak Metadata & CatalogLibrarian
Electronic Thesis And Dissertation Database Errors Luke Schmader Ryan Mestre Client: Zhiwu Xie CS4624 5/6/2014.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
DIGITIZATION OF RARE LIBRARY MATERIALS Metadata Format Access to Digital Documents © Adolf Knoll, National Library of the Czech Republic.
An Introduction to Content Management. By the end of the session you will be able to... Explain what a content management system is Apply the principles.
Pemrograman Berbasis WEB XML part 2 -Aurelio Rahmadian- Sumber: w3cschools.com.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
XML introduction to Ahmed I. Deeb Dr. Anwar Mousa  presenter  instructor University Of Palestine-2009.
_ HTML, XHTML & CSS Sami Niemelä | Module 1: Introduction to digital media: Day 02.
CS 299 – Web Programming and Design Introduction to HTML.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Wittgenstein's Nachlass in TEI P5 Ann Arbor, 13. November 2009 Tone Merete Bruvik, Alois Pichler and Vemund Olstad.
WEB DESIGN USING DREAMWEAVER. The World Wide Web –A Web site is a group of related files organized around a common topic –A Web page is a single file.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
XHTML1 Building Document Structure Chapter 2. XHTML2 Objectives In this chapter, you will: Learn how to create Extensible Hypertext Markup Language (XHTML)
XP Dreamweaver 8.0 Tutorial 3 1 Adding Text and Formatting Text with CSS Styles.
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
Session 1 SESSION 1 Working with Dreamweaver 8.0.
XP Tutorial 9 1 Working with XHTML. XP SGML 2 Standard Generalized Markup Language (SGML) A standard for specifying markup languages. Large, complex standard.
METS at UC Berkeley Generating METS Objects. Background Kinds of materials: –primarily imaged content & tei encoded content archival materials: manuscripts.
Construction and Pedagogical Use of Digital Archives Washington University 30 May 2006 Four: The DTD
audio video object Options: controls autoplay Need to define height and width Options: controls autoplay.
What it is and how it works
XML Introduction. Markup Language A markup language must specify What markup is allowed What markup is required How markup is to be distinguished from.
Tutorial 3 Adding and Formatting Text with CSS Styles.
ACCESSIBILITY An Introduction. Accessibility Accessibility is the degree to which a product, device, service, or environment is available to as many people.
Web Technologies Lecture 4 XML and XHTML. XML Extensible Markup Language Set of rules for encoding a document in a format readable – By humans, and –
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
WP 3: Standardisation of shared metadata Mode of operation –All partners are involved –Building on practice outside the project Achievements of Year 1.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
HTML Basics. HTML Coding HTML Hypertext markup language The code used to create web pages.
Using DSDL plus annotations for Netconf (+) data modeling Rohan Mahy draft-mahy-canmod-dsdl-01.
The “Quick Change” Method of Web Design. Create Your Design Create and cut up the graphics for your web site. Create a masterstyle sheet. Name it “plainmasterstylesheet.html.
XML Notes taken from w3schools. What is XML? XML stands for EXtensible Markup Language. XML was designed to store and transport data. XML was designed.
TEI 工作坊 TEI and Images October The Concept.
Introduction to HTML.
Chapter 18 Maintaining Information Systems
Coding, Testing and Valdating a Web Page
Consistent URIs For Compliance Checking (1)
TEI Workshop 10. ROMA Summer 2010.
Web Engineering.
Content & the Supply Chain
Lisa Ruff Business Productivity/Accessibility TS Microsoft Federal
Ch 1 Second Half What is a Language?
Open Access to your Research Papers and Data
Part of the Multilingual Web-LT Program
Attributes and Values Describing Entities.
Updating GML datasets S-100 WG TSM September 2017
Microsoft PowerPoint This is the introduction to PowerPoint.
PREMIS Tools and Services
Introduction to Metadata
Document Design Justine Nielsen April 28, 2003
Text image linking.
Regression testing Tor Stållhane.
Overview of Contract Association Batch Upload
Attributes and Values Describing Entities.
YANG Instance Data for Documenting Server Capabilities
Japan CS/OTA 15th session, Geneva 27-28, August 2019
Presentation transcript:

Manuscript Markup with TEI Buddhist Manuscripts as Digital Text M.Bingenheimer, 2009 “Manuscripts don’t burn” M.Bulgakov, The Master and Margarita 1940

scholarly markup project Scholarly Acumen: - research questions - quality of the data - publications, posters - grant proposal writing Technical Acumen: - technical standards - interface design - longterm usability - cross platform design Project Management: - Budget control - Personnel - Scheduling - Training - Communication with stake holders

Manuscript digitization - Stages Transcription Textual Markup Schema design Linking to Digital Images Metadata Design Interface Design

1.Transcription Is a digital version of the |text| already available? NO → create a digital text. YES → Is it less work to change the existing version than to re-type the text from the manuscript? Closeness of versions Error rate What constitutes difference? (Gaiji, etc.) What to do with previous markup?

2. Textual Markup What phenomena do I want to mark? Often relevant for manuscripts Textstructure <div> <p> <lg> <l> Page-, linebreaks <pb> <lb> Substitutions, deletions, additions <subst> <del> <add> Corrections <choice> <corr> <sic>

2. Textual Markup Relevant for manuscripts Gaps or illegible parts <gap> Damage <damage> Text supplied by the encoder into the transcription <supplied> Text partly illegible <unclear> Scribal comments <note> Images in the text <figure>

2. Textual Markup Critical apparatus: <app>, <lem>, <rdg wit=>... Content markup?: Person & place names (needs authorities) <persName>, <placeName>, <roleName>, <name>... Dates <date> Citations <cit> Pointers and links <ptr>

2. Textual Markup - Punctuation Add punctuation Enclose in <c> (algorithmically) (Chinese full-space punctuation marks help with the automatic replace) → Switch the punctuation on and of as needed.

3. Schema Design Get a TEI schema from ROMA or VESTA Add these modules: transcr msdescription gaiji textcrit verse figures

3. Schema Design Keep the ODD file, otherwise you won’t be able to develop the schema. Keep on validating while you work Trim your schema until it contains (almost) only necessary elements, it will be easier to manage that way Restrict attribute values

4. Linking to digital facsimiles 摹本1 <facsimile> between <teiHeader> and <text> Simplest solution: Step 1 (between header and text) <facsimile> <graphic>....</facsimile> <facsimile> <graphic url="BD6776a.jpg"/> <graphic url="BD6776b.jpg"/> </facsimile>

4. Linking to digital facsimiles 摹本2 Step 2: Link text passages to facsimile IDs via @facs (@facs (facsimile) points to all or part of an image which corresponds with the content of the element.) <div facs="BD6776a.jpg">...</div> or <pb facs="BD6776a.jpg"/> or...

5. Metadata design see presentation on msDesc you might need more than TEI: MIX: Metadata for still Images in XML METS: Metadata Encoding and Transmission Standard

6. Presentation interface design ...ad libitum, but General rules for interface design: Accessibility (no red/green differences etc.) Low server footprint Easy to maintain (PHP vs. Java, CSS vs. JS) Documented Simplicity

6. Presentation interface design Basic question: relationship between digital facsimiles and digital text in the interface Solution A: The interface mainly shows the text → cut the image Solution B: The interface mainly shows the facsimile? → cut the text Solution C: Equal rights. Both text and images are present. → integrate both

A: Cut the image Link images/image eras to the text (TEI: @facs)

B: Cut the text Using the Image Markup Tool by M. Holmes you still keep the text in TEI This links the image to the text

C: “Equal rights” (here in a EXT JS library)

Evaluation In general, I tend to C “equal rights” solutions: align larger passages of digital text and facsimiles +: no need for cutting, minimize “interesting- phenomenon-at-the-border-problem”, simple -: programming a JS library means slightly more IT overhead There are scenarios where A or B is preferable instead.