Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI.

Slides:



Advertisements
Similar presentations
CSCI N241: Fundamentals of Web Design Copyright ©2004 Department of Computer & Information Science Introducing XHTML: Module B: HTML to XHTML.
Advertisements

METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
Delivering textual resources. Overview Getting the text ready – decisions & costs Structures for delivery Full text Marked-up Image and text Indexed How.
Standards and Increasing Maintainability on Web- based Systems James Eaton SE4112/16/2006.
 Fundamentals of Web Design.  Describe the history and theory of XHTML  Understand the rules for creating valid XHTML documents  Apply a DTD to an.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 13-1 COS 346 Day 24.
Achieving Distributed Extensibility and Versioning in XML Dave Orchard W3C Lead BEA Systems.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
Tutorial 9 Working with XHTML
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
Uncovering the TEI and ODD A pedagogical strip-tease Laurent Romary - Max Planck Digital Library.
Advanced Technical Writing 2006 Session #3. Today in Class… ► Teams pitch poster concepts:  Meet with your editorial team, show us how your material.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
1 Technologies and Modelling Frameworks XML ontology RDF taxonomy OWL thesaurus Semantic Web.
Upgrading to XHTML DECO 3001 Tutorial 1 – Part 1 Presented by Ji Soo Yoon 19 February 2004 Slides adopted from
Introducing XHTML: Module B: HTML to XHTML. Goals Understand how XHTML evolved as a language for Web delivery Understand the importance of DTDs Understand.
Copyright © 2003 Pearson Education, Inc. Slide 1-1 Created by Cheryl M. Hughes, Harvard University Extension School — Cambridge, MA The Web Wizard’s Guide.
Use of METS in CDL Digital Special Collections Brian Tingle.
OCLC Online Computer Library Center Two Paths to Interoperable Metadata Jean Godby, Devon Smith, Eric Childress DC-2003 September 29, 2003.
Digital Encoding What’s behind E-text Resources?.
Technical University of Valencia Computer Science Department SOFSEM’07 (22/01/2007) A Program Slicing Based Method to Filter XML/DTD documents.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
August Chapter 1 - Introduction Learning XML by Erik T. Ray Slides were developed by Jack Davis College of Information Science and Technology Radford.
EAD: A Technical Introduction Julie Hardesty, Metadata Analyst June 3, 2014.
CREATED BY ChanoknanChinnanon PanissaraUsanachote
Copyright © 2012 Accenture All Rights Reserved.Copyright © 2012 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are.
XML The Overview. Three Key Questions What is XML? What Problems does it solve? Where and how is it used?
Introduction technology XSL. 04/11/2005 Script of the presentation Introduction the XSL The XSL standard Tools for edition of codes XSL Necessary resources.
CISC 3140 (CIS 20.2) Design & Implementation of Software Application II Instructor : M. Meyer Address: Course Page:
Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Interoperability: Where the irresistible force of flexibility meets the immovable.
Funded by: © AHDS Oxford Text Archive and good practice in the creation of electronic resources Martin Wynne
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
XML BIS4430 – unit 10. XML Origins Extensible Markup Language (XML) 1998 Inspired by Standard Generalized Markup Language (SGML) and HTML. SGML defines.
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
November 1, 2006IU DLP Brown Bag : Fall Data Integrity and Document- centric XML Using Schematron for Managing Text Collections Dazhi Jiao, Tamara.
TEI, CIDOC-CRM and a Possible Interface between the Two? Øyvind Eide & Christian-Emil Ore Unit for Digital Documentation, University of Oslo, Norway.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
TEI and Scholarly publishing Laurent Romary INRIA & HUB-ISDL TEI council, chair.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
1 Collection Specific Vocabularies March Terminology CB - abbreviation for collection builder CV - abbreviation for controlled vocabulary.
Development Process and Testing Tools for Content Standards OASIS Symposium: The Meaning of Interoperability May 9, 2006 Simon Frechette, NIST.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
Standards for digital encoding Tomaž Erjavec Institut für Informationsverarbeitung Geisteswissenschaftliche Fakultät Karl-Franzens-Universität Graz Tomaž.
UVa's Digital Library CSG - September 2005 Slides courtesy of: Leslie Johnston Director, Digital Access Services, UVA Library Tim Sigmon University of.
DITA Single Source technology. What is Single Source? Single source technology is a concept of publishing documents when same content can be used in different.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
Encoding language corpora: current trends and future directions Tomaž Erjavec Department of Knowledge Technologies Department of Knowledge Technologies.
1 Credits Prepared by: Rajendra P. Srivastava Ernst & Young Professor University of Kansas Sponsored by: Ernst & Young, LLP (August 2005) XBRL Module Part.
XML technologies for text encoding Tamás Váradi
OFC291 Microsoft® Office Word XML (part 1 of 3): Introduction Martin Sawicki Lead Program Manager.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Web Technologies Lecture 4 XML and XHTML. XML Extensible Markup Language Set of rules for encoding a document in a format readable – By humans, and –
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Representing Netconf Data Models using Document Schema Definition Languages (DSDL) Rohan Mahy Sharon Chisholm Lada Lhotka IETF 72 - Dublin.
Information Design Trends Unit 4: Sources and Standards Lecture 3: A Brief Introduction to XML.
INFSY 547: WEB-Based Technologies Gayle J Yaverbaum, PhD Professor of Information Systems Penn State Harrisburg.
XML The Overview. Three Key Questions What is XML? What Problems does it solve? Where and how is it used?
Delivering textual and visual resources. Overview Case studies Methods for providing access Structures for delivery Full text Marked-up Image and text.
TEI presentation for IS 590 Robert Patrick Waltz July 10 th, 2012.
1 XML and XML in DLESE Katy Ginger November 2003.
TEI 工作坊 TEI and Images October The Concept.
7th Annual Hong Kong Innovative Users Group Meeting
Introduction to TEI Tomaž Erjavec dept
Demystifying Digital Scholarship 10: TEI
Markup of Educational Content
UNC Digital Library Project
VI-SEEM Data Repository
Oya Y. Rieger Cornell University Library May 2004
Presentation transcript:

Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI

What can the TEI do for you? The TEI provides a framework for the definition of multiple schemas it defines and names several hundred useful textual distinctions it provides a set of modules that can be used to define schemas making those distinctions it provides a customization mechanism for modifying and combining those definitions with new ones using the same conceptual model

Where did the TEI come from? Originally, a research project within the humanities   Sponsored by three professional associations   Funded by US, EU. Major influences   digital libraries and text collections   language corpora   scholarly datasets International consortium established June 1999 (see

Goals of the TEI better interchange and integration of scholarly data support for all texts, in all languages, from all periods guidance for the perplexed: what to encode — hence, a user-driven codification of existing best practice assistance for the specialist: how to encode — hence, a loose framework into which unpredictable extensions can be fitted These apparently incompatible goals result in a flexible and modular environment

TEI Guidelines A set of recommendations for text encoding, covering both generic text structures and some highly specific areas based on (but not limited by) existing practice A very large collection of element definitions with associated declarations for various schema languages a modular system for creating personalized schemas or DTDs from the foregoing for the full picture see

Legacy of the TEI a way of looking at what ‘text’really is a codification of current scholarly practice (crucially) a set of shared assumptions and priorities about the digital agenda:   focus on content and function (rather than presentation)   identify generic solutions (rather than application-specific ones)

Users of TEI Over 100 projects listed on the TEI project page Over 100 projects listed on the TEI project pageTEI project pageTEI project page Main areas: Main areas:  digital libraries  text-critical editions  computer corpora  dictionaries

Versions of the Guidelines TEI P3 (1994) first public version: TEI P3 (1994) first public version:  SGML + book (1200pp) and soon also on the Web. TEI P4 (2002): TEI P4 (2002): TEI P4 TEI P4  provides equal support for XML and SGML applications using the TEI scheme;  error correction, while maintaining backward compatibility: documents conforming to TEI P3 will not become illegal when processed with TEI P4. TEI P5 (2007): TEI P5 (2007): TEI P5 TEI P5  implements more fundamental changes to the schemas, in line with current practice and identified problems, e.g. uses namespaces  no longer backward compatible (but a ~migration P4 to P5 XSLT exists)  Relax NG becomes the main schema langauge  continuous improvement..

The general structure of TEI documents Burnard, Driscoll, Rahtz, TEI Training Course, Sofia 2005: Slides for TEI overview Burnard, Driscoll, Rahtz, TEI Training Course, Sofia 2005: Slides for TEI overviewTEI Training Course, Sofia 2005TEI overviewTEI Training Course, Sofia 2005TEI overview

TEI Lite TEI Lite is a particular parametrisation of TEI that “provides 90% of the elements needed for 90% of users” TEI Lite is a particular parametrisation of TEI that “provides 90% of the elements needed for 90% of users” TEI Lite TEI Lite but better to make your own schema.. but better to make your own schema..

Lab session take personlist data and write XSLT to transform it into teiLite take personlist data and write XSLT to transform it into teiLite

XSLT Erjavec: Course at ESSLII 2005 Annotation of Language Resources, Lecture II. XML-Related Recommendations: Formatting and Transforming XML Annotation of Language Resources Formatting and Transforming XML Annotation of Language Resources Formatting and Transforming XML ZVON tutorial ZVON tutorial ZVON tutorial ZVON tutorial W3schools W3schools W3schools …