Uncovering the TEI and ODD A pedagogical strip-tease Laurent Romary - Max Planck Digital Library.

Slides:



Advertisements
Similar presentations
SharePoint Forms All you ever wanted to know about forms but were afraid to ask.
Advertisements

W3C XML Schema: what you might not know (and might or might not like!) Noah Mendelsohn Distinguished Engineer IBM Corp. October 10, 2002.
XHTML Basics.
Representing dictionaries with the TEI Proposal for basic guidelines Laurent Romary - Max Planck Digital Library With the help of Susanne Alt - CNRS.
SRDC Ltd. 1. Problem  Solutions  Various standardization efforts ◦ Document models addressing a broad range of requirements vs Industry Specific Document.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
Sunday, June 28, 2015 Abdelali ZAHI : FALL 2003 : XML Schemas XML Schemas Presented By : Abdelali ZAHI Instructor : Dr H.Haddouti.
EPOCH 1000 File Management Data Logging and Reporting
Unit 4 – XML Schema XML - Level I Basic.
Introduction to XML This material is based heavily on the tutorial by the same name at
Tutorial 3: Adding and Formatting Text. 2 Objectives Session 3.1 Type text into a page Copy text from a document and paste it into a page Check for spelling.
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
Visualization By: Simon Luangsisombath. Canonical Visualization  Architectural modeling notations are ways to organize information  Canonical notation.
1 ADVANCED MICROSOFT WORD Lesson 15 – Creating Forms and Working with Web Documents Microsoft Office 2003: Advanced.
Lecture 15 XML Validation. a simple element containing text attribute; attributes provide additional information about an element and consist of a name.
EAD: A Technical Introduction Julie Hardesty, Metadata Analyst June 3, 2014.
NHS CFH Approach to HL7 CDA Rik Smithies Chair HL7 UK NProgram Ltd.
Why XML ? Problems with HTML HTML design - HTML is intended for presentation of information as Web pages. - HTML contains a fixed set of markup tags. This.
CREATED BY ChanoknanChinnanon PanissaraUsanachote
Copyright © 2012 Accenture All Rights Reserved.Copyright © 2012 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are.
Pervasive e-commerce with XML Babak Esfandiari Carleton University Ottawa, Canada.
Neminath Simmachandran
Chapter 1 Understanding the Web Design Environment Principles of Web Design, 4 th Edition.
Experiments with ODD outside the TEI framework Laurent Romary & Piotr Banski The ISO-TEI connection.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
XHTML1 Building Document Structure Chapter 2. XHTML2 Objectives In this chapter, you will: Learn how to create Extensible Hypertext Markup Language (XHTML)
Software Design 13.1 XML: Another TLA or the Future? XML is eXtensible Markup Language  It's a w3c standard.
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
What is XML?  XML stands for EXtensible Markup Language  XML is a markup language much like HTML  XML was designed to carry data, not to display data.
FIGIS’ML Hands-on training - © FAO/FIGIS An introduction to XML Objectives : –what is XML? –XML and HTML –XML documents structure well-formedness.
HTML and XML Behind Web Authoring Tools. 2 Objectives Introduce HTML Learn HTML Step by step Introduce XML.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Session 1 SESSION 1 Working with Dreamweaver 8.0.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
ARCHIVISTS’ TOOLKIT WORKSHOP March 13, 2008 Christine de Catanzaro Jody Thompson.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
SaveUML System design. System overview Possible...
TUTORIAL Dolphy A. Fernandes Computer Science & Engg. IIT Bombay.
New Perspectives on XML, 2nd Edition
An OO schema language for XML SOX W3C Note 30 July 1999.
Lecture 16 Introduction to XML Boriana Koleva Room: C54
Microsoft ® Office Excel 2003 Training Using XML in Excel SynAppSys Educational Services presents:
The eXtensible Markup Language (XML). Presentation Outline Part 1: The basics of creating an XML document Part 2: Developing constraints for a well formed.
XML 2nd EDITION Tutorial 4 Working With Schemas. XP Schemas A schema is an XML document that defines the content and structure of one or more XML documents.
1 Tutorial 14 Validating Documents with Schemas Exploring the XML Schema Vocabulary.
Tutorial 13 Validating Documents with Schemas
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 1COMP9321, 15s2, Week.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI.
WP 3: Standardisation of shared metadata Mode of operation –All partners are involved –Building on practice outside the project Achievements of Year 1.
David Orchard W3C Lead BEA Systems Web service and XML Extensibility and Versioning.
Basics of Web Based Computing. The Architecture The user’s system A Web Server What’s inside? Server software Apache or other Resources to be accessible.
Optimising XML Schema for IODEF Data model INCH WG, IETF57 July 16, 2003 Yuri Demchenko.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Class Diagrams. Terms and Concepts A class diagram is a diagram that shows a set of classes, interfaces, and collaborations and their relationships.
Using DSDL plus annotations for Netconf (+) data modeling Rohan Mahy draft-mahy-canmod-dsdl-01.
Object-Oriented Software Engineering Practical Software Development using UML and Java Modelling with Classes.
XML Notes taken from w3schools. What is XML? XML stands for EXtensible Markup Language. XML was designed to store and transport data. XML was designed.
Generating XML Data from a Database Eugenia Fernandez IUPUI.
XML Namespaces In this first lesson XML Namespaces, you will learn to:
Themes CIS 136 Building Mobile Apps 1. Themes 2  jQuery Mobile has a robust theme framework that supports up to 26 sets of toolbar, content and button.
XML BASICS and more…. What is XML? In common:  XML is a standard, simple, self-describing way of encoding both text and data so that content can be processed.
Autonomy Paradigm Warning: This document is a part of my “Responsible Programming” theme. All docs related to that theme just gather some of my ideas.
Setting Defaults in Microsoft Word for Accessibility
Component-based Software Engineering: XML
Manuscript Markup with TEI
TEI Workshop 10. ROMA Summer 2010.
Eugenia Fernandez IUPUI
Presentation transcript:

Uncovering the TEI and ODD A pedagogical strip-tease Laurent Romary - Max Planck Digital Library

Objectives Using Roma as a pedagogical tool –A user oriented approach –A coherent picture of the TEI Understanding the core principles of the TEI –Modules, classes, and main elements Getting acquainted with ODD –Showing why ODD is an essential part in the TEI architecture Providing an intuitive view about conformance

Prerequisites What the learner knows –Principles of tagging (cf. HTML) –XML (may have already drafted his own DTD for encoding his stuff) –What he wants to do Some ideas about the data he would like to encode …and the kind of information he would like to see in his “schema” What he may not necessarily know –Principles of data interchange and/or long-term archiving –Hence, what standardization may mean…

The TEI spirit Conformance? –Sharing a common text encoding culture –Sharing the same vocabulary (when applicable) –Allowing user autonomy in defining modifications (extensions, customization), but sharing the mechanisms to do so

Main concepts Literate programming –One Document Does it all Schema specification User oriented documentation The founding principles of ODD –Modularity: all specifications pertaining to a coherent sub-domain of the TEI –Classes: identifying shared behaviours or semantics Extensibility: a consequence of the above mechanisms

Specifying a schema What is a schema? –Elements, attributes, and constraints between these Which schema language should be used? –Agnosticism of the TEI DTD, Relaxng, W3C –Still, RelaxNG fragments in ODD

ODD and the TEI ODD for the TEI –The TEI is defined as an ODD specification Main application: Roma ODD is part of the TEI –The ODD language contains a set of additional elements dedicated to schema specification TD - Documentation elements ODD beyond the TEI –ODD can be used by any community to specify their own schema

A journey in five steps 1.Easy TEI Simple access to the TEI through Roma Subsetting the TEI Making the TEI even easier to use Enlarging the application profile Using modules Modifying the TEI objects First insights into extensibility Behind the scenes - ODD Starting to use the actual specification language

Step 1 Easy TEI

The scenario Quick and simple access to the TEI environment –I have seen my colleague next door doing some encoding with the TEI and I want to do the same at once I have just downloaded Oxygen…

Main action points Go to Roma – Toy with user profile [Customize] Overview of interface tabs –Without going into them properly Generate Schema [Schema] Try it out with Oxygen –Create a simple document –Get back to Roma to have a basic documentation E.g. the never ending

Discussion points But that’s already too much! –The TEI as a market place (LB=“framework”) See Step 2 The things I actually want are missing! –Are you sure? –Identify what’s missing Already defined components in the TEI structure - See Step 3 (“Modules”) Missing objects - See Step 4 (“Subsetting”) Uncovered textual genres (Cuneiform tablets encoding) - See Step 5 (“Odd as a specification language”)

Discussion points (cont.) Doing this does not necessarily means conformance –Schemas (a fortiori DTDs) only provide a syntactic validation of your document –Understanding the meaning of tags Cf. tag misuse or abuse a priori –Several solutions for the same problem Numbered vs. unnumbered s

Lessons to be learnt The TEI is XML-based Getting started with the TEI is easy Full documentation comes with it

Step 2 Subsetting the TEI

The scenario Defining a project oriented TEI Lite –I just want a set of useful tags to toy with my text Understanding the basics of the TEI architecture –Modules –Main organisation of a TEI text –Getting things documented

Main action points Go to Roma… –Look at [Modules] Default modules + additional modules Explore default modules by pointing to main elements (in order of pedagogic interest) –textstructure: TEI - text - body - div –core: p - q - list - pb - head –header –Start checking out elements Preserving the skeleton (TEI/text/body) Making editorial choices (numbered vs. unnumbered divs)

Main action points (cont.) Check the result in XML –Look at the resulting schema –Start editing a document Back to Roma –Generate documentation Customizing everything at once!

Discussion points Introducing the concept of production vs. exploration schema –Limit the elements to those that the encoder may use –But how can I reuse what I have just done on Roma? Conformance (cont.): First risk of divergence across applications –Two application profiles are not necessarily compatible with one another –Still, they share the same semantics for the same element

Lessons to be learnt The TEI is not a monolithic environment –Very few things are really mandatory The TEI is more than just a market place –Basic document structure must be preserved The TEI is based on a powerful environment for working with elements and producing documentation

Step 3 Enlarging the application profile

The scenario Completing the default module selection –It was nice to encode my text, but I would like to create a dictionary… Understanding the magic behind the scene –Classes: to connect modules to one another –ODD: to save a user’s profile

Main action points Go to Roma… –[Modules] Add dictionary modules Test the result on Oxygen –Create a first dictionary entry Back to Roma –[Save customization] –Look at resulting file First ODD declarations –Reload the file under Roma Carry on work…

Discussion points But the structure I get is not exactly what I want –See next Step… Conformance (cont.) –Not all TEI application use the same set of modules –Extra subsetting is still possible (¬ )

Lessons to be learnt The TEI offers a very easy way to build a complex application The TEI is based on a specification language - ODD

Step 4 Modifying the TEI objects

The scenario Tuning a TEI profile –I want to describe my own view on what a dictionary entry is Adding an element Restricting the content model of (m + f) Getting acquainted with classes

Main action points Go to Roma –Start with a default TEI + dictionary structure –Explain what classes are (see next slides) –observe Classes and content model –[Add elements] create Test result on Oxygen [Save customization] Look at resulting file

Main action points (cont.) Go to Roma –Change content model of with Check result on Oxygen [Save customization] Look at resulting file

Classes in the TEI Groups together all elements with the same role in the TEI architecture –Same syntactic behaviour The elements in the class will appear in the same content models –Semantic similarity The class defines a group of elements belonging to the same family of concepts Principle: –elements declare themselves as belonging to a class

A simple example: model.gramPart Grammatical information in a dictionary entry –E.g.: luire verb intransitive –Rather homogeneous set of elements,,,, etc. –May also appear in

The decoupling principle model.gramPart Global behavior Local declaration

Discussion points How far can I go in making change to the TEI –See next step Conformance –How can I be conformant to the TEI when I actually change the specifications? Adding objects and documenting them Subsetting values

Lessons to be learnt The TEI allows you to tune an application profile and generate its documentation

Step 5 Behind the scene — ODD

The scenario Going beyond Roma –Should I really go through the interface to create my modifications Editing ODD files

Main action points Take a previous configuration file (step 4) Toy with the various possibilities under Oxygen –Delete elements manually –Restrict the value of another element Go back to Roma –Present the documentation elements module –Create an application profile with TD Validate (!) the configuration file

Discussion points What is a schema –RelagNG vs. ODD specification Can I do more than that –Describing a schema from scratch Conformance –Am I still conformant I just do not care about existing TEI elements?

Lesson to be learnt Feeling the urge to know and do more…

Conclusion Roma contains all the necessary entry points into the TEI principles What remains to be done –More homogeneous documentation with P5 –Graphical presentation of element dependences –Better integration of RelaxNg