Creating an Electronic Edition of an Original 18 th Century Manuscript -- Mémoires de la comtesse de L… Shaoping Moss Monday, Oct. 3, 2005 Research and.

Slides:



Advertisements
Similar presentations
CSCI N241: Fundamentals of Web Design Copyright ©2004 Department of Computer & Information Science Introducing XHTML: Module B: HTML to XHTML.
Advertisements

Delivering textual resources. Overview Getting the text ready – decisions & costs Structures for delivery Full text Marked-up Image and text Indexed How.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
XML/EDI Overview West Chester Electronic Commerce Resource Center (ECRC)
Getting a Taste of Cascading Stylesheets Steve Mooradian December 14, 2005.
IS 373—Web Standards Todd Will
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Sistemi basati su conoscenza XML Prof. M.T. PAZIENZA a.a
Advanced Technical Writing 2006 Session #3. Today in Class… ► Teams pitch poster concepts:  Meet with your editorial team, show us how your material.
XML A brief introduction ---by Yongzhu Li. XML --- a brief introduction 2 CSI668 Topics in System Architecture SUNY Albany Computer Science Department.
A Practical Introduction to XML in Libraries Marty Kurth NYLA October 22, 2004.
Sistemi basati su conoscenza XML Prof. M.T. PAZIENZA a.a
XML Primer. 2 History: SGML vs. HTML vs. XML SGML (1960) XML(1996) HTML(1990) XHTML(2000)
Upgrading to XHTML DECO 3001 Tutorial 1 – Part 1 Presented by Ji Soo Yoon 19 February 2004 Slides adopted from
Developing a Basic Web Page with HTML
Introducing XHTML: Module B: HTML to XHTML. Goals Understand how XHTML evolved as a language for Web delivery Understand the importance of DTDs Understand.
Presented by Karen W. Gwynn LS – Metadata University of Alabama Prof. Steven MacCall Spring 2011.
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
CNIT 133 Interactive Web Pags – JavaScript and AJAX Review HTML5.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
1 Networks and the Internet A network is a structure linking computers together for the purpose of sharing resources such as printers and files Users typically.
August Chapter 1 - Introduction Learning XML by Erik T. Ray Slides were developed by Jack Davis College of Information Science and Technology Radford.
EAD: A Technical Introduction Julie Hardesty, Metadata Analyst June 3, 2014.
What is XML? XML stands for EXtensible Markup Language
Mark Sullivan University of Florida Libraries Digital Library of the Caribbean.
CREATED BY ChanoknanChinnanon PanissaraUsanachote
1Computer Sciences Department Princess Nourah bint Abdulrahman University.
An XML Introduction Extensible Markup Language Describe Structure and Content of Data Sample XML Document.
Open Textbooks and Electronic Publishing Formats/Standards Arctic Virtual Learnng Tools
Chapter 1 Understanding the Web Design Environment Principles of Web Design, 4 th Edition.
Week 1 Understanding the Web Design Environment. 1-2 HTML: Then and Now HTML is an application of the Standard Generalized Markup Language Intended to.
XML and XSL A report on the workshop given by Shaoping Moss on October 16, 2004 Presented by ASIS&T members Caryn Anderson, Prairie Clayton & Kara Schwartz.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
XHTML. Introduction to XHTML What Is XHTML? – XHTML stands for EXtensible HyperText Markup Language – XHTML is almost identical to HTML 4.01 – XHTML is.
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
Fundamentals of Web Design Copyright ©2004  Department of Computer & Information Science Introducing XHTML: Module A: Web Design Basics.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
XML eXtensible Markup Language. Topics  What is XML  An XML example  Why is XML important  XML introduction  XML applications  XML support CSEB.
Digital Archiving in the Hungarian Széchényi Library The story and the plans of the Hungarian Electronic Library Rome, 21. Oct István Moldován OSZK,
XML and Digital Libraries M. Zubair Department of Computer Science Old Dominion University.
XP 2 HTML Tutorial 1: Developing a Basic Web Page.
Chapter 1 Understanding the Web Design Environment Principles of Web Design, 4 th Edition.
Overview of HTML and XML. Contents n History n Usage n Examples n Advantages n Disadvantages.
Digital Media Technology Week 5: XML and Presentation Peter Verhaar.
XML Introduction. Markup Language A markup language must specify What markup is allowed What markup is required How markup is to be distinguished from.
Introduction to Markup Languages January 31, 2002.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Document Computing Technologies for Managing Electronic Document Collections Ross Wilkinson... [et al.] Circulation Counter [RES3H] ZA4080.D
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
XML CSC1310 Fall HTML (TIM BERNERS-LEE) HyperText Markup Language  HTML (HyperText Markup Language): December  Markup  Markup is a symbol.
XML The Extensible Markup Language (XML ), which is comparable to SGML and modeled on it, describes how to describe a collection of data. A standard way.
Web Design New Brighton High School Exploring the History of the World Wide WebWorld Wide Web.
XP 1 HTML Tutorial 1: Developing a Basic Web Page.
Linda Schmandt Structured Text & XML in Medicine 16 Jan 2004.
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
Delivering textual and visual resources. Overview Case studies Methods for providing access Structures for delivery Full text Marked-up Image and text.
XML Extensible Markup Language
XHTML1 Introduction to Web Pages N100 Building a Simple Web Page.
Web Design Principles 5 th Edition Chapter 3 Writing HTML for the Modern Web.
Kynn Bartlett 11 April 2001 STC San Diego The HTML Writers Guild Copyright © 2001 XML, XHTML, XSLT, and other X-named specifications.
7th Annual Hong Kong Innovative Users Group Meeting
Introducing HTML & XHTML:
What is XML?.
Presentation transcript:

Creating an Electronic Edition of an Original 18 th Century Manuscript -- Mémoires de la comtesse de L… Shaoping Moss Monday, Oct. 3, 2005 Research and Instructional Support, LITS Mount Holyoke College French 331, Fall 2005

Today’s Topics An electronic edition: what and why an electronic edition Significance of Manuscripts Technologies used behind the scene Markup languages: SGML, XML and HTML Stylesheets: XSLT TEI -- Guidelines and DTDs Group Project Encoding Project Objectives Text interpretation and markup of the manuscript

What and Why an Electronic Edition? An electronic edition -- a transcription of a text, which can  be encoded as an object of study for literary, linguistic, historical, or related purposes.  be searched and manipulated by computer programs in many different ways.  facilitate and expand access.  facilitate the long-term preservation of the original form of the materials.

Significance of Manuscripts The term 'manuscript' simply means “written by hand.” These works written by authors, artists, scientists, and others, not only contain invaluable information for the study of the genesis, meaning and reception of their work, also for the reconstruction and a better understanding of the contemporary society and mentality in which they lived. In addition manuscripts throw light on the economics, psychology, politics, and social sciences, as well as the history and philosophy of science.

What does Encoding a Text Mean? The purpose of encoding a document is to embed intelligence in the text in such a way that the computer program can derive information from it. The information embedded in the text is variously called encoding, markup, or tagging.

What’s a Document? A document is: A set of information presented to the reader in different forms and media: books, web pages, magazines, articles, advertisements. A collection of small elements, which can be headings, paragraphs, quotations, etc. Structure versus Format Structure concerns the content of a document. Format concerns the way a document looks.

Sample Digital Collections  The Newton Papers Project The Newton Project aims to create a printed edition of Newton's theological, alchemical and administrative writings and an electronic edition of all his writings, including his correspondence. Sample Transcriptions:  The Adams Family Paper: an Electronic Archive  Five College Archives & Manuscript Collections -- use XML (EAD) to improve searching capabilities of archival finding aids

Markup Languages Address the structure of a document. Identify different components of the document. A set of symbols that can be placed in the text of a document to define and label the parts of the document. Convey information to software that will allow it to: determine the functions and boundaries of document parts. index the data for searching. render the data (e.g. for screen display or print). transform the data (e.g. for a voice synthesizer) for some output device(s).

Development of Markup Languages  SGML -- Standard Generalized Markup Language (‘86)  Initiated by Charles Goldfarb at IBM in the 1960s  Adopted as a standard of the International Organization for Standardization (ISO 8879) in 1986  HTML -- Hypertext Markup Language (‘91) developed by Tim Berners-Lee at a physics lab near Geneva, Switzerland in 1992  XML -- eXtensible Markup Language (‘98) XML is a new Web standard developed by World Wide Web Consortium since 1998.

SGML and Its Subdivisions  SGML is a toolkit for developing specialized markup languages.  SGML is composed of tag-set building rules.  SGML has given birth to other sets of subdivisions:  HTML and XML  CALS for U.S Department of Defense  BOEING for commercial airlines  C-H for publishing  OED for Old English Dictionary  TEI guidelines for the Text Encoding Initiative  EAD for Encoded Archival Descriptions

HTML: Good v. Bad  Good:  Its simplicity has contributed to the rapid growth of the World Wide Web in the 1990s.  XHTML 1.0 is the latest HTML standard.  Bad:  Easy HTML coding has made it harder for browsers to handle.  Tags are predefined in HTML.  Format and content are mixed and content is hard to reuse. e.g. My First XML Introduction to XML What is HTML?….

What is XML?XML  XML stands for eXtensible Markup Language.  XML was designed to describe data.  XML tags are not predefined in XML.  You must define your own tags in using XML.  XML separates format from content and semantic structure, e.g. What is XML? Introduction to XML  Data encoded in XML can function much like a traditional database.  XML content can be output in many formats, such as XHTML, text, Word documents, PDF, etc.

A Sample XML Document Project Cool Guide to XML for Web Designers Teresa A. Martin USA John Wiley and Sons …

Transformation of the XML Document XSLT file Word file

XSLT XSLT - eXtensible Stylesheet Language Transformations A markup language and programming syntax for processing XML data Contains a set of template rules that defines what info. can be taken out of the XML document and how it is structured Is most often used to: Transform XML to HTML for delivery to standard Web clients or wireless devices Transform XML from one structure to another Convert XML data into any wanted output - text, Word document, PDF, etc.

XSLT Transformation XSLT Stylesheet XSLT Transformation Source document Result document

 EAD for Encoded Archival Descriptions  The Dublin Core Metadata  MARC XML - MARC 21 XML Schema  MODS XML - Metadata Object Description Schema Markup Languages in Libraries

Markup Languages in Academics  TEI -- guidelines and DTDs  Resource Bioinformatic Sequence Markup Language (BSML)  Mathematical Markup Language (MathML)

What is TEI?  Initially launched in 1987, the Text Encoding Initiative (TEI) is an international and interdisciplinary standard for encoding, keeping and analyzing textual content & structure of digital texts. This standard is designed for use with a broad range of text types. Now it is widely used in libraries, archives, and by publishers and researchers for online research and teaching and for the storage and exchange of large and small text collections.

TEI Guidelines  The TEI encoding system is built upon Standard General Markup Language (SGML) and shifted to XML in The system is described in the TEI guidelines. It is modular and flexible, including basic modules, such as prose, poetry, drama, speech, lexicography and terminology. These modules can be combined in various ways according to the needs to adapt to a great number of text-encoding purposes.

TEI Lite (documentation) (download the files) TEILite is a simplified ‘starter set’ of TEI elements, which has been defined in simple DTD. It includes most of the core tags, basic structural components, and an adequate set of header elements. It is a good starting point for simple encoding projects, and has proved very popular and serves about 85% of its users’ needs.

DTD -- Document Type Definition  A DTD is a computer-readable text file that defines a markup language for a particular type of document, such as a poem, a novel, or an archival finding aids.  Its purpose is to define the document structure with a list of legal elements --a root element, parent and child elements, and where data can be placed.  It lays out the logical structure of the data.  It establishes rules about which elements a document may have, which are required, which can repeat, etc.  A DTD can be declared inline in your XML document, or as an external reference.

TEI Document Format All TEI documents follow the same essential format:  TEI header -- documents the bibliographic information about the electronic edition being created.  TEI body -- contains the content being created.

Relationships in a TEI Document </body) Parent element of and Sibling elements is an ancestor element of

The Encoding Example A sample TEI markup for Mémoires de la comtesse de Lsample …

TEI Encoding Examples A sample markup for Mémoires de la comtesse de Lsample Letter : Centre Harbor, N.H., from John Greenleaf Whittier to Lucy Larcom, 11 Aug. [between 1884 and 1892] Letter  Letter from Emma P. Carr to Professor Victor Henri, [July 5, 2004]

Encoding Project Objectives Encoding Mémoires de la comtesse de L… is an act of analysis and interpretation, presenting intellectual challenges that bring us closer to the text and thus help us better understand the work, life, and the social environment surrounding the author.

Group Project: Let’s have Fun!!  3 octobre: Overview of XML/TEI technology Hands-on encoding exercises: themes, personal and place names  7 novembre: Demo encoding images by Shaoping in class  28 novembre: Demo encoding translation of selected words by Shaoping in class  5 et 14 decembre: class demo of each group project (Note: Students will have to make appointment with Alexandra for encoding problems.)

Contact Info Shaoping Moss Information Technology Consultant Research and Instructional Support Mount Holyoke College Phone: (413) Alexandra Balan Tech Mentor