j.mp/RU_ABOUT WORKSHOPS SCHEDULE CONTACT US

Slides:



Advertisements
Similar presentations
XML-XSL Introduction SHIJU RAJAN SHIJU RAJAN Outline Brief Overview Brief Overview What is XML? What is XML? Well Formed XML Well Formed XML Tag Name.
Advertisements

DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
What is XML? a meta language that allows you to create and format your own document markups a method for putting structured data into a text file; these.
XML Craig Stewart Dr. Alexandra I. Cristea
XHTML Basics.
SPECIAL TOPIC XML. Introducing XML XML (eXtensible Markup Language) ◦A language used to create structured documents XML vs HTML ◦XML is designed to transport.
Developing Interfaces and Interactivity for DSpace with Manakin Part 5: Introduction to Manakin’s Theme Tier Eric Luhrs Digital Initiatives Librarian,
Extensible Markup Language Natawut Nupairoj, Ph.D. Department of Computer Engineering Chulalongkorn University.
IS 373—Web Standards Todd Will
F DIGITAL MEDIA: COMMUNICATION AND DESIGN INTRODUCTION TO XML AND XHTML.
XML Introduction What is XML –XML is the eXtensible Markup Language –Became a W3C Recommendation in 1998 –Tag-based syntax, like HTML –You get to make.
XML: What, Why, When & How? Hope Greenberg Center for Teaching & Learning June 11 & 18.
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
XML introduction to Ahmed I. Deeb Dr. Anwar Mousa  presenter  instructor University Of Palestine-2009.
Introduction to XML cs3505. References –I got most of this presentation from this site –O’reilly tutorials.
Why XML ? Problems with HTML HTML design - HTML is intended for presentation of information as Web pages. - HTML contains a fixed set of markup tags. This.
CREATED BY ChanoknanChinnanon PanissaraUsanachote
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
CS134 Web Design & Development Creating a Basic Web Page Mehmud Abliz.
XML 1 Enterprise Applications CE00465-M XML. 2 Enterprise Applications CE00465-M XML Overview Extensible Mark-up Language (XML) is a meta-language that.
 XML is designed to describe data and to focus on what data is. HTML is designed to display data and to focus on how data looks.  XML is created to structure,
Session IV Chapter 9 – XML Schemas
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
JSTL, XML and XSLT An introduction to JSP Standard Tag Library and XML/XSLT transformation for Web layout.
E0262 – MIS – Multimedia Storage Techniques XML (Extensible Markup Language  XML is a markup language for creating documents containing structured information.
XML Documents Chao-Hsien Chu, Ph.D. School of Information Sciences and Technology The Pennsylvania State University Elements Attributes Comments PI Document.
1 Introduction  Extensible Markup Language (XML) –Uses tags to describe the structure of a document –Simplifies the process of sharing information –Extensible.
Digital Media Technology Week 5: XML and Presentation Peter Verhaar.
1 Introduction to XML XML stands for Extensible Markup Language. Because it is extensible, XML has been used to create a wide variety of different markup.
1 Credits Prepared by: Rajendra P. Srivastava Ernst & Young Professor University of Kansas Sponsored by: Ernst & Young, LLP (August 2005) XBRL Module Part.
The eXtensible Markup Language (XML). Presentation Outline Part 1: The basics of creating an XML document Part 2: Developing constraints for a well formed.
What it is and how it works
XML for Text Markup An introduction to XML markup.
An Introduction to XML Paul Donohue May 8th 2002 Hotel Senator Zürich.
XML Introduction. Markup Language A markup language must specify What markup is allowed What markup is required How markup is to be distinguished from.
XML Steve Fisher/RAL. 20 October 2000XML - Steve Fisher/RAL2 Warning Information may not be all completely up to date.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Web Technologies Lecture 4 XML and XHTML. XML Extensible Markup Language Set of rules for encoding a document in a format readable – By humans, and –
XML CSC1310 Fall HTML (TIM BERNERS-LEE) HyperText Markup Language  HTML (HyperText Markup Language): December  Markup  Markup is a symbol.
HTML A brief introduction HTML1. HTML, what is? HTML is a markup language for describing web documents (web pages). HTML stands for Hyper Text Markup.
Basic HTML Document Structure. Slide 2 Goals (XHTML HTML5) XHTML Separate document structure and content from document formatting HTML 5 Create a formal.
C Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Introduction to XML Standards.
VCE IT Theory Slideshows by Mark Kelly study design By Mark Kelly, vceit.com, Begin.
Jackson, Web Technologies: A Computer Science Perspective, © 2007 Prentice-Hall, Inc. All rights reserved Chapter 7 Representing Web Data:
XML Introduction to XML Extensible Markup Language.
XML Notes taken from w3schools. What is XML? XML stands for EXtensible Markup Language. XML was designed to store and transport data. XML was designed.
Leiden University. The university to discover. Feedback 1: General You are part of a research project: Follow Project Guidelines minutely: that means 7.
1 Extensible Stylesheet Language (XSL) Extensible Stylesheet Language (XSL)
Blended HTML and CSS Fundamentals 3 rd EDITION Tutorial 1 Using HTML to Create Web Pages.
Spring 2013 Markup – Validate – Transform Introduction to Digital Text and XML Rice University, April.
1. Intro and XML Rules Spring Summer 2010 Marcus Bingenheimer TEI Workshop.
XML BASICS and more…. What is XML? In common:  XML is a standard, simple, self-describing way of encoding both text and data so that content can be processed.
XML Introduction Bill Jerome.
Unit 4 Representing Web Data: XML
Do More with Digital Scholarship: XML-Based Metadata
Extensible Markup Language XML
Do More with Digital Scholarship: XML-Based Metadata
Demystifying Digital Scholarship 10: TEI
Manuscript Markup with TEI
XML in Web Technologies
Extensible Markup Language XML
Basic HTML Document Structure
Chapter 7 Representing Web Data: XML
Roy Tennant California Digital Library
Introducing HTML & XHTML:
XML Data Introduction, Well-formed XML.
Review of XML IST 421 Spring 2004 Lecture 5.
Unit 6 - XML Transformations
Presentation transcript:

j.mp/RU_ABOUT patrick.pu@nus.edu.sg WORKSHOPS SCHEDULE CONTACT US

William Chong D’Linkup Pte. Ltd. Data, for everyone Workshop: Encoding text for visualization and analyses using the TEI standard William Chong D’Linkup Pte. Ltd. Data, for everyone

Principles WORK SHOP D’Linkup Pte. Ltd. Data, for everyone

D’Linkup Pte. Ltd. Data, for everyone

Attack plan Hands-on Overview XML TEI Content Expertise

Attack plan Hands-on XML TEI Overview Content Expertise

Annotation Meaning Encoding Markup

Metadata Content Textual Structure and Layout What can be encoded? Metadata Content Textual Structure and Layout D’Linkup Pte. Ltd. Data, for everyone

Many elements Formal method of encoding Language eXtensible Markup Language

The need for a common language <person>Tan Quee Lan</person> is a seamstress. Lim Teck is <name>Tan Quee Lan</name>’s husband. Text Encoding Initiative – 1987 P5 Standards D’Linkup Pte. Ltd. Data, for everyone

Examples of TEI https://www.hathitrust.org/ https://books.google.com/ngrams http://buddhistinformatics.ddbc.edu.tw/biographies/ D’Linkup Pte. Ltd. Data, for everyone

HOW to apply tei http://www.tei-c.org/Guidelines/P5/ D’Linkup Pte. Ltd. Data, for everyone

Metadata: Header D’Linkup Pte. Ltd. <teiHeader>  <fileDesc>   <titleStmt>    <title>Shakespeare: the first folio (1623) in electronic form</title>    <author>Shakespeare, William (1564-1616)</author>   </titleStmt>   <publicationStmt>    <distributor>Oxford Text Archive</distributor>    <address>     <addrLine>13 Banbury Road, Oxford OX2 6NN, UK</addrLine>    </address>    <idno type="OTA">119</idno>    <availability>     <p>Freely available on a non-commercial basis.</p>    </availability>    <date when="1968">1968</date>   </publicationStmt>   <sourceDesc>    <bibl>The first folio of Shakespeare, prepared by Charlton Hinman (The Norton Facsimile,        1968)</bibl>   </sourceDesc>  </fileDesc> D’Linkup Pte. Ltd. Data, for everyone

STructure <div> VISION & MISSION </div> <p> Vision A premier knowledge hub promoting the University’s vision as a leading global university centred in Asia </s> NUS Libraries aspires to be the NUS communities’ first stop when seeking information NUS Libraries provides for the information needs of diverse communities in NUS  in support of learning and research NUS Libraries equips the NUS communities with information seeking skills to enhance learning, research and scholarly communication  </p> Mission To actively engage and partner the NUS community in advancing scholarship and research through innovative library services </p>  </div> D’Linkup Pte. Ltd. Data, for everyone

Content <div> </div> <p> Recently, I visited the <org>Design Centric Programme</org> (DCP) in Engineering. One of the student projects I saw was called Snowstorm. As you can see, it is a personal flying machine designed for indoor spaces. Snowstorm received extensive media attention. Our student team1 , led by <persName>Shawn Sim</persName>, and their faculty mentors, were invited to the <event>Founders Forum</event> in the <country>UK</country>, where they had the chance to explain their flying machine to <persName>Prince William</persName>. </p> </div> D’Linkup Pte. Ltd. Data, for everyone

Well-formed Valid D’Linkup Pte. Ltd. Data, for everyone

Attack plan Hands-on Overview XML TEI Content Expertise

extensible mArkup language D’Linkup Pte. Ltd. Data, for everyone

Optional rule 1: Declaration <?xml version="1.0" encoding="UTF-8" standalone="yes"?> <?xml version="1.0" encoding="UTF-8" standalone="yes"?> D’Linkup Pte. Ltd. Data, for everyone

<name>...</name> <date>...</date> rule 2: End-Tags (Start of highlight) <name>...</name> <date>...</date> <place>...</place> <name>...</name> <date>...</date> <place>...</place> (End of highlight) <lb/> D’Linkup Pte. Ltd. Data, for everyone <pb/>

rule 3: root element <?xml version="1.0"?> <TEI> <div> <p> <s>.....<name>...</name>...</s> </p> </div> <div>.....<date>...</date>...</div> <div>.....<place>...</place>...</div> </TEI>

rule 4: Nesting  <p>...<name>...</name>...</p>  <p>...<name>...</p>...</name> D’Linkup Pte. Ltd. Data, for everyone

rule 5: XML Names XML is case-sensitive Element names must start with an alphabet or the “_” May contain only alphanumeric characters (letters and digits) and “_” “-” “.” The colon “:” is reserved for namespaces

Practice Open Notepad Save file as “test.xml” 3. Open Google Chrome 4. Drag “test.xml” into Google Chrome D’Linkup Pte. Ltd. Data, for everyone

Practice 5. Input xml declaration: <?xml version="1.0"?> 6. On the next line, write a sentence: My name is _______(your name), and I was born on _______(a date) at _______(a place). 7. <TEI> tag 8. <p> tag 9. <name>,<date>,<place> tags 10. case-sensitivity test 11. wrong nesting test D’Linkup Pte. Ltd. Data, for everyone

Well-formed D’Linkup Pte. Ltd. Data, for everyone

Have a break, have a Kit Kat

Attack plan Hands-on Overview XML TEI Content Expertise TEI

T ext E ncoding I nitiative D’Linkup Pte. Ltd. Data, for everyone

Metadata Content Textual Structure and Layout <teiHeader>Metadata</teiHeader> D’Linkup Pte. Ltd. Data, for everyone

Metadata: Header D’Linkup Pte. Ltd. <teiHeader>  <fileDesc>   <titleStmt>    <title>Shakespeare: the first folio (1623) in electronic form</title>    <author>Shakespeare, William (1564-1616)</author>   </titleStmt>   <publicationStmt>    <distributor>Oxford Text Archive</distributor>    <address>     <addrLine>13 Banbury Road, Oxford OX2 6NN, UK</addrLine>    </address>    <idno type="OTA">119</idno>    <availability>     <p>Freely available on a non-commercial basis.</p>    </availability>    <date when="1968">1968</date>   </publicationStmt>   <sourceDesc>    <bibl>The first folio of Shakespeare, prepared by Charlton Hinman (The Norton Facsimile,        1968)</bibl>   </sourceDesc>  </fileDesc> D’Linkup Pte. Ltd. Data, for everyone

four main components <fileDesc> <encodingDesc> <profileDesc> <revisionDesc> <fileDesc> D’Linkup Pte. Ltd. Data, for everyone

Using tei website http://www.tei-c.org/Guidelines/P5/ D’Linkup Pte. Ltd. Data, for everyone

oxygenxml “Request a Trial License” OR https://www.oxygenxml.com/download_thanks.html?url=http ://mirror.oxygenxml.com/InstData/Editor/Windows64/VM/oxy gen-64bit.exe#get_trial D’Linkup Pte. Ltd. Data, for everyone

New Document > Choose a file template > TEI P5 > All Open “One Hundred Years’ History of the Chinese in Singapore: An annotated edition” D’Linkup Pte. Ltd. Data, for everyone

<teiHeader> <fileDesc> <titleStmt> <title>Title</title> </titleStmt> <publicationStmt> <p>Publication Information</p> </publicationStmt> <sourceDesc> <p>Information about the source</p> </sourceDesc> </fileDesc> </teiHeader> <teiHeader> <fileDesc> <titleStmt> <title>Title</title> </titleStmt> <publicationStmt> <p>Publication Information</p> </publicationStmt> <sourceDesc> <p>Information about the source</p> </sourceDesc> </fileDesc> </teiHeader> Add: Author Publisher Check: Well-formedness Valid D’Linkup Pte. Ltd. Data, for everyone

Valid D’Linkup Pte. Ltd. Data, for everyone

Metadata Content <div><p>Textual Structure and Layout</div></p> D’Linkup Pte. Ltd. Data, for everyone

Structural markups <div> VISION & MISSION </div> <p> A premier knowledge hub promoting the University’s vision as a leading global university centred in Asia </s> NUS Libraries aspires to be the NUS communities’ first stop when seeking information NUS Libraries provides for the information needs of diverse communities in NUS  in support of learning and research NUS Libraries equips the NUS communities with information seeking skills to enhance learning, research and scholarly communication  </p> Mission To actively engage and partner the NUS community in advancing scholarship and research through innovative library services </p>  </div> D’Linkup Pte. Ltd. Data, for everyone

WORKING at oxygen <text> <body> <p>Some text here.</p> </body> </text> Add: Front Back Page breaks (pb) Line breaks (lb) Quotations (quote) p50 Footnotes (note) D’Linkup Pte. Ltd. Data, for everyone

quick structural marking using excel and word Manually indicate paragraphs(<p>) Sentences(<s>) Excel IF & Word Replace D’Linkup Pte. Ltd. Data, for everyone

Metadata <name>Content</name> Textual Structure and Layout D’Linkup Pte. Ltd. Data, for everyone

Metadata Content Textual Structure and Layout Person Place Date Metadata Content Textual Structure and Layout D’Linkup Pte. Ltd. Data, for everyone

Persons and names <persName>Raffles</persName> “Major Farquhar” or “Farquhar”?  “encodingDesc” xml:id listPerson D’Linkup Pte. Ltd. Data, for everyone

Collaborative Work https://docs.google.com/document/d/1Pd1a7eWBx49K72wgdn66p3f2 hseb4AOW4jwjsk3R4jQ/edit?usp=sharing Share work using “03” file (listPerson) D’Linkup Pte. Ltd. Data, for everyone

Place <country>Singapore</country> <region>Malacca</region> <district>…</district> <settlement>…</settlement> D’Linkup Pte. Ltd. Data, for everyone

Date <date>1999</date> <date when=“1999”>1999</date> <date when-iso="1819-06-11">11th June 1819</date> D’Linkup Pte. Ltd. Data, for everyone

Checking for errors Copy from google docs into OxygenXML Editor Do validation Make corrections See “04” file for completed markups D’Linkup Pte. Ltd. Data, for everyone

Basic visualization See “05” file D’Linkup Pte. Ltd. Data, for everyone

Questions? D’Linkup Pte. Ltd. Data, for everyone

How can TEI/XML be applied to your field? How would you execute it? BRAINSTORMING How can TEI/XML be applied to your field? How would you execute it? D’Linkup Pte. Ltd. Data, for everyone

D’Linkup mission  D’Linkup Pte. Ltd. Data, for everyone

D’Linkup Pte. Ltd. Data, for everyone William Chong business@dlinkup.com

Thank you! D’Linkup Pte. Ltd. Data, for everyone

YOUR FEEDBACK IS GREATLY APPRECIATED j.mp/RU_Feedback

PARSING valid  well-formed not well-formed not valid  (DTD/ Schema) e.g. TEI Parser step 1 Parser step 2 well-formed valid  XML not well-formed not valid 

PDF JScript HTML ePub any XML OUTPUT 輸出 CSS XSLT XQuery XSL-FO XPath TRANSFORM 轉換 <XML Document> ENCODING 標記 Document Model 文件模型: DTD, Relax NG, XML Schema