Beyond HTML: Extensible Markup Language

Slides:



Advertisements
Similar presentations
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Advertisements

XHTML Basics.
1 © Netskills Quality Internet Training, University of Newcastle XML.
HTML5 and CSS3 Illustrated Unit B: Getting Started with HTML
Tutorial 1 Getting Started with HTML5
Extensible Markup Language XML MIS 520 – Database Theory Fall 2001 (Day) Lecture 14.
XML A brief introduction ---by Yongzhu Li. XML --- a brief introduction 2 CSI668 Topics in System Architecture SUNY Albany Computer Science Department.
XML Introduction What is XML –XML is the eXtensible Markup Language –Became a W3C Recommendation in 1998 –Tag-based syntax, like HTML –You get to make.
Developing a Basic Web Page with HTML
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
XML – Extensible Markup Language Sivakumar Kuttuva & Janusz Zalewski.
Pemrograman Berbasis WEB XML part 2 -Aurelio Rahmadian- Sumber: w3cschools.com.
Creating a Basic Web Page
Chapter 6 Text and Multimedia Languages and Properties
University of Illinois at Urbana-Champaign OAI Alpha Experiences Timothy W. Cole Thomas G. Habing Grainger Engineering.
Scientific Markup Languages Birds of a Feather A 10-Minute Introduction to XML Timothy W. Cole Mathematics Librarian & Professor of.
CREATED BY ChanoknanChinnanon PanissaraUsanachote
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
Introduction to XML Eugenia Fernandez IUPUI. What is XML? From the World Wide Web Consortium (W3C) The Extensible Markup Language (XML) is the universal.
Chapter 1 Understanding the Web Design Environment Principles of Web Design, 4 th Edition.
Week 1 Understanding the Web Design Environment. 1-2 HTML: Then and Now HTML is an application of the Standard Generalized Markup Language Intended to.
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
XML - Why: The HTML-Dilemma HTML, SGML, XML - How: Syntax, Concept, Language Elements Basics Well-formed XML-Documents (without DTD) Valid XML-Documents.
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
Intro. to XML & XML DB Bun Yue Professor, CS/CIS UHCL.
A Basic Web Page. Chapter 2 Objectives HTML tags and elements Create a simple Web Page XHTML Line breaks and Paragraph divisions Basic HTML elements.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
1 Chapter 10: XML What is XML What is XML Basic Components of XML Basic Components of XML XPath XPath XQuery XQuery.
XP Tutorial 9 1 Working with XHTML. XP SGML 2 Standard Generalized Markup Language (SGML) A standard for specifying markup languages. Large, complex standard.
XML 2nd EDITION Tutorial 1 Creating An Xml Document.
CP3024 Lecture 9 XML: Extensible Markup Language.
VICTORIA UNIVERSITY OF WELLINGTON Te Whare Wananga o te Upoko o te Ika a Maui SWEN 432 Advanced Database Design and Implementation An Introduction to XML.
Introduction to XML This presentation covers introductory features of XML. What XML is and what it is not? What does it do? Put different related technologies.
XP 1 Creating an XML Document Developing an XML Document for the Jazz Warehouse XML Tutorial.
1 Introduction to XML XML stands for Extensible Markup Language. Because it is extensible, XML has been used to create a wide variety of different markup.
ISO/TC 211 WG4 WI 18 Encoding Foil no. 1 Annex C XML and XMI David Skogan SINTEF Telecom and Informatics
1 Tutorial 11 Creating an XML Document Developing a Document for a Cooking Web Site.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 1COMP9321, 15s2, Week.
Web Technologies Lecture 4 XML and XHTML. XML Extensible Markup Language Set of rules for encoding a document in a format readable – By humans, and –
Basic HTML Document Structure. Slide 2 Goals (XHTML HTML5) XHTML Separate document structure and content from document formatting HTML 5 Create a formal.
Module Road Map Assignment Road Map Notice we have linked the conduit directly to the presentation layer. This is normally a bad idea!
Tutorial 9 Working with XHTML. New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition 2 Objectives Describe the history and theory of XHTML.
Introduction to XML Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
VCE IT Theory Slideshows by Mark Kelly study design By Mark Kelly, vceit.com, Begin.
XML Extensible Markup Language
Web Design Principles 5 th Edition Chapter 3 Writing HTML for the Modern Web.
HTML5 and CSS3 Illustrated Unit B: Getting Started with HTML.
1 Introduction to XML Babak Esfandiari. 2 What is XML? introduced by W3C in 98 Stands for eXtensible Markup Language it is more general than HTML, but.
Beyond HTML: Extensible Markup Language (XML)
Kynn Bartlett 11 April 2001 STC San Diego The HTML Writers Guild Copyright © 2001 XML, XHTML, XSLT, and other X-named specifications.
Unit 4 Representing Web Data: XML
XML Related Technologies
XML QUESTIONS AND ANSWERS
XHTML Basics.
Markup Languages Gilok Choi 9/17/2018
Web Programming– UFCFB Lecture 9
Chapter 7 Representing Web Data: XML
XHTML Basics.
Roy Tennant California Digital Library
XHTML Basics.
Creating an XML Document
Introducing HTML & XHTML:
XML Introduction By Hongming Yu Feb 6th, 2002.
XHTML Basics.
XHTML Basics.
Web Programming– UFCFB Lecture 9
CSE591: Data Mining by H. Liu
Allyson Falkner Spokane County ISD
HTML5 and CSS3 Illustrated Unit B: Getting Started with HTML
Presentation transcript:

Beyond HTML: Extensible Markup Language Timothy W. Cole Grainger Engineering Library Information Center University of Illinois at Urbana-Champaign American Association of Law Libraries 19 July 2000 t-cole3@uiuc.edu http://dli.grainger.uiuc.edu/Publications/TWCole/AALL_2000/

Ordered Hierarchy of Content Objects A Definition of Text in Computer Terms Premise: A Text is the Sum of its Components So a <BOOK> Could Be Defined as Containing: <FRONT_MATTER> <CHAPTER>s <BACK_MATTER> <FRONT_MATTER> Could Contain: <BOOK_TITLE> <AUTHOR>s <PUBLISHER> While Each <CHAPTER> Could Contain: <CHAPTER_TITLE> <SECTION>s And Each <SECTION> Could Contain: <SECTION_TITLE> <PARAGRAPH>s Components Chosen Reflect Anticipated Use

Ordered Hierarchy of Content Objects (continued) OHCO is a Useful, Albeit Imperfect Model More Powerful Than Model of Text as a Stream of Characters & Formatting Instructions Does Not Allow for Overlapping Content Objects OHCO Model is Inherent in XML, HTML XML Designed for Descriptive Content Objects, Not Presentational Content Objects XML Syntax is Fixed, But Semantics is Extensible

XML Basics: Markup & Content Consider: Would Display As: <?xml version='1.0' ?> Colè, Tim <!-- This is an Example --> <author sequence='first'> <LName> Colè </LName>, <FName> Tim </FName> </author> This example illustrates: XML Processing Instructions XML Comments (Ignored by XML Applications) XML Element Markup, Including an Attribute XML Content, Including an Entity

XML Basics (continued) “Well-Formed” XML Rules: XML Element Markup is Case-Sensitive All XML Tags Must Be Closed Hierarchical Nesting; No Overlapping Elements All XML Attribute Values Must Be Quoted Enforces Stricter Syntax than HTML Facilitates Fast, Efficient Parsing Extensible Semantics Provide Flexibility “Well-Formed” More Lightweight Than SGML

Is It Valid Or Well-Formed? When Does It Matter? All Web Browsers Need Is Well-Formed XML Authoring Tools Need To Validate Otherwise Tower of Babel Ensues Indexing Agents & Schema-Specific Rendering Agents May Need To Validate Illustrations: Malformed XML Well-Formed But Invalid XML Valid XML

Library Uses of XML: Using XML for Primary Sources Facilitates Searching Full-Text Searching & Field-Specific Searching More Meaningful Proximity Searching Better Retrieval / Browsing Selective Views / Suppression of Personal Data Re-Ordered & Piecemeal Views Illustration -- Illinois Agronomy Handbook Search Browsing

Library Uses of XML: XML for Metadata & Wrapping Facilitates Interchange, Normalization, ... Simpler than Fixed Fields, Record Headers, Etc. XML Implementations of Metadata Standards, e.g.: RDF, EAD, DC, FGDC, US-MARC Easier Routing / Handling of Specialized Content In Combination with Primary Source XML Automatic Extraction of Metadata From Source Facilitates Authority Control

Library Uses of XML: XML for Document Management Smarter Documents XML Namespaces -- Integrating Multiple XML Schemas (Including XHTML) Rights Management, Technical Requirements,… Facilitates Enhanced Linking Between Docs. Creation of Links From Marked Up Content Easy to Add or Modify Links Over Time XLink & XPointer Promise More Robust Linking Metadata File from Illinois DLIB Testbed Schema Integrates RDF, DC, & Project Design

Components of XML Implementations DTDs & XML Schemas Use Either to: Define Content Models Declare Attributes & Entities DTDs Inherited from SGML DTDs Themselves Not Well-Formed XML Limits on Detail of Content Model Definitions Minimal Data Typing XML Schemas Are Well-Formed XML Data Typing & Better Content Models Supported Not Yet in Widespread Use

Components of XML Implementations Encoding & Entities (Using Characters Not on Your Keyboard) Computers Use 1s and 0s, but Characters form the Basis of Human-Readable Texts Coded Character Sets (CCS) Assign Integer Values to Characters -- ASCII, ISO 8859, Unicode Character Encoding Schemes (CES) Map Those Integers to Bytes -- 7-bit, 8-bit, UTF-8 Bytes Are Then Rendered as Glyphs by Your Computer, Using Font Appropriate to CCS/ CES Font Unavailable Or CCS/CES Misunderstood Results in Incorrect Character(s) on Screen

Components of XML Implementations Encoding & Entities (continued) Common Ways to Deal With This Problem: Select CCS/CES Appropriate to Language Use Default CCS/CES, but Override Default Font Use XML/HTML Named or Numeric Entity HTML Understands Non-Extensible Set of Named Entities XML Understands Numeric Entities Corresponding to Unicode CCS, All Named Entities Must Be Declared in DTD Use Unicode for CCS, UTF-8 for CES - XML Defaults An Illustration in HTML

Components of XML Implementations Presentation - CSS Style Sheets XML Content Objects Have No Style Use Cascading Style Sheets (CSS) Work Like CSS for HTML, Except: Must Be Explicit About Everything No Special Treatment of Class & ID Attributes Attach CCS to XML Using Special XML PI CSS Does Define Formatting CSS DOES NOT Reorganize or Add Content Simple XML-CSS Example; The CSS Used

Components of XML Implementations Transformations - XSLT Style Sheets Some Characteristics of XSLT Style Sheets XSLT Files Are Well-Formed XML XSLT Transform to Another Schema, Or to XHTML XSLT Objects Have Implicit Functionality Attach XSLT To Document Using XML PI XSLT Can Reorganize & Add Content Still Need CSS for Presentation -- CSS Style Sheets Work on the Output of XSLT Processing Supplement XSLT With Script To Manipulate & Modify Actual Content Simple XSLT Example; The XSLT Style Sheet

The State-of-the-Art in XML Tools XML Authoring Add-Ons to Established Word Processors, e.g.: WordPerfect 9 / WordPerfect 2000 Tools With SGML Roots, e.g.: ArborText’s Epic (was Adept) Editor SoftQuad’s XMetaL Editor New XML Tools, e.g.: Vervet Logic’s XMLPro Extensibility’s XML Authority / XML Turbo So Far, There Are Fewer Authoring Tools Customized for Specialized XML Schemas

The State-of-the-Art in XML Tools (continued) XML Presentation Tools: Latest Releases of Netscape Navigator/Mozilla, and Microsoft’s Internet Explorer Support XML -- But Support is Generic, Partial, & Uneven Plug-Ins, Standalones Available / In Work for Advanced XML Schemas (CML, MML, VML,…) XML Database Integration Tools: Add-Ons to Established DBMS Available/In Work Microsoft SQL Server-XML Technology Preview Illustration; With Query & CSS; XML Source File; XML Query Language Specification In Work

Developing XML Applications: The Politics of XML Evolution of XML XML Formalized as W3C Recommendation 2/98 Numerous Ancillary Specs Released & In Work Namespaces, XSLT, XLink/XPointer, XML Signature Numerous Early Implementors (Chemistry, Biology, Multimedia, Metadata) Prerequisites for Community Implementations Identify Target(s) of Opportunity Define Horizontal & Vertical Content Objects Consensus Building & Community Buy-In Test Implementations & Tool Building

Developing XML Applications: The Politics of XML (continued) Status of XML In Legal Community LegalXML Has Identified Targets Begun Process of Defining Content Objects & Building Consensus Progress in Some Areas, e.g.: Court Filing (see also XML Court Interface) Less Visible Progress in Other Workgroups, e.g.: Reference, Public Law, Users Presence (& Vested Interests) of Extensive Non-XML Legal Automation Systems In Place Lessens Motivation

Developing XML Applications: The Politics of XML (continued) Status of XML In Publishing & Libraries Extensive XML Work in Metadata Unfortunately Has Led to Competing Stds. Many Publishers Have Been Using SGML for a Decade or More -- But Only Internally Perceived Tradeoff (probably overrated): Publicly Releasing Primary Sources in XML vs. Control of Product & Marketplace Problems with Early SGML Web Experiments No One Wants to be First But No One Wants to be Last Either

Future Directions Continued Evolution of Standards, Tools Continued Development of Community Implementations -- Selected Disciplines Increased Use of XML Behind the Scenes Carryover from SGML Trends Integration of XML with Databases XML Unlikely to Replace HTML, Other Document Formats, But Will Co-Exist Magnitude of Role in Law Libraries Uncertain, but Likely to Have At Least Some Role