Tutorial on Standoff Markup as used in: HCRC Map Task Corpus MATE/NITE Workbench Amy Isard HCRC Language Technology Group University of Edinburgh.

Slides:



Advertisements
Similar presentations
XML-XSL Introduction SHIJU RAJAN SHIJU RAJAN Outline Brief Overview Brief Overview What is XML? What is XML? Well Formed XML Well Formed XML Tag Name.
Advertisements

Inside an XSLT Processor Michael Kay, ICL 19 May 2000.
XML III. Learning Objectives Formatting XML Documents: Overview Using Cascading Style Sheets to format XML documents Using XSL to format XML documents.
XML: Extensible Markup Language
1 XSLT – eXtensible Stylesheet Language Transformations Modified Slides from Dr. Sagiv.
SPECIAL TOPIC XML. Introducing XML XML (eXtensible Markup Language) ◦A language used to create structured documents XML vs HTML ◦XML is designed to transport.
XSLT (eXtensible Stylesheet Language Transformation) 1.
Stand-off Annotation Further details and examples: Durusau and O’Donnell’s (2001) powerpoint presentation Thompson and McKelvie’s (1997) “Hyperlink semantics.
1 CP3024 Lecture 9 XML revisited, XSL, XSLT, XPath, XSL Formatting Objects.
XML Unit 6 October 31. XML, review XML is used to markup data Used to describe information Uses tags like HTML –But all tags are user-defined –Must be.
XSLT Stylesheets Some more advanced examples (adapted from the Edinburgh LT site)
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
We Need Smart XML Processing HTML has ultra-complex semantics XML has no semantics Something must bridge the gap  A program ?  A clear set of data semantics?
1 COS 425: Database and Information Management Systems XML and information exchange.
XML Language Family Detailed Examples Most information contained in these slide comes from: These slides are intended.
EAGLES/ISLE Workshop LREC 2000 Athens, Greece The XML Framework Its Implications for Corpus Access and Use Nancy Ide Department of Computer Science Vassar.
XML(EXtensible Markup Language). XML XML stands for EXtensible Markup Language. XML is a markup language much like HTML. XML was designed to describe.
XML Technologies and Applications Rajshekhar Sunderraman Department of Computer Science Georgia State University Atlanta, GA 30302
COS 381 Day 16. Agenda Assignment 4 posted Due April 1 There was no resubmits of Assignment Capstone Progress report Due March 24 Today we will discuss.
September 15, 2003Houssam Haitof1 XSL Transformation Houssam Haitof.
MC 365 – Software Engineering Presented by: John Ristuccia Shawn Posts Ndi Sampson XSLT Introduction BCi.
SD2520 Databases using XML and JQuery
Chapter 12 Creating and Using XML Documents HTML5 AND CSS Seventh Edition.
XP Tutorial 9New Perspectives on Creating Web Pages with HTML, XHTML, and XML 1 Working with XHTML Creating a Well-Formed Valid Document Tutorial 9.
XML Anisha K J Jerrin Thomas. Outline  Introduction  Structure of an XML Page  Well-formed & Valid XML Documents  DTD – Elements, Attributes, Entities.
NXT meets the ICSI Corpus Jean Carletta and Jonathan Kilgour University of Edinburgh HCRC Language Technology Group.
Lecture 6 of Advanced Databases XML Schema, Querying & Transformation Instructor: Mr.Ahmed Al Astal.
Copyright © 2012 Accenture All Rights Reserved.Copyright © 2012 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are.
IS432 Semi-Structured Data Lecture 5: XSLT Dr. Gamal Al-Shorbagy.
Introduction technology XSL. 04/11/2005 Script of the presentation Introduction the XSL The XSL standard Tools for edition of codes XSL Necessary resources.
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
WORKING WITH XSLT AND XPATH
Another PillowTalk Presentation  2004 Dynamic Systems, Inc. Introduction to XML for SOA Lee H. Burstein,
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
1 CIS336 Website design, implementation and management (also Semester 2 of CIS219, CIS221 and IT226) Lecture 6 XSLT (Based on Møller and Schwartzbach,
XSLT + Xpath CS High level view We are doing tree editing Changes to node properties Structural rearrangement Several mdels for this kind of task.
Intro. to XML & XML DB Bun Yue Professor, CS/CIS UHCL.
Processing of structured documents Spring 2002, Part 2 Helena Ahonen-Myka.
XML About XML Things to be known Related Technologies XML DOC Structure Exploring XML.
1 XSLT An Introduction. 2 XSLT XSLT (extensible Stylesheet Language:Transformations) is a language primarily designed for transforming the structure of.
CITA 330 Section 6 XSLT. Transforming XML Documents to XHTML Documents XSLT is an XML dialect which is declared under namespace "
XSLT Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
Transforming Documents „a how-to of transforming xml documents“ Lecture on Walter Kriha.
Presentation Topic: XML and ASP Presented by Yanzhi Zhang.
Lecture 11 XSL Transformations (part 1: Introduction)
Collaborative Annotation of the AMI Meeting Corpus Jean Carletta University of Edinburgh.
1 Introduction  Extensible Markup Language (XML) –Uses tags to describe the structure of a document –Simplifies the process of sharing information –Extensible.
McGraw-Hill/Irwin © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Applying eXtensible Style Sheets (XSL) Ellen Pearlman Eileen Mullin Programming.
ITB Web programming for E- Commerce 1 ITB6227 Programming for E-COMMERCE Lecture Presentation of XML Documents.
1 Overview of XSL. 2 Outline We will use Roger Costello’s tutorial The purpose of this presentation is  To give a quick overview of XSL  To describe.
XML Introduction. Markup Language A markup language must specify What markup is allowed What markup is required How markup is to be distinguished from.
XP New Perspectives on XML, 2 nd Edition Tutorial 8 1 TUTORIAL 8 CREATING ELEMENT GROUPS.
The Semistructured-Data Model Programming Languages for XML Spring 2011 Instructor: Hassan Khosravi.
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 1COMP9321, 15s2, Week.
University of Nottingham School of Computer Science & Information Technology Introduction to XML 2. XSLT Tim Brailsford.
Universal fuzzy system representation with XML Authors : Chris Tseng, Wafa Khamisy, Toan Vu Source : Computer Standards & Interfaces, Volume 28, Issue.
Martin Kruliš by Martin Kruliš (v1.1)1.
XPath --XML Path Language Motivation of XPath Data Model and Data Types Node Types Location Steps Functions XPath 2.0 Additional Functionality and its.
XP 1 XSLT II Robin Burke ECT 360. XP 2 Homework #4 Forgot we hadn't covered "choose" becomes extra credit My books.xml didn't include descriptions.
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
Jackson, Web Technologies: A Computer Science Perspective, © 2007 Prentice-Hall, Inc. All rights reserved Chapter 7 Representing Web Data:
XML Extensible Markup Language
XML Schema – XSLT Week 8 Web site:
1 XSL Transformations (XSLT). 2 XSLT XSLT is a language for transforming XML documents into XHTML documents or to other XML documents. XSLT uses XPath.
Rendering XML Documents ©NIITeXtensible Markup Language/Lesson 5/Slide 1 of 46 Objectives In this session, you will learn to: * Define rendering * Identify.
1 Extensible Stylesheet Language (XSL) Extensible Stylesheet Language (XSL)
CH 15 XSL Transformations 1. Objective What is XSL? Overview of XSL transformations Understanding XSL templates Computing the value of a node with xsl:value-of.
Unit 4 Representing Web Data: XML
XML in Web Technologies
Chapter 7 Representing Web Data: XML
Presentation transcript:

Tutorial on Standoff Markup as used in: HCRC Map Task Corpus MATE/NITE Workbench Amy Isard HCRC Language Technology Group University of Edinburgh

Standoff Annotation Don’t keep all your data in one big document One document for each annotation level (with its own DTD) Links between documents

LTG link syntax (1) an element can point to one or more contiguous elements in the same or a different document each element is identified by a unique ID a link is shown as an attribute on an element default attributes in the DTD tell a program that this is a link

LTG link syntax (2) attributes to describe a link which will be embedded in the original element output document hrefCDATA#IMPLIED xml:linkCDATA#FIXED"simple“ showCDATA#FIXED"embed“ actuateCDATA#FIXED"auto"

Standoff Example (1): Words XML turn right for three centimetres okay

Standoff Example (2): Moves XML <move type=“instruct” speaker=“spk1” id=“m1” href=“words.xml#id(w1)..id(w5)”/> <move type=“align” speaker=“spk1” id=“m2” href=“words.xml#id(w6)”/> …

Standoff Example (3): Moves and Words XML turn right for three centimetres okay <move type=“align” speaker=“spk1” id=“m2” href=“words.xml#id(w6)”/> …

Advantages of Standoff Annotation It is possible to have levels of annotation which have crossing branches (not normally possible in XML) New levels of annotation can be added without disturbing existing ones Editing one level of annotation has minimal knock-on effects on others People can work on different levels at the same time without worrying about creating different versions

Example Map Task Annotation Structure threecentimetresokaythreeorfourcentimetresokay right M instruct M ackM instructM ackM align S1 S2 turnrightfor reparandumrepair Game instruct Disfluency Dialogue Moves Dialogue Games Disfluencies Words

HCRC Map Task XML Corpus Architecture Gaze Timed Units Tokens Tagged Words Automatic Syntax Moves Games Transactions Disfluencies Landmark References Other Speaker’s Words

Tools and Software LTXML tools MATE workbench (NITE) mate.nis.sdu.dk(nite.nis.sdu.dk) Map Task XML

knit Part of the LTXML toolkit Allows you to “expand” links according to how they have been defined in the DTD (e.g. replace or embed) Command line program, can be used in pipelines

Standoff Example (3): Moves and Words XML turn right for three centimetres okay <move type=“align” speaker=“spk1” id=“m2” href=“words.xml#id(w6)”/> …

Standoff Example (4) Moves XML with embed links turn right for three centimetres okay …

Standoff Example (4) Moves XML with replace links turn right for three centimetres okay …

Working with knit Use knit on one XML document to work with one hierarchical view of the data To work across hierarchies, knit several views and navigate using the structures plus the unique ids of elements

Stylesheets style sheet: template rules –pattern which specifies which tree it applies to –pattern which specifies which tree it should output stylesheet processor –reads XML document and stylesheet –carries out the instructions in the stylesheet –outputs a new XML document or

Template Matching XPath is a language for addressing parts of an XML document, and is used by XSLT in the match attribute of a template e.g. matches any sentence element. A stylesheet processor goes through the XML document matching elements to templates and carries out the instructions in the template.

Standard Stylesheet Example

The MATE Workbench For display, querying, and especially annotation of XML corpora Flexible user-defined user interfaces Uses stylesheets to create Java display objects which have defined user interface behaviours In MATE internal data representation, elements with link pointers are viewed as parent elements

MATE query language Easy to write queries over more than one hierarchy In MATE query language you define variables by element type and then relationships between them ($a ^ $b) means that element $a is a parent of element $b, either in the same document, or via a link.

MATE example query Find all words which are in a move whose label is “instruct” and which are part of a disfluency ($w word)($m move)($d disfluency); ($m ^ $w) and ($m label ~ instruct) and ($d ^ $w)

Conclusions Standoff markuup is not just theoretically a good idea Map Task standoff annotations in place for 5 years, used regularly Accessible to linguists with modest technical backgrounds