Object Oriented Programming III1 XSLT Processing XML using XSLT Using XPath
Object Oriented Programming III2 Processing XML using XSLT XSLT is available on a number of platforms. Next week – how C# interacts with XSLT
Object Oriented Programming III3 The Catcher in the Rye J. D. Salinger Little, Brown and Company Input
Object Oriented Programming III4 <xsl:stylesheet xmlns:xsl=" version="1.0"> Processing
Object Oriented Programming III5 The Catcher in the Rye J. D. Salinger Little, Brown and Company Output
Object Oriented Programming III6 The Catcher in the Rye J. D. Salinger Little, Brown and Company Input
Object Oriented Programming III7 <xsl:stylesheet xmlns:xsl=" version="1.0"> The default rules matches the root, library and block elements.
Object Oriented Programming III8 The Catcher in the Rye J. D. Salinger Little, Brown and Company The output is the same.
Object Oriented Programming III9 The Catcher in the Rye J. D. Salinger Little, Brown and Company Cliff Notes on The Catcher in the Rye Two books in the input
Object Oriented Programming III10 <xsl:stylesheet xmlns:xsl=" version="1.0"> What’s the output?
Object Oriented Programming III11 The Catcher in the Rye J. D. Salinger Little, Brown and Company Cliff Notes on The Catcher in the Rye Illegal HTML
Object Oriented Programming III12 The Catcher in the Rye J. D. Salinger Little, Brown and Company Input
Object Oriented Programming III13 <xsl:stylesheet xmlns:xsl=" version="1.0"> <!-- --> We are not matching on publisher.
Object Oriented Programming III14 The Catcher in the Rye J. D. Salinger Little, Brown and Company We get the default rule matching the publisher and then printing its child.
Object Oriented Programming III15 The Catcher in the Rye J. D. Salinger Little, Brown and Company Input
Object Oriented Programming III16 <xsl:stylesheet xmlns:xsl=" version="1.0"> We can skip the publisher by matching and stopping the recursion.
Object Oriented Programming III17 The Catcher in the Rye J. D. Salinger
Object Oriented Programming III18 The Catcher in the Rye J. D. Salinger Little, Brown and Company The Catcher in the Rye J. D. Salinger Little, Brown and Company The Catcher in the Rye J. D. Salinger Little, Brown and Company A shelf has many books.
Object Oriented Programming III19 <xsl:stylesheet xmlns:xsl=" version="1.0"> Will this do the job?
Object Oriented Programming III20 The Catcher in the Rye J. D. Salinger Little, Brown and Company The Catcher in the Rye J. D. Salinger Little, Brown and Company The Catcher in the Rye J. D. Salinger Little, Brown and Company This is not what we want.
Object Oriented Programming III21 The Catcher in the Rye J. D. Salinger Little, Brown and Company The Catcher in the Rye J. D. Salinger Little, Brown and Company The Catcher in the Rye J. D. Salinger Little, Brown and Company Same input.
Object Oriented Programming III22 <xsl:stylesheet xmlns:xsl=" version="1.0"> Found a shelf Checks for a shelf and quits.
Object Oriented Programming III23 Found a shelf Output
Object Oriented Programming III24 The Catcher in the Rye J. D. Salinger Little, Brown and Company The Catcher in the Rye J. D. Salinger Little, Brown and Company The Catcher in the Rye J. D. Salinger Little, Brown and Company Same input.
Object Oriented Programming III25 <xsl:stylesheet xmlns:xsl=" version="1.0"> These are a few of my favorite books Produce a table of books.
Object Oriented Programming III26 These are a few of my favorite books 1 The Catcher in the Rye J. D. Salinger Little, Brown and Company 2 The XSLT Programmer's Reference Michael Kay Wrox Press 3 Computer Organization and Design Patterson and Henessey Morgan Kaufmann
Object Oriented Programming III27
Object Oriented Programming III28 XPATH Non-xml language used to identify particular parts of an xml document Used by XSLT for matching and selecting particular elements to be copied into the result tree. Used by Xpointer to identify a particular point in or part of an xml document that an Xlink links to. Slides adapted from “XML in a Nutshell” by Harold
Object Oriented Programming III29 XPATH First, we’ll look at three commonly used XSLT instructions: xsl:value-of xsl:template xsl:apply-templates
Object Oriented Programming III30 XPATH The xsl:value-of element computes the string value of an Xpath expression and inserts it into the result tree. XPath allows us to select nodes in the tree and different node types produce different values.
Object Oriented Programming III31 XPATH element => the text content of the element after all tags are stripped text => the text of the node attribute => the value of the attribute root => the value of the root processing-instruction => the processing instruction data (, and the target are not included comment => the text of the comment (no comment symbols) namespace => the namespace URI node set => the value of the first node in the set
Object Oriented Programming III32 XPATH The xsl:template top-level element is the key to all of xslt. The match attribute contains a pattern (location path) against which nodes are compared as they’re processed. If the pattern matches a node, then the contents are instantiated
Object Oriented Programming III33 XPATH Find and apply the highest priority template that matches the node set expression. If the select attribute is not present then all children of the context node are processed.
Object Oriented Programming III34 The Tree Structure of an XML Document Alan Turing computer scientist mathematician cryptographer See Harold Pg. 147
Object Oriented Programming III35 Richard M Feynman physicist Playing the bongoes Unicode ‘M’
Object Oriented Programming III36 / person born = “1914” died = “1952” id=“p342” person name first_name Alan <!– Did the word “computer scientist” exist in Turing’s day?”-- > profession
Object Oriented Programming III37 The root Element Nodes Text Nodes Attribute Nodes Comment Nodes Processing Instructions Namespace Nodes Nodes seen by XPath Constructs not seen by XPath CDATA sections Entity references Document Type Declarations
Object Oriented Programming III38 Note The following appears in each example below so it has been removed from the slides. <xsl:stylesheet xmlns:xsl=" version="1.0" > :
Object Oriented Programming III39 Location Paths The root matched the root matched the root
Object Oriented Programming III40 Location Paths Child element location paths (relative to context node) computer scientist
Object Oriented Programming III41 Location Paths Attribute location paths (relative to context node) 1912
Object Oriented Programming III42 Location Paths Attribute location paths (relative to context node)
Object Oriented Programming III43 Location Paths Comment Location Step (comments don’t have names) Did the word "computer scientist" exist in Turing's day?
Object Oriented Programming III44 Location Paths Comment Location Step comment deleted Document content with comments replaced as shown. Default – no comments output
Object Oriented Programming III45 Location Paths Text Location Step (Text nodes don’t have names) computer scientist
Object Oriented Programming III46 Location Paths Processing Instruction Location Step type="text/xsl" href = "pi.xsl"
Object Oriented Programming III47 Location Paths Wild cards There are three wild cards: *, The * matches any element node. It will not match attributes, text nodes, comments or processing instructions nodes.
Object Oriented Programming III48 Location Paths Matching with * Matches all elements and requests calls on sub-elements only. Nothing is displayed. The text nodes are never reached.
Object Oriented Programming III49 Location Paths Matching with node() The node() wild card matches all nodes: element nodes, text nodes, attribute nodes, processing instruction nodes, namespace nodes and comment nodes.
Object Oriented Programming III50 Matching with Node What is the output?
Object Oriented Programming III51 Matching with Node -Output
Object Oriented Programming III52 Location Paths Matching wild card matches all attribute nodes.
Object Oriented Programming III53 Matching Found an attribute What is the output?
Object Oriented Programming III54 Matching - Output Found an attribute 1912 Found an attribute 1954 Found an attribute p342 Found an attribute 1918 Found an attribute 1988 Found an attribute p4567
Object Oriented Programming III55 Matching p p4567
Object Oriented Programming III56 Location Paths Multiple matches with | Matches all the elements. Skips the text nodes unless they describe a profession or hobby.
Object Oriented Programming III57 Location Paths Selecting from all descendants with // // selects from all descendants of the context node as well as the context node itself. At the beginning of an Xpath expression, it selects from all descendants of the root node.
Object Oriented Programming III58 Location Paths Selecting from all descendants with // TuringFeynman
Object Oriented Programming III59 Location Paths Selecting from all descendants with // Alan
Object Oriented Programming III60 Location Paths Selecting from all descendants with // AlanRichard
Object Oriented Programming III61 Location Paths Selecting from all descendants with // Richard
Object Oriented Programming III62 Specifying the Child Axis Consider the following path: /Envelope/Header/Signature The above is an abbreviation for /child::Envelope/child::Header/child::Signature
Object Oriented Programming III63 Using an Axis
Object Oriented Programming III64 What is the output?
Object Oriented Programming III65 Richard M Feynman Alan Turing Axis Example - Output
Object Oriented Programming III66 Writing Output to an Attribute
Object Oriented Programming III67 Writing Output to an Attribute
Object Oriented Programming III68 Predicates In general, an Xpath expression may refer to more than one node. Predicates allow us to reduce the number of nodes we are interested in. Each step in a location path may have a predicate that selects from the node list that is current at that step in the expression. The boolean expression in the predicate is tested against each node in the context node list. If the expression is false then that node is deleted from the list.
Object Oriented Programming III69 Predicates Richard M Feynman
Object Oriented Programming III70 Predicates Richard M Feynman physicist Playing the bongoes
Object Oriented Programming III71 Predicates Alan Turing computer scientist mathematician cryptographer
Object Oriented Programming III72 Predicates Richard M Feynman physicist Playing the bongoes
Object Oriented Programming III73 Predicates <xsl:apply-templates select = < 1950]/ name[first_name='Alan']" /> Alan Turing
Object Oriented Programming III74 General XPath Expressions Xpath expressions that are not node sets can’t be used in the match attribute of an xsl:template element. They can be used for the values for the select attribute of xsl:value-of elements and in location path predicates.
Object Oriented Programming III75 General XPath Expressions
Object Oriented Programming III76 General XPath Expressions Xpath Functions Person Person 1 Person 2
Object Oriented Programming III77 General XPath Expressions Xpath Functions Mr. T. Mr. T. Alan Turing Node set converted to string