1 Query Languages for XML XPath XQuery XSLT. 2 The XPath/XQuery Data Model uCorresponding to the fundamental “relation” of the relational model is: sequence.

Slides:



Advertisements
Similar presentations
Advanced XSLT. Branching in XSLT XSLT is functional programming –The program evaluates a function –The function transforms one structure into another.
Advertisements

2/10/05Salman Azhar: Database Systems1 XML Query Languages Salman Azhar XPATH XQUERY These slides use some figures, definitions, and explanations from.
1 Introduction to SQL Select-From-Where Statements Multirelation Queries Subqueries.
SQL Queries Principal form: SELECT desired attributes FROM tuple variables –– range over relations WHERE condition about tuple variables; Running example.
1 XSLT – eXtensible Stylesheet Language Transformations Modified Slides from Dr. Sagiv.
1 XQuery Web and Database Management System. 2 XQuery XQuery is to XML what SQL is to database tables XQuery is designed to query XML data What is XQuery?
Friday, September 4 th, 2009 The Systems Group at ETH Zurich XML and Databases Exercise Session 6 courtesy of Ghislain Fourny/ETH © Department of Computer.
1 XPath Path Expressions Conditions. 2 Paths in XML Documents uXPath is a language for describing paths in XML documents. uReally think of the semistructured.
CMPT 354, Simon Fraser University, Fall 2008, Martin Ester 357 Database Systems I Query Languages for XML.
1 COS 425: Database and Information Management Systems XML and information exchange.
XML Language Family Detailed Examples Most information contained in these slide comes from: These slides are intended.
Winter 2002Arthur Keller – CS 18018–1 Schedule Today: Mar. 12 (T) u Semistructured Data, XML, XQuery. u Read Sections Assignment 8 due. Mar. 14.
1 XML Document Type Definitions XML Schema. 2 Well-Formed and Valid XML uWell-Formed XML allows you to invent your own tags. uValid XML conforms to a.
1 SQL/PSM Procedures Stored in the Database General-Purpose Programming.
1 XML Semistructured Data Extensible Markup Language Document Type Definitions.
XML –Query Languages, Extracting from Relational Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria.
1 XML Query Languages XPATH XQUERY. 2 XPATH and XQUERY uXPATH is a language for describing paths in XML documents. wReally think of the semistructured.
XML Technologies and Applications Rajshekhar Sunderraman Department of Computer Science Georgia State University Atlanta, GA 30302
1 PL/SQL Oracle’s Version of Triggers and PSM. 2 PL/SQL uOracle uses a variant of SQL/PSM which it calls PL/SQL. uPL/SQL not only allows you to create.
1 XQuery Values FLWR Expressions Other Expressions.
1 XML Semistructured Data Extensible Markup Language Document Type Definitions.
Introduction to XPath Bun Yue Professor, CS/CIS UHCL.
Using XSLT and XPath to Enhance HTML Documents Reference: Roger L. Costello
XML Document Type Definitions XML Schema. Motivation for Semistructured data Serves as a model suitable for integration of databases Notations such as.
ECA 228 Internet/Intranet Design I Intro to XSL. ECA 228 Internet/Intranet Design I XSL basics W3C standards for stylesheets – CSS – XSL: Extensible Markup.
Lecture 6 of Advanced Databases XML Schema, Querying & Transformation Instructor: Mr.Ahmed Al Astal.
XML Semi-structured data XML Document Type Definitions (DTD)
1 Lecture 5: XML and XQuery. 2 Semistructured Data uAnother data model, based on trees. uMotivation: flexible representation of data. wOften, data comes.
TDDD43 XML and RDF Slides based on slides by Lena Strömbäck and Fang Wei-Kleiner 1.
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
CSCE 520- Relational Data Model Lecture 2. Relational Data Model The following slides are reused by the permission of the author, J. Ullman, from the.
Document Type Definitions XML Schema
1 CIS336 Website design, implementation and management (also Semester 2 of CIS219, CIS221 and IT226) Lecture 6 XSLT (Based on Møller and Schwartzbach,
Advance Database S Week-7 Dr.Kwanchai Eurviriyanukul
Processing of structured documents Spring 2002, Part 2 Helena Ahonen-Myka.
Lecture 22 XML querying. 2 Example 31.5 – XQuery FLWOR Expressions ‘=’ operator is a general comparison operator. XQuery also defines value comparison.
Processing of structured documents Spring 2003, Part 7 Helena Ahonen-Myka.
XPath. Why XPath? Common syntax, semantics for [XSLT] [XPointer][XSLT] [XPointer] Used to address parts of an XML document Provides basic facilities for.
Winter 2006Keller, Ullman, Cushing18–1 Plan 1.Information integration: important new application that motivates what follows. 2.Semistructured data: a.
ECA 228 Internet/Intranet Design I XSLT Example. ECA 228 Internet/Intranet Design I 2 CSS Limitations cannot modify content cannot insert additional text.
CITA 330 Section 6 XSLT. Transforming XML Documents to XHTML Documents XSLT is an XML dialect which is declared under namespace "
XSLT part of XSL (Extensible Stylesheet Language) –includes also XPath and XSL Formatting Objects used to transform an XML document into: –another XML.
XSLT Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
Unit 04 : XQuery COMP 5323 Web Database Technologies and Applications 2014.
1 XQuery uXQuery extends XPath to a query language that has power similar to SQL. uUses the same sequence-of-items data model. uXQuery is an expression.
August Chapter 6 - XPath & XPointer Learning XML by Erik T. Ray Slides were developed by Jack Davis College of Information Science and Technology.
Database Systems Part VII: XML Querying Software School of Hunan University
1 Real SQL Programming Persistent Stored Modules (PSM) PL/SQL Embedded SQL.
Jeff Ullman: Introduction to XML 1 XML Semistructured Data Extensible Markup Language Document Type Definitions.
More XML XPATH, XSLT CS 431 – February 23, 2005 Carl Lagoze – Cornell University.
XML Query Languages XPATH XQUERY Zaki Malik November 11, 2008.
XPath --XML Path Language Motivation of XPath Data Model and Data Types Node Types Location Steps Functions XPath 2.0 Additional Functionality and its.
XSLT: How Do We Use It? Nancy Hallberg Nikki Massaro Kauffman.
CSE 6331 © Leonidas Fegaras XQuery 1 XQuery Leonidas Fegaras.
XSLT I Robin Burke ECT 360. Outline History / Terminology XSLT processing XSLT syntax XPath XSLT basics Lab.
1 The XPath Language. 2 XPath Expressions Flexible notation for navigating around trees A basic technology that is widely used uniqueness and scope in.
XML Schema – XSLT Week 8 Web site:
XQUERY The contents of this slide are obtained from various sources including, Wikipedia, W3School, Stanford website etc. January 2011 Dr.Kwanchai Eurviriyanukul.
1 XSL Transformations (XSLT). 2 XSLT XSLT is a language for transforming XML documents into XHTML documents or to other XML documents. XSLT uses XPath.
CH 15 XSL Transformations 1. Objective What is XSL? Overview of XSL transformations Understanding XSL templates Computing the value of a node with xsl:value-of.
1 XSLT XSLT (extensible stylesheet language – transforms ) is another language to process XML documents. Originally intended as a presentation language:
Query Languages for XML
XML: Extensible Markup Language
Schedule Today: Jan. 28 (Mon) Jan. 30 (Wed) Next Week Assignments !!
Slides are reused by the approval of Jeffrey Ullman’s
Querying and Transforming XML Data
Introduction to Database Systems, CS420
XQuery Leonidas Fegaras.
Query Languages for XML
Select-From-Where Statements Multirelation Queries Subqueries
Presentation transcript:

1 Query Languages for XML XPath XQuery XSLT

2 The XPath/XQuery Data Model uCorresponding to the fundamental “relation” of the relational model is: sequence of items. uAn item is either: 1.A primitive value, e.g., integer or string. 2.A node (defined next).

3 Principal Kinds of Nodes 1.Document nodes represent entire documents. 2.Elements are pieces of a document consisting of some opening tag, its matching closing tag (if any), and everything in between. 3.Attributes names that are given values inside opening tags.

4 Document Nodes uFormed by doc(URL) or document(URL). uExample: doc(/usr/class/cs145/bars.xml) uAll XPath (and XQuery) queries refer to a doc node, either explicitly or implicitly. wExample: key definitions in XML Schema have Xpath expressions that refer to the document described by the schema.

5 DTD for Running Example <!DOCTYPE BARS [ ]>

6 Example Document … <BEER name = ”Bud” soldBy = ”JoesBar SuesBar … ”/> … An element node An attribute node Document node is all of this, plus the header ( <? xml version… ).

7 Nodes as Semistructured Data BARS PRICE BEERBAR name = ”JoesBar” theBeer = ”Miller” theBeer = ”Bud” SoldBy = ”…” name = ”Bud” Rose =document Green = element Gold = attribute Purple = primitive value bars.xml

8 Paths in XML Documents uXPath is a language for describing paths in XML documents. uThe result of the described path is a sequence of items.

9 Path Expressions uSimple path expressions are sequences of slashes (/) and tags, starting with /. wExample: /BARS/BAR/PRICE uConstruct the result by starting with just the doc node and processing each tag from the left.

10 Evaluating a Path Expression uAssume the first tag is the root. wProcessing the doc node by this tag results in a sequence consisting of only the root element. uSuppose we have a sequence of items, and the next tag is X. wFor each item that is an element node, replace the element by the subelements with tag X.

11 Example: /BARS … <BEER name = ”Bud” soldBy = ”JoesBar SuesBar … ”/> … One item, the BARS element

12 Example: /BARS/BAR … <BEER name = ”Bud” soldBy = ”JoesBar SuesBar …”/> … This BAR element followed by all the other BAR elements

13 Example: /BARS/BAR/PRICE … <BEER name = ”Bud” soldBy = ”JoesBar SuesBar …”/> … These PRICE elements followed by the PRICE elements of all the other bars.

14 Attributes in Paths uInstead of going to subelements with a given tag, you can go to an attribute of the elements you already have. uAn attribute is indicated by in front of its name.

15 Example: … <BEER name = ”Bud” soldBy = ”JoesBar SuesBar …”/> … These attributes contribute ”Bud” ”Miller” to the result, followed by other theBeer values.

16 Remember: Item Sequences uUntil now, all item sequences have been sequences of elements. uWhen a path expression ends in an attribute, the result is typically a sequence of values of primitive type, such as strings in the previous example.

17 Paths that Begin Anywhere uIf the path starts from the document node and begins with //X, then the first step can begin at the root or any subelement of the root, as long as the tag is X.

18 Example: //PRICE … <BEER name = ”Bud” soldBy = ”JoesBar SuesBar …”/> … These PRICE elements and any other PRICE elements in the entire document

19 Wild-Card * uA star (*) in place of a tag represents any one tag. uExample: /*/*/PRICE represents all price objects at the third level of nesting.

20 Example: /BARS/* … <BEER name = ”Bud” soldBy = ”JoesBar SuesBar … ”/> … This BAR element, all other BAR elements, the BEER element, all other BEER elements

21 Selection Conditions uA condition inside […] may follow a tag. uIf so, then only paths that have that tag and also satisfy the condition are included in the result of a path expression.

22 Example: Selection Condition u/BARS/BAR/PRICE[. < 2.75] … The condition that the PRICE be < $2.75 makes this price but not the Miller price part of the result. The current element.

23 Example: Attribute in Selection = ”Miller”] … Now, this PRICE element is selected, along with any other prices for Miller.

24 Axes uIn general, path expressions allow us to start at the root and execute steps to find a sequence of nodes at each step. uAt each step, we may follow any one of several axes. uThe default axis is child:: --- go to all the children of the current set of nodes.

25 Example: Axes u/BARS/BEER is really shorthand for /BARS/child::BEER. is really shorthand for the attribute:: axis. wThus, = ”Bud” ] is shorthand for /BARS/BEER[attribute::name = ”Bud”]

26 More Axes uSome other useful axes are: 1.parent:: = parent(s) of the current node(s). 2.descendant-or-self:: = the current node(s) and all descendants. wNote: // is really shorthand for this axis. 3.ancestor::, ancestor-or-self, etc. 4.self (the dot).

27 XQuery uXQuery extends XPath to a query language that has power similar to SQL. uUses the same sequence-of-items data model. uXQuery is an expression language. wLike relational algebra --- any XQuery expression can be an argument of any other XQuery expression.

28 More About Item Sequences uXQuery will sometimes form sequences of sequences. uAll sequences are flattened. uExample: (1 2 () (3 4)) = ( ). Empty sequence

29 FLWR Expressions 1.One or more for and/or let clauses. 2.Then an optional where clause. 3.A return clause.

30 Semantics of FLWR Expressions uEach for creates a loop. wlet produces only a local definition. uAt each iteration of the nested loops, if any, evaluate the where clause. uIf the where clause returns TRUE, invoke the return clause, and append its value to the output.

31 FOR Clauses for in,... uVariables begin with $. uA for-variable takes on each item in the sequence denoted by the expression, in turn. uWhatever follows this for is executed once for each value of the variable.

32 Example: FOR for $beer in return {$beer} u$beer ranges over the name attributes of all beers in our example document. uResult is a sequence of BEERNAME elements: Bud Miller... “Expand the en- closed string by replacing variables and path exps. by their values.” Our example BARS document

33 Use of Braces uWhen a variable name like $x, or an expression, could be text, we need to surround it by braces to avoid having it interpreted literally. wExample: $x is an A-element with value ”$x”, just like foo is an A-element with ”foo” as value.

34 Use of Braces --- (2)  But return $x is unambiguous.  You cannot return an untagged string without quoting it, as return ”$x”.

35 LET Clauses let :=,... uValue of the variable becomes the sequence of items defined by the expression. uNote let does not cause iteration; for does.

36 Example: LET let $d := document(”bars.xml”) let $beers := return {$beers} uReturns one element with all the names of the beers, like: Bud Miller …

37 Order-By Clauses uFLWR is really FLWOR: an order-by clause can precede the return. uForm: order by wWith optional ascending or descending. uThe expression is evaluated for each assignment to variables. uDetermines placement in output sequence.

38 Example: Order-By uList all prices for Bud, lowest first. let $d := document(”bars.xml”) for $p in order by $p return $p Generates bindings for $p to PRICE elements. Order those bindings by the values inside the elements (auto- matic coersion). Each binding is evaluated for the output. The result is a sequence of PRICE elements.

39 Aside: SQL ORDER BY uSQL works the same way; it’s the result of the FROM and WHERE that get ordered, not the output. uExample: Using R(a,b), SELECT b FROM R WHERE b > 10 ORDER BY a; R tuples with b>10 are ordered by their a-values. Then, the b-values are extracted from these tuples and printed in the same order.

40 Predicates uNormally, conditions imply existential quantification. uExample: means “all the bars that have a name.” uExample: = ”JoesBar”] gives the set of beers that are sold at Joe’s Bar.

41 Example: Comparisons uLet us produce the PRICE elements (from all bars) for all the beers that are sold by Joe’s Bar. uThe output will be BBP elements with the names of the bar and beer as attributes and the price element as a subelement.

42 Strategy 1.Create a triple for-loop, with variables ranging over all BEER elements, all BAR elements, and all PRICE elements within those BAR elements. 2.Check that the beer is sold at Joe’s Bar and that the name of the beer and theBeer in the PRICE element match. 3.Construct the output element.

43 The Query let $bars = doc(”bars.xml”)/BARS for $beer in $bars/BEER for $bar in $bars/BAR for $price in $bar/PRICE where = ”JoesBar” and = return {$price} True if ”JoesBar” appears anywhere in the sequence

44 Strict Comparisons uTo require that the things being compared are sequences of only one element, use the Fortran comparison operators: weq, ne, lt, le, gt, ge. uExample: eq ”JoesBar” is true only if Joe’s is the only bar selling the beer.

45 Comparison of Elements and Values uWhen an element is compared to a primitive value, the element is treated as its value, if that value is atomic.  Example: eq ”2.50” is true if Joe charges $2.50 for Bud.

46 Comparison of Two Elements uIt is insufficient that two elements look alike. uExample: eq is false, even if Joe and Sue charge the same for Bud.

47 Comparison of Elements – (2) uFor elements to be equal, they must be the same, physically, in the implied document. uSubtlety: elements are really pointers to sections of particular documents, not the text strings appearing in the section.

48 Getting Data From Elements uSuppose we want to compare the values of elements, rather than their location in documents. uTo extract just the value (e.g., the price itself) from an element E, use data(E ).

49 Example: data() uSuppose we want to modify the return for “find the prices of beers at bars that sell a beer Joe sells” to produce an empty BBP element with price as one of its attributes. return

50 Eliminating Duplicates  Use function distinct-values applied to a sequence. uSubtlety: this function strips tags away from elements and compares the string values. wBut it doesn’t restore the tags in the result.

51 Example: All the Distinct Prices return distinct-values( let $bars = doc(”bars.xml”) return $bars/BARS/BAR/PRICE ) Remember: XQuery is an expression language. A query can appear any place a value can.

52 Effective Boolean Values uThe effective boolean value (EBV) of an expression is: 1.The actual value if the expression is of type boolean. 2.FALSE if the expression evaluates to 0, ”” [the empty string], or () [the empty sequence]. 3.TRUE otherwise.

53 EBV Examples has EBV TRUE or FALSE, depending on whether the name attribute is ”JoesBar”. has EBV TRUE if some bar is named the Golden Rail, and FALSE if there is no such bar.

54 Boolean Operators uE 1 and E 2, E 1 or E 2, not(E ), apply to any expressions. uTake EBV’s of the expressions first. uExample: not(3 eq 5 or 0) has value TRUE. uAlso: true() and false() are functions that return values TRUE and FALSE.

55 Branching Expressions uif (E 1 ) then E 2 else E 3 is evaluated by: w Compute the EBV of E 1. wIf true, the result is E 2 ; else the result is E 3. uExample: the PRICE subelements of $bar, provided that bar is Joe’s. eq ”JoesBar”) then $bar/PRICE else () Empty sequence. Note there is no if-then expression.

56 Quantifier Expressions some $x in E 1 satisfies E 2 1.Evaluate the sequence E 1. 2.Let $x (any variable) be each item in the sequence, and evaluate E 2. 3.Return TRUE if E 2 has EBV TRUE for at least one $x. uAnalogously: every $x in E 1 satisfies E 2

57 Example: Some uThe bars that sell at least one beer for less than $2. for $bar in doc(”bars.xml”)/BARS/BAR where some $p in $bar/PRICE satisfies $p < 2.00 return Notice: where $bar/PRICE < 2.00 would work as well.

58 Example: Every uThe bars that sell no beer for more than $5. for $bar in doc(”bars.xml”)/BARS/BAR where every $p in $bar/PRICE satisfies $p <= 5.00 return

59 Document Order uComparison by document order: >. uExample: << is true iff the Bud element appears before the Miller element in the document $d.

60 Set Operators uunion, intersect, except operate on sequences of nodes. wMeanings analogous to SQL. wResult eliminates duplicates. wResult appears in document order.

61 XSLT uXSLT (extensible stylesheet language – transforms ) is another language to process XML documents. uOriginally intended as a presentation language: transform XML into an HTML page that could be displayed. uIt can also transform XML -> XML, thus serving as a query language.

62 XSLT Programs uLike XML Schema, an XSLT program is itself an XML document. uXSLT has a special namespace of tags, usually indicated by xsl:.

63 Templates uThe xsl:template element describes a set of elements (of the document being processed) and what should be done with them. uThe form: … Attribute match gives an XPath expression describing how to find the nodes to which the template applies.

64 Example: BARS Document -> Table uIn a running example, we’ll convert the bars.xml document into an HTML document that looks like the Sells(bar, beer, price) relation. uThe first template will match the root of the document and produce the table without any rows.

65 The Template for the Root bar beer price Template matches only the root. Output of the template is a table with the attributes in the header row, no other rows. Needs to be fixed. As is, there is no way to insert rows.

66 Outline of Strategy 1.Inside the HTML for the table is xsl:apply- templates to extract data from the document. 2.From each BAR, use an xsl:variable b to remember the bar name. 3.xsl:for-each PRICE subelement, generate a row, using b, and xsl:value-of to extract the beer name and price.

67 Recursive Use of Templates uAn XSLT document usually contains many templates. uStart by finding the first one that applies to the root. uAny template can have within it, which causes the template-matching to apply recursively from the current node.

68 Apply-Templates uAttribute select gives an XPath expression describing the subelements to which we apply templates. uExample: says to follow all paths tagged BARS, BAR from the current node and apply all templates there.

69 Example: Apply-Templates bar beer price

70 Extracting Values u produces a value to be placed in the output. uExample: suppose we are applying a template at a BAR element and want to put the bar name into a table.

71 Variables uWe can declare x to be a variable with. uExample: within a template that applies to BAR elements will set variable bar to the name of that bar.

72 Using Variables uPut a $ in front of the variable name. uExample: $bar

73 Completing the Table 1.We’ll apply a template at each BAR element. 2.This template will assign a variable b the value of the bar, and iterate over each PRICE child. 3.For each PRICE child, we print a row, using b, the theBeer attribute, and the PRICE itself.

74 Iteration u … executes the body of the for-each at each child of the current node that is reached by the path.

75 The Template for BARS $b Constructs a bar- beer-price row. Iterates over all PRICE subelements of the bar. This element A variable for each bar