1 New Ways of Querying the Web by Eliahu Brodsky and Alina Blizhovsky.

Slides:



Advertisements
Similar presentations
XML e X tensible M arkup L anguage (XML) By: Albert Beng Kiat Tan Ayzer Mungan Edwin Hendriadi.
Advertisements

XML: Extensible Markup Language
XML DOCUMENTS AND DATABASES
XML and Enterprise Computing. What is XML? Stands for “Extensible Markup Language” –similar to SGML and HTML –document “tags” are used to define content.
Managing XML and Semistructured Data Lecture 12: XML Schema Prof. Dan Suciu Spring 2001.
1 Web Data Management XML Schema. 2 In this lecture XML Schemas Elements v. Types Regular expressions Expressive power Resources W3C Draft:
Relational Databases for Querying XML Documents: Limitations & Opportunities VLDB`99 Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton,
Agenda from now on Done: SQL, views, transactions, conceptual modeling, E/R, relational algebra. Starting: XML To do: the database engine: –Storage –Query.
Managing XML and Semistructured Data Lecture 8: Query Languages - XML-QL Prof. Dan Suciu Spring 2001.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
1 Lecture 10 XML Wednesday, October 18, XML Outline XML (4.6, 4.7) –Syntax –Semistructured data –DTDs.
1 COS 425: Database and Information Management Systems XML and information exchange.
1 Statistics XML: –Altavista: 800,000 pages returned. –Amazon.com: 242 books. In comparison: –God: 12,000 books, 7 Million pages –Bible: 32,000 books,
XML and The Relational Data Model
4/15/2002Bo Du 1 - Bo Du, April 15, XML - QL A Query Language for XML.
XML(EXtensible Markup Language). XML XML stands for EXtensible Markup Language. XML is a markup language much like HTML. XML was designed to describe.
WWW and Internet The Internet Creation of the Web Languages for document description Active web pages.
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
Manohar – Why XML is Required Problem: We want to save the data and retrieve it further or to transfer over the network. This.
4/20/2017.
IS432: Semi-Structured Data Dr. Azeddine Chikh. 1. Semi Structured Data Object Exchange Model.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 XML Taken from Chapter 7.
IS432 Semi-Structured Data Lecture 3: XSchema Dr. Gamal Al-Shorbagy.
Lecture 6 of Advanced Databases XML Schema, Querying & Transformation Instructor: Mr.Ahmed Al Astal.
XML-QL A Query Language for XML Charuta Nakhe
XML and XPath. Web Services: XML+XPath2 EXtensible Markup Language (XML) a W3C standard to complement HTML A markup language much like HTML origins: structured.
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
Introduction to XML Eugenia Fernandez IUPUI. What is XML? From the World Wide Web Consortium (W3C) The Extensible Markup Language (XML) is the universal.
An Introduction to XML Presented by Scott Nemec at the UniForum Chicago meeting on 7/25/2006.
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
XML BIS4430 – unit 10. XML Origins Extensible Markup Language (XML) 1998 Inspired by Standard Generalized Markup Language (SGML) and HTML. SGML defines.
XML과 Database 홍기형 성신여자대학교 성신여자대학교 홍기형.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
XQL, OQL and SQL Xia Tang Sixin Qian Shijun Shen Feb 18, 2000.
Presentation Topic: XML and ASP Presented by Yanzhi Zhang.
WEB BASED DATA TRANSFORMATION USING XML, JAVA Group members: Darius Balarashti & Matt Smith.
1 What Is XML? eXtensible Markup Language for data –Standard for publishing and interchange –“Cleaner” SGML for the Internet Applications: –Data exchange.
Copyright © 2004 Pearson Education, Inc.. Chapter 26 XML and Internet Databases.
Lecture 6: XML Query Languages Thursday, January 18, 2001.
Lecture 5: XML Tuesday, January 16, Outline XML, DTDs (Data on the Web, 3.1) Semistructured data in XML (3.2) Exporting Relational Data in XML (8.3.1)
[ Part III of The XML seminar ] Presenter: Xiaogeng Zhao A Introduction of XQL.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
1 Introduction to Semistructured Data and XML. 2 How the Web is Today  HTML documents often generated by applications consumed by humans only easy access:
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
The Semistructured-Data Model Programming Languages for XML Spring 2011 Instructor: Hassan Khosravi.
XML, XSL, and SOAP Building Object Systems from Documents CSC/ECE 591o Summer 2000.
CS 157B: Database Management Systems II February 11 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron.
XML Presented by Kushan Athukorala. 2 Agenda XML Overview Entity References Elements vs. Atributes XML Validation DTD XML Schema Linking XML and CSS XSLT.
XML e X tensible M arkup L anguage (XML) By: Albert Beng Kiat Tan Ayzer Mungan Edwin Hendriadi.
XML A Language Presentation. Outline 1. Introduction 2. XML 2.1 Background 2.2 Structure 2.3 Advantages 3. Related Technologies 3.1 DTD 3.2 Schemas and.
Primer on XML Schema CSE 544 April, XML Schemas Generalizes DTDs Uses XML syntax Two parts: structure and datatypes Very complex –criticized –alternative.
Representing data with XML SE-2030 Dr. Mark L. Hornick 1.
Martin Kruliš by Martin Kruliš (v1.1)1.
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
Semi-structured Data In many applications, data does not have a rigidly and predefined schema: –e.g., structured files, scientific data, XML. Managing.
SEMI-STRUCTURED DATA (XML) 1. SEMI-STRUCTURED DATA ER, Relational, ODL data models are all based on schema Structure of data is rigid and known is advance.
XML Extensible Markup Language
XML Databases Presented By: Pardeep MT15042 Anurag Goel MT15006.
XML: Extensible Markup Language
XML QUESTIONS AND ANSWERS
XML in Web Technologies
XML Data Introduction, Well-formed XML.
eXtensible Markup Language (XML)
Semi-Structured data (XML Data MODEL)
Alin Deutsch, University of Pennsylvania Mary Mernandez, AT&T Labs
2/18/2019.
Lecture 9: XML Monday, October 17, 2005.
Lecture 8: XML Data Wednesday, October
Semi-structured Data In many applications, data does not have a rigidly and predefined schema: e.g., structured files, scientific data, XML. Managing such.
Semi-Structured data (XML)
Presentation transcript:

1 New Ways of Querying the Web by Eliahu Brodsky and Alina Blizhovsky

2 Simple Querying A search engine looks for the word (or the words) that a document contains. A search engine looks for a Web document which contains the word.

3 Querying structured data A data on the Web may be structured (e.g. books catalog). A “structure” means schema. The schema may not be rigid (semi- structured data). More complex queries may be executed.

4 CGI Advantage –Uses the existing DBMS (e.g. relational). Disadvantage –Problems on integrating a data from the different Web sources.

5 XML (Extensible Markup Language) A subset of SGML Benefits –Arbitrary extension of a document’s tags and attributes. –Support for documents with complex structure. –Validation of documents structure (with respect to an optional Document Type Descriptor).

6 Example of XML data Database Systems Date Addison-Wesley The Math Works MATLAB

7 Example of Document Type Descriptor (DTD) <!ELEMENT book (title,author?,publisher)>

8 Semi-structured Data Model Non-rigid schema Object Exchange Model (OEM) Data represented by a graph.

9 Example of XML data Database Systems Date Addison-Wesley The Math Works MATLAB

10 book (year=“1995”) (year=“1998”) title author name publisher Database Systems Addison- Wesley Date publisher title MATLAB The Math Works

11 Example of XML data Database Systems Date Addison-Wesley The Math Works MATLAB

12 book (year=“1995”) (year=“1998”) title author name publisher Database Systems Addison- Wesley Date publisher title MATLAB The Math Works related

13 XML-QL Extracts data from large XML documents. Integrates XML data from multiple sources. Translates XML data between different DTD. Processes a request by –sending queries to XML sources, or by –transporting large amounts of XML data to clients.

14 Example of XML-QL WHERE Addison-Wesley $t IN “ CONSTRUCT $t

15 Example of XML data Database Systems Date Addison-Wesley The Math Works MATLAB

16 Result of the query Database Systems

17 WHERE Addison-Wesley $a1 IN “ The Math Works $a2 IN “ $a1 = $a2 CONSTRUCT $a1

18 Regular Path Expressions Permitted wherever XML permits an element. Provide: –alternation ( | ) –concatenation (. ) –Kleene-star operators ( * )

19 Example of a regular path expression WHERE $r IN “ CONSTRUCT $r

20 $r.

21 XQL Is designed specifically for XML documents. Provides a simple syntax (patterns modeled after directory notation). Expressed in strings that can be embedded in programs, scripts, and XML or HTML attributes.

22 The Result of XQL Query Depends on implementation. One of the following: –XML document. –A tree that can be fed back in to XQL. –Different type of structure (e.g. set of pointers to nodes).

23 Search Context Is the set of nodes against which a query operates. The “root context” and the “current context”: / use the “root context”. / use the “current context” explicitly

24 Example of an XQL = = Find all books where the value of style attribute of the book is equal to the value of the specialty attribute of the bookstore element at the root of the XML document.

25 Additional examples author[lastname = ‘Bob’] Find all author elements whose last name sub element is Bob. author[. = ‘Bob’] Find all author elements whose value is Bob.

26 Regular path expressions in XQL bookstore//title Find all title elements, one or more levels deep in the bookstore. bookstore/*/title Find all title elements that are grandchildren of bookstore elements.

27 Indices in XQL Text1 Text2 Text3 Text4 x/y[0] Text1,Text3 (x/y)[3] Text4 x[1]/y[0] Text3

28 XML-QL vs. XQL XQL may easily be embedded into programs, scripts, XML and HTML tags. XQL assume the user understand XML document as a graph. XML-QL provides construction of new complicated XML documents. XML-QL provides XML-like patterns.