1 Statistics XML: –Altavista: 800,000 pages returned. –Amazon.com: 242 books. In comparison: –God: 12,000 books, 7 Million pages –Bible: 32,000 books,

Slides:



Advertisements
Similar presentations
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Advertisements

XML e X tensible M arkup L anguage (XML) By: Albert Beng Kiat Tan Ayzer Mungan Edwin Hendriadi.
XML: Extensible Markup Language
XML/EDI Overview West Chester Electronic Commerce Resource Center (ECRC)
XML and Enterprise Computing. What is XML? Stands for “Extensible Markup Language” –similar to SGML and HTML –document “tags” are used to define content.
Relational Databases for Querying XML Documents: Limitations & Opportunities VLDB`99 Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton,
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Database Management Systems, R. Ramakrishnan1 Introduction to Semistructured Data and XML Chapter 27, Part D Based on slides by Dan Suciu University of.
Agenda from now on Done: SQL, views, transactions, conceptual modeling, E/R, relational algebra. Starting: XML To do: the database engine: –Storage –Query.
Managing XML and Semistructured Data Lecture 8: Query Languages - XML-QL Prof. Dan Suciu Spring 2001.
From Semistructured Data to XML: Migrating The Lore Data Model and Query Language Roy Goldman, Jason McHugh, Jennifer Widom Stanford University
1 COS 425: Database and Information Management Systems XML and information exchange.
XML - QL A Query Language for XML Version /2000XML-QL2 Outline * Introduction * Examples in XML-QL * A Data Model for XML * Advanced Examples in.
1 XML and QUERY Shilpi Ahuja CSE Data Mining 4 th April 2002.
Winter 2002Arthur Keller – CS 18018–1 Schedule Today: Mar. 12 (T) u Semistructured Data, XML, XQuery. u Read Sections Assignment 8 due. Mar. 14.
XML and The Relational Data Model
1 New Ways of Querying the Web by Eliahu Brodsky and Alina Blizhovsky.
Database Systems and XML David Wu CS 632 April 23, 2001.
4/15/2002Bo Du 1 - Bo Du, April 15, XML - QL A Query Language for XML.
Lecture #6 XML November 2 nd, Administration Thanks for the mid-term comments Comment on the book & readings Project #2 Project #1 Homework #4 Homework.
XML(EXtensible Markup Language). XML XML stands for EXtensible Markup Language. XML is a markup language much like HTML. XML was designed to describe.
XML –Query Languages, Extracting from Relational Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria.
Fundamentals, Design, and Implementation, 9/e Text and XML databases Instructor: Dragomir R. Radev Winter 2005.
Putting Semi-structured Data to Practice Alon Levy Seattle, Washingon University of Washington.
Jennifer Widom XML Data DTDs, IDs & IDREFs. Jennifer Widom DTDs, IDs & IDREFs “Well-Formed” XML Adheres to basic structural requirements Single root element.
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
XML-to-Relational Schema Mapping Algorithm ODTDMap Speaker: Artem Chebotko* Wayne State University Joint work with Mustafa Atay,
XML – what is it? eXtensible Markup Language Standard for publishing and interchange on the web and over the wire simpler version of SGML adapted to internet.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 XML Taken from Chapter 7.
XML By Dr.S.Sridhar, Ph.D.(JNUD), RACI(Paris, NICE), RMR(USA), RZFM(Germany) DIRECTOR ARUNAI ENGINEERING COLLEGE TIRUVANNAMALAI.
Maziar Sanaii Ashtiani – SCT – EMU, Fall 2011/12.
XML-QL A Query Language for XML Charuta Nakhe
XML by Dan Suciu 1 Introduction to Semistructured Data and XML Based on slides by Dan Suciu University of Washington.
TDDD43 XML and RDF Slides based on slides by Lena Strömbäck and Fang Wei-Kleiner 1.
XML eXtensible Markup Language w3c standard Why? Store and transport data Easy data exchange Create more languages WSDL (Web Service Description Language)
Another PillowTalk Presentation  2004 Dynamic Systems, Inc. Introduction to XML for SOA Lee H. Burstein,
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
Extensible Markup and Beyond
XML과 Database 홍기형 성신여자대학교 성신여자대학교 홍기형.
Company LOGO OODB and XML Database Management Systems – Fall 2012 Matthew Moccaro.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
XML & Mediators Thitima Sirikangwalkul Wai Sum Mong April 10, 2003.
Winter 2006Keller, Ullman, Cushing18–1 Plan 1.Information integration: important new application that motivates what follows. 2.Semistructured data: a.
1 What Is XML? eXtensible Markup Language for data –Standard for publishing and interchange –“Cleaner” SGML for the Internet Applications: –Data exchange.
Database Systems Part VII: XML Querying Software School of Hunan University
XML Name: Niki Sardjono Class: CS 157A Instructor : Prof. S. M. Lee.
Lecture 5: XML Tuesday, January 16, Outline XML, DTDs (Data on the Web, 3.1) Semistructured data in XML (3.2) Exporting Relational Data in XML (8.3.1)
[ Part III of The XML seminar ] Presenter: Xiaogeng Zhao A Introduction of XQL.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
1 Introduction to Semistructured Data and XML. 2 How the Web is Today  HTML documents often generated by applications consumed by humans only easy access:
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
The Semistructured-Data Model Programming Languages for XML Spring 2011 Instructor: Hassan Khosravi.
Sept. 27, 2002 ISDB’02 Transforming XPath Queries for Bottom-Up Query Processing Yoshiharu Ishikawa Takaaki Nagai Hiroyuki Kitagawa University of Tsukuba.
XML e X tensible M arkup L anguage (XML) By: Albert Beng Kiat Tan Ayzer Mungan Edwin Hendriadi.
1 “Universal Data-Speak”: The eXtensible Markup Language Zack Ives CSE 590DB, Winter 2000 University of Washington 3 January 2000.
Semi-structured Data In many applications, data does not have a rigidly and predefined schema: –e.g., structured files, scientific data, XML. Managing.
SEMI-STRUCTURED DATA (XML) 1. SEMI-STRUCTURED DATA ER, Relational, ODL data models are all based on schema Structure of data is rigid and known is advance.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
임 순 범 숙명여대 정보과학부 멀티미디어학과 1 III. XML-QL 멀티미디어 데이터베이스 ( ~11.1)
XML Databases Presented By: Pardeep MT15042 Anurag Goel MT15006.
XML: Extensible Markup Language
XML QUESTIONS AND ANSWERS
XML Data Introduction, Well-formed XML.
XML Data DTDs, IDs & IDREFs.
eXtensible Markup Language (XML)
Alin Deutsch, University of Pennsylvania Mary Mernandez, AT&T Labs
Lecture 8: XML Data Wednesday, October
CSE591: Data Mining by H. Liu
Semi-structured Data In many applications, data does not have a rigidly and predefined schema: e.g., structured files, scientific data, XML. Managing such.
Presentation transcript:

1 Statistics XML: –Altavista: 800,000 pages returned. –Amazon.com: 242 books. In comparison: –God: 12,000 books, 7 Million pages –Bible: 32,000 books, 4.6 Million pages. More comparisons: –Alon Levy + XML: 132 pages (770 without Alon) –XML-QL: 509 pages. –Levy + God: 12,000, (Alon Levy + God: 1, but not me). –Levy + Bible: 10,000 (Alon Levy + bible: 3; 1 me).

2 What is XML? –Emerging format for data exchange on the web and between applications. eXtensible Markup Language:

3 Attributes and References  XML distinguishes attributes from sub-elements.  ID’s and IDREFs are used to reference objects.

4 Document Type Descriptors  Sort of like a schema but not really. Won’t stay for very long, either.  First in a long series of 3-letter acronyms.

5 Origin of XML Comes from SGML (very nasty language). Principle: separate the data from the graphical presentation.

6 XML, After the roots A format for sharing data. Applications: –EDI: electronic data exchange: Transactions between banks Producers and suppliers sharing product data (auctions) Extranets: building relationships between companies Scientists sharing data about experiments. –Sharing data between different components of an application. –Format for storing all data in Office Basis for data sharing and integration.

7 Why Do People Like it so much? It’s easy to learn. It’s human readable. No need for proprietary formats anymore. It’s very flexible: –Data is self-describing –Can add attributes easily –Data can be irregular Note: without common DTD’s data sharing is not solved!

8 Why are we DB’ers interested? It’s data, stupid. That’s us. Proof by Altavista: –database+XML -- 40,000 pages. Database issues: –How are we going to model XML? (graphs). –How are we going to query XML? (XML-QL) –How are we going to store XML (in a relational database? object-oriented?) –How are we going to process XML efficiently? (uh… well..., um..., ah..., get some good grad students!)

9 3-Letter Acronyms XML, DTD, W3C DOM (Document Object Model) XML-schemas XQL (very early query language) RDF (resource description framework) Today, in New Jersey, a W3C committee is meeting to discuss standard query language.

10 XML Data Model (Graph) Issues: distinguish between attributes and sub-elements? Should we conserve order? Think of the labels as names of binary relations.

11 Querying XML Requirements: –Query a graph, not a relation. –The result should be a graph (representing an XML document), not a relation. –No schema. –We may not know much about the data, so we need to navigate the XML.

12 Query Languages First, there was XQL (from Microsoft). Very quickly realized that it was very limited. Then, a bunch of database researchers looked at XML and invented XML-QL. –XML-QL comes from the nicer StruQL language. –Many people got excited. Formed a committee.

13 Extracting Data by Query Matching data using elements patterns. WHERE Addison-Wesley $t $a IN “ CONSTRUCT $a

14 Constructing XML Data WHERE Addison-Wesley $t $a IN “ CONSTRUCT $a $t

15 Grouping with Nested Queries WHERE $t, Addison-Wesley CONTENT_AS $p IN “ CONSTRUCT $t WHERE $a IN $p CONSTRUCT $a

16 Joining Elements by Value WHERE $f $l ELEMENT_AS $e IN “ $f $l IN “ y > 1995 CONSTRUCT $e Find all articles whose writers also published a book after 1995.

17 Tag Variables WHERE $f $l ELEMENT_AS $e IN “ $f $l IN “ y > 1995 CONSTRUCT $e Find all articles whose writers have done something after 1995.

18 Regular Path Expressions WHERE $r Ford IN " CONSTRUCT $r Find all parts whose brand is Ford, no matter what level they are in the hierarchy.

19 Regular Path Expressions WHERE $r IN " CONSTRUCT $r

20 XML Data Integration WHERE ELEMENT_AS $n $ssn IN “ $ssn ELEMENT_AS $I IN “ CONSTRUCT $n $I Query can access more than one XML document.

21 Query Processing For XML Approach 1: store XML in a relational database. Translate an XML-QL query into a set of SQL queries. –Leverage 20 years of research & development. Approach 2: store XML in an object- oriented database system. –OO model is closest to XML, but systems do not perform well and are not well accepted. Approach 3: build an entire DBMS tailored to XML. –Still in the research phase.