XML – what is it? eXtensible Markup Language Standard for publishing and interchange on the web and over the wire simpler version of SGML adapted to internet.

Slides:



Advertisements
Similar presentations
XML: Extensible Markup Language
Advertisements

XML/EDI Overview West Chester Electronic Commerce Resource Center (ECRC)
Database Management Systems, R. Ramakrishnan1 Introduction to Semistructured Data and XML Chapter 27, Part D Based on slides by Dan Suciu University of.
XML Schemas Microsoft XML Schemas W3C XML Schemas.
1 Lecture 10 XML Wednesday, October 18, XML Outline XML (4.6, 4.7) –Syntax –Semistructured data –DTDs.
1 COS 425: Database and Information Management Systems XML and information exchange.
1 Statistics XML: –Altavista: 800,000 pages returned. –Amazon.com: 242 books. In comparison: –God: 12,000 books, 7 Million pages –Bible: 32,000 books,
1 Introduction to XML Yanlei Diao UMass Amherst April 19, 2007 Slides Courtesy of Ramakrishnan & Gehrke, Dan Suciu, Zack Ives and Gerome Miklau.
XML Introduction What is XML –XML is the eXtensible Markup Language –Became a W3C Recommendation in 1998 –Tag-based syntax, like HTML –You get to make.
XML and Databases 198:541. XML Motivation  Huge amounts of unstructured data on the web: HTML documents  No structure information  Only format instructions.
XML(EXtensible Markup Language). XML XML stands for EXtensible Markup Language. XML is a markup language much like HTML. XML was designed to describe.
Introducing XHTML: Module B: HTML to XHTML. Goals Understand how XHTML evolved as a language for Web delivery Understand the importance of DTDs Understand.
Jennifer Widom XML Data DTDs, IDs & IDREFs. Jennifer Widom DTDs, IDs & IDREFs “Well-Formed” XML Adheres to basic structural requirements Single root element.
Introduction to XML This material is based heavily on the tutorial by the same name at
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
4/20/2017.
XML, distributed databases, and OLAP/warehousing The semantic web and a lot more.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 XML Taken from Chapter 7.
Introduction to XML cs3505. References –I got most of this presentation from this site –O’reilly tutorials.
Why XML ? Problems with HTML HTML design - HTML is intended for presentation of information as Web pages. - HTML contains a fixed set of markup tags. This.
XML eXtensible Markup Language by Darrell Payne. Experience Logicon / Sterling Federal C, C++, JavaScript/Jscript, Shell Script, Perl XML Training XML.
Lecture 6 of Advanced Databases XML Schema, Querying & Transformation Instructor: Mr.Ahmed Al Astal.
XML By Dr.S.Sridhar, Ph.D.(JNUD), RACI(Paris, NICE), RMR(USA), RZFM(Germany) DIRECTOR ARUNAI ENGINEERING COLLEGE TIRUVANNAMALAI.
Chapter 10: XML.
1 XML at a neighborhood university near you Innovation 2005 September 16, 2005 Kwok-Bun Yue University of Houston-Clear Lake.
XML eXtensible Markup Language w3c standard Why? Store and transport data Easy data exchange Create more languages WSDL (Web Service Description Language)
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
XML BIS4430 – unit 10. XML Origins Extensible Markup Language (XML) 1998 Inspired by Standard Generalized Markup Language (SGML) and HTML. SGML defines.
XML 1 Enterprise Applications CE00465-M XML. 2 Enterprise Applications CE00465-M XML Overview Extensible Mark-up Language (XML) is a meta-language that.
Cornell CS 502 More XML XHTML, namespaces, DTDs CS 502 – Carl Lagoze – Cornell University.
Intro. to XML & XML DB Bun Yue Professor, CS/CIS UHCL.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Intro to XML Originally Presented by Clifford Lemoine Modified by Box.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
XML – An Introduction Structured Data Mark-up James McCartney CSCE 590, Cluster and Grid Computing.
Softsmith Infotech XML. Softsmith Infotech XML EXtensible Markup Language XML is a markup language much like HTML Designed to carry data, not to display.
Of 33 lecture 3: xml and xml schema. of 33 XML, RDF, RDF Schema overview XML – simple introduction and XML Schema RDF – basics, language RDF Schema –
XML Overview / Schema / DOM Brent P. Christie Major USMC.
1 What Is XML? eXtensible Markup Language for data –Standard for publishing and interchange –“Cleaner” SGML for the Internet Applications: –Data exchange.
XML Name: Niki Sardjono Class: CS 157A Instructor : Prof. S. M. Lee.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
1 XML eXtensible Markup Language. 2 XML vs. HTML HTML is a HyperText Markup language HTML is a HyperText Markup language Designed for a specific application,
Lecture 20 XML. 2 Objectives What semistructured data is. Concepts of the Object Exchange Model (OEM), a model for semistructured data. Basics of Lore,
1 Introduction to Semistructured Data and XML. 2 How the Web is Today  HTML documents often generated by applications consumed by humans only easy access:
XML – A Quick Introduction Kerry Raymond (stolen from others)
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
Management of XML and Semistructured Data Lecture 10: Schemas Monday, April 30, 2001.
The Semistructured-Data Model Programming Languages for XML Spring 2011 Instructor: Hassan Khosravi.
1 Indexing The syntax for creating a index is: CREATE [UNIQUE] INDEX index_name ON table_name (column1, column2,... column_n) [ COMPUTE STATISTICS ]; Why.
XML eXtensible Markup Language. XML A method of defining a format for exchanging documents and data. –Allows one to define a dialect of XML –A library.
Internet & World Wide Web How to Program, 5/e. © by Pearson Education, Inc. All Rights Reserved.2.
1 “Universal Data-Speak”: The eXtensible Markup Language Zack Ives CSE 590DB, Winter 2000 University of Washington 3 January 2000.
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 1COMP9321, 15s2, Week.
+ 1 XML eXtensible Markup Language. + 2 XML Lecture Adapted from the work of Dr. Praveen Madiraju of Marquette University.
When we create.rtf document apart from saving the actual info the tool saves additional info like start of a paragraph, bold, size of the font.. Etc. This.
Martin Kruliš by Martin Kruliš (v1.1)1.
C Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Introduction to XML Standards.
CIS 228 The Internet 9/20/11 XHTML 1.0. “Quirks” Mode Today, all browsers support standards Compliant pages are displayed similarly There are multiple.
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
SEMI-STRUCTURED DATA (XML) 1. SEMI-STRUCTURED DATA ER, Relational, ODL data models are all based on schema Structure of data is rigid and known is advance.
XML – Basic Concepts (modified version from Dr. Praveen Madiraju) 2015, Fall Pusan National University Ki-Joune Li.
1 XML eXtensible Markup Language. 2 Introduction and Motivation Dr. Praveen Madiraju Modified from Dr.Sagiv’s slides.
1 Introduction to XML Babak Esfandiari. 2 What is XML? introduced by W3C in 98 Stands for eXtensible Markup Language it is more general than HTML, but.
XML: Extensible Markup Language
XML QUESTIONS AND ANSWERS
XML in Web Technologies
XML Data DTDs, IDs & IDREFs.
Lecture 9: XML Monday, October 17, 2005.
CSE591: Data Mining by H. Liu
Semi-Structured data (XML)
Presentation transcript:

XML – what is it? eXtensible Markup Language Standard for publishing and interchange on the web and over the wire simpler version of SGML adapted to internet what’s it good for: Data exchange between businesses/enterprises common denominator regardless of source data format E-business Publishing of data Storage format for irregular data …

An Example What is the name of this book? Raymond M. Smullyan Penguin Principia Mathematica B. Russel A. Whitehead XML lingo: tags (like brackets) elements (like complex objects  can be nested. may be empty. rooted graph. well-formed = brackets properly nested.

Example revisited What is the name of this book? Raymond M. Smullyan Penguin Principia Mathematica B. Russel A. Whitehead Note different kinds of attributes -- ID, IDREF(S), others. All are just attributes nevertheless. attributes vs. elements

Some important properties widely supported self-descriptive – parsers available flexible model (can control how much structure we put in) data intra-doc and inter-doc references Presentation/publishing separated from representation (XSL) Human readable. No need for proprietary formats. Many, many tools

Some predecessors SGML (document authoring standard) [too heavyweight] HTML (almost no capabilities for representation (as opposed to presentation) EDI (electronic data interchange) – used by companies, banks (pre-XML “standard”). – not human readable.

Some database issues for XML How to model XML? (trees with some funny cross-links). How to query? (XPath, XQuery) How to store? (relational, OO/OR, native) How to process XML data efficiently? (devise new algorithms? tweak old ones on, e.g., RDBMS? RDBMS extenders?) XML – authoring is easy: just publish and not worry about schema. (“schema later!”).  isn’t always good. (why?) what is an appropriate notion of schema?

Document Type Descriptors a first-cut approx. to schema for XML. extended context-free grammar. enforces only structure. leaves typing mostly out. has many limitations. e.g.:... when is an XML document valid w.r.t. a DTD? Exercise: devise an efficient algorithm for validity checking. sequence choice quantifier optional & mandatory obligations

Why DTDs aren’t good enough? Useful for documents, but not enough for data: no support for structure sharing and reuse Object-oriented-like features not supported (recall: ID/IDREF(s) are purely syntactic.) no support for data types Can’t validate your data! no support for keys (exception – ID single attribute key!) & foreign keys IDREFs not typed (what if an IDREF from a book “points” to a nuclear reactor?) DTD does not conform to XML syntax!

XML Schema Highlights XML format?  support for basic data types? (integer, float, string, date, bool, etc.)  support for value-based constraints?  extensibility (e.g., users can define complex types)  OO-like features? e.g., Inheritance (extension or restriction)  keys & foreign keys  are references typed? 

Example XML Schema …

Example XML

(Subset of) useful XML standards Xpath/Xpointer/Xlink*: standard for linking to documents and elements within docs XSL/XSLT*: presentation and transformation RDF: resource description framework (meta- info such as ratings, categorizations, etc.) – plays a pivotal role in semantic web. Namespaces: for resolving name clashes DOM: Document Object Model for manipulating XML documents SAX: Simple API for XML parsing

Tree model for XML Data. bib book rev lang tit auth pubtit auth year... what is the name of this book? raymond m. smullyan penguin principia mathematica b. russel a. whitehead 1950 may 15, date 1.what is the semantics? 2.(when) is order important? 3. what kinds of queries would you like to pose? 4. how do you publish? english

Representing relations EmpPhone John Mary Mike employees tuple e p e p e p John Mary Mike Is order of tuples or of attributes important? (how) can the various relational integrity cons- traints (ICs) be captured?

Relations vs. XML. XML – need not be flat. schema/data distinction blurred. schema may be as large as data and indeed comes with data. typing not strict. missing and repeating elements – e.g., multiple authors, missing pub/year. These distinctions raise challenges when we try to store XML in relations and query it. (forward pointer.) XML QLs next class.