Challenges in XML It’s good… but is it good enough? Siddhesh Bhobe Persistent eBusiness Solutions.

Slides:



Advertisements
Similar presentations
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Advertisements

Native XML Database or RDBMS. Data or Document orientation If you are primarily storing documents, then a Native XML Database may be the best option.
Introduction to XSLT. What is XML? Design Goals of XML XML Format XML Declaration ElementsAttributes.
System Design and Memory Limits. Problem  If you were integrating a feed of end of day stock price information (open, high, low, and closing price) for.
Information Retrieval in Practice
XML: Extensible Markup Language BA Presented by: Tara Barnes, Lauren Jones, Ryan Kline.
XML, XSL, XSLT, XHTML and others By Sean Hunter. Why XML?  XML was created to be a quick and easy way to provide structured data over the web.  Existing.
1 Database Research at the UW  Faculty: Alon Halevy and Dan Suciu. A dozen Ph.D students  Related faculty: Oren Etzioni, Pedro Domingos, Dan Weld and.
1 XML - A Premier B2B Enabling Technology : Hype or Reality? Leslie Beckford DPS Spring 2001.
Tutorial 11 Creating XML Document
XML Primer. 2 History: SGML vs. HTML vs. XML SGML (1960) XML(1996) HTML(1990) XHTML(2000)
System Integration (Cont.) Week 7 – Lecture 2. Approaches Information transfer –Interface –Database replication –Data federation Business process integration.
XML Vikki Brandon Jesse Josh. Quick Facts Who created it? –W3C Jon Bosak and Tim Bray as leaders and major contributors When? –Started in 1996 and finished.
5-1 Facilitating Business over the Internet: The XML language CR (2004) Prentice Hall, Inc. The xml goals The main objects of xml: Diagrams: Blocks and.
Efficient XML Interchange. XML Why is XML good? A widely accepted standard for data representation Fairly simple format Flexible It’s not used by everyone,
Overview of Search Engines
CS 415 N-Tier Application Development By Umair Ashraf July 16,2013 National University of Computer and Emerging Sciences Lecture # 12 HTML/ XHTML/ HTML5.
WSRP Description and Transport Issues SC Andre Kramer, Citrix Systems Inc. 8 th WSRP F2F, Gluecode Software, LA 1.
ITR3 lecture 2: XML Thomas Krichel Structure URIs (we will come back to them in lecture 3) XML Sofix xml example.
AMS confidential & proprietary International Business and Technology Consultants 1 XML as a Document Exchange Format Tom Loukas AMS Center for Advanced.
XML and XSL Institutional Web Management 2001: Organising Chaos.
Worshipping at the Shrine: Myths and Legends from comp.text.xml Kerry “the heretic” Raymond, CiTR.
Chapter 10: XML.
XML and friends Part 1 - XML and DTD ELAG 2001 workshop 8 Jan Erik Kofoed © BIBSYS Library Automation.
XML The Overview. Three Key Questions What is XML? What Problems does it solve? Where and how is it used?
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
XML and Web Services November 21, 2005 Leo Putra Mardjuki Christopher William Lee Corey Fung Chan.
Session 21-2 Session 11 Common Origination and Disbursement (COD) & Commonline: Dispel the Myths.
A In-Memory Compressed XML Representation of Astronomical Data PPARC UK e-Science Postgraduate School ’05 O’Neil Delpratt – PhD Student University of Leicester.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
2005/08/31ACS WG - Data transport with SOAP message 1 Data transport with SOAP message There is a W3C note proposing the SOAP with attachment: –
Tutorial 1: XML Creating an XML Document. 2 Introducing XML XML stands for Extensible Markup Language. A markup language specifies the structure and content.
Introduction to HTML Tutorial 1 eXtensible Markup Language (XML)
XML 2nd EDITION Tutorial 1 Creating An Xml Document.
CP3024 Lecture 9 XML: Extensible Markup Language.
2005 Epocrates, Inc. All rights reserved. Integrating XML with legacy relational data for publishing on handheld devices David A. Lee Senior member of.
DB Zip Expert Portable database backup and export/import Copyright © SoftTree Technologies, Inc.
WEB APPLICATION DEVELOPMENT For More visit:
XML The “E-Lance Economy” or “Digital Economy” is a new challenge for interacting over networks. XML was developed by the World Wide Web Consortium (W3C)
XP 1 Creating an XML Document Developing an XML Document for the Jazz Warehouse XML Tutorial.
A subject line is the topic of the so the recipient of the has a preview of what the is about. It will be no longer than a couple.
XML & varieties, e.g. VoiceXML By: Shawn Ramdass, Saji Abraham & Billy Santamorena.
XML Design Goals 1.XML must be easily usable over the Internet 2.XML must support a wide variety of applications 3.XML must be compatible with SGML 4.It.
1 Tutorial 11 Creating an XML Document Developing a Document for a Cooking Web Site.
XML Engr. Faisal ur Rehman CE-105T Spring Definition XML-EXTENSIBLE MARKUP LANGUAGE: provides a format for describing data. Facilitates the Precise.
Simple Object Access Protocol. Web Services: SOAP2 Why Simple Object Access Protocol Light weight replacement for complicated distributed object technology.
Efficient XML Interchange High Performance XML Don McGregor (mcgredo (at) nps.edu) Don Brutzman (brutzman (at) nps.edu)
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
Introduction to Markup Languages January 31, 2002.
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 1COMP9321, 15s2, Week.
2005/08/25ACS WG - Data transport with SOAP message 1 Data transport with SOAP message There is a W3C note proposing the SOAP with attachment: –
DAP4, SOAP and THREDDS James Gallagher OPeNDAP Presented by Steve Hankin.
Games: XML Presented by: Idham bin Mat Desa Mohd Sharizal bin Hamzah Mohd Radzuan bin Mohd Shaari Shukor bin Nordin.
Copyright © 2002 Pearson Education, Inc. Slide 3-1 Internet II A consortium of more than 180 universities, government agencies, and private businesses.
XML Databases – do they really exist? Jan Erik Kofoed BIBSYS Library Automation ELAG 2005 at CERN, Geneva.
XML The Overview. Three Key Questions What is XML? What Problems does it solve? Where and how is it used?
1 Storing and Maintaining Semistructured Data Efficiently in an Object- Relational Database Mo Yuanying and Ling Tok Wang.
XP 1Creating Web Pages with XML Tutorial 1 New Perspectives on XML Tutorial 1 – Creating an XML Document.
XML Extensible Markup Language
Efficient XML Aaron Braeckel Briefing to FAA Program Management 28 Dec 2008 National Center for Atmospheric Research Boulder, CO Copyright ©
Information Retrieval in Practice
Session I - Introduction
Session I - Introduction
Creating an XML Document
XML Data Introduction, Well-formed XML.
XML.
Siddhesh Bhobe Persistent eBusiness Solutions
A SEMINAR SYNOPSIS ON XML ENCRYPTION
Presentation transcript:

Challenges in XML It’s good… but is it good enough? Siddhesh Bhobe Persistent eBusiness Solutions

XML goals 1. XML shall be straightforwardly usable over the Internet. 2. XML shall support a wide variety of applications. 3. XML shall be compatible with SGML. 4. It shall be easy to write programs which process XML documents. 5. The number of optional features in XML is to be kept to the absolute minimum, ideally zero.

XML goals (Cont.) 6. XML documents should be human-legible and reasonably clear. 7. The XML design should be prepared quickly. 8. The design of XML shall be formal and concise. 9. XML documents shall be easy to create. 10. Terseness in XML markup is of minimal importance.

XML has been successful! XML is emerging as a standard for Document Exchange Significant momentum in the market place and industry consortia –Commerce One –Rosetta Net –Biztalk –PeBS

But… is it good enough?

XML is verbose Column1|Column2|………. ColumnN| becomes value1 value2 : valueN

So? More storage space More network transmission time Data exchange on the net will be very expensive!

Anything positive? Lends itself very well to compression in case of structured data (like web logs) Migrated non-XML data gives better compression than the original data. (Xmill, paper at SIGMOD 2000)

Storing XML is difficult! Store as text, but… Impossible to query… no indexing possible! Additional cost of creating blocks… limit on size of text that can be stored in databases Can be updated only by replacing the entire XML document!

Storing XML (Cont.) Store in database tables, but… XML to relational data conversion is very expensive! Current set of tools handle only regular XML document structures (Ex. XML-DBMS)

XML in Oracle 8i Store XML document as a single, intact object with its tags in a CLOB or BLOB Store the XML document as data and distribute it untagged across object- relational tables Combine XML documents and data using views

Processing XML is costly! XML needs to be parsed… and that is not efficient! Tools available today are not easy to use. Need better ones. Text processing is always a performance hit Do NOT use XML for passing parameters!

No data types in XML No data type support… all XML data is text Limited options for binary data XML Schema Part 2: Datatypes ( W3C Working Draft, 22 September 2000) proposes facilities for defining datatypes in XML

Encoding Binary Data The PIXELS element would contain the binary data encoded in some notation like Base64.

Multipart/related MIME type (RFC 2112) MIME is used for messages that are not just ASCII text, but include different "types" of information The multipart/related MIME type was developed to represent compound documents. Individual parts represent individual streams in the compound document.

Example… Content-Type: multipart/related --xxxxxxxxxx Content-Type: application/binary Content-Transfer-Encoding: Little-Endian Content-ID: Pixels Content-Length: …encoded binary data here... --xxxxxxxxxx

Conclusion XML is great as a data exchange format, but…. Need compression Need better storage techniques Need fast and easy to use parsers Need data type support