XML (with a bias towards query language issues) A boring research topic? A new frontier? A means to keep standards people busy? Prepared by S. Abiteboul.

Slides:



Advertisements
Similar presentations
The Business of Recombinant Data Eric Miller
Advertisements

Key tasks TG Leader: Danica Zendulkova danica.
XML DOCUMENTS AND DATABASES
W3C XML Schema: what you might not know (and might or might not like!) Noah Mendelsohn Distinguished Engineer IBM Corp. October 10, 2002.
Database Systems Research: Where it is (or should be) Headed? (aka looking for a “perfect” candidate) Laks V.S. Lakshmanan Dept. of Computer Science Univ.
CS 431 The Semester in Elevator Speak Carl Lagoze – Cornell University May 5, 2004.
Database Theory: Back to the Future Victor Vianu UC San Diego / INRIA.
System Design and Memory Limits. Problem  If you were integrating a feed of end of day stock price information (open, high, low, and closing price) for.
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Information Retrieval in Practice
Search Engines and Information Retrieval
Data Management for XML: Research Directions By: Jennifer Widom Stanford University Reviewer: Kristin Streilein.
12. Summary, Trends, Research. © O. Nierstrasz PS — Summary, Trends, Research Roadmap  Summary: —Trends in programming paradigms  Research:...
1 Chapter 2 Database Environment Transparencies © Pearson Education Limited 1995, 2005.
The Data Ring: Community Content Sharing Serge Abiteboul (INRIA) Alkis Polyzotis (UC Santa Cruz)
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Chapter 1: Data Models and DBMS Architecture Title: What Goes Around Comes Around Authors: M. Stonebraker, J. Hellerstein Pages: 2-40.
12. Summary, Trends, Research. © O. Nierstrasz PS — Summary, Trends, Research Roadmap  Summary: —Trends in programming paradigms  Research:...
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
...Looking back Why use a DBMS? How to design a database? How to query a database? How does a DBMS work?
Overview of Search Engines
Implementing search with free software An introduction to Solr By Mick England.
4/20/2017.
Module 17 Storing XML Data in SQL Server® 2008 R2.
Database Environment 1.  Purpose of three-level database architecture.  Contents of external, conceptual, and internal levels.  Purpose of external/conceptual.
IS432: Semi-Structured Data Dr. Azeddine Chikh. 1. Semi Structured Data Object Exchange Model.
Search Engines and Information Retrieval Chapter 1.
1 Distributed Monitoring of Peer-to-Peer Systems By Serge Abiteboul, Bogdan Marinoiu Docflow meeting, Bordeaux.
XML and its applications: 4. Processing XML using PHP.
Database Architecture Introduction to Databases. The Nature of Data Un-structured Semi-structured Structured.
CSC271 Database Systems Lecture # 4.
XML과 Database 홍기형 성신여자대학교 성신여자대학교 홍기형.
1 Adapted from Pearson Prentice Hall Adapted form James A. Senn’s Information Technology, 3 rd Edition Chapter 7 Enterprise Databases and Data Warehouses.
Intro. to XML & XML DB Bun Yue Professor, CS/CIS UHCL.
Master Thesis Defense Jan Fiedler 04/17/98
1 CS 430 Database Theory Winter 2005 Lecture 17: Objects, XML, and DBMSs.
The Data Ring: Community Content Sharing Serge Abiteboul (INRIA) Alkis Polyzotis (UC Santa Cruz)
MULTIMEDIA DATABASES -Define data -Define databases.
2005 Epocrates, Inc. All rights reserved. Integrating XML with legacy relational data for publishing on handheld devices David A. Lee Senior member of.
XML and Database COSC643 Sungchul Hong. Is XML a Database? Yes but only in the strictest sense of the term. It is a collection of data. (some sort) XML.
Electronic Scriptorium, Ltd. AIIM Minnesota Chapter Metadata and Taxonomy Presentation Copyright Electronic Scriptorium, Ltd. All rights reserved, 1991.
ICS (072)Database Systems: An Introduction & Review 1 ICS 424 Advanced Database Systems Dr. Muhammad Shafique.
Chapter 1 Introduction Yonsei University 1 st Semester, 2015 Sanghyun Park.
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
XML Engr. Faisal ur Rehman CE-105T Spring Definition XML-EXTENSIBLE MARKUP LANGUAGE: provides a format for describing data. Facilitates the Precise.
Intro: 1 What is a Database? Collection of Dynamic Data –Large Large of yesteryear now fits on a PC (small DBs) Many applications require even more (terabytes,
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
XML and Database.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Assoc. Prof. Dr. Ahmet Turan ÖZCERİT.  The concept of Data, Information and Knowledge  The fundamental terms:  Database and database system  Database.
REST By: Vishwanath Vineet.
Martin Kruliš by Martin Kruliš (v1.1)1.
Semi-structured Data In many applications, data does not have a rigidly and predefined schema: –e.g., structured files, scientific data, XML. Managing.
The Object-Oriented Database System Manifesto Malcolm Atkinson, François Bancilhon, David deWitt, Klaus Dittrich, David Maier, Stanley Zdonik DOOD'89,
Welcome to CPSC 534B: Information Integration Laks V.S. Lakshmanan Rm. 315.
1 Storing and Maintaining Semistructured Data Efficiently in an Object- Relational Database Mo Yuanying and Ling Tok Wang.
Forum Topics OMG Roundtable Dec. 11, Background Forum is part of the ODBMS.ORG Moderated by Dr. Roberto Zicari Started on 10/16/07 (last post 12/8/07)
Welcome: To the fifth learning sequence “ Data Models “ Recap : In the previous learning sequence, we discussed The Database concepts. Present learning:
Postgraduate Module Enterprise Database Systems Technological Educational Institution of Larisa in collaboration with Staffordshire University Larisa
IIS 645 Database Management Systems DDr. Khorsheed Today’s Topics 1. Course Overview 22. Introduction to Database management 33. Components of Database.
Chapter 1 Overview of Databases and Transaction Processing.
Information Retrieval in Practice
The Object-Oriented Database System Manifesto
XML in Web Technologies
Using Postgres/PostGIS for WFSRI
Query Optimization.
XML and its applications: 4. Processing XML using PHP
Presentation transcript:

XML (with a bias towards query language issues) A boring research topic? A new frontier? A means to keep standards people busy? Prepared by S. Abiteboul and J. Widom

XML Rapidly adopted by industry A format for exchange of small/medium pieces of data? –Yes –But when archived, grows to large volumes Also a data model? –Yes, for all kinds of data –From unstructured documents to collections of structured data –Warning: this is not the relational world! Permissive typing, full-text search, … The database community should be very involved, perhaps very concerned

Some XML Issues Storage of XML –Native vs. XML-relational Lesson from OODB: it’s not only a technical issue but a business one Situation is different: more $ involved –Efficient representation, compression –Key issue: interface DOM, SAX, query language, DB-like API, … Revisiting old topics –Database design –Integrity constraints –Concurrency control –Access control –Etc. All topics are under active investigation, sometimes reinventing the wheel

Universal Query Language for XML Problems with XQuery –Focus on complex queries; really need simple filters + IR-style search –Too complex, too ambitious, lack of underlying paradigm, too much politics! –Too broad spectrum of applications? From documents to data Undermining XQuery with something better? –Personal viewpoint: Small core OQL-like + functional plug-ins –Too late? We need a standard now! What about updates, standing queries, deltas, constraints, …? This direction has been mostly deactivated by XQuery Scientific: Is Xquery good or bad from a scientific viewpoint? Politics: Should we push for XQuery?

Back to Basics: Query Optimization For subsets of the language The tree structure is definitely a new ball game –New index structures –New cost models –New everything Depends on storage –Relational, native, others Revisit old problems –Distributed query processing –View maintenance All topics are under active investigation (but more effort on distribution wouldn’t hurt)

Back to Basics: Foundations Lots of work on semistructured data –First-order logic and relational languages: strong –OQL/functional languages: reasonable –Full-text search: messy Significant issue: typing –Much more complex than in relational world –Not settled (XML Schema, tree automata, …) –Query type-checking, type inferencing, update consistency Very active area –People from database theory, functional programming, automata theory, … All topics already active, not simple, require more work

The Real Frontier (The World is Changing) OLD data management Closed world Client/server Distributed databases Query/answer Active databases QBE interfaces NEW data management Openness P2P applications Web-scale data Subscription queries Queries over streams ADB + Web services New interfaces Research should focus on the new issues rather than on traditional processing of single-site XML data Beyond XML: semantic Web

Discussion