Data Management for XML: Research Directions By: Jennifer Widom Stanford University Reviewer: Kristin Streilein.

Slides:



Advertisements
Similar presentations
XML DOCUMENTS AND DATABASES
Advertisements

By Daniela Floresu Donald Kossmann
With Microsoft Access 2010© 2011 Pearson Education, Inc. Publishing as Prentice Hall1 PowerPoint Presentation to Accompany GO! with Microsoft ® Access.
Relational Databases for Querying XML Documents: Limitations & Opportunities VLDB`99 Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton,
Information Retrieval in Practice
From Semistructured Data to XML: Migrating The Lore Data Model and Query Language Roy Goldman, Jason McHugh, Jennifer Widom Stanford University
The Last Lecture Agenda –1:40-2:00pm Integrating XML and Search Engines—Niagara way –2:00-2:10pm My concluding remarks (if any) –2:10-2:45pm Interactive.
INFO 624 Week 3 Retrieval System Evaluation
Chapter 9 & 10 Database Planning, Design and Administration.
FACT: A Learning Based Web Query Processing System Hongjun Lu, Yanlei Diao Hong Kong U. of Science & Technology Songting Chen, Zengping Tian Fudan University.
CS155b: E-Commerce Lecture 10: Feb. 13, 2003 XML and its relationship to B2B commerce Acknowledgements: R. Glushko, A. Gregory, and V. Ramachandran.
Summary. Chapter 9 – Triggers Integrity constraints Enforcing IC with different techniques –Keys –Foreign keys –Attribute-based constraints –Schema-based.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
ICS (072)Database Systems Background Review 1 Database Systems Background Review Dr. Muhammad Shafique.
Managing XML and Semistructured Data Lecture 19: Compressing XML Data Prof. Dan Suciu Spring 2001.
Methodology Conceptual Database Design
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Jennifer Widom XML Data XML Schema. Jennifer Widom XML Schema “Valid” XML Adheres to basic structural requirements  Also adheres to content-specific.
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
Overview of Search Engines
CS 405G: Introduction to Database Systems 24 NoSQL Reuse some slides of Jennifer Widom Chen Qian University of Kentucky.
Module 17 Storing XML Data in SQL Server® 2008 R2.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
Welcome to CPSC 534B: Web Data Integration & Management Laks V.S. Lakshmanan Rm. CICSR Main Mall.
Overview of the Database Development Process
Dept. Computer Science, Korea Univ. Intelligent Information System Lab. XML clustering methods Sohn Jong-Soo Intelligent Information.
Practical RDF Chapter 1. RDF: An Introduction
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
DP&NM Lab. POSTECH, Korea - 1 -Interaction Translation Methods for XML/SNMP Gateway Interaction Translation Methods for XML/SNMP Gateway Using XML Technologies.
XML과 Database 홍기형 성신여자대학교 성신여자대학교 홍기형.
2. Database System Concepts and Architecture
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
XML (with a bias towards query language issues) A boring research topic? A new frontier? A means to keep standards people busy? Prepared by S. Abiteboul.
XML & Mediators Thitima Sirikangwalkul Wai Sum Mong April 10, 2003.
Using XML for Test Case Definition, Storage and Presentation Michael Ensminger
1 CS 430 Database Theory Winter 2005 Lecture 17: Objects, XML, and DBMSs.
Information System Development Courses Figure: ISD Course Structure.
The CompleteSearch Engine: Interactive, Efficient, and Towards IR&DB Integration Holger Bast, Ingmar Weber CIDR 2007) Conference on Innovative Data Systems.
EASE: An Effective 3-in-1 Keyword Search Method for Unstructured, Semi-structured and Structured Data Cuoliang Li, Beng Chin Ooi, Jianhua Feng, Jianyong.
Ontoprise: B 3 - Semantic B2B Broker whitepaper review Bernhard Schueler CSCI 8350, Spring 2002,UGA.
Marshall Breeding Director for Innovative Technology and Research Vanderbilt University
Introduction to Web Services Eric Lease Morgan University Libraries of Notre Dame June 24, 2005.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Jennifer Widom XML Data Introduction, Well-formed XML.
Intro: 1 What is a Database? Collection of Dynamic Data –Large Large of yesteryear now fits on a PC (small DBs) Many applications require even more (terabytes,
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
Introduction to Databases
XML and Database.
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
Building a Distributed Full-Text Index for the Web by Sergey Melnik, Sriram Raghavan, Beverly Yang and Hector Garcia-Molina from Stanford University Presented.
Jennifer Widom NoSQL Systems Motivation. Jennifer Widom NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Full-Text Support in a Database Semantic File System Kristen LeFevre & Kevin Roundy Computer Sciences 736.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Welcome to CPSC 534B: Information Integration Laks V.S. Lakshmanan Rm. 315.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
SEMI-STRUCTURED DATA (XML) 1. SEMI-STRUCTURED DATA ER, Relational, ODL data models are all based on schema Structure of data is rigid and known is advance.
3 Copyright © 2006, Oracle. All rights reserved. Designing and Developing for Performance.
Information Retrieval in Practice
CS 405G: Introduction to Database Systems
XML Related Technologies
Database Systems Instructor Name: Lecture-3.
Magnet & /facet Zheng Liang
Query Optimization.
Presentation transcript:

Data Management for XML: Research Directions By: Jennifer Widom Stanford University Reviewer: Kristin Streilein

Paper Objectives: whitepaper “thoughts on the research opportunities XML brings to the general area of data management” Not a survey offers “personal opinions and thoughts on Data Management for XML” “written from a true research standpoint”

Important Commerical Perspectives not covered How will XML be used? Data exchange format? Data storage format? With or without DTDs? Application interoperation and data integration will still cause problems Proposed mechanisms for inter-document references and proposed extensions or alternatives to DTDs for richer schema definitions not covered

Current State of Query Processing of Web Information HTML Pages Needs be preprocessed for meaningful queries Simple keyword-based searches Traditional DBMS Simple & rigid forms-based interfaces

Sample XML Research Topics: Ability to map XML-encoded info into a true data model Resolve conflicts from mixing concepts of documents and databases Designing XML databases Theoretical results Practical techniques Relationship between XML DTDs and traditional database schemas

Sample XML Research Topics: Query language(s) Database updates in XML setting Efficient physical layout and indexing mechanisms Query Processing View mechanisms How to scale everything to web proportions

Lore Project at Stanford: Personal Research Agenda Storage and Indexing Clustering schemes New index types Compression DataGuides and DTDs Build validating into XML database system Encode subelement ordering Performance and functionality tradeoffs (DataGuides & DTDs) Combine DataGuides & DTDs Browse database structure Allow updates propagate database

Lore Project at Stanford: Personal Research Agenda Databases and Information Retrieval Keyword search Proximity search Similarity search Other Database Features Views Constraints Triggers Change Management

Lore Project at Stanford: Personal Research Agenda Mixing Semistructured and Structured Data Finding the structure Exploiting the structure XML in/on a Traditional DBMS Performace Evaluation Appropriate benchmark for what XML data should look like Type of queries & mix of queries and updates