Enumerating XML Data for Dynamic Updating L.Kit and V.Ng, Hong Kong Polytechnique University Sang-Ho Nah Lily Daniel Yun Hee Lee.

Slides:



Advertisements
Similar presentations
Computing Structural Similarity of Source XML Schemas against Domain XML Schema Jianxin Li 1 Chengfei Liu 1 Jeffrey Xu Yu 2 Jixue Liu 3 Guoren Wang 4 Chi.
Advertisements

Splay Tree Algorithm Mingda Zhao CSC 252 Algorithms Smith College Fall, 2000.
 Definition of B+ tree  How to create B+ tree  How to search for record  How to delete and insert a data.
DIMACS Streaming Data Working Group II On the Optimality of the Holistic Twig Join Algorithm Speaker: Byron Choi (Upenn) Joint Work with Susan Davidson.
Structural Joins: A Primitive for Efficient XML Query Pattern Matching Al Khalifa et al., ICDE 2002.
B+-Trees (PART 1) What is a B+ tree? Why B+ trees? Searching a B+ tree
1 CS 561 Presentation: Indexing and Querying XML Data for Regular Path Expressions A Paper by Quanzhong Li and Bongki Moon Presented by Ming Li.
QUANZHONG LI BONGKI MOON Indexing & Querying XML Data for../Regular Path Expressions/* SUNDAR SUPRIYA.
CS 171: Introduction to Computer Science II
1 Problem Solving Abstraction Oftentimes, different real-world problems can be modeled using the same underlying idea Examples: Runtime storage, Undo operation.
Deterministic Wavelet Thresholding for Maximum-Error Metrics Minos Garofalakis Bell Laboratories Lucent Technologies 600 Mountain Avenue Murray Hill, NJ.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Accelerating Inferencing. Assertion Efficient inferencing using taxonomies require fast computation of subsumption, disjointness, least common ancestors,
Chapter Physical Database Design Methodology Software & Hardware Mapping Logical Design to DBMS Physical Implementation Security Implementation Monitoring.
Insert A tree starts with the dummy node D D 200 D 7 Insert D
1 Indexing and Querying XML Data for Regular Path Expressions A Paper by Quanzhong Li and Bongki Moon Presented by Amnon Shochot.
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Part B Part A:  Index Definition in SQL  Ordered Indices  Index Sequential.
Storing and Querying Ordered XML Using a Relational Database System By Khang Nguyen Based on the paper of Igor Tatarinov and Statis Viglas.
Week 10: Heap and Priority queue. Any feature here?
1 Chapter 8 Priority Queues. 2 Implementations Heaps Priority queues and heaps Vector based implementation of heaps Skew heaps Outline.
1 CS 728 Advanced Database Systems Chapter 17 Database File Indexing Techniques, B- Trees, and B + -Trees.
Binary Trees Chapter 6.
R-Trees: A Dynamic Index Structure for Spatial Data Antonin Guttman.
COSC2007 Data Structures II
AAU A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing Presented by YuQing Zhang  Slobodan Rasetic Jorg Sander James Elding Mario A.
XML as a Boxwood Data Structure Feng Zhou, John MacCormick, Lidong Zhou, Nick Murphy, Chandu Thekkath 8/20/04.
A Summary of XISS and Index Fabric Ho Wai Shing. Contents Definition of Terms XISS (Li and Moon, VLDB2001) Numbering Scheme Indices Stored Join Algorithms.
Chapter 6 Binary Trees. 6.1 Trees, Binary Trees, and Binary Search Trees Linked lists usually are more flexible than arrays, but it is difficult to use.
R ++ -tree: an efficient spatial access method for highly redundant point data Martin Šumák, Peter Gurský University of P. J. Šafárik in Košice.
Graph Indexing: A Frequent Structure- based Approach Alicia Cosenza November 26 th, 2007.
Expert Systems with Applications 34 (2008) 459–468 Multi-level fuzzy mining with multiple minimum supports Yeong-Chyi Lee, Tzung-Pei Hong, Tien-Chin Wang.
An optimal dynamic spanner for points residing in doubling metric spaces Lee-Ad Gottlieb NYU Weizmann Liam Roditty Weizmann.
File System Implementation
Storing XML Data in Relational Databases Shaghayegh Sahebi Nesa Asoudeh.
QED: A Novel Quaternary Encoding to Completely Avoid Re-labeling in XML Updates Changqing Li,Tok Wang Ling.
R-Trees: A Dynamic Index Structure For Spatial Searching Antonin Guttman.
Graph Data Management Lab, School of Computer Science Branch Code: A Labeling Scheme for Efficient Query Answering on Tree
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
Efficient Processing of Updates in Dynamic XML Data Changqing Li, Tok Wang Ling, Min Hu.
1 Review of report "LSDX: A New Labeling Scheme for Dynamically Updating XML Data"
Reuse or Never Reuse the Deleted Labels in XML Query Processing Based on Labeling Schemes Changqing Li, Tok Wang Ling, Min Hu.
Data Structure and Algorithms
1 Storing and Maintaining Semistructured Data Efficiently in an Object- Relational Database Mo Yuanying and Ling Tok Wang.
Grid Files Multi-dimensional Index Structures. Jaruloj Chongstitvatana 2006Grid Files 2 Properties of Grid Files  Support multi-dimensional data, but.
Distributed Control and Autonomous Systems Lab. Sang-Hyuk Yun and Hyo-Sung Ahn Distributed Control and Autonomous Systems Laboratory (DCASL ) Department.
Indexing and Querying XML Data for Regular Path Expressions Quanzhong Li and Bongki Moon Dept. of Computer Science University of Arizona VLDB 2001.
An Improved Prefix Labeling Scheme: A Binary String Approach for Dynamic Ordered XML Changqing LiTok Wang Ling Department of Computer Science School of.
Subject Four Graphs Data Structures. What is a graph? A data structure that consists of a set of nodes (vertices) and a set of edges that relate the nodes.
Scalable Multi-match Packet Classification Using TCAM and SRAM Author: Yu-Chieh Cheng, Pi-Chung Wang Publisher: IEEE Transactions on Computers (2015) Presenter:
Spatial Data Management
Mehdi Kargar Department of Computer Science and Engineering
Partially Ordered Data ,Heap,Binary Heap
Balanced Search Trees 2-3 Trees AVL Trees Red-Black Trees
ADT description Implementations
Data Structures: Disjoint Sets, Segment Trees, Fenwick Trees
Indexing Structures for Files and Physical Database Design
CS522 Advanced database Systems
Multiway Search Trees Data may not fit into main memory
CS 728 Advanced Database Systems Chapter 18
RE-Tree: An Efficient Index Structure for Regular Expressions
B+ Tree.
Heap Chapter 9 Objectives Upon completion you will be able to:
Trees and Binary Trees.
Overview: File system implementation (cont)
Multiway Trees Searching and B-Trees Advanced Tree Structures
Skyline query with R*-Tree: Branch and Bound Skyline (BBS) Algorithm
MTree An implementation and An example with m=3
Dynamic rectangular intersection with priorities
Structural Joins: A Primitive for Efficient XML Query Pattern Matching
Dynamic rectangular intersection with priorities
Presentation transcript:

Enumerating XML Data for Dynamic Updating L.Kit and V.Ng, Hong Kong Polytechnique University Sang-Ho Nah Lily Daniel Yun Hee Lee

Introduction n-Inode: new model-mapping approach Multidimensional node ID for indexing and node-to-node relationship calculation Supports dynamic updating of XML data flexibly

n-INode XML document represented as n c -ary complete tree, where n c =maximum number of child node per node Multidimensional node ID: k-dimensional ID:(id 1, io 1, id 2, io 2, …, id k, io k ) id x : Node identifier assigned by numbering scheme io x : Insertion order, sequential number starting from 0 Presence of io x allows more than n c child nodes to be inserted. No re-calculation of existing nodes’ id required

n-INode cont Insertion Rules  If newly inserted node’s id 1 exists in the tree, its io 1 is incremented from maximum io 1 among existing nodes with the same id 1  If new node is inserted to the “right most position”, and maximum io 1 (of all the nodes with the same id 1 ) is less than n c, then io 1 = n c +1  A new dimension is introduced to all descendants of a node that has io 1 > 0. Parent’s first dimension is assigned to the child’s first dimension.

n-INode cont Parent-Child relationship:  Pair of nodes with the same number of dimensions  Pair of nodes with dimensional difference of one  Parent and Child MUST share the identical first dimension Ancestor-Descendant relationship:  Above 2 situations  Pair of nodes with dimensional difference of more than one

Implementation & Experiment Required storage space is not the smallest of all the models tested Other test results show that this is a reasonable trade-off Query time is reasonable and consistent – shows it does not depend heavily on the type of query

Possible flaws in n-INode Node relationship calculation/verification rule excludes a case where both nodes in the pair have 1-dimensional ID (first dimensions cannot be the same) Path sequence of each node changes by allowing more than n c child nodes to be inserted – therefore path sequence should not be used in node identifier calculation

Conclusion Identifying the insertion order removes restriction on the number of child nodes to be inserted Re-calculation of existing nodes’ ID is not required This allows for more effective and efficient node locating operation, supporting dynamic updates of XML data. However, some aspects were overlooked and this makes the proof of correctness presented in the paper somewhat deficient.