Conceptual Modeling for XML Data

Slides:



Advertisements
Similar presentations
Ch5: ER Diagrams - Part 1 Much of the material presented in these slides was developed by Dr. Ramon Lawrence at the University of Iowa.
Advertisements

Data Modeling and Relational Database Design ISYS 650.
An Introduction to XML Schema CSCI 7818 by Ming Rutar.
TU/e eindhoven university of technology / faculty of mathematics and informatics Exporting Databases in XML DTD A Conceptual and Generic Approach Philippe.
The Entity-Relationship (ER) Model CS541 Computer Science Department Rutgers University.
Semi-structured Data. Facts about the Web Growing fast Popular Semi-structured data –Data is presented for ‘human’-processing –Data is often ‘self-describing’
Modeling Your Data Chapter 2. Part II Discussion of the Model: Good Design/ Bad Design?
Class Number – CS 304 Class Name - DBMS Instructor – Sanjay Madria Instructor – Sanjay Madria Lesson Title – ER Model.
LOGICAL DATABASE DESIGN
Database Design & ER Diagrams
Powerpoint 2006 PRESENTATION The University of Auckland New Zealand Marsden Fund A PVS Approach to Verifying ORA-SS Data Models Scott Uk-Jin Lee 1, Gillian.
1 The ORA-SS Approach for Designing Semistructured Databases Xiaoying Wu, Tok Wang Ling, Mong Li Lee National University of Singapore Gillian Dobbie University.
Tok Wang Ling1 Mong Li Lee1 Gillian Dobbie2
Lecture 2: Entity-Relationship Modeling
Dr. Mohamed Osman Hegaz1 Conceptual data base design: The conceptual models: The Entity Relationship Model.
1 Designing Valid XML Views Ya Bing Chen, Tok Wang Ling, Mong Li Lee Department of Computer Science National University of Singapore.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 XML Taken from Chapter 7.
1 Data Modeling : ER Model Lecture Why We Model  We build models of complex systems because we cannot comprehend any such system in its entirety.
Entity-Relationship Model Ch. 3
1 Maintaining Semantics in the Design of Valid and Reversible SemiStructured Views Yabing Chen, Tok Wang Ling, Mong Li Lee Department of Computer Science.
A Z Approach in Validating ORA-SS Data Models Scott Uk-Jin Lee Jing Sun Gillian Dobbie Yuan Fang Li.
Switch off your Mobiles Phones or Change Profile to Silent Mode.
ICS 321 Spring 2011 High Level Database Models Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 2/7/20111Lipyeow.
Relational Data Model Ch. 7.1 – 7.3 John Ortiz Lecture 3Relational Data Model2 Why Study Relational Model?  Most widely used model.  Vendors: IBM,
1 5 Normalization. 2 5 Database Design Give some body of data to be represented in a database, how do we decide on a suitable logical structure for that.
April 9, 2006KDXD 2006, Singapore1 Capturing Semantics in XML Documents Tok Wang Ling Department of Computer Science National University of Singapore.
1 M ATERIALIZED V IEW M AINTENANCE FOR THE X ML D OCUMENTS Yuan Fa, Yabing Chen, Tok Wang Ling, Ting Chen Yuan Fa, Yabing Chen, Tok Wang Ling, Ting Chen.
1 Resolving Schematic Discrepancy in the Integration of Entity-Relationship Schemas Qi He Tok Wang Ling Dept. of Computer Science School of Computing National.
1 Storing and Maintaining Semistructured Data Efficiently in an Object- Relational Database Mo Yuanying and Ling Tok Wang.
Wenyue Du, Mong Li Lee, Tok Wang Ling Department of Computer Science School of Computing National University of Singapore {duwenyue, leeml,
April 20022/CS/3X1 Database Design Design method John Wordsworth Department of Computer Science The University of Reading Room.
CS 480: Database Systems Lecture 26 March 18, 2013.
COP Introduction to Database Structures
Lecture 5 Supplement – ER Model & Mapping to Relational Model
Databases and DBMSs Todd S. Bacastow January
Data Modeling Using the ERD
Entity-Relationship Model
CS4222 Principles of Database System
ER model Ashima Wadhwa.
Faloutsos - Pavlo C. Faloutsos - A. Pavlo Lecture#2: E-R diagrams
University of Milano Bicocca, Italy Carlo Batini
Well-formed and Valid XML Documents
Chapter 7: Entity-Relationship Model
Databases and Database Management Systems Chapter 9
Entity-Relationship Modelling
Outline of the ER Model By S.Saha
Entity-Relationship Modeling
Lecture3: Data Modeling Using the Entity-Relationship Model.
بسم الله الرحمن الرحيم.
Modeling Your Data Chapter 2 cs542
Translation of ER-diagram into Relational Schema
The Entity-Relationship Model
Lec 3: Object-Oriented Data Modeling
The Entity-Relationship Model
Entity Relationship Diagrams
Entity-Relationship Modeling
الفصل الخامس قواعد البيانات Databases
Entity – Relationship Model
Entity-Relationship Modelling
The Entity-Relationship Model
Database Systems Instructor Name: Lecture-9.
Faloutsos - Pavlo C. Faloutsos - A. Pavlo Lecture#2: E-R diagrams
376a. Database Design Dept. of Computer Science Vassar College
Conceptual Data Modeling Using Entities & Relationships
Unit 7 Normalization (表格正規化).
Chapter 7: Entity-Relationship Model
The Entity-Relationship Model
The Entity-Relationship Model
CS4222 Principles of Database System
The Entity-Relationship (ER) Model
Presentation transcript:

Conceptual Modeling for XML Data Tok Wang Ling National University of Singapore DASFAA’2003 Panel Discussion

Outlines Why do we need conceptual modeling? What are the important semantic information to be captured? Uses of conceptual model for some XML research topics

Motivation: Why do we need to have a conceptual model to represent XML Data? <department number =“cs”> <name> computer science</name> <course number = “cs4221”> <name> Database </name> <student number = “1234” > <name> B.Y.Smith</name> <grade> 70</grade> </student> <student number=“1235”> <name> C.U.Brown </name> <grade> 60</grade> </course> </department> <! ELEMENT department (name,course+)> <! ATTLIST department number ID #REQUIRED> <! ELEMENT course (name, student*)> <! ATTLIST course <! ELEMENT student (name, grade?)> <! ATTLIST student number CDATA #REQUIRED> <! ELEMENT name (#PCDATA)> <! ELEMENT grade (#PCDATA)> (b) An XML DTD for (a) (a) XML document

▼♦ department ▼♦ student ▼ ♦ course ♦ name ♦ number ♦ grade Motivation (cont.) department cs student number name computer science &3 &1 course &20 &21 &22 grade 1234 B.Y.Smith 70 1235 C.U. Brown 60 cs4221 Database &26 &7 &4 &2 &5 &6 &23 &24 &25 &27 &28 (B) Dataguide (a) OEM Diagram Figure 1: Sample instance demonstrating OEM and dataguide

Motivation (continue) Q: What are the important semantic information and constraints cannot be captured by the DTD and Dataguide? What are the object classes? department, course, student? Attributes of object classes? Identifiers of object classes? What are the relationship types defined among object classes? e.g. Relationship types among department, course, student? What is “grade”? Object class? Attribute of student? Are there redundancies?

Semantic Information to be captured by an XML conceptual model Object class attributes of object class ordering on object class Relationship Type Represent hierarchical structure degree of n-ary relationship type participation constraints of object classes in relationship type attributes of relationship type disjunctive relationship type recursive relationship type Reference

Semantic Information to be captured by an XML conceptual model (cont.) Attribute key attribute / identifier composite attribute disjunctive attribute attributes with unknown structure fixed and default values of attribute derived attribute Functional dependencies and other constraints Inheritance hierarchy (class hierarchy) Semi-structured data instance representation

Attribute or object class Degree of relationship type Q: What are the semantic information cannot be represented by Dataguide, DTD, XML Schema? Attribute or object class Degree of relationship type Attibute of object class or relationship type Class hierarchy Functional dependency …

Figure 2: ORA-SS instance diagram A solution: ORA-SS, an object-relationship attribute model for semi-structured data. department Number: CS Name: Computer Science course course student student student Number Name Number: CS4221 Name: Database number department course 2, 1:n, 1:1 student name grade cs number: 1234 Name: B.Y. Smith number: 1235 Name: C.U. Brown Grade: 70 Grade: 60 Figure 2: ORA-SS instance diagram Figure 3: ORA-SS schema diagram

The data model of ORA-SS - Relationship Type attributes of relationship type degree of n-ary relationship type participation constraints of objects in relationship type disjunctive relationship type recursive relationship type project id name member job title publication number title 2, +, + 2, *, + p1 p2 p3 m1 m2 pub1 pub2 pub3 project member publication ♦ number ▼♦ publication ▼♦ project ▼♦ member ♦ name ♦ title ♦ job title ♦ id (a) ORA-SS Schema Diagram (b) Instance Relationship (c) Dataguide Figure 5: Representing binary relationship type

The data model of ORA-SS - Relationship Type (cont.) pub1 pub2 pub3 project member publication project id name member job title publication number title 2, +, + 3, *, + (b) Instance Relationship ♦ number ▼♦ publication ▼♦ project ▼♦ member ♦ name ♦ title ♦ job title ♦ id (a) ORA-SS Schema Diagram (c) Dataguide Figure 6: Representing ternary relationship type

The data model of ORA-SS - Attribute key attribute composite attribute disjunctive attribute attribute with unknown structure fixed and default values of attribute derived attribute course student * code title ANY dept prefix D: comp number first name last mark grade cs 2, 4:n, 3:8 hobby Figure 7: Object classes with relationship type and attributes in an ORA-SS schema diagram

Uses of the Conceptual model for XML research Normal form XML schema remove redundant data resolve multiple inheritance conflicts Storage structure for XML databases use Object Relational Model XML Views derived information from references and class hierarchy defining views materialized view maintenance view updates Integration of XML documents Evaluating XML queries on XML databases

Research Topics using ORA-SS Model Normal Form XML Schema Schema may have a lot of redundant data Update anomalies Normal Form schema is needed title author ISBN 2 professor textbook staff# name course C.Code +

Research Topics using ORA-SS Model NF XML Schema (cont.) Some better solutions: Redundancies are removed, in normal form ISBN title author professor textbookR staff# name courseR C.Code course textbook + textbook-Ref course-Ref C.Code title ISBN author professor textbookR name professorR course textbook + textbook-Ref professor-Ref staff#

Research Topics using ORA-SS Model Storage Structure for XML Databases Main Rules Each object class together with its attributes form a nested relation (object relation) Each relationship type together with its attributes form a nested relation (relationship relation) Nested relations can be handled by Object Relational model, e.g. ORACLE 8i.

Research Topics using ORA-SS Model Storage Structure for XML Databases Object Relations Supplier (S#, Name, (City)) Part (P#, Name, color) Project(J#, Name, Loc) Relationship relations SP (S#, P#, price) SPJ (S#, P#, J#, Qty) Constraint: SPJ[S#, P#]  SP[S#, P#] Price SP SPJ 3 Loc City Name J# 2 Supplier Project Part + Color Qty S# P#

Research Topics using ORA-SS Model XML Views What information can be directly derived from references and class hierarchy code faculty grade history fs cs fs, 2,1:n,1:n name title cs, 2,4:n,3:8 course studentR student number * street hostel home | street number name cs title course code student grade | hostel home Referencing an object class in an ORA-SS schema diagram

Research Topics using ORA-SS Model XML Views (cont.) Valid views of an ORA-SS schema Operations: selection, projection, join, up/down Supplier Part Project Project 2 2 2 Part Supplier Part Part 2 2 3 3 2 2 3 price price Project Project Supplier total_qty 2 3 3 3 price Qty Qty Qty View 1 View 2 View 3

Conclusion A good conceptual model is needed for XML database applications: * normal form schema * storage structure * view design and view updates * ….