Presentation is loading. Please wait.

Presentation is loading. Please wait.

An Ontology-Extended Relational Algebra Piero Bonatti Università di Napoli "Federico II" Yu Deng V.S. Subrahmanian University of Maryland College Park.

Similar presentations


Presentation on theme: "An Ontology-Extended Relational Algebra Piero Bonatti Università di Napoli "Federico II" Yu Deng V.S. Subrahmanian University of Maryland College Park."— Presentation transcript:

1 An Ontology-Extended Relational Algebra Piero Bonatti Università di Napoli "Federico II" Yu Deng V.S. Subrahmanian University of Maryland College Park

2 10/28/2003Ontology Extended Algebra2 Outline Problem statement Approach Motivating example Ontology-extended relational algebra HOME system Contributions Related work Future work

3 10/28/2003Ontology Extended Algebra3 Problem Statement Integrating heterogeneous data sources is an important problem. There are many projects in this area, but at syntactic level. Our goal: Integrate data sources with diverse structures and assumptions at the semantic level. Answer queries correctly under user’s assumptions of semantic meaning about the terms being used.

4 10/28/2003Ontology Extended Algebra4 Approach Associate ontologies to data sources. Ontology interoperation. Extend relational data model and relational algebra.

5 10/28/2003Ontology Extended Algebra5 Motivating Example Two parts relations: Relation Parts1 with the schema (Name, Cost, Shipping) Relation Parts2 with the schema (Item, Price, ShipCost) Two insurance claim relations: Relation Claims1 with the schema (ClaimId, Type, Cost) Relation Claims2 with the schema (ClaimNumber, Type, Value)

6 Parts1 and Parts2 Relations NameCostShipping Tire54.1920.05 Gasket3.051.55 Valve3.351.55 Brake pads78.508.50 Evaporator305.0011.50 ItemPriceShipCost Wheel50.0518.00 Air Gasket3.001.70 Valve3.351.55 Hubcap11.506.00 Spark Plug20.008.50 Parts1 relation Parts2 relation

7 10/28/2003Ontology Extended Algebra7 Problems (1) When users specify a query spanning these two relations, they may wonder: Do the fields Cost and Price mean the same thing? Is wheel a part of tire? Is air gasket a gasket? Furthermore, does the field Cost use the unit US dollar? Does the field Price use the unit Euro? Users may be at a loss to determine these by looking at the fields.

8 Claims1 and Claims2 Relation ClaimIdTypeCost 1burglary2000 2theft150 3mugging860 4arson1800 ClaimNumberTypeValue 1robbery400 2fire550 3auto accident 500 4burglary250 Claims1 relationClaims2 relation

9 10/28/2003Ontology Extended Algebra9 Problem (2) Users may have a query such as “Find all the thefts that involved a cost of over $1000 dollars”. The system should automatically recognize that burglaries, muggings and robberies count as thefts. In addition, conversions between units are needed if costs are represented in different units in above query.

10 10/28/2003Ontology Extended Algebra10 Ontology Extended Relation (OER) We use ontology to convey semantics about terms in a domain and associate ontologies with relations. Intuitively, an Ontology extended relation is an ordinary relation as well as an associated ontology.

11 10/28/2003Ontology Extended Algebra11 Ontology Suppose ∑ is some finite set of strings and S is some set. An ontology w.r.t. ∑ is a partial mapping Θ from ∑ to hierarchies for S. For example, ∑ = {isa, part_of, affects} A hierarchy can be regarded as a Hasse diagram associated with a partial ordering. We provide formal definition in our paper.

12 10/28/2003Ontology Extended Algebra12 Ontology Example theft mugging arson burglary Ontology associated with Claims1 relation (∑ = {isa}) Wheel Valve Air Gasket Hubcap Spark Plug Ontology associated with Parts2 relation (∑ = {part_of})

13 10/28/2003Ontology Extended Algebra13 Ontology Integration Example query: Find all the thefts that involved a cost of over $1000 dollars. Ontology integration is needed to answer this query when performing binary operations between two ontology extended relations. Interoperation constraints are needed to specify the connections between ontologies. We consider: x = y, x ≤ y, x ≠ y, x !≤ y, suppose x and y are from two different hierarchies.

14 10/28/2003Ontology Extended Algebra14 Definition of Hierarchy Integration Suppose (H i, ≤ i ), 1≤i ≤n are n different hierarchies and suppose IC is a finite set of interoperation constraints. A hierarchy (H, ≤) is said to be an integration of (H i, ≤ i ), 1≤i ≤n iff there are n injective mappings φ 1,…,φ n from H 1,…,H n respectively to H such that: (  i  {1,…,n} )x ≤ i y  φ i (x) ≤ φ i (y). (  x  H i )(  y  H j ) (x:i op y:j)  IC  φ i (x) op φ j (y). H1H1 H2H2 HnHn... H φ1φ1 φ2φ2 φnφn

15 Example of Hierarchy Integration theft mugging arson burglary isa hierarchy with Claims1 relation robbery fire auto-accident burglary isa hierarchy with Claims2 relation theft mugging arson burglary fireauto-accident Integrated isa hierarchy for Claims1 and Claims2 IC = {theft:1 = robbery:2, arson:1 ≤ fire:2} With the integrated hierarchy, system can recognize that burglaries, muggings and robberies count as thefts.

16 10/28/2003Ontology Extended Algebra16 Canonical Hierarchy Suppose (H i, ≤ i ), 1≤i ≤n are n different hierarchies and suppose IC is a finite set of interoperation constraints. The canonical hierarchy (H *, ≤ * ) of (H i, ≤ i ), 1≤i ≤n is defined as follows. H * is the set of all strongly connected components of the graph associated with (H i, ≤ i ), 1≤i ≤n. If x *, y *  H *, then x * ≤ * y * iff either x * = y * or there exists a directed path from x:i to y:j (for some x:i  x * and y:j  y * ) in the hierarchy graph associated with (H i, ≤ i ), 1≤i ≤n.

17 10/28/2003Ontology Extended Algebra17 Example of Canonical Hierarchy theft mugging arson burglary isa hierarchy with Claims1 relation robbery fire auto-accident burglary isa hierarchy with Claims2 relation Canonical Hierarchy with Claims1 and Claims2 IC = {theft:1 = robbery:2, arson:1 ≤ fire:2} theft robbery burglary mugging fire arson auto-accident

18 Theorems about Hierarchy Integrability Let (H i, ≤ i ), 1≤i ≤n be a family of hierarchies and suppose (H *, ≤ * ) is its canonical hierarchy. Suppose (H, ≤), φ 1,…,φ n is any arbitrary witness to the integration of (H i, ≤ i ), 1≤i ≤n. Then: [x:i] ≤ * [y:j]  φ i (x) ≤ φ j (y). A set (H i, ≤ i ), 1≤i ≤n of hierarchies is integrable if and only if the canonical witness of (H i, ≤ i ), 1≤i ≤n is a witness to the integrability of (H i, ≤ i ), 1≤i ≤n. This shows how to integrate hierarchies very efficiently: compute canonical hierarchy and check integrability.

19 10/28/2003Ontology Extended Algebra19 Definition of Ontology Integrability Suppose  is some finite set of strings, S is some set, and  1,…,  n are ontologies w.r.t. , S. Suppose IC is a finite set of interoperation constraints. The ontologies  1,…,  n are integrable iff for every x  ,  1 (x),…,  n (x) are integrable.

20 10/28/2003Ontology Extended Algebra20 Definition of OER An ontology extended relation is a triple (R, S, H isa ), where S is a schema (A 1 :  1, …,A n :  n ), H isa is an isa hierarchy and the following constraints are satisfied:  1,…,  n  T isa R  below H isa (  1 ) x … x below H isa (  n ) Below H (  ) = {  ’|  ’≤  }  dom(  )

21 10/28/2003Ontology Extended Algebra21 Ontology Extended Relational Algebra (1) Example query: Find the car parts from Parts1 relation which are more expensive than Wheel in Parts2 relation. Conversion function is needed to answer this query. Conversion Function: for each pair of types  i and  j, we assume there exists at most one conversion function  i 2  j : dom(  i )  dom(  j ) Given a term X, X t is defined as: t.A i, if X = A i, where t is a tuple of relation R. , if X = . v, if X = v: .

22 10/28/2003Ontology Extended Algebra22 Ontology Extended Relational Algebra (2) Operations in simple select conditions: X op Y, op  { =, <>,,  }: Let  be the least common supertype of X and Y, then (type(X)2  )(X t ) op (type(Y)2  )(Y t ) is true. X instance_of Y: Y t  T, type(X) ≤ H Y t, and X t  dom( Y t ). X subtype_of Y: X t  T, Y t  T, X t ≤ H Y t. If c 1, c 2 are select conditions, c 1  c 2, c 1  c 2, and  c 1 are select conditions. Complex operations in select conditions: X below Y: X instance_of Y  X subtype_of Y. X above Y: Y below X. The operators instance_of, subtype_of, below and above are applicable to arbitrary hierarchies.

23 10/28/2003Ontology Extended Algebra23 Ontology Extended Relational Algebra (3) Suppose (R 1, S 1, H 1 ),…,(R Z, S Z, H Z ) are ontology extended relations, F is a fusion of H 1,…,H Z via witness tr F. If E is a relation R i, [E] F = (R, S,  F ), where R = tr F (R i ), S = (A 1 :tr F (  1 ), …, A n : tr F (  n )). If E is  A i 1,…, A i k (E’) (1  i j  n, 1  j  k) and if [E’] F = (R’, (A 1 :  1, …, A n :  n ),  F ), then [E] F = (R, S,  F ), where R =  A i 1,…, A i k (R’) and S = (A i 1 :  i 1, …, A i k :  i k ). If E is E 1 x E 2 and [E i ] F = (R i, S i,  F ), (i = 1, 2), then [E] F = (R, S,  F ), where R = R 1 x R 2, S = S 1 S 2. If E is  c (E’), [E’] F = (R’, S,  F ), then [E] F = (R, S,  F ), where R = {t  R’  (R’, S,  F ), t |= c}.

24 10/28/2003Ontology Extended Algebra24 Example of Selection Example query: Find all the items from Parts1 relation which are parts of Tire. To answer this query: Ontology of Parts1 including part_of hierarchy. Retrieve the set of subtypes of Tire with regard to part_of relationship. Transform the query based on the set of subtypes.

25 10/28/2003Ontology Extended Algebra25 Example of Join Example query: Find the items from Claims2 relation which are a kind of theft and cost more than the item theft in Claims1 relation. To answer this query: Integrated ontology of Claims1 and Claims2 including isa hierarchy. Conversion function between the corresponding units. Transform the query with regard to the ontology and conversion function.

26 10/28/2003Ontology Extended Algebra26 Ontology Extended Relational Algebra (4) If E = E 1 op E 2 where op  { , ,  }, and [E i ] F = (R i, S i,  F ), (i=1,2), and S 1, S 2 have a least common super schema S, then [E] F = (R, S,  F ), where R = S 1 2S(R 1 ) op S 2 2S(R 2 ). If E = (S)E’, where S is a schema and [E’] F = (R, S’,  F ), then [E] F = (S’2S(R), S,  F ).

27 10/28/2003Ontology Extended Algebra27 Example of Union Example query: Find all the items from Claims1 and Claims2 that are a kind of theft and involve a cost of over $1000 dollars. To answer this query: Integrated ontology including isa hierarchy which contains not only values, but also field names, such as Cost and Value. Conversion function between corresponding units. Compute least common super schema of Claims1 and Claims2. Convert the selected records to the least common super schema and compute the union of them.

28 10/28/2003Ontology Extended Algebra28 HOME We built the HOME (Heterogeneous Ontology Management Engine) system to prove the proposed concepts and implement the algorithms. The main components in HOME: GUI Ontology maker Rule maker Ontology inference Query Executor

29 10/28/2003Ontology Extended Algebra29 Current Status of HOME HOME is implemented in Java. Briefly, HOME has the following major functionalities: Learn ontology from relational and XML data sources. Modify ontology with a rule maker. Browse ontology with zoomable interface. Import ontology from XML files and write ontology back to XML files. Ontology integration. Ontology extended query processing for relational data sources and XML sources.

30 Experimental Results (1) Performance of HOME for conjunctive selection queries based on GNIS data sets

31 Experimental Results (2) Performance of HOME for join queries based on GNIS data sets

32 Experimental Results (3) Join queries with varying selectivity and number of tuples based on GNIS data sets

33 Experimental Results (4) Performance of ontology integration algorithms

34 10/28/2003Ontology Extended Algebra34 Contributions Theory about ontologies and ontology integration. Theory about ontology extended relational algebra. HOME: a platform for ontology-based data integration.

35 10/28/2003Ontology Extended Algebra35 Related Work Integrate heterogeneous data sources: TSIMMIS from Stanford HERMES from UMD SIMS from USC DISCO from INRIA and UMD Ontology algebra Scalable Knowledge Composition Project from Stanford Focused on computing union, intersection, and difference of ontologies, instead of answering queries with ontologies. Did not consider embedding ontologies into existing data models.

36 10/28/2003Ontology Extended Algebra36 Future Work Integrate non-relational data sources, such as semi-structured sources, textual sources, etc. More effort on Semantic Web, DAML+OIL, RDF, metadata, etc. Extension to richer ontology structures. Indexing for ontology based data retrieval. Scaling ontology integration.

37 Finally Thank you!


Download ppt "An Ontology-Extended Relational Algebra Piero Bonatti Università di Napoli "Federico II" Yu Deng V.S. Subrahmanian University of Maryland College Park."

Similar presentations


Ads by Google