An Algebraic Approach for Specifying Compound Terms in Faceted Taxonomies Yannis Tzitzikas 1 Anastasia Analyti 2 Nicolas Spyratos 3 Panos Constantopoulos.

Slides:



Advertisements
Similar presentations
Efficient Keyword Search for Smallest LCAs in XML Database Yu Xu Department of Computer Science & Engineering University of California, San Diego Yannis.
Advertisements

IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
1 A Description Logic with Concrete Domains CS848 presentation Presenter: Yongjuan Zou.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
Provenance in Open Distributed Information Systems Syed Imran Jami PhD Candidate FAST-NU.
Introduction to Software Engineering Dr. Basem Alkazemi
Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.
Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.
TAXONOMY-BASED ANNOTATION OF XML DOCUMENTS Application to e-Learning Resources Nicolas Spyratos University of Paris-South France Joint work with B. Gueye.
FACT: A Learning Based Web Query Processing System Hongjun Lu, Yanlei Diao Hong Kong U. of Science & Technology Songting Chen, Zengping Tian Fudan University.
A Framework for Scalable Global IP-Anycast Sigcomm 2000, Dina Katabi Presented by Wei Yu.
Semantic Location Based Services for Smart Spaces Kostas Kolomvatsos, Vassilis Papataxiarhis, Vassileios Tsetsos P ervasive C omputing R esearch G roup.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Memoplex Browser: Searching and Browsing in Semantic Networks CPSC 533C - Project Update Yoel Lanir.
Lecture 7 Access Control
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:
Recommender Systems on the Web: A Model-Driven Approach Gonzalo Rojas – Francisco Domínguez – Stefano Salvatori Department of Computer Science University.
Real-Time Concepts for Embedded Systems Author: Qing Li with Caroline Yao ISBN: CMPBooks.
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
1999 Asian Women's Network Training Workshop. A short discussion about Information Architecture.
An affinity-driven clustering approach for service discovery and composition for pervasive computing J. Gaber and M.Bakhouya Laboratoire SeT Université.
2-Oct-15 Bojan Orlic, TU/e Informatica, System Architecture and Networking 12-Oct-151 Homework assignment 1 feedback Bojan Orlic Architecture.
Patterns, effective design patterns Describing patterns Types of patterns – Architecture, data, component, interface design, and webapp patterns – Creational,
CSCE350 Algorithms and Data Structure Lecture 17 Jianjun Hu Department of Computer Science and Engineering University of South Carolina
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
Skip Lists Mrutyunjay. Introduction ▪ Linked Lists Benefits & Drawbacks: – Benefits: – Easy Insert and Deletes, implementations. – Drawbacks: – Hard to.
Course Web Page Most information about the course (including the syllabus) will be posted on the course wiki:
July 30, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 8: Exploiting Memory Hierarchy: Virtual Memory * Jeremy R. Johnson Monday.
The 5th annual UK Workshop on Computational Intelligence London, 5-7 September 2005 The 5th annual UK Workshop on Computational Intelligence London, 5-7.
Book: Bayesian Networks : A practical guide to applications Paper-authors: Luis M. de Campos, Juan M. Fernandez-Luna, Juan F. Huete, Carlos Martine, Alfonso.
Efficient Instant-Fuzzy Search with Proximity Ranking Authors: Inci Centidil, Jamshid Esmaelnezhad, Taewoo Kim, and Chen Li IDCE Conference 2014 Presented.
CS 4850: Senior Project Fall 2014 Object-Oriented Design.
Search Engines.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
LRI Université Paris-Sud ORSAY Nicolas Spyratos Philippe Rigaux.
Computer Science: A Structured Programming Approach Using C Trees Trees are used extensively in computer science to represent algebraic formulas;
Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:
14.1/21 Part 5: protection and security Protection mechanisms control access to a system by limiting the types of file access permitted to users. In addition,
Acclimatizing Taxonomic Semantics for Hierarchical Content Categorization --- Lei Tang, Jianping Zhang and Huan Liu.
The Bloom Paradox Ori Rottenstreich Joint work with Yossi Kanizo and Isaac Keslassy Technion, Israel.
Algorithmic Detection of Semantic Similarity WWW 2005.
1 Compiler Design (40-414)  Main Text Book: Compilers: Principles, Techniques & Tools, 2 nd ed., Aho, Lam, Sethi, and Ullman, 2007  Evaluation:  Midterm.
An approach for Framework Construction and Instantiation Using Pattern Languages Rosana Teresinha Vaccare Braga Paulo Cesar Masiero ICMC-USP: Institute.
Requirements Validation
Distributed Models for Decision Support Jose Cuena & Sascha Ossowski Pesented by: Gal Moshitch & Rica Gonen.
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
Requirements Analysis
G042 - Lecture 09 Commencing Task A Mr C Johnston ICT Teacher
Trialogical E-Learning and Emergent Knowledge Artifacts Yannis Tzitzikas 1,2, Vassilis Christophides 1,2, Giorgos Flouris 2, Dimitris Kotzinos 1,2, Hannu.
Dominance and Indifference in Airline Planning Decisions NEXTOR Conference: INFORMS Aviation Session June 2 – 5, 2003 Amy Mainville Cohn, KoMing Liu, and.
Quality Assurance in the Presence of Variability Kim Lauenroth, Andreas Metzger, Klaus Pohl Institute for Computer Science and Business Information Systems.
Graph Indexing From managing and mining graph data.
June 30, 2005 Public Web Site Search Project Update: 6/30/2005 Linda Busdiecker & Andy Nguyen Department of Information Technology.
1 Agenda TMA02 M876 Block 4. 2 Model of database development data requirements conceptual data model logical schema schema and database establishing requirements.
The PLA Model: On the Combination of Product-Line Analyses 강태준.
Scalable Multi-match Packet Classification Using TCAM and SRAM Author: Yu-Chieh Cheng, Pi-Chung Wang Publisher: IEEE Transactions on Computers (2015) Presenter:
Testing and delivery Web design principles. Web development is software development.
Efficient Multi-User Indexing for Secure Keyword Search
SysML v2 Formalism: Requirements & Benefits
System Design and Modeling
Federated & Meta Search
The Use of Facets in Web Search Engines
ece 720 intelligent web: ontology and beyond
Chapter 1 Database Systems
Query Processing B.Ramamurthy Chapter 12 11/27/2018 B.Ramamurthy.
Indexing and Hashing Basic Concepts Ordered Indices
Chapter 1 Database Systems
A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 22, Feb, 2010 Department of Computer.
Presentation transcript:

An Algebraic Approach for Specifying Compound Terms in Faceted Taxonomies Yannis Tzitzikas 1 Anastasia Analyti 2 Nicolas Spyratos 3 Panos Constantopoulos 2,4 1 Instituto di Scienza e Technologie dell’Informazione CNR-ISTI,Italy 2 Institute of Computer Science, ICS-FORTH, Greece 3 Laboratoire de Recherche en Informatique, Universite de Paris-Sud, France 4 Department of Computer Science, University of Crete, Greece

June 2003Yannis Tzitzikas et al., EJC'20032 Outline of the presentation Introduction - Motivation Faceted Classification and Faceted Taxonomies –Advantages and Problems Compound Terms and Compound Taxonomies The Algebra –Operations –Examples –Algorithms –Deriving Navigational Trees –Prototype implementation Concluding Remarks

June 2003Yannis Tzitzikas et al., EJC'20033 Introduction Existing ways to locate information in the Web –searching (using search engines like Google) –browsing (using catalogues like Yahoo!, ODP) Currently, the catalogues are also exploited by the search engines: –for improving the measuring of relevance –for giving to the user a set of related pages to each page of the answer –for limiting the scope of the search Web Catalogues (or indices using controlled structured vocabularies): [-]: index only a subset of the pages that are indexed by search engines [+]: ensure indexing consistency [+]: enable intelligent reasoning [+]: enable browsing

June 2003Yannis Tzitzikas et al., EJC'20034 Drawbacks of the taxonomies that are used by Web Catalogues Hard to understand Laborious browsing Laborious object indexing Hard to update/revise Large storage requirements (1) Big size (e.g. currently Open Directory has terms) (2) Inconsistent and incomplete terminology and structuring USER DESIGNER

June 2003Yannis Tzitzikas et al., EJC'20035 Faceted Classification and Faceted Taxonomies Faceted classification was developed, prior to the existence of computers,by S. R. Ranganathan ( ), a Hindu mathematician working as a librarian. Key point: Faceted taxonomies do not require an a priori division of concepts into subconcepts (only relationships between elemental concepts are stored) * A faceted taxonomy consists of a set of facets * Each facet is a group of elemental concepts * Each object is indexed by synthesizing elemental concepts Advantages of faceted taxonomies: they are easier to build and understand they require less storage space requirements they are more scalable

June 2003Yannis Tzitzikas et al., EJC'20036 Faceted Taxonomies Sports SeaSportsWinterSports Location Islands Mainland CretePilioOlympus

June 2003Yannis Tzitzikas et al., EJC'20037 Example of using one taxonomy 1 billion pages blocks of 10 pages 100 million indexing terms Complete and balanced decimal tree Total: 111,111,111 terms

June 2003Yannis Tzitzikas et al., EJC'20038 Example of using a faceted taxonomy consisting of 4 facets 1 billion pages blocks of 10 pages 100 million indexing terms Total: 444 terms 100 terms xx x 400 terms

June 2003Yannis Tzitzikas et al., EJC'20039 Example of using a faceted taxonomy consisting of 8 facets 1 billion pages blocks of 10 pages 100 million indexing terms Total: 88 terms! …… 10 terms x … x 80 terms …

June 2003Yannis Tzitzikas et al., EJC' Sports SeaSportsWinterSports Location Islands Mainland CretePilioOlympus The Problem of Faceted Taxonomies Consequences : laborious/erroneous object indexing difficulties in browsing Invalid compound terms may appear during object indexing or browsing/retrieval A compound term is invalid if it cannot be applied to any object of the domain

June 2003Yannis Tzitzikas et al., EJC' Valid and Invalid Compound Terms ValidInvalid Sports SeaSportsWinterSports Location Islands Mainland CretePilioOlympus F SeaSports.Olympus WinterSports.Islands WinterSports.Crete Invalid Compound Terms Sports.Location, Sports.Islands Sports.Crete Sports.Mainland Sports.Pilio Sports.Olymous SeaSports.Location, SeaSports.Islands SeaSports.Crete SeaSports.Mainland SeaSports.Pilio WinterSports.Location, WinterSports.Mainland WinterSports.Pilio WinterSports.Olympus Valid Compound Terms Example:

June 2003Yannis Tzitzikas et al., EJC' The Idea Define an algebra with operators that allow specifying the set of valid compound terms without having to enumerate all of the valid compound terms. Operations: unaryCombines terms from one facet plus negative modifiers self-minus-product unaryCombines terms from one facet plus positive modifiers self-plus-product unaryCombines terms from one facet self-product n-aryCombines terms from different facets plus negative modifiers minus-product n-aryCombines terms from different facets plus positive modifiers plus-product n-aryCombines terms from different facets product Initial Operands: Facet terminologies:

June 2003Yannis Tzitzikas et al., EJC' Compound Terms and Compound Taxonomies Compound term: any subset s of T Compound terminology S : a set of compound terms Compound taxonomy: a pair (S,  ) where –S is a compound terminology and – {Sports,Crete}  {Sports}, {Sports,Crete}  {Sports,Greece} Sports Greece Crete Example:

June 2003Yannis Tzitzikas et al., EJC' The Product Operation {Greece} {Islands} {Sports} {SeaSports}{Greece,Sports} {Islands,Sports}{Greece,SeaSorts} {Islands,SeaSorts} {Greece} {Islands} {Sports} {SeaSports} SS’

June 2003Yannis Tzitzikas et al., EJC' The Plus-Product Operation {Greece} {Islands} {Sports} {SeaSports} SS’ {WinterSports} {SnowSki} {Greece} {Islands} {Sports} {SeaSports} {Greece,Sports} {Islands,Sports}{Greece,SeaSports} {Islands,SeaSports} {WinterSports} {Greece,WinterSports} {SnowSki} {Greece,SnowSki} P={{Islands,SeaSports}, {Greece,SnowSki}}

June 2003Yannis Tzitzikas et al., EJC' The Minus-Product Operation {Greece} {Islands} {Sports} {SeaSports} SS’ {Greece} {Islands} {Sports} {SeaSports} {Greece,Sports} {Islands,Sports}{Greece,SeaSports} {Islands,SeaSports} {WinterSports} {SnowSki} {WinterSports} {Greece,WinterSports} {SnowSki} {Greece,SnowSki} N={{Islands, WinterSports}}

June 2003Yannis Tzitzikas et al., EJC' The Self-[Plus/Minus]-Product Operations Self-Product Self-Plus-Product Self-Minus-Product

June 2003Yannis Tzitzikas et al., EJC' The Self-Plus-Product: Example {Sports} {SeaSports} S {WinterSports} {SnowSki} P={{SeaSki,WindSurfing}, {SnowSki, SnowBoard}} {SeaSki}{Windsurfing}{SnowBoard} {Sports} {SeaSports}{WinterSports} {SnowSki}{SeaSki}{Windsurfing}{SnowBoard} {SeaSki,WindSurfing}{SnowSki,SnowBoard}

June 2003Yannis Tzitzikas et al., EJC' Expressions and Well-formed Expressions An expression e is well-formed if: (a) each basic compound terminology appears at most once in e, (b) the parameters P/N are subsets of the corresponding genuine compound terms In this way: no conflicts arise monotonic behavior The set of expressions over a facet set {F 1,…, F k } is defined according to the grammar:

June 2003Yannis Tzitzikas et al., EJC' Example: Building the catalog of a tourist portal Location Iraklion Outdoor AmmoudaraHersonissos Accommodation Furn. Appartments RoomsBungalows Facilities JacuzziSwimmingPool Indoor 3 facets, 13 terms, 890 compound terms from which only 96 are valid P = {{Iraklio, Furn.Appartments}, {Iraklio,Rooms}, {Ammoudara, Furn. Appartments}, {Ammoudara,Rooms}, {Hersonisson, Furn.Apartments}, {Ammoudara, Bungalows, Jacuzzi}, {Hersonissos,Rooms,Indoor}, {Hersonissos, Bungalows,Outdoor} } |P|=8 N = {{Iraklio, Bungalows}}, P = { {Hersonisson, Rooms, Indoor}, {Hersonissos, Bungalows,Outdoor}, {Ammoudara,Bungalows,Jacuzzi} } |P|+|N|=4

June 2003Yannis Tzitzikas et al., EJC' Checking the Validity of a Compound Term We provide an algorithm for checking whether s  S e without having to compute (and store) the entire S e. The time complexity for this algorithm is: Let S e be the compound terminology defined by an algebraic expression e. => Only F and e have to be stored

June 2003Yannis Tzitzikas et al., EJC' Generating Navigation Trees Objective: Given an expression e generate dynamically a navigation tree with nodes that correspond to valid compound terms only for using it during object indexing and browsing The navigation tree also contains nodes for facet crossing Sports SeaSports WinterSports byLocation Islands Mainland Crete Pilio Olympus byLocationMainland Pilio byLocation Islands Mainland Crete Olympus Pilio bySportsSeaSports WinterSports bySportsSeaSportsbyLocationCrete bySports SeaSports WinterSports Location

June 2003Yannis Tzitzikas et al., EJC' Application in Web Catalogues big, incomplete, scalability problems Taxonomies of existing catalogs P|N Navigation Trees dynamically small, clear, scalable Faceted Taxonomies + Algebra

June 2003Yannis Tzitzikas et al., EJC' Prototype Implementation using a RDBMS Three tables are used for storing the faceted taxonomy and the expression e. TERMS nameid SUBSUMPTION term1term2 PARAMETERS F1F2Fk... Expression Builder Storage Manager Validity Checker Nav. Tree Generator RDBMS DesignerIndexer/User Architecture

June 2003Yannis Tzitzikas et al., EJC' Concluding Remarks Faceted Taxonomies : [+] conceptual clarity (it is easier to understand) [+] compactness (it takes less space) [+] scalability (the update operations can be formulated easier and be performed more efficiently) [-] invalid compound terms may appear. The Proposed Algebra : [+] provides a solution to the problem of invalid compound terms [+] Aids indexing and browsing (and prevents errors)