Guided Interactive Discovery of e-Government Services Giovanni Maria Sacco Dipartimento di Informatica, Università di Torino Corso Svizzera 185, 10149.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

® IBM Research © 2006 IBM Corporation Faceted Logic, Ontologies, and Wikis: Possible Approaches for ONTOLOG Content John Boz Handy-Bosma, Ph.D., Senior.
Oyster, Edinburgh, May 2006 AIFB OYSTER - Sharing and Re-using Ontologies in a Peer-to-Peer Community Raul Palma 2, Peter Haase 1 1) Institute AIFB, University.
Language Technologies Reality and Promise in AKT Yorick Wilks and Fabio Ciravegna Department of Computer Science, University of Sheffield.
Ontology-enhanced retrieval (and Ontology-enhanced applications) Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge Systems.
Chapter 11 user support. Issues –different types of support at different times –implementation and presentation both important –all need careful design.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Database Systems: Design, Implementation, and Management Tenth Edition
Information Retrieval: Human-Computer Interfaces and Information Access Process.
Search Engines and Information Retrieval
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Help and Documentation zUser support issues ydifferent types of support at different times yimplementation and presentation both important yall need careful.
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
Web Mining Research: A Survey
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
Information Retrieval: Human-Computer Interfaces and Information Access Process.
Web Mining Research: A Survey
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Visualization By: Simon Luangsisombath. Canonical Visualization  Architectural modeling notations are ways to organize information  Canonical notation.
Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:
Exploring Personal CoreSpace For DataSpace Management Li Yukun and Xiaofeng Meng WAMDM Lab Renmin University of China.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Controlled Vocabulary & Thesaurus Design Planning & Maintenance.
Search Engines and Information Retrieval Chapter 1.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
2 1 Chapter 2 Data Models Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Chapter 1 Introduction to Data Mining
Master Thesis Defense Jan Fiedler 04/17/98
Jessica Chen-Burger A Framework for Knowledge Sharing and Integrity Checking for Multi-Perspective Models Yun-Heh (Jessica) Chen-Burger Artificial Intelligence.
Information Retrieval and Knowledge Organisation Knut Hinkelmann.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Overview of MOT Knowledge representation system : Basic Modeling Editor LexiconGrammarSemantics Pragmatics MOT Editor.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Data Mining for Web Intelligence Presentation by Julia Erdman.
Semantic based P2P System for local e-Government Fernando Ortiz-Rodriguez 1, Raúl Palma de León 2 and Boris Villazón-Terrazas 2 1 1Universidad Tamaulipeca.
MS Access: Database Concepts Instructor: Vicki Weidler Assistant: Joaquin Obieta.
L. Ardissono, C. Barbero, A. Goy and G. Petrone Dipartimento di Informatica Universita’ di Torino, Torino, Italy
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
LoG: A Methodology for Metadata Registry-based Management of Scientific Data July 5, 2002 Doo-Kwon Baik
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
14. Information Search and Visualization
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Knowledge Modeling and Discovery. About Thetus Thetus develops knowledge modeling and discovery infrastructure software for customers who: Have high-value.
On-To-Knowledge review Juan-Les-Pins/France, October 06, 2000 Hans Akkermans, VUA Hans-Peter Schnurr, AIFB Rudi Studer, AIFB York Sure, AIFB KMKMMethodology.
Object-Oriented Software Engineering Practical Software Development using UML and Java Chapter 1: Software and Software Engineering.
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Optimization of Association Rules Extraction Through Exploitation of Context Dependent Constraints Arianna Gallo, Roberto Esposito, Rosa Meo, Marco Botta.
Improvement of Semantic Interoperability based on Metadata Registry(MDR) Doo-Kwon Baik Dept. of CSE Korea University.
UNIVERSITY UTARA MALAYSIA COLLEGE OF ARTS & SCIENCES.
Data Models. 2 The Importance of Data Models Data models –Relatively simple representations, usually graphical, of complex real-world data structures.
Knowledge Engineering. Sources of Knowledge - Books - Journals - Manuals - Reports - Films - Databases - Pictures - Audio and Video Tapes - Flow Diagram.
WHIM- Spring ‘10 By:-Enza Desai. What is HCIR? Study of IR techniques that brings human intelligence into search process. Coined by Gary Marchionini.
Information and Information Technology 1. Information and employment 2.
Information Retrieval in Practice
Information Organization: Overview
Research on Knowledge Element Relation and Knowledge Service for Agricultural Literature Resource Xie nengfu; Sun wei and Zhang xuefu 3rd April 2017.
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
ece 627 intelligent web: ontology and beyond
CSE 635 Multimedia Information Retrieval
Chapter 11 user support.
Web Mining Research: A Survey
Information Organization: Overview
Presentation transcript:

Guided Interactive Discovery of e-Government Services Giovanni Maria Sacco Dipartimento di Informatica, Università di Torino Corso Svizzera 185, Torino, Italy Where is the knowledge we have lost in information? T.S. Eliot, The Rock

e-Government Services for citizens represent one of the most frequent and critical points of contact between citizens and public administrations. THE PUBLIC FACE OF GOVERNMENT e-services represent the only practical way of providing incentives and support to specific classes of citizens. THE FRIENDLIER FACE OF GOVERNMENT

DISCOVERY of e-services rather than plain RETRIEVAL is a critical functionality in e-government systems But it is managed by search rather than explorative technology

TRADITIONAL SEARCH TECHNIQUES DO NOT WORK

Since the vast majority of information is essentially textual and unstructured in nature information retrieval techniques are extensively used both in pull and push strategies BUT…

1. almost 80% of relevant documents are not retrieved 2. extremely wide semantic gap between the user model (concepts) and the system model (words) 3. users have no or very little assistance in formulating queries 4. results are presented as a flat list with no systematic organization: browsing is difficult or impossible.

RICH SEMANTIC SCHEMATA (ONTOLOGIES) End-users do not understand them Agent mediators required: costly to implement, not transparent, hard to understand what they do Schemata hard to design and maintain

Traditional research has focussed on RETRIEVAL OF INFORMATION BUT The most common task is BROWSING: FIND RELATIONSHIPS THIN ALTERNATIVES OUT

Finding opportunities/services Finding a job Finding the laws and regulations that apply BUT ALSO Buying a digital camera Finding a restaurant for tonight Finding the cause of a malfunction Selecting a photo Finding a suspect/missing person from a photobank ….

REQUIRE A DIFFERENT INFORMATION ACCESS PARADIGM GUIDED EXPLORATION AND INFORMATION THINNING

Dynamic Taxonomies: the first model to fully exploit multidimensional and faceted classifications Sacco, G.M., “Dynamic taxonomies: a model for large information bases”, IEEE Trans. on Data and Knowledge Engineering, May/June 2000 US Patent n. 6,763,349 (EU pending) DYNAMIC TAXONOMIES

Representation Intension: The infobase is described by a taxonomy designed by an expert (the schema) Extension: Documents can be classified at any level of abstraction and each document is classified under n concepts (n>1) No relationships other than subsumptions (IS-A, PART-OF) need to be represented in the schema. DYNAMIC TAXONOMIES

What is a concept? A concept is a label which identifies a set of documents (classified under that concept) A nominalistic approach: concepts are described by instances rather than by properties Subsumptions require that an inclusion constraint is maintained: If D(C) denotes the set of documents classified under C and C’ is a descendant of C in the hierarchy, D(C’)  D(C) DYNAMIC TAXONOMIES

How do concepts relate? By subsumptions (IS-A, PART-OF) By the Extensional Inference Rule: Two concepts C and C’ are related if there is at least a document D which is classified both under C and C’ or one of their descendants Because of the inclusion constraint, IS-A, PART-OF relationships are a special case of the Extensional Inference Rule. DYNAMIC TAXONOMIES

Concepts extensionally related to G have a yellow background

DYNAMIC TAXONOMIES Concepts extensionally related to G have a yellow background

Important consequence: Relationships among concepts need not be anticipated but can be inferred from the actual classification Advantages: a simpler schema adapts to new relationships (dynamic) finds unexpected relationships (discovery) DYNAMIC TAXONOMIES

Putting it all together… The browsing system AN EXAMPLE

1.Initial step: Tree picture of the entire infobase AN EXAMPLE The infobase schema is used for browsing The initial focus is the entire infobase

2. Zoom on a concept and see related concepts AN EXAMPLE This is the central operation: 1.The new focus is ANDed with the previous focus 2.The entire infobase is reduced to the documents in the current focus 3.The taxonomy is reduced in order to show all and only those concepts which are extensionally related to the selected focus (filtering)

3. Iterate until the number of documents is sufficiently small AN EXAMPLE 3 zoom operations are sufficient to select an average 10 documents from infobases with 1,000,000 documents, described by a compact taxonomy with 1,000 concepts.

Simple and familiar interface (the only new operation is the Zoom, which is easily understood) The user is effectively guided to reach his goal: at each stage he has a complete list of all related concepts (i.e. a complete taxonomic summary of his current focus) Completely symmetric interaction: if A and B are related, the user will find B if he zooms on A, and A if he zooms on B (most systems are asymmetric) Discovery of unexpected relationships BENEFITS

TRANSPARENCY: the user is in charge and knows exactly what’s happening EXCELLENT CONVERGENCE  very few iterations needed BENEFITS

Easy multilingual support (just translate concept labels) Easy to unobtrusively gather user interests Easy to accommodate reviews, popularity, etc. Effective push strategies  dbworldx.di.unito.it BENEFITS

Simple integration with other retrieval techniques (IR, DB): dynamic taxonomies as a prefilter: they establish the context for the query dynamic taxonomies as a conceptual summary: they summarize long result lists BENEFITS

CONCLUSIONS Dynamic taxonomies provide a single and simple access model that solves the vast majority of the information dissemination needs of public administrations In fact they are so versatile that can be used for: laws and regulations, e-commerce, medical guidelines, human resource management, multimedia information bases…

Universal Knowledge Processor High-performance dynamic taxonomy engine Microsoft Windows environment A set of high performance multithreaded COM objects Intension and extension in RAM even for large databases (20Mb for 1M documents) Extremely fast operation: 327 reduced taxonomies per second on a 800K item infobase CONCLUSIONS

THE SYSTEM IS AVAILABLE AT om Thank you! CONCLUSIONS