An eGovernment system for temporal- and semantic-aware access to norms SWEG 2006 – The Semantic Web meets eGovernment 2006 AAAI Spring Symposium Series,

Slides:



Advertisements
Similar presentations
1 Ontolog Open Ontology Repository Review 19 February 2009.
Advertisements

XML: Extensible Markup Language
1 3D_XML A three-Dimensional XML-based Model Khadija Ali, Jaroslav Pokorný Czech Technical University Prague - Czech Republic.
TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Xyleme A Dynamic Warehouse for XML Data of the Web.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
An Intelligent Broker Approach to Semantics-based Service Composition Yufeng Zhang National Lab. for Parallel and Distributed Processing Department of.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
DL systems DL and the Web Ilie Savga
Managing Master Data with MDS and Microsoft Excel
Database Systems Chapter 1 The Worlds of Database Systems.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Cluj Napoca, 28 August IEEE International Conference on Intelligent Computer Communication and Processing Digital Libraries Workshop Towards.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Search Engines and Information Retrieval Chapter 1.
Avalanche Internet Data Management System. Presentation plan 1. The problem to be solved 2. Description of the software needed 3. The solution 4. Avalanche.
Guided Interactive Discovery of e-Government Services Giovanni Maria Sacco Dipartimento di Informatica, Università di Torino Corso Svizzera 185,
ATLAS Demystified: A Practical Introduction Christophe Laprun, Jonathan Fiscus, John Garofolo, Sylvain Pajot National Institute of Standards and Technology.
WS-Security: SOAP Message Security Web-enhanced Information Management (WHIM) Justin R. Wang Professor Kaiser.
ApplicationsApplications Mills Davis Ana Cristina Garcia Peter Mika Gerti Orthofer Giovanni Sacco Maria A. Wimmer (Moderator)
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
Querying Structured Text in an XML Database By Xuemei Luo.
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
University of Crete Department of Computer Science ΗΥ-561 Web Data Management XML Data Archiving Konstantinos Kouratoras.
Book: Bayesian Networks : A practical guide to applications Paper-authors: Luis M. de Campos, Juan M. Fernandez-Luna, Juan F. Huete, Carlos Martine, Alfonso.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
BNCOD07Indexing & Searching XML Documents based on Content and Structure Synopses1 Indexing and Searching XML Documents based on Content and Structure.
Semantic based P2P System for local e-Government Fernando Ortiz-Rodriguez 1, Raúl Palma de León 2 and Boris Villazón-Terrazas 2 1 1Universidad Tamaulipeca.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Semantic Web Techniques for Personalization of eGovernment Services SemWAT st International ER Workshop on Semantic Web Applications: Theory and.
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Comparing Document Segmentation for Passage Retrieval in Question Answering Jorg Tiedemann University of Groningen presented by: Moy’awiah Al-Shannaq
Managing Semi-Structured Data. Is the web a database?
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Rinke Hoekstra Use of OWL in the Legal Domain Statement of Interest OWLED 2008 DC, Gaithersburg.
Developing GRID Applications GRACE Project
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
CS 405G: Introduction to Database Systems
Fabio Grandi, Maria Rita Scalas,
A Generalized Modeling Framework for Schema Versioning Support
WEBIST 2005 – International Conference on Web Information Systems and Technologies Efficient Management Of Multi-Version XML Documents For E-Government.
Improving Data Discovery Through Semantic Search
Light-weight Ontology Versioning with Multi-temporal RDF Schema
Third International Conference on Health Informatics
Dynamic Multi-version Ontology-based Personalization
IADIS International Conference e-Society 2005
DEXA EGOV 2005 Conference Personalized Access to Multi-version Norm Texts in an eGovernment Scenario Fabio Grandi, Maria Rita Scalas Alma Mater Studiorum.
The Valid Ontology: a simple OWL Temporal Versioning Framework
Multi-temporal RDF Ontology Versioning
An eGovernment system for temporal- and semantic-aware access to norms
ece 627 intelligent web: ontology and beyond
Semantic Web Techniques for Personalization of eGovernment Services
Data Model.
Magnet & /facet Zheng Liang
Effective Representation and Efficient Management of Indeterminate Dates Fabio Grandi University of Bologna, Italy Federica Mandreoli University of.
Fabio Grandi DEIS - Univ. of Bologna, Italy
Presentation transcript:

An eGovernment system for temporal- and semantic-aware access to norms SWEG 2006 – The Semantic Web meets eGovernment 2006 AAAI Spring Symposium Series, Stanford University, CA, March 2006 Fabio Grandi Maria Rita Scalas Università degli Studi di Bologna Federica Mandreoli Riccardo Martoglia Enrico Ronchetti Paolo Tiberio Università degli Studi di Modena e Reggio Emilia

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Overview  Our research activities concern the implementation of Web information systems for eGovernment applications  Development of eGovernment initiatives: more and more on-line resources and services are being made available by Public Administrations (PAs)  We make use of temporal database and semantic Web techniques to provide personalized access to such resources and services  In particular, we consider multi-version norm texts (stored in XML format) available in Web repositories

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms time Original normative text 1 2 new version 3 Importance of versioning  Temporal concerns are ubiquitous in the law domain  Each normative text changes in time due to different modifications, but keeps its identity  The ability to model temporal dimensions is essential for the management of evolving norms  it is crucial to reconstruct the consolidated version of a norm  also past versions are still important

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Importance of versioning  Applicability (semantic) versioning also plays an important role  some norms or some of their parts have or acquire a limited applicability  personalized version of the norm  A version only containing articles which are applicable to a citizen’s personal case Self-employed Art. 1 (unemployed) xxy yyx yxyx yyyxx xyyx xxy yyx yxyx yyyxx xyyx Art. 2 (self-employed) aab bbab abab abba ab aab bbab abab abba ab Art. 3 (retired) qwqq ww wqqw wq ww qwqq ww wqqw wq ww

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Motivation  Large XML collections of norms are made available by the PA on the Web but personalization is:  Absent, e.g. (temporal versioning partially supported)  Predefined in the Website structure and contents, e.g. (hardwired by human experts following the life-events approach)  Lack of an effective, flexible, on-demand (“intelligent”, efficient) personalization facility

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Objectives  Development of an effective and efficient Web information system where:  norms are represented as XML documents  dynamics of norms in time is captured  limited applicability of norms (and their parts) is captured  selective access and reconstruction of versions is supported by a query engine  Aimed at:  enabling citizens to access personalized versions of multiversion resources  improving and optimizing the involvement of citizens in the eGovernance process

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Personalized access to multi-version norms Classification of the citizen wrt an ontology on the basis of his/her digital identity Retrieval and reconstruction of a personalized version of the norm to be delivered Citizen logged on to the Web repository looking for a norm of interest

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms The Technological Infrastructure WEB SERVICES OF PUBLIC ADMINISTRATION WEB SERVICES WITH ONTOLOGY O C XML REPOSITORY OF ANNOTATED NORMS SIMPLEELABORATIONUNIT 1 – identification phase: reconstruction on-the-fly of the digital identity of the authenticated user 1 class C x 2 – classification phase: use of the collected digital identity to classify the citizen with respect to the civic ontology O c 2 Public Administration DB creation /update 3 – querying phase: access and reconstruction of all and only norms which are applicable to the class C x 3 Querying phase

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms  Definition of a temporal XML model including  a temporal multi-version XML schema is based on the hierarchical organization of normative texts: contents-section-article-paragraph is based on the hierarchical organization of normative texts: contents-section-article-paragraph at each level of the hierarchy, the history of changes is represented by the (time-stamped) versions produced at each level of the hierarchy, the history of changes is represented by the (time-stamped) versions produced it supports ancestor-descendant inheritance it supports ancestor-descendant inheritance  temporal manipulation operations  Extension of the XML model with applicability annotations in order to support semantic versioning  Design, implementation and evaluation of system prototypes supporting the model Approach

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms The temporal XML schema 4 Temporal Dimensions: Publication time time of publication on the Official Journal Validity time time the norm is in force Efficacy time time the norm can be applied Transaction time time the norm is stored in the system Law TitleContents Publication – R Vt_Start – R Vt_End – O Tt_Start – R Tt_End – O Et_Start – R Et_End – O An_ref – O Ver Section Ver Article Ver Heading Paragraph Ver Heading Num – R An_ref – O Num – R An_ref – O Num – R An_ref – O Num – R Type – R Vt_Start – R Vt_End – O Tt_Start – R Tt_End – O Et_Start – R Et_End – O TA Vt_Start – R Vt_End – O Tt_Start – R Tt_End – O Et_Start – R Et_End – O TA Vt_Start – R Vt_End – O Tt_Start – R Tt_End – O Et_Start – R Et_End – O TA Vt_Start – R Vt_End – O Tt_Start – R Tt_End – O Et_Start – R Et_End – O TA

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Semantic versioning  Extension of the multi-version model based on temporal dimensions to include a semantic versioning dimension to provide personalized access to norm texts  Civic ontology: a classification of citizens based on the distinctions introduced by successive norms (founding acts) that imply some limitations in their applicability

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Semantic versioning   At this stage of the project, we manage “tree-like” ontologies   class taxonomies induced by the IS-A relationship   we exploit the pre-order and post-order properties of trees   New versioning dimension: applicability of different parts of a norm text to the relevant classes of the civic ontology   Applicability annotations (AA) are added to semantic versions

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Semantic versioning   Applicability is inherited by descendant nodes unless locally redefined   By means of redefinitions we can also introduce, for each part of a document, complex applicability properties   Restrictions with respect to ancestors   Extensions with respect to ancestors

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms  John Smith is a self-employed citizen.  He is interested in the text of all the norms... ... which contain paragraphs dealing with health care,... ... which were valid and in effect between 2002 and 2004,... ... and which are applicable to his case (civic class 7). Example of full search Structural constraint Textual constraint Temporal constraint Applicability constraint 4 orthogonal constraints

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms FOR $a IN norms WHERE textConstr ($a//paragraph//text(), ’health AND care’) AND tempConstr (’vTime OVERLAPS PERIOD(’ ’,’ ’)’) AND tempConstr (’eTime OVERLAPS PERIOD(’ ’,’ ’)’) AND applConstr (’class 7’) RETURN $a Example of full search Structural constraint Textual constraint Temporal constraint Applicability constraint 4 orthogonal constraints

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Norm Article 1 Par 1 Ver 1 AA=3 Ver 1 Par 2 Article 2 Health care… …text X Ver 2 Public health… …text Y Example of full search TA AA TA AA=4 TA Ver 1 AA=3,8 TA Health care… …text Z Civic ontology Normative DB …norm//paragraph//text()… ‘class 7’ …

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Our prototype system (“native” approach)  The query engine is able to access and retrieve only the strictly necessary data  selection relies on ad-hoc data structures supporting multi-versioning  storage granularity is finer than the entire documents used by standard XML engines  Only the parts which satisfy the temporal and applicability constraints are used for the reconstruction of the retrieved documents  There is no need to retrieve whole XML documents and build space- consuming structures such as DOM trees Enhanced query processing efficiency Reduced memory requirements

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Evaluation benchmark  Three XML document sets  5000 documents (120MB)  documents (240MB)  documents (480MB)  Variable document size  min = 2KB  avg = 24KB  max = 125KB  Five different query types  Queries on keywords (structural + textual constraints)  Q1 – keywords in contents  Q2 – keywords in type and contents  Temporal queries (structural + temporal constraints)  Q3 – conditions on publication, validity and transaction time  Mixed queries (structural + textual + temporal constraints)  Q4, Q5 – with keywords and temporal conditions  Five variants with semantic constraints  Qx-A – with additional applicability constraints PERSONALIZATION OF THE QUERIES

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Performance evaluation  Very high personalization query efficiency  The system is able to solve personalization problems by means of simple comparisons involving pre-post encodings  0.5-1% more time than for the original versions  3-4% storage space overhead

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Performance evaluation  Scalability tests  The computing time grows sublinearly with the number of documents  Good scalability of the system in every type of query context 5000 docs docs docs time 1046 msec 1366 msec 1741 msec

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Conclusions   We presented our research work concerning the design and implementation of efficient Web-based information systems for eGovernment applications   We introduced a personalized access to resources on the basis of the digital identity of citizens relying on semantic versioning and ontology mapping   We developed a efficient platform (“native” approach) for which a specialized Multi-version XML Query Processor has been designed and implemented   We proved our approach to be very efficient in a large set of experimental situations and showed excellent scale-up figures with varying load configurations

SWEG 2006 An eGovernment system for temporal- and semantic-aware access to norms Future Work   Extensions of the current framework   more advanced application requirements may include a more sophisticated ontology definition, possibly versioned, and more advanced reasoning services   Development of a complete technological infrastructure usable in a large Web-based eGovernment scenario, including   identification, classification and reconstruction services   Assessment of our prototype systems in a concrete working environment   with real users and with a large repository of real norms   Extension to a more general application domain (Web personalization via ontology-based user profiling)