Tuan-Dung CAO 1, Fabien GANDON 1,2 1 ACACIA Team, INRIA Sophia Antipolis 2 Computer School, Carnegie Mellon University Integrating.

Slides:



Advertisements
Similar presentations
Berliner XML Tage. Humboldt Universität zu Berlin, Oktober 2004 SWEB2004 – Intl Workshop on Semantic Web Technologies in Electronic Business Intelligent.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
ACACIA in short… Objectives: Offer methodological and software support (i.e. models, methods and tools) for construction, management and diffusion of.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Chapter 2. Slide 1 CULTURAL SUBJECT GATEWAYS CULTURAL SUBJECT GATEWAYS Subject Gateways  Started as links of lists  Continued as Web directories  Culminated.
From Digital Libraries and Multimedia Archives Towards Virtual Information and Knowledge Environments supporting Collective Memories Technology Platforms.
Provenance in Open Distributed Information Systems Syed Imran Jami PhD Candidate FAST-NU.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
A Semantic e-Wallet to Reconcile Privacy and Context Awareness Fabien L. Gandon & Norman M. Sadeh Mobile Commerce Lab. – Carnegie Mellon University.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Effective Coordination of Multiple Intelligent Agents for Command and Control The Robotics Institute Carnegie Mellon University PI: Katia Sycara
Managing enterprise applications as dynamic resources in corporate semantic webs an application scenario for semantic web services. Fabien Gandon, Moussa.
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
Learning Ontologies from RDF Annotations Alexandre Delteil, Catherine Faron-Zucker, Rose Dieng ACACIA project, INRIA, 2004 Sophia Antipolis, France.
1 Corporate Semantic Web Acacia INRIA Sophia Antipolis.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Combining reactive & deliberative agents for complete ecosystems in infospheres Fabien L. Gandon Carnegie Mellon University “The diversity of resources.
Infomaster: An information Integration Tool O. M. Duschka and M. R. Genesereth Presentation by Cui Tao.
SESSION 9 THE INTERNET AND THE NEW INFORMATION NEW INFORMATIONTECHNOLOGYINFRASTRUCTURE.
Connecting Diverse Web Search Facilities Udi Manber, Peter Bigot Department of Computer Science University of Arizona Aida Gikouria - M471 University of.
1 DCS861A-2007 Emerging IT II Rinaldo Di Giorgio Andres Nieto Chris Nwosisi Richard Washington March 17, 2007.
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Cluj Napoca, 28 August IEEE International Conference on Intelligent Computer Communication and Processing Digital Libraries Workshop Towards.
GMD German National Research Center for Information Technology Innovation through Research Jörg M. Haake Applying Collaborative Open Hypermedia.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
Fabien GANDON - INRIA - ACACIA Team - KMSS 2002 CoMMA in a Nutshell.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Practical RDF Chapter 1. RDF: An Introduction
INDIN' Perth, Australia Multi-Agent Based Information Access Services for Condition Monitoring in Process Automation Teppo Pirttioja 1, Antti.
Peer-to-Peer Data Integration Using Distributed Bridges Neal Arthorne B. Eng. Computer Systems (2002) Supervisor: Babak Esfandiari April 12, 2005 Candidate.
Agent Model for Interaction with Semantic Web Services Ivo Mihailovic.
Dynamic Hypermedia Generations through a Mediator using CRM and Web Service Jen-Shin Hong National ChiNan University,Taiwan
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
10/18/20151 Business Process Management and Semantic Technologies B. Ramamurthy.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Distributed Information Retrieval Using a Multi-Agent System and The Role of Logic Programming.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Page 1 Alliver™ Page 2 Scenario Users Contents Properties Contexts Tags Users Context Listener Set of contents Service Reasoner GPS Navigator.
Department of computer science and engineering Two Layer Mapping from Database to RDF Martin Švihla Research Group Webing Department.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
1 Of Crawlers, Portals, Mice and Men: Is there more to Mining the Web? Jiawei Han Simon Fraser University, Canada ACM-SIGMOD’99 Web Mining Panel Presentation.
SEMANTIC AGENT SYSTEMS Towards a Reference Architecture for Semantic Agent Systems Applied to Symposium Planning Usman Ali.
Evaluation of Agent Building Tools and Implementation of a Prototype for Information Gathering Leif M. Koch University of Waterloo August 2001.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Fabien GANDON, Laurent BERTHELOT, Rose DIENG A Multi-Agents Platform for a Corporate Web Semantic aa m as
A Multi-Agent and Service-Oriented Architecture for Developing Integrated and Intelligent WBES Computing and Information Systems Oscar Lin Peter Holt.
Empowering the Knowledge Worker End-User Software Engineering in Knowledge Management Witold Staniszkis The 17th International.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Witold Staniszkis Empowering the Knowledge Worker End-User Software Engineering in Knowledge Management Witold Staniszkis
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
LOD reference architecture
Business Process Management and Semantic Technologies
Metadata supported full-text search in a web archive
Presentation transcript:

Tuan-Dung CAO 1, Fabien GANDON 1,2 1 ACACIA Team, INRIA Sophia Antipolis 2 Computer School, Carnegie Mellon University Integrating external sources in a corporate semantic web managed by a multi-agent system

2 Introduction and plan  Research problems in ACACIA team:  Organisations need to adapt to an ever changing world  Nervous system: capture and diffuse knowledge  Persistent memory: store and/or index knowledge  Study problematics of organisational memories  Here: special case of external organizational knowledgeIntroduction You are here  Brief summary of previous work on CoMMA  Corporate semantic Webs as corporate memories  Multi-agent system as management architecture  New society of wrappers / HTML scrappers  XML-based extraction process  Wrapper society: roles and interactionsPlan

3 Positioning and pointers  Dynamically integrating heterogeneous sources of information OBSERVER [Mena et al., 1996] InfoSleuth [Nodine et al., 1999] Carnot [Collet et al., 1991] InfoMaster [Genesereth et al., 1997] SIMS [Arens et al., 1996] RETSINA [Decker & Sycara, 1997] Manifold [Kirk et al.,1995]  Assist the management of digital libraries SAIRE [Odubiyi et al., 1997] UMDL [Weinstein et al., 1999]  Organisational knowledge management:  Collaborative gathering, filtering and profiling CASMIR [Berney & Ferneley, 1999] Ricochet [Bothorel & Thomas, 1999]  Mobile access & domain model for document classification KnowWeb [Dzbor et al., 2000]  Taxonomy of topics, profiling and push RICA [Aguirre et al., 2000] CoMMA  Ontology and corporate memory: multiple ontologies FRODO [Van Elst & Abecker, 2001] semantic intraweb, ontology, user profiling CoMMA You are here

4 Implementation choices (1)  Implementation choices:  Materialisation of memory RDF(S) and its XML syntax (manipulated with CORESE) Result in CGs Projection RDF RDFQuery User RDFRules RDFAnnotations RDFSchema Knowledge Engineer CORESE Translation Support in CGs Base of CGs CGs Rules CGsQuery Inferences Result in Result inRDF User Translation principle example of schema Annual activity report of ACACIA example of annotation IF Group Include Person ?x Include Person ?y THEN Person ?x Colleague Person ?y example of rule ?t ?t ?a ?a example of query

5 Implementation choices (2)  Implementation choices:  Materialisation of memory RDF(S) and its XML syntax (manipulated with CORESE)  Exploitation of memory (DAI Archi. adequate D. Memory) Multi-agent system and machine learning techniques (implemented with ) (implemented with WEKA) Corporate Memory Multi-Agents System Learning User Agent Learning Profile Agent Ontology and Models Agent User Agent Learning Interconnection Agent Knowledge Engineer Authors and annotators of documents End User Annotation Document Annotation Document Annotation Document Annotation Document Ontology Models - Enterprise Model - User's Profiles Query annotation pull and push overview

6 Sub-societies of CoMMA  Ontology & model society: replication  Ontologist: to store and provide ontology  Corporate model archivist: to store and provide structural model of the (human) organisation  Annotations society: hierarchy  Archivist: handles local annotation sources / archives  Mediator manage distributed processes for 2 tasks:  new annotation submissions (contract-net and semantic distance)  query solving processes (decomposition using URI as cut/joint points)  Matchmakers society: peer-to-peer  Directory facilitator: yellow pages service  Agent management system: white pages service  User-dedicated sub-society: three main roles  Handle users' profile: profile manager & profile archivist  Interface controller: graphical user interface. extract 1 from O'CoMMA document chartbook graphdiagram LCST Distance(diagram,graph)=2 booklet document chartbook graphdiagram LCST Distance(diagram,book)=3 booklet extract 2 from O'CoMMA

7 Roles and interactions Users' society Annotations Society Ontology and Model Society Interconnection Society Ontologist Agents MediatorsArchivists Profile Managers Profiles Archivists InterfaceControllers FederatedMatchmakers  Roles and interactions www ? ? ? Wrapper society ? ? ?

8 Motivation for a new society of wrappers  No organization is an island: in a society, a market…  Information resources on the open Web relevant  Integrate external resources in memory i.e. annotate  Corporate portal: outer  inner vs. inner  outer  CoMMA: scenarios with annotation roles (e.g., librarians)  Large sets of documents  annotate = tedious repetitive task Manual annotation unrealistic  Repositories, digital libraries, etc. : recurrent structural clues  Annotations mainly based on information present in the page  Automate some extractions in rules to scale-up annotation  For pages of one site: “annotate one, extract every others”  Introduction of a new society of wrappers / scrappers Multiple ontology & mapping: another on-going work Focus on one use scenario with one appropriate ontology

9 Annotation extraction process  Two options for the annotation extraction  On-the-fly conversion: generate annotations when needed up-to-date, no duplication, slow, connected  Local archive maintenance: generate and maintain a base rapid, disconnected, needs monitoring, memory-consuming  Second option: rapid & decouples intraweb from open-Web  XML-based approach: World Wide WebCorporate memory HTML documents Temporary XHTML version Transform with JTidy Ontology Data location using XPath concepts & relations Extraction GUI interface XSLT Annotation extraction template XSLT Engine RDF Annotations principle

10 XSLT for extraction rules  XML-based intraweb  Ontology: XML syntax of RDF/S  Web page: XHTML document  Annotations: XML syntax of RDF  XSLT  XML transformation language  XML syntax: communication of extraction rules  XPath expressions for data extraction  Template construction  GUI manipulation translation  Built-in library of extraction templates  e.g., list extraction, keyword mapping  Transparent, embedded, combined <xsl:value-of select="$openning" disable-output- escaping="yes"/> <xsl:value-of select="$closing" disable-output- escaping="yes"/> <xsl:value-of select="$openning" disable-output-escaping="yes"/> <xsl:value-of select="concat(normalize-space (substring-before($list, $delimiter)),' ')" /> <xsl:value-of select="$closing" disable-output-escaping="yes"/> <xsl:value-of select="$openning" disable-output-escaping="yes"/> <xsl:value-of select="concat(normalize-space ($list),' ')" /> <xsl:value-of select="$closing" disable-output-escaping="yes"/> <xsl:with-param name="list" select="substring- after($list, $delimiter)" /> <xsl:with-param name="delimiter" select="$delimiter" /> <xsl:with-param name="openning" select="$openning" /> example

11 Example of extraction from the PubMed catalog <c:ResearchReport rdf:about=" db=PubMed&list_uids= &dopt=Abstract\"> Expression of cyclins E, A, and B, and prognosis in lymph node-negative breast cancer. Kuhling H Alm P Olsson H Ferno M Baldetorp B Parwaresch R Rudolph P.

12 Example of extraction from the INRIA librarian page RR Methods and Tools for Corporate Knowledge Management Dieng, Rose Corby, Olivier Giboin, Alain Ribière, Myriam CORPORATE MEMORY ORGANIZATIONAL MEMORY TECHNICAL MEMORY KNOWLEDGE MANAGEMENT

13 Roles and interactions  Modified the interface controller to add new GUI

14

15 Roles and interactions  Modified the interface controller to add new GUI  The two roles of the wrapper society  Annotation wrapper archivists: attached to 1 external source  Wrapper manager: creates and manages wrappers  Hierarchical society  AWA: 2 (sub-)roles involved in 2 societies  Annotation Wrapper role under Wrapper manager superv.  Annotation Archivist role under Annotation Mediator superv. www Wrapper society Wrappers Managers Annotations Society Mediators Archivists www Wrappers Wrapper society Managers

16 Wrapper society (wrapper manager)  Contact point for other sub-societies  Currently only creates AWA but evolving toward managing wrapper’s lifecycle  Wrapper Manager WM directory facilitator AWA

17 Wrapper society (annotation wrapper archivist)  Extraction and maintenance of a base of annotations  Makes annotations available for query solving  Annotation Wrapper Archivist AWA wrapper manager archive of extracted annotations

18 Annotation-dedicated sub-society (archivist)  Attached to & exploits local base  Answers to query as much as it can with local knowledge  Normally proposes archiving services  One sub-behavior overwritten to refuse archiving services  Existing Archivist role (manage a local archive) AA annotation mediator annotation archive

19 Annotation-dedicated sub-society (mediator)  Contact point for other sub-societies  Supervising distribution of tasks for query solving  Allocating a new annotation to an archive (semantic dist.)  Notifying arrival new annotations to trigger push functions  Existing Annotation Mediator (untouched) AM directory facilitator AA

20 Wrapping interaction protocol (acquaintance graph and interactions) John: IClocalYP: DF localWrapperMgr: WM Base d'annotations (10:create) 1: request 2: agree 3: inform :protocol fipa -request :content :language FIPA-SL0 :ontology fipa-ag-man 11: request 12: agree 13: inform :protocol fipa -request :content :language FIPA-SL0 :ontology Fipa-ag-man 8: request 9: agree :protocol fipa -request :content :language CoMMA-RDF :ontology CoMMA-user localAM: AM 14: request :protocol fipa -request :content :language FIPA-SL0 :ontology Fipa-ag-man 15: agree 16: inform 7: inform 4: request 5: agree :protocol fipa -request :content :language FIPA-SL0 :ontology fipa-ag-man 17: inform WrapperArc hivist: AWA (6:create) AMS

21 Conclusions and perspectives  Tested on 3 different libraries: research reports of INRIA, technical reports of SCS – CMU, Library of medicines of MedLINE(correctly annotated)  Generate and feed a large annotation base for performance evaluation of CORESE search engine  Improvements:  Relative XPaths, machine learning, multiple sources  Wrapper manager role: manage the Wrappers Archivists life cycle (kill, restart, alerts to administrator, etc.)  Annotation Wrapper Archivist role: registration services to notify changes in the structure or content, automatic adjustment of XPath when the structure changes  Applies to XML documents in general, including: RDF, XTopic, DAML+OIL, OWL, etc.  General interest: flexible and modular, customization of sub-behaviors info. agents, exchange procedural knowledge, standard

22Pointers  More about current & future work on Wrapper?  More about CoMMA? See PhD Dissertation “Distributed Artificial Intelligence and Knowledge Management: ontologies and multi- agent systems for a corporate semantic web”  Current work at C.M.U. “myCampus” semantic web services & agents for context-aware nomadic PDA-based access to campus intraweb