1 ESCRIRE: Embedded Structured Content Representation In Repositories Jérôme Euzenat INRIA Rhône-Alpes

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

August 6, 2009 Joint Ontolog-OOR Panel 1 Ontology Repository Research Issues Joint Ontolog-OOR Panel Discussion Ken Baclawski August 6, 2009.
Ontology-enhanced retrieval (and Ontology-enhanced applications) Deborah L. McGuinness Associate Director and Senior Research Scientist Knowledge Systems.
ACACIA in short… Objectives: Offer methodological and software support (i.e. models, methods and tools) for construction, management and diffusion of.
Chapter 5: Introduction to Information Retrieval
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Jérôme Euzenat 655 avenue de l’Europe, Montbonnot Saint-Martin, France Enabling technologies for semantic interoperability.
Information Retrieval Review
Xyleme A Dynamic Warehouse for XML Data of the Web.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Information retrieval. perspectives: information retrieval application(s) - surveillance psychological - information overload experimental - search engines.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
What is a document? Information need: From where did the metaphor, doing X is like “herding cats”, arise? quotation? “Managing senior programmers is like.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Toward Semantic Web Information Extraction B. Popov, A. Kiryakov, D. Manov, A. Kirilov, D. Ognyanoff, M. Goranov Presenter: Yihong Ding.
CS580: Building Web Based Information Systems Roger Alexander & Adele Howe The purpose of the course is to teach theory and practice underlying the construction.
Alternatives to Metadata IMT 589 February 25, 2006.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Search Engines and Information Retrieval Chapter 1.
12 July 2002Colloquim on Applications of Natural Langauge Corpora, Saarland University Domain-specific Web Corpora and their Applications Gregor Erbach.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
The Semantic Web William M Baker
H. Lundbeck A/S3-Oct-151 Assessing the effectiveness of your current search and retrieval function Anna G. Eslau, Information Specialist, H. Lundbeck A/S.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Modern Information Retrieval Computer engineering department Fall 2005.
DL.org All WGs Meetings, Rome, May 2010 Quality Interoperability Approaches, case studies and open issues DL.org Quality Working Group Rome, 28 th.
WEB SEARCH PERSONALIZATION WITH ONTOLOGICAL USER PROFILES Data Mining Lab XUAN MAN.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
Aquenergy Portal Elisabetta Zuanelli, University of Rome “Tor Vergata”, Italy E-Age 2014 Muscat december.
Semantic Technologies & GATE NSWI Jan Dědek.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Math Information Retrieval Zhao Jin. Zhao Jin. Math Information Retrieval Examples: –Looking for formulas –Collect teaching resources –Keeping updated.
A Declarative Similarity Framework for Knowledge Intensive CBR by Díaz-Agudo and González-Calero Presented by Ida Sofie G Stenerud 25.October 2006.
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
Semantic Visualization What do we mean when we talk about visualization? - Understanding data - Showing the relationships between elements of data Overviews.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
INFO Week 8 Subject Indexing & Knowledge Representation Dr. Xia Lin Assistant Professor College of Information Science and Technology Drexel University.
Digital libraries and web- based information systems Mohsen Kamyar.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Measuring How Good Your Search Engine Is. *. Information System Evaluation l Before 1993 evaluations were done using a few small, well-known corpora of.
Topic Maps introduction Peter-Paul Kruijsen CTO, Morpheus software ISOC seminar, april 5 th 2005.
Information Retrieval
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Approach to building ontologies A high-level view Chris Wroe.
Co-funded by the European Union Semantic CMS Community Reference Architecture for Semantic CMS Copyright IKS Consortium 1 Lecturer Organization Date of.
Standards for representing meeting metadata and annotations in meeting databases Standards for representing meeting metadata and annotations in meeting.
Improvement of Semantic Interoperability based on Metadata Registry(MDR) Doo-Kwon Baik Dept. of CSE Korea University.
SEMANTIC WEB Presented by- Farhana Yasmin – MD.Raihanul Islam – Nohore Jannat –
Introduction Multimedia initial focus
SAMT 2006.
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Lifecycle Metadata for Digital Objects
CSE 635 Multimedia Information Retrieval
Session 2: Metadata and Catalogues
Information Retrieval and Web Design
Information Retrieval and Web Design
Presentation transcript:

1 ESCRIRE: Embedded Structured Content Representation In Repositories Jérôme Euzenat INRIA Rhône-Alpes

2 ESCRIRE: Motivations Embedding a simplified but formal representation of content in documents : search on structured criteria; document comparison (genericity, similarity…); automatic classification and organisation.

3 Knowledge based queries (and book (about "Agatha Christie")) vs. book AND "Agatha Christie" (and flat (location "Alps")) …including those in Val d’Isère! (and bookshop (location "London")) …bookstore included.

4 Query languages level 3 Semiotic level 2 Semantic (F-logic, Escrire…) level 1 Structural (SQL, XQL) level 0 Full-text search

5 ESCRIRE: Goals Comparison of several knowledge representation techniques in order to find the type of situation to which they are most suited (indexing, classifying, filtering…).

6 ESCRIRE: Consortium “Coordinated research action (ARC)” involving Acacia (Sophia-Antipolis): conceptual graphs Sherpa/Exmo (Rhône-Alpes): object-based representations Orpailleur (Lorraine): terminological logics. Usinor: application.

7 “Ontology” Description ESCRIRE: Acquisition Global analysis Individual analysis Integration XML document Document Tr-schema Tr-object

8 ESCRIRE: Queries “Ontology” Query helper XML document Tr-schema Tr-queryTroeps XML document

9 ESCRIRE: Problem statement Given: A set of (HTML) documents annotated by a description of their content in a pivotal langage An ontology of the domain A set of queries about the subject. Retrieve: the adequate documents.

10 ESCRIRE: Software variation Knowledge representation + query evaluation Translated from a pivotal language in Conceptual graphs, Object-based representation, Description logic Translated by hand in CG, OKR, DL

11 ESCRIRE: Evaluation Two corpora Two (or three) sets (training and test) provided by an external user. Qualitative and quantitative evaluation (possibly external).

12 ESCRIRE: Quantitative criteria Precision: rate of correct answers Recall: rate of complete answers Acuracy=(precision+recall)/2 Performances in time Coverage of the query language Ordering of answers

13 ESCRIRE: Qualitative criteria Given by external users (query designers): Naturalness of queries Adequacy of answers Overall appreciation (aggregation).

14 ESCRIRE: Scaling Multiplying the size by orders of magnitude: Corpus Ontology Queries.

15 ESCRIRE: Reference comparisons Dublin core metadata Full-text search

16 ESCRIRE: Ontology elements (1) …

17 ESCRIRE: Ontology elements (2) … …

18 ESCRIRE: Content descriptions inhibition …

19 ESCRIRE: Knowledge embedding … … … … …

20 ESCRIRE: Queries Stated on objects, but results are documents (concerning these topics) Document similarity by content similarity

21 ESCRIRE: Query language SELECT / FROM / WHERE / ORDERBY + AND / OR / NOT / ALL / EXISTS | IN ALIKE

22 ESCRIRE: Corpus 1 Subject: genetic interaction Text source: MedLine abstracts Annotations: manual Ontology: Knife knowledge base + other

23 ESCRIRE: Corpus 2 Subject: Psychological stress Text source: MedLine abstracts Annotation: manual annotations Ontology: UMLS/MeSH

24 ESCRIRE: Where are we? Building translators from pivot to actual formats 1st part of Corpus 1 available (other data shall folow quikly)

25 ESCRIRE: Calls Other corpora Natural language technology Other representation systems starting from september 2000

26 For more information…