Evaluating XML retrieval: The INEX initiative Mounia Lalmas Queen Mary University of London

Slides:



Advertisements
Similar presentations
XIRQL: Eine Anfragesprache für Information Retrieval in XML-Dokumenten
Advertisements

COUNTER Update Peter Shepherd Project Director COUNTER STM Innovations Seminar, 2 December 2005.
INEX: Evaluating content-oriented XML retrieval Mounia Lalmas Queen Mary University of London
Evaluating content-oriented XML retrieval: The INEX initiative Mounia Lalmas Queen Mary University of London
XML Retrieval: from modelling to evaluation Mounia Lalmas Queen Mary University of London qmir.dcs.qmul.ac.uk.
Even More TopX: Relevance Feedback Ralf Schenkel Joint work with Osama Samodi, Martin Theobald.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
Information Retrieval and Organisation Chapter 11 Probabilistic Information Retrieval Dell Zhang Birkbeck, University of London.
Traditional IR models Jian-Yun Nie.
Chapter 5: Introduction to Information Retrieval
Modern information retrieval Modelling. Introduction IR systems usually adopt index terms to process queries IR systems usually adopt index terms to process.
Multimedia Database Systems
Modern Information Retrieval Chapter 1: Introduction
Query Languages. Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
XML Ranking Querying, Dagstuhl, 9-13 Mar, An Adaptive XML Retrieval System Yosi Mass, Michal Shmueli-Scheuer IBM Haifa Research Lab.
The COUNTER Code of Practice for Books and Reference Works Peter Shepherd Project Director COUNTER UKSG E-Books Seminar, 9 November 2005.
Overview of Collaborative Information Retrieval (CIR) at FIRE 2012 Debasis Ganguly, Johannes Leveling, Gareth Jones School of Computing, CNGL, Dublin City.
1 Entity Ranking Using Wikipedia as a Pivot (CIKM 10’) Rianne Kaptein, Pavel Serdyukov, Arjen de Vries, Jaap Kamps 2010/12/14 Yu-wen,Hsu.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Search Engines and Information Retrieval
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Structure/XML Retrieval Mounia Lalmas Department of Computer Science Queen Mary University of London.
IR Models: Structural Models
DYNAMIC ELEMENT RETRIEVAL IN A STRUCTURED ENVIRONMENT MAYURI UMRANIKAR.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) IR Queries.
Chapter 2Modeling 資工 4B 陳建勳. Introduction.  Traditional information retrieval systems usually adopt index terms to index and retrieve documents.
Reference Collections: Task Characteristics. TREC Collection Text REtrieval Conference (TREC) –sponsored by NIST and DARPA (1992-?) Comparing approaches.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
INEX 2003, Germany Searching in an XML Corpus Using Content and Structure INEX 2003, Germany Yiftah Ben-Aharon, Sara Cohen, Yael Grumbach, Yaron Kanza,
WXGB6106 INFORMATION RETRIEVAL Week 3 RETRIEVAL EVALUATION.
1 - Fuhr: Information Retrieval Methods for XML Documents XIRQL: Eine Anfragesprache für Information Retrieval in XML- Dokumenten Norbert Fuhr Universität.
Chapter 5: Information Retrieval and Web Search
XML Information Retrieval and INEX Norbert Fuhr University of Duisburg-Essen.
Modeling (Chap. 2) Modern Information Retrieval Spring 2000.
INEX : Understanding XML Retrieval Evaluation Mounia Lalmas and Anastasios Tombros Queen Mary, University of London Norbert Fuhr University.
Search Engines and Information Retrieval Chapter 1.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
INEX – a broadly accepted data set for XML database processing? Pavel Loupal, Michal Valenta.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Semantic Learning Instructor: Professor Cercone Razieh Niazi.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
ISP 433/533 Week 11 XML Retrieval. Structured Information Traditional IR –Unit of information: terms and documents –No structure Need more granularity.
Controlling Overlap in Content-Oriented XML Retrieval Charles L. A. Clarke School of Computer Science University of Waterloo Waterloo, Canada.
Chapter 6: Information Retrieval and Web Search
Book: Bayesian Networks : A practical guide to applications Paper-authors: Luis M. de Campos, Juan M. Fernandez-Luna, Juan F. Huete, Carlos Martine, Alfonso.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Lecture 1: Overview of IR Maya Ramanath. Who hasn’t used Google? Why did Google return these results first ? Can we improve on it? Is this a good result.
Users and Assessors in the Context of INEX: Are Relevance Dimensions Relevant? Jovan Pehcevski, James A. Thom School of CS and IT, RMIT University, Australia.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Towards Contextual and Structural Relevance Feedback in XML Retrieval Lobna Hlaoua IRIT (Institut de Recherche en Informatique de Toulouse) Equipe SIG-RI.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
1 Information Retrieval LECTURE 1 : Introduction.
Information Retrieval
Comparing Document Segmentation for Passage Retrieval in Question Answering Jorg Tiedemann University of Groningen presented by: Moy’awiah Al-Shannaq
1 13/05/07 1/20 LIST – DTSI – Interfaces, Cognitics and Virtual Reality Unit The INFILE project: a crosslingual filtering systems evaluation campaign Romaric.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Information Retrieval Lecture 3 Introduction to Information Retrieval (Manning et al. 2007) Chapter 8 For the MSc Computer Science Programme Dell Zhang.
General Architecture of Retrieval Systems 1Adrienn Skrop.
Using Blog Properties to Improve Retrieval Gilad Mishne (ICWSM 2007)
Multimedia Information Retrieval
“INEX 2005: Playground for XML-retrieval” Sergey Chernov
موضوع پروژه : بازیابی اطلاعات Information Retrieval
COUNTER Update February 2006.
Introduction to Information Retrieval
Chapter 5: Information Retrieval and Web Search
“The need for Semantic Desktop Dataset” L3S and University of Hannover, Germany Sergey Chernov, Tereza Iofciu, Wolfgang Nejdl, Xuan Zhou (chernov, iofciu,
CoXML: A Cooperative XML Query Answering System
Presentation transcript:

Evaluating XML retrieval: The INEX initiative Mounia Lalmas Queen Mary University of London

Outline Information retrieval Information retrieval (Content-oriented) XML retrieval (Content-oriented) XML retrieval Evaluating information retrieval Evaluating information retrieval Evaluating XML retrieval: INEX Evaluating XML retrieval: INEX

Information retrieval Example of a user information need: Find all documents about sailing charter agencies that (1) offer sailing boats in the Greek islands, and (2) are registered with the RYA. The documents should contain boat specification, price per week, and other contact details. A formal representation of an information need constitutes a query

Information retrieval IR is concerned with the representation, storage, organisation, and access to repositories of information, usually under the form of documents. Primary goal of an IR system Retrieve all the documents which are relevant (useful) to a user query, while retrieving as few non-relevant documents as possible.

DocumentsQuery Document representation Retrieval results Query representation IndexingFormulation Retrieval function Relevance feedback Conceptual model for IR

Structured Document Retrieval Traditional IR is about finding relevant documents to a users information need, e.g. entire book. Traditional IR is about finding relevant documents to a users information need, e.g. entire book. SDR allows users to retrieve document components that are more focussed to their information needs, e.g a chapter of a book instead of an entire book. SDR allows users to retrieve document components that are more focussed to their information needs, e.g a chapter of a book instead of an entire book. The structure of documents is exploited to identify which document components to retrieve. The structure of documents is exploited to identify which document components to retrieve.

Structured Documents Linear order of words, sentences, paragraphs … Hierarchy or logical structure of a books chapters, sections … Links (hyperlink), cross- references, citations … Temporal and spatial relationships in multimedia documents Book Chapters Sections Paragraphs World Wide Web This is only only another to look one le to show the need an la a out structure of and more a document and so ass to it doe not necessary text a structured document have retrieval on the web is an it important topic of todays research it issues to make se last sentence..

Structured Documents Explicit structure formalised through document representation standards (Mark-up Languages) Explicit structure formalised through document representation standards (Mark-up Languages) Layout Layout LaTeX (publishing), HTML (Web publishing) Structure Structure SGML, XML (Web publishing, engineering), MPEG-7 (broadcasting) Content/Semantic Content/Semantic RDF, DAML + OIL, OWL (semantic web) World Wide Web This is only only another to look one le to show the need an la a out structure of and more a document and so ass to it doe not necessary text a structured document have retrieval on the web is an it important topic of todays research it issues to make se last sentence.. SDR … …

XML: eXtensible Mark-up Language Meta-language (user-defined tags) currently being adopted as the document format language by W3C Meta-language (user-defined tags) currently being adopted as the document format language by W3C Used to describe content and structure (and not layout) Used to describe content and structure (and not layout) Grammar described in DTD ( used for validation) Grammar described in DTD ( used for validation) Structured Document Retrieval Smith John Introduction into XML retrieval …. … …

XML: eXtensible Mark-up Language Use of XPath notation to refer to the XML structure chapter/title: title is a direct sub-component of chapter //title: any title chapter//title: title is a direct or indirect sub-component of chapter chapter/paragraph[2]: any direct second paragraph of any chapter chapter/*: all direct sub-components of a chapter Structured Document Retrieval Smith John Introduction into SDR …. …

Querying XML documents Content-only (CO) queries Content-only (CO) queries ' open standards for digital video in distance learning ' Content-and-structure (CAS) queries Content-and-structure (CAS) queries //article [about(., 'formal methods verify correctness aviation systems')] /body//section /body//section [about(.,'case study application model checking theorem proving')] [about(.,'case study application model checking theorem proving')] Structure-only (SA) queries Structure-only (SA) queries/article//*section/paragraph[2]

Conceptual model for XML retrieval Structured documents Content + structure Inverted file + structure index tf, idf, acc Matching content + structure Presentation of related components DocumentsQuery Document representation Retrieval results Query representation IndexingFormulation Retrieval function Relevance feedback

Content-oriented XML retrieval Return document components of varying granularity (e.g. a book, a chapter, a section, a paragraph, a table, a figure, etc), relevant to the users information need both with regards to content and structure.

Content-oriented XML retrieval Retrieve the best components according to content and structure criteria: INEX: most specific component that satisfies the query, while being exhaustive to the query INEX: most specific component that satisfies the query, while being exhaustive to the query Shakespeare study: best entry points, which are components from which many relevant components can be reached through browsing Shakespeare study: best entry points, which are components from which many relevant components can be reached through browsing ??? ???

Article ?XML,?retrieval Article ?XML,?retrieval ?authoring ?authoring 0.9 XML 0.5 XML 0.2 XML 0.9 XML 0.5 XML 0.2 XML 0.4 retrieval 0.7 authoring 0.4 retrieval 0.7 authoring Challenges Title Section 1 Section 2 No fixed retrieval unit + nested document components + different types of document components how to obtain document and collection statistics? which component is a good retrieval unit? which components contribute best to content of Article? how to estimate? how to aggregate?

Approaches … vector space model probabilistic model bayesian network language model extending DB model boolean model natural language processing cognitive model ontology parameter estimation tuning smoothing fusion phrase term statistics collection statistics component statistics proximity search logistic regression belief model relevance feedback

Evaluation The goal of an IR system The goal of an IR system retrieve as many relevant documents as possible and as few non- relevant documents as possible Comparative evaluation of technical performance of IR systems = effectiveness Comparative evaluation of technical performance of IR systems = effectiveness ability of the IR system to retrieve relevant documents and suppress non-relevant documents Effectiveness Effectiveness combination of recall and precision

Relevance A document is relevant if it has significant and demonstrable bearing on the matter at hand. A document is relevant if it has significant and demonstrable bearing on the matter at hand. Common assumptions: Common assumptions: Objectivity Objectivity Topicality Topicality Binary nature Binary nature Independence Independence

Recall / Precision Document collection Retrieved Relevant Retrieved and relevant

Recall / Precision relevant documents for a given query {d3, d5, d9, d25, d39, d44, d56, d71, d89, d123} rankdocprecisionrecallrankdocprecisionrecall d123d84d56D6d8d9d5111/12/33/61/102/103/ d129d187d25d48d250d113d34/105/144/105/10

Test collection Document collection = document themselves Document collection = document themselves depend on the task, e.g. evaluating web retrieval requires a collection of HTML documents. Queries / requests Queries / requests simulate real user information needs. Relevance judgements Relevance judgements stating for a query the relevant documents. See TREC, CLEF, etc See TREC, CLEF, etc

Evaluation of XML retrieval: INEX Evaluating the effectiveness of content-oriented XML retrieval approaches Evaluating the effectiveness of content-oriented XML retrieval approaches Collaborative effort participants contribute to the development of the collection Collaborative effort participants contribute to the development of the collectionqueries relevance assessments Similar methodology as for TREC, but adapted to XML retrieval Similar methodology as for TREC, but adapted to XML retrieval 40+ participants worldwide 40+ participants worldwide Workshop in Schloss Dagstuhl in December (20+ institutions) Workshop in Schloss Dagstuhl in December (20+ institutions)

INEX Test Collection Documents (~500MB), which consist of 12,107 articles in XML format from the IEEE Computer Society; 8 millions elements Documents (~500MB), which consist of 12,107 articles in XML format from the IEEE Computer Society; 8 millions elements INEX 2002 INEX CO and 30 CAS queries inex_eval metric INEX 2003 INEX CO and 30 CAS queries CAS queries are defined according to enhanced subset of XPath inex_eval and inex_eval_ng metrics INEX 2004 is just starting INEX 2004 is just starting

Relevance in XML A element is relevant if it has significant and demonstrable bearing on the matter at hand A element is relevant if it has significant and demonstrable bearing on the matter at hand Common assumptions in IR Common assumptions in IR Objectivity Objectivity Topicality Topicality Binary nature Binary nature Independence Independence section paragraph article

Relevance in INEX Exhaustivity Exhaustivity how exhaustively a document component discusses the query: 0, 1, 2, 3 Specificity Specificity how focused the component is on the query: 0, 1, 2, 3 Relevance Relevance (3,3), (2,3), (1,1), (0,0), … (3,3), (2,3), (1,1), (0,0), … section article all sections relevant article very relevant all sections relevant article better than sections one section relevant article less relevant one section relevant section better than article …

Relevance assessment task Completeness Completeness Element parent element, children element Element parent element, children element Consistency Consistency Parent of a relevant element must also be relevant, although to a different extent Parent of a relevant element must also be relevant, although to a different extent Exhaustivity increase going Exhaustivity increase going Specificity decrease going Specificity decrease going Use of an online interface Use of an online interface Assessing a query takes a week! Assessing a query takes a week! Average 2 topics per participants Average 2 topics per participants Only participants that complete the assessment task have access to the collection Only participants that complete the assessment task have access to the collection section paragraph article

Metrics Recall / precision - based Recall / precision - based quantisation functions to obtain one relevance value expected search length expected search length penalise overlap penalise overlap consider size consider size Others Others expected ratio of relevant cumulated gain-based metrics tolerance to irrelevance section article

Lessons learnt Good definition of relevance Good definition of relevance Expressing CAS queries was not easy Expressing CAS queries was not easy Relevance assessment process must be improved Relevance assessment process must be improved Further development on metrics needed Further development on metrics needed User studies required User studies required

Conclusion XML retrieval is not just about the effective retrieval of XML documents, but also about how to evaluate effectiveness XML retrieval is not just about the effective retrieval of XML documents, but also about how to evaluate effectiveness INEX 2004 tracks INEX 2004 tracks Relevance feedback Relevance feedback Interactive Interactive Heterogeneous collection Heterogeneous collection Natural language query Natural language query

Evaluating XML retrieval: The INEX initiative Mounia Lalmas Queen Mary University of London