LogCLEF 2009 Log Analysis for Digital Societies (LADS) Thomas Mandl, Maristella Agosti, Giorgio Maria Di Nunzio, Alexander Yeh, Inderjeet Mani, Christine.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
While You Were Out: How Students are Transforming Information and What it Means for Publishing Kate Wittenberg The Electronic Publishing Initiative at.
WEB USAGE MINING FRAMEWORK FOR MINING EVOLVING USER PROFILES IN DYNAMIC WEBSITE DONE BY: AYESHA NUSRATH 07L51A0517 FIRDOUSE AFREEN 07L51A0522.
Thomas Mandl, Julia Maria Schulz LREC 2010, Web Logs & QA, /10 Log-Based Evaluation Resources for Question Answering Thomas Mandl, Julia Maria.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Overview of Collaborative Information Retrieval (CIR) at FIRE 2012 Debasis Ganguly, Johannes Leveling, Gareth Jones School of Computing, CNGL, Dublin City.
Features and Uses of a Multilingual Full-Text Electronic Theses and Dissertations (ETDs) System Yin Zhang Kent State University Kyiho Lee, Bumjong You.
Dialogue – Driven Intranet Search Suma Adindla School of Computer Science & Electronic Engineering 8th LANGUAGE & COMPUTATION DAY 2009.
Search Engines and Information Retrieval
1 Adaptive Management Portal April
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
8/29/2000Database Management -- Fall R. Larson Database Management: Introduction University of California, Berkeley School of Information Management.
Web Mining Research: A Survey
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
Web Mining Research: A Survey
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Web Logs and Question Answering Richard Sutcliffe 1, Udo Kruschwitz 2, Thomas Mandl University of Limerick, Ireland 2 - University of Essex, UK 3.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
8/28/2001Database Management -- Fall R. Larson Database Management: Introduction University of California, Berkeley School of Information Management.
SESSION 9 THE INTERNET AND THE NEW INFORMATION NEW INFORMATIONTECHNOLOGYINFRASTRUCTURE.
Knowledge is Power Marketing Information System (MIS) determines what information managers need and then gathers, sorts, analyzes, stores, and distributes.
 Ad-hoc - This track tests mono- and cross- language text retrieval. Tasks in 2009 will test both CL and IR aspects.
Europeana: Europe's Digital Library, Museum and Archive Ashley Carter and Dana Sagona.
 Official Site: facility.org/research/evaluation/clef-ip-10http:// facility.org/research/evaluation/clef-ip-10.
Databases & Data Warehouses Chapter 3 Database Processing.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Classroom User Training June 29, 2005 Presented by:
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Search Engines and Information Retrieval Chapter 1.
CONCLUSION & FUTURE WORK Normally, users perform triage tasks using multiple applications in concert: a search engine interface presents lists of potentially.
“Cross-Media and Personalized Learning Applications on top of Digital Libraries” 20 September 2007, Budapest, Hungary M. Agosti 1, T. Coppotelli 1, G.M.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.
National Library of Estonia in the TEL-ME-MOR project IST4Balt workshop in Estonia June 2006 Baltic ICT Community.
User Behavior Analysis of Location Aware Search Engine Third international Conference of MDM, 2002 Takahiko Shintani, Iko Pramudiono NTT Information Sharing.
1 CS430: Information Discovery Lecture 18 Usability 3.
IST Programme - Key Action III Semantic Web Technologies in IST Key Action III (Multimedia Content and Tools) Hans-Georg Stork CEC DG INFSO/D5
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
L JSTOR Tools for Linguists 22nd June 2009 Michael Krot Clare Llewellyn Matt O’Donnell.
Which Log for which Information? Gathering Multilinguality Data from Different Log File Types Maria Gäde, Vivien Petras, and Juliane Stiller Humboldt-Universität.
MOVIE RETRIEVAL SYSTEM INFORMATION VISUALIZATION & PROPOSING NEW INTERFACE IAT 814 Adrian Bisek.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Thomas Mandl: GeoCLEF Track Overview Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) 8 th Workshop.
Information Retrieval
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Information Design Trends Unit Five: Delivery Channels Lecture 2: Portals and Personalization Part 2.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Multilingual terminologies: the experience of Europeana Collection Athena Plus Workshop : “Innovative tools and pilots for access to digital.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Data mining in web applications
From CLEF to TrebleCLEF Promoting Technology Transfer
Usage scenarios, User Interface & tools
Joseph JaJa, Mike Smorul, and Sangchul Song
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
European Network of e-Lexicography
Boštjan Kožuh Statistical Office of the Republic of Slovenia,
Web Mining Department of Computer Science and Engg.
Introduction to Information Retrieval
Presentation transcript:

LogCLEF 2009 Log Analysis for Digital Societies (LADS) Thomas Mandl, Maristella Agosti, Giorgio Maria Di Nunzio, Alexander Yeh, Inderjeet Mani, Christine Doran, Julia Maria Schulz LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop

Overview Task Data Participants Results LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop

The LADS Task The aim of the LADS task is to analyze user behavior with a focus on multilingual search. User interaction with the portal at query time –e.g. how users interact with the search interface, what kind of search they perform –how many of them reformulate queries, browse results, leave the portal to follow the search in a national library. LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop

The LADS Task LADS deals with logs from The European Library (TEL) TEL is a free service that offers access to the resources of 48 national libraries of Europe in 35 languages. Resources can be both digital (e.g. books, posters, maps, sound recordings, videos) and bibliographical. Quality and reliability are guaranteed by the 48 collaborating national libraries of Europe. LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop

Goals This task was open to diverse approaches, in particular data mining techniques in order to extract knowledge from the data and find interesting user patterns: –user session reconstruction (necessary) –user interaction with the portal at query time –multilinguality and query reformulation –user context and user profile LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop

TEL Environment LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop

TEL Environment LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop

Data The data used for the LADS task are search (“action”) logs of The European Library portal All the actions are logged and stored by TEL in a relational table –each record represents a user action. The most significant columns of the table are: –A numeric id, for identifying registered users or “guest” otherwise; –User’s IP address; –An automatically generated alphanumeric, identifying sequential actions of the same user (sessions) ; –Query contents; –Name of the action that a user performed; –The corresponding collection’s alphanumeric id; –Date and time of the action’s occurrence. LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop

Data Action logs distributed to the participants of the task cover the period from 1st January 2007 until 30th June –1,866,330 records PostgreSQL table, csv file Description of the collection LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop

Participants About 20 participants registered 4 participants submitted results –University of Sunderland –Trinity College Dublin –University of Hildesheim –CELI Research, Torino LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop

Results CELI: identify translations of search queries. –The result is a list of pairs of queries in two languages. –Combined with session information, it is possible the check whether users translate their query within a session. University of Sunderland: users rarely switch the query language during their sessions. –They also found out that queries are typically submitted in the language of the interface which the user selects. Trinity College Dublin: thorough analysis of query reformulation, query length and activity sequence. –understanding of the behavior of users from different linguistic or cultural backgrounds. University of Hildesheim: sequences of interactions within the log file. –Visualized in an interactive user interface which allows the exploration of the sequences. University of Amsterdam: gain more context information –limited knowledge about the user which is inherent in log files needs to be tackled –semantic enrichment of the queries by linking them to digital objects [7]. LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop

Conclusions LogCLEF has provided an evaluation resource with log files of user activities in multilingual search environments: –the Tumba! Search engine and –The European Library (TEL) Web site. The results and approaches of the participants to the 2009 campaign will be helpful to define a more formal task in the next LogCLEF. Advertise better! –Workshop on Query Log Analyisis (TrebleCLEF 2009) –Workshop on Understanding the User Logging and interpreting user interactions in information search and retrieval (SIGIR 2009) Sharing resources and knowledge about log files, Collaborative User Log Analysis Pool –Mailing list –Web site LogCLEF Overview LADS Task October 1, 2009, Corfu, GR CLEF Workshop