Pisa, 19 February, 20011 CYCLADES The Personalization Service Fabrizio Sebastiani IEI-CNR (Italy)

Slides:



Advertisements
Similar presentations
February, CYCLADES An Open Collaborative Virtual Archive Environment Umberto Straccia IEI-CNR (Italy)
Advertisements

Michela MichilliStockolm 11 June Museums best practices.
Università di AnconaCoopIS01 September 6, C OOPERATION S TRATEGIES FOR I NFORMATION I NTEGRATION Maurizio Panti, Luca Spalazzi, Loris Penserini
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
Universidade de São Paulo
SCC October, A Modern Interface For Library Public PCs Richard Wayne UT Southwestern Library utsouthwestern.edu.
OAF Workshop, May 13-14, 2002, Pisa.CYCLADES IST CYCLADES An Open Collaborative Virtual Archive Environment Umberto Straccia.
Lisbon, 7th December 2002 Open Archives Forum Workshop Cyclades E U V Framework Project – February 2001 IEI-CNR (Italy) UNIVERSITY OF DORTMUND (Germany)
Relational Database and Data Modeling
Monitoring very high speed links Gianluca Iannaccone Sprint ATL joint work with: Christophe Diot – Sprint ATL Ian Graham – University of Waikato Nick McKeown.
1 ICS-FORTH Dimitris Plexousakis, Pisa, February 2001 The CYCLADES Mediator Service Dimitris Plexousakis Computer Science Department, University.
The Cost of Authoring with a Knowledge Layer Judy Kay and Lichao Li School of Information Technologies The University of Sydney, Australia.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
TU/e technische universiteit eindhoven WebNet 2001October 26, XML to XML through XML Pim Lemmens Geert-Jan Houben Eindhoven University of Technology.
Database System Concepts and Architecture
Modern information retrieval Modelling. Introduction IR systems usually adopt index terms to process queries IR systems usually adopt index terms to process.
Decision Making: An Introduction 1. 2 Decision Making Decision Making is a process of choosing among two or more alternative courses of action for the.
More Interfaces for Retrieval. Information Retrieval Activities Selecting a collection –Lists, overviews, wizards, automatic selection Submitting a request.
CS347 Review Slides (IR Part II) June 6, 2001 ©Prabhakar Raghavan.
Parliamentary Library Parliamentary Library Information, analysis and advice for the Parliament Parliament of Australia Department of Parliamentary Services.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Personalised Search on the World Wide Web Originally by Micarelli, Gasparetti, Sciarrone & Gauch
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
An investigation of query expansion terms Gheorghe Muresan Rutgers University, School of Communication, Information and Library Science 4 Huntington St.,
 IR: representation, storage, organization of, and access to information items  Focus is on the user information need  User information need:  Find.
INTRODUCTION TO DATABASE USING MS ACCESS 2013 PART 2 NOVEMBER 4, 2014.
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
Modern Information Retrieval Computer engineering department Fall 2005.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
CYCLADES IST CYCLADES: A Personalised Collaborative Digital Library Environment Umberto Straccia I.S.T.I. - C.N.R. Pisa (ITALY)
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MetaLib 4 User Guide. 2 MetaLib 4 Access MetaLib at: – MetaLib may be used at two different levels –
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Recuperação de Informação Cap. 01: Introdução 21 de Fevereiro de 1999 Berthier Ribeiro-Neto.
SCALABLE INFORMATION-DRIVEN SENSOR QUERYING AND ROUTING FOR AD HOC HETEROGENEOUS SENSOR NETWORKS Paper By: Maurice Chu, Horst Haussecker, Feng Zhao Presented.
Software Configuration Management SEII-Lecture 21
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
DeepBET Reverse-Engineering the Behavioral Targeting mechanisms of Ad Networks via Deep Learning Sotirios Chatzis Cyprus University of Technology.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
Relevance Feedback in Image Retrieval System: A Survey Tao Huang Lin Luo Chengcui Zhang.
Module 5: Managing Content. Overview Publishing Content Executing Reports Creating Cached Instances Creating Snapshots and Report History Creating Subscriptions.
Topics Covered: File Components of file Components of file Terms used Terms used Types of business file Types of business file Operations on file Operations.
Integrated ISO ILL for staff users Borrowing requests – part two Yoel Kortick 2007.
Copyright 2006 John Wiley & Sons, Inc Chapter 5 – Cognitive Engineering HCI: Developing Effective Organizational Information Systems Dov Te’eni Jane Carey.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
WHIM- Spring ‘10 By:-Enza Desai. What is HCIR? Study of IR techniques that brings human intelligence into search process. Coined by Gary Marchionini.
DSS & Warehousing Systems
Working in the Forms Developer Environment
Modern Information Retrieval
System for Semi-automatic ontology construction
Education in the Information Era
Introduction lecture1.
Information Retrieval
Document Visualization at UMBC
ece 627 intelligent web: ontology and beyond
Data Model.
Data Warehousing Data Model –Part 1
CSE 635 Multimedia Information Retrieval
CYCLADES An Open Collaborative Virtual Archive Environment
Appendix D: Network Model
Information Retrieval and Web Design
Recuperação de Informação
Presentation transcript:

Pisa, 19 February, CYCLADES The Personalization Service Fabrizio Sebastiani IEI-CNR (Italy)

Pisa, 19 February, Main goal The Personalization Service (PS) is responsible for allowing a user to interact with the system in a flexible and highly personalized way.

Pisa, 19 February, Personalization as a topic-driven activity Personalization is viewed as a content-based notion; i.e. the interaction with the user must take into account the users interests. The users interests are viewed as a hierarchically structured set of topics, each of which represents the users subjective view of a discipline which is of interest to her. Personalization is achieved by –Creating and maintaining a set of topic profiles (i.e. internal representations of topics) –Letting the information seeking/presenting behaviour of the system be influenced by these profiles.

Pisa, 19 February, User topics and community topics Topics (and their profiles) may belong to: –a particular user (user-specific topics). –a particular community (community-specific topics). Any user has a default subscription to the topics belonging to a community she is a member of, and may selectively cancel these subscriptions. The topics that pertain to a user are thus the union of –the topics that belong to the user as an individual –the topics that belong to the communities the user is a member of.

Pisa, 19 February, An example topic hierarchy Fuzzy Logic Possibilistic Logics Digital Libraries Probabilistic Logics Logics for Uncertainty and Imprecision Umbertos CYCLADES folder Information Retrieval Logics of subjective probability Logics of objective probability

Pisa, 19 February, Topic-based personalization Personalization influences the behaviour of the system in –Seeking information –Presenting the retrieved information to the user both when it operates –in ad hoc mode –in on-demand mode

Pisa, 19 February, Personalization in the ad hoc mode Ad hoc (pull) mode: the system delivers to the user information it deems relevant to a specific request she has just explicitly issued. This modality is apt to serve user information needs of a contingent (temporally local) nature. For personalization, the topic profiles of the user and of the communities to which the user belongs are to be used globally (thus forming a user profile) as "background information" that allow the request to be understood in the context of the users long-term interests.

Pisa, 19 February, Personalization in the on demand mode On demand mode: the user asks the system to deliver to her any information that might have recently become available and that is relevant to her own interests. This modality is used to satisfy information needs of a permanent or semi-permanent (temporally global) nature. For personalization, each of these profiles is to be used as a standing (i.e. permanent) query.

Pisa, 19 February, Classification of results The PS further provides personalized automatic classification of the records resulting from the (either ad hoc or on-demand) information-providing activity, into a set of hierarchically organized folders, each corresponding to a topic. A topic profile is thus (the declarative part of) a classifier, i.e. a tool that can automatically decide whether an item is relevant or not to the topic Any automatic classifier is going to have a non-null error rate: the user is thus allowed to correct the classifiers decisions that she perceives as wrong

Pisa, 19 February, How are classifiers built and updated? The classifier is automatically built by learning from items the user has classified manually. The classifier is automatically updated by interpreting implicit user feedback, consisting of –moving an item from folder A to folder B –deleting an item from folder A –aliasing an item from folder A to folder B Updates reflect –errors by the classifier –semantic shifts in the meaning of the involved topics

Pisa, 19 February, Dynamic topic hierarchies The user (or the community owner) may decide herself how to structure the topic hierarchy initially; a single-node hierarchy is the default The topic hierarchy is dynamic, as the user (or the system, after user approval) may restructure it by –Splitting a topic into several subtopics –Collapsing the subtopics into a single topic –Deleting a topic

Pisa, 19 February, Some technical issues Topic profiles will be inductively built and updated by supervised learning techniques; we plan to use incremental classification techniques (e.g. Balanced Winnow) The hierarchical topic structure will be dynamically modified by unsupervised learning techniques (i.e. clustering) The classifiers will be evaluated and optimized using user- dependent utility measures (e.g. giving different penalties to errors of commission and errors of omission).

Pisa, 19 February, The PS and the Mediator Service When the MS interacts with user X, the PS feeds to the MS the topic profiles (resp. user profile) of X that the QBS uses to issue on-demand X queries (resp. to personalize ad hoc X queries) The QBS feeds to the PS the results of these queries that the PS classifies in the topic hierarchy specific to the user. The MS informs the PS of user actions (such as moving, deleting or aliasing records across folders) that the PS may use for updating topic profiles and/or restructuring (upon user approval) the structure of the topic hierarchy.