Information Retrieval Lab DiSCo – University of Milan Bicocca Viale Sarca 336 U14 Head: Prof. Gabriella Pasi.

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Information Retrieval Lab DISCo – University of Milan Bicocca viale Sarca 336 U14 Head: Prof. Gabriella Pasi.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
OAF Workshop, May 13-14, 2002, Pisa.CYCLADES IST CYCLADES An Open Collaborative Virtual Archive Environment Umberto Straccia.
Lisbon, 7th December 2002 Open Archives Forum Workshop Cyclades E U V Framework Project – February 2001 IEI-CNR (Italy) UNIVERSITY OF DORTMUND (Germany)
Bringing It All Together: An Academic Viewpoint (What is needed and what is likely to come next?) Association of Information and Dissemination Centers.
A Domain Level Personalization Technique A. Campi, M. Mazuran, S. Ronchi.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Modern Information Retrieval Chapter 1: Introduction
Multi-Phase Reasoning of temporal semantic knowledge Sakirulai O. Isiaq and Taha Osman School of Computer and Informatics Nottingham Trent University Nottingham.
Torrossa The Casalini Libri Full Text Platform Featuring ebook and ejournal content from Romance language countries Moscow International.
Search Engines and Information Retrieval
Modern Information Retrieval Chapter 1: Introduction
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Advanced Distributed Learning. Conditions Before SCORM  Couldn’t move courses from one Learning Management System to another  Couldn’t reuse content.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Information Retrieval in Practice
Personalised Search on the World Wide Web Originally by Micarelli, Gasparetti, Sciarrone & Gauch
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Information Retrieval – Introduction and Survey Norbert Fuhr University of Duisburg-Essen Germany
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Some studies on Vietnamese multi-document summarization and semantic relation extraction Laboratory of Data Mining & Knowledge Science 9/4/20151 Laboratory.
Attention and Event Detection Identifying, attributing and describing spatial bursts Early online identification of attention items in social media Louis.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Search Engines and Information Retrieval Chapter 1.
revised CmpE 583 Fall 2006Discussion: OWL- 1 CmpE 583- Web Semantics: Theory and Practice DISCUSSION: OWL Atilla ELÇİ Computer Engineering.
1 The BT Digital Library A case study in intelligent content management Paul Warren
Personalized Information Retrieval in Context David Vallet Universidad Autónoma de Madrid, Escuela Politécnica Superior,Spain.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Give Your Online Teaching a JOLT Michelle Pilati, PhD Professor of Psychology Rio Hondo College Edward H. Perry, PhD Professor of Mechanical Engineering.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
Query Expansion By: Sean McGettrick. What is Query Expansion? Query Expansion is the term given when a search engine adding search terms to a user’s weighted.
Similar Document Search and Recommendation Vidhya Govindaraju, Krishnan Ramanathan HP Labs, Bangalore, India JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
Toward the Next generation of Recommender systems
Giorgos Giannopoulos (IMIS/”Athena” R.C and NTU Athens, Greece) Theodore Dalamagas (IMIS/”Athena” R.C., Greece) Timos Sellis (IMIS/”Athena” R.C and NTU.
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
School of Electrical Engineering & Computer Science National University of Sciences & Technology (NUST), Pakistan Research Profile Sharifullah Khan.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Towards Contextual and Structural Relevance Feedback in XML Retrieval Lobna Hlaoua IRIT (Institut de Recherche en Informatique de Toulouse) Equipe SIG-RI.
Masoud Makrehchi, PAMI, UW Learning Object Metadata Masoud Makrehchi PAMI University of Waterloo August 2004.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
1 Information Retrieval LECTURE 1 : Introduction.
Information Retrieval
Comparing Document Segmentation for Passage Retrieval in Question Answering Jorg Tiedemann University of Groningen presented by: Moy’awiah Al-Shannaq
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Contextual Text Cube Model and Aggregation Operator for Text OLAP
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
WP1: Plan for the remainder (1) Ontology –Finalise ontology and lexicons for the 2 nd domain (RTV) Changes agreed in Heraklion –Improvement to existing.
Information Retrieval in Practice
Information Retrieval and Web Search
Web Mining Ref:
Information Retrieval and Web Search
What is IR? In the 70’s and 80’s, much of the research focused on document retrieval In 90’s TREC reinforced the view that IR = document retrieval Document.
Information Retrieval and Web Search
Information Retrieval
CSE 635 Multimedia Information Retrieval
Magnet & /facet Zheng Liang
Haystack: an Adaptive Personalized Information Retrieval System
Information Retrieval and Web Design
Information Retrieval and Web Design
Information Retrieval and Web Search
Presentation transcript:

Information Retrieval Lab DiSCo – University of Milan Bicocca Viale Sarca 336 U14 Head: Prof. Gabriella Pasi

IR Lab people Gabriella Pasi Associate Professor and Head of the Laboratory Silvia Calegari Post-doc DISCO Stefania Marrara Post-doc UNIMI Célia Cristina Pereira Post-doc UNIMI

IR Lab numbers Small but active! ▫Two people (since January 2009) ▫Two external collaborators (since 2008) ▫Three workplaces for Students and Collaborators ▫About 60 articles in proceeding of international conferences and in international journals in the last three years ▫4-5 master students per year

The IR Lab in brief The Information Retrieval Group (IRG) was established in 2005 at DiSCo, University of Milan Bicocca. FOCUS: as the amount of information available on the Web has enormously increased in last years, there is need of effective systems that allow an easy and flexible access to information relevant to specific user’s needs. By flexibility is here meant the capability of the system to both manage imperfect (vague and/or uncertain) information, and to personalise its behaviour to the user context. AIM: the research activity undertaken by the IRG is aimed at defining models and techniques that improve the limitations of current systems for the Information Access to the main aim of offering personalised and flexible solutions to the problem of locating information relevant to specific user’s needs.

Research in IR: some main issues Improving indexing text representation is usually based on keywords extraction and weighting ▫how to improve document representations?  Conceptual indexing based on the use of conceptual structures  Latent semantic indexing  Metadata and the Semantic Web Modeling user preferences in query formulation usually based on selection criteria specified by terms ▫how to formulate queries that capture real users’ needs?  Modeling the user’s context  Accounting for vagueness  Defining mechanism for query reformulation, relevance feedback

Improving relevance estimate usually based on a measure of topicality, more recently Popularity (in search engines) ▫It should be based on additional criteria:  Novelty  Trust in information sources  Timeliness  Contextual information (geographic location, date, author, etc…) ▫It should be learnt on the basis of users needs/behavior  Application of machine learning techniques  Query reformulation Text classification Text summarization Research in IR: some main issues

IR Lab Activity Research areas: ▫Information Retrieval ▫Information Filtering ▫Document Clustering ▫Personalization ▫XML Retrieval Application Domains: ▫Large document repositories ▫World Wide Web

Ongoing and future research Definition of conceptual approaches to IR. Definition of flexible query languages for semi- structured documents (XML). Definition of models for multi-dimensional relevance assessment Definition of text clustering techniques Definition of techniques for assessment of text quality and their use for relevance assessment Web Service Retrieval

Personalized Information Access Personalization is the process of customizing search results according to the user’s interests and context. Approach: generation of user-tailored ontologies Aim: to model and learn the user context to personalize the search process at distinct levels: Document indexing Query formulation Relevance assessment

XML Retrieval In XML collections it is important to retrieve documents based on users’ constraints on both documents’ content and structure. Approach: 1) application of fuzzy set theory to define flexible extensions of existing XML query languages. 2) definition of ad hoc indexing strategies. Aim: to propose advanced solutions for storing, managing and retrieving semi-structured documents.

Projects Past. ▫STREP Project ▫STREP Project: PENG (Personalised News Content Programming) (Gabriella Pasi, Project Coordinator) (2004 – 2006) Submitted ▫PRIN ▫PRIN. Title: What, Where, When? (W3?): Recommendation of Information concerning specific topics and spatio-temporal contexts characterized by dynamicity and imprecision. ▫FIRB ▫FIRB. Title: A Cloud Service Stack for Personalized Semantic Information Retrieval. ▫Spanish Project ▫Spanish Project: High Performance processing for large data sets represented as Graphs (HIPERGRAPH) (Principal Investigator: Ricardo Baeza Yates – Yahoo! Research) ▫COST Action "Combining Soft Computing Techniques and Statistical Methods to Improve Data Analysis Solutions", coordinated by ECSC (ONGOING)

Collaborations At D.I.S.Co: Davide Ciucci Fabio Farina ITIS – SEQUOIAS (Information Quality; Web Service Retrieval) External Collaborations: CNR – IDPA, Italy European Center for Soft Computing (ECSC), Spain IRIT – Toulouse, France Iona College, NY, USA Università La Coruna, Spain

Conferences and Events Organization of: ▫The 2009 IEEE / WIC / ACM International Conferences on Web Intelligence (WI'09) and Intelligent Agent Technology (IAT'09), Milano, Italy, September 2009 ▫International Workshop on “Managing Vagueness and Uncertainty in the Semantic Web (VUSW’09)”, Milano, Italy, 15 September 2009 ▫Program Chair of the International conference RIAO 2010, Paris ▫Poster co-chair of ACM SIGIR Some Past Events (since 2005) ▫"Special Track on Information Access and Retrieval Systems”, within the “ACM Symposium on Applied Computing”, (Fortaleza, Ceará, Brazil, March , 2008, Dijon France March 2006, Santa Fe - New Mexico March 2005, Cyprus March 2004, Melbourne - Florida 9-12 March 2003, Madrid March 2002). IAR2008IAR2008 ▫International Workshop on Fuzzy Logic and Applications (WILF 2007), Hotel Portofino Kulm, Portofino Vetta - Ruta di Camogli, Genova (Italy) - July 7-10, 2007WILF 2007 ▫PhD School on Web Information Retrieval, WebBar 2007 Varenna, Italy, 26th August- 1st September 2007.WebBar 2007 ▫Seventh International Conference on Flexible Query Answering Systems (FQAS 2006), Milano, 2-10 June 2006.FQAS 2006 ▫“3rd International Summer School on Aggregation Operators”, Università della Svizzera Italiana (USI-Lugano), Lugano, July 2005

Publications from 2005 … some numbers 22Papers in International Journals: 22 4Special Issues in International Journals: 4 3Edited Volumes: 3 10Chapters of International Books: 10 40Proceedings for International Conferences: 40