Supporting Effective Access through User- and Topic- Based Language Models.

Slides:



Advertisements
Similar presentations
12/03/ Second International Workshop on New Generation Enterprise and Business Innovation NGEBIS 2013 Cross Domain Crawling for Innovation Pieruigi.
Advertisements

Verification and Validation
Metadata in Carrot II Current metadata –TF.IDF for both documents and collections –Full-text index –Metadata are transferred between different nodes Potential.
Normal Distributions: Finding Probabilities
Developing and Evaluating a Query Recommendation Feature to Assist Users with Online Information Seeking & Retrieval With graduate students: Karl Gyllstrom,
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Information Retrieval Visualization CPSC 533c Class Presentation Qixing Zheng March 22, 2004.
ISP 433/533 Week 2 IR Models.
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Semantic (Language) Models: Robustness, Structure & Beyond Thomas Hofmann Department of Computer Science Brown University Chief Scientist.
Tags, Networks, Narrative Explorations in Folksonomy Sue Thomas and Bruce Mason IOCT, De Montfort University 30 th January 2007.
1 Querying the Web for Genealogical Information Troy Walker Spring Research Conference 2003 Research funded by NSF.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
J. Chen, O. R. Zaiane and R. Goebel An Unsupervised Approach to Cluster Web Search Results based on Word Sense Communities.
Problem Addressed The Navigation –Aided Retrieval tries to provide navigational aided query processing. It claims that the conventional Information Retrieval.
Personalizing the Digital Library Experience Nicholas J. Belkin, Jacek Gwizdka, Xiangmin Zhang SCILS, Rutgers University
An Overview of Relevance Feedback, by Priyesh Sudra 1 An Overview of Relevance Feedback PRIYESH SUDRA.
User Modeling Thoughts on LMs James Allan Center for Intelligent Information Retrieval University of Massachusetts, Amherst September 11, 2002.
Creating and Visualizing Document Classification J. Gelernter, D. Cao, R. Lu, E. Fink, J. Carbonell.
1 CS 502: Computing Methods for Digital Libraries Lecture 11 Information Retrieval I.
Online Search Evaluation with Interleaving Filip Radlinski Microsoft.
Understanding Data Analytics and Data Mining Introduction.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Query Relevance Feedback and Ontologies How to Make Queries Better.
Essential Standard 3.00 Understand the role marketing in business. 1.
Personalization of the Digital Library Experience: Progress and Prospects Nicholas J. Belkin Rutgers University, USA
CONCLUSION & FUTURE WORK Normally, users perform triage tasks using multiple applications in concert: a search engine interface presents lists of potentially.
Information Retrieval and Web Search Text properties (Note: some of the slides in this set have been adapted from the course taught by Prof. James Allan.
The Development of a search engine & Comparison according to algorithms Sungsoo Kim Haebeom Lee The mid-term progress report.
Modern Information Retrieval: A Brief Overview By Amit Singhal Ranjan Dash.
Implicit Acquisition of Context for Personalization of Information Retrieval Systems Chang Liu, Nicholas J. Belkin School of Communication and Information.
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
Distributed Information Retrieval Using a Multi-Agent System and The Role of Logic Programming.
Math Information Retrieval Zhao Jin. Zhao Jin. Math Information Retrieval Examples: –Looking for formulas –Collect teaching resources –Keeping updated.
Distributed Information Retrieval Server Ranking for Distributed Text Retrieval Systems on the Internet B. Yuwono and D. Lee Siemens TREC-4 Report: Further.
CONCLUSION & FUTURE WORK Normally, users perform search tasks using multiple applications in concert: a search engine interface presents lists of potentially.
Task-oriented approach to information handling support within web-based education Lora M. Aroyo 15 November 2001.
Autumn Web Information retrieval (Web IR) Handout #1:Web characteristics Ali Mohammad Zareh Bidoki ECE Department, Yazd University
Lecture 1: Overview of IR Maya Ramanath. Who hasn’t used Google? Why did Google return these results first ? Can we improve on it? Is this a good result.
CS315-Web Search & Data Mining. A Semester in 50 minutes or less The Web History Key technologies and developments Its future Information Retrieval (IR)
Computing Ontology Part II. So far, We have seen the history of the ACM computing classification system – What have you observed? – What topics from CS2013.
Essential Standard 3.00 Understand the role marketing in business. 1.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Essential Standard 3.00 Understand the role marketing in business. 1.
A Word Clustering Approach for Language Model-based Sentence Retrieval in Question Answering Systems Saeedeh Momtazi, Dietrich Klakow University of Saarland,Germany.
Understand the role marketing in business. 1. Understand principles of marketing. 2.
Relevance-Based Language Models Victor Lavrenko and W.Bruce Croft Department of Computer Science University of Massachusetts, Amherst, MA SIGIR 2001.
L&I SCI 110: Information science and information theory Instructor: Xiangming(Simon) Mu Sept. 9, 2004.
What Does the User Really Want ? Relevance, Precision and Recall.
Final Year Project 1 (FYP 1) CHAPTER 1 : INTRODUCTION
NATURAL LANGUAGE PROCESSING Zachary McNellis. Overview  Background  Areas of NLP  How it works?  Future of NLP  References.
Relevance Models and Answer Granularity for Question Answering W. Bruce Croft and James Allan CIIR University of Massachusetts, Amherst.
A code-centric cluster-based approach for searching online support forums for programmers Christopher Scaffidi, Christopher Chambers, Sheela Surisetty.
A Generation Model to Unify Topic Relevance and Lexicon-based Sentiment for Opinion Retrieval Min Zhang, Xinyao Ye Tsinghua University SIGIR
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
GENERATING RELEVANT AND DIVERSE QUERY PHRASE SUGGESTIONS USING TOPICAL N-GRAMS ELENA HIRST.
Automatic vs manual indexing Focus on subject indexing Not a relevant question? –Wherever full text is available, automatic methods predominate Simple.
Information Seeking from the User’s Perspective CMPT 455/826 - Week 11, Day 1 (Based on Kuhlthau) 1.
User Errors in Formulating Queries and IR Techniques to Overcome Them Birger Larsen Information Interaction and Information Architecture Royal School of.
Human Computer Interaction Lecture 21,22 User Support
Chapter 1 OBJECT-ORIENTED ANALYSIS AND DESIGN
Personalized Social Image Recommendation
Adaptive2 Language Model
Lecture 8 Information Retrieval Introduction
Junghoo “John” Cho UCLA
Information Retrieval and Web Design
Information Retrieval and Web Design
Presentation transcript:

Supporting Effective Access through User- and Topic- Based Language Models

Who we are… + =

People –Nick Belkin, Principle Investigator, Rutgers –Bruce Croft & James Allan, Principle Investigators, U Mass –Diane Kelly, Graduate Assistant, Rutgers – Jeong Hyun (Annie) Kim, Yang-woo Kim, Hyuk-Jin Lee, Anne Washington, Rutgers Funding –NSF # , Rutgers awarded $243,771 for a 3-year period Specifics…

The Problem Information retrieval and filtering tools make mistakes that are obvious and aggravating to users. Queries are often totally misinterpreted by search tools and, even when relevant documents are found, they are usually mixed with many others that are totally unrelated. WHY? Lack of adequate models of the user and of the domain.

General Solution Language Models –Assume that associated with every document or group of documents there are one or more probability distributions that model how the text in the document can be generated. –Can be viewed as models of the important topics that are covered by a document or a group of documents.

–Have the potential for representing topics of interest to a user or a group of users. –Assume that different types of information problems are expressed in language in different ways. These different ways could be characterized by language models understood as models of the user.

Types of Language Models Topic-based –Identify characteristics of the language that is used in a particular domain or topic. User-based –Identify characteristics of the language that a user or a group of users associate with a particular domain or topic.

Overview of the Retrieval Process

An Example Consider the following information needs that indicate a user’s interest in Java: –I want to find the best and least expensive way to have a 2 week holiday in Java. –I need to know where Java is. –I need to write a brief survey of the current political situation in Java.

–I need to learn the programming language Java. –I want to buy some dark roast Java coffee. Consider the following premise: Documents that are relevant to each of these kinds of queries will have different language characteristics associated with them.

Our Goal To identify the language characteristics of each of these types of needs (user) To identify the language characteristics of the groups of documents that satisfy each of these types of needs (topic) To match these needs and document clusters accordingly