CONCLUSION & FUTURE WORK Normally, users perform triage tasks using multiple applications in concert: a search engine interface presents lists of potentially.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

CONTRIBUTIONS Ground-truth dataset Simulated search tasks environment Multiple everyday applications (MS Word, MS PowerPoint, Mozilla Browser) Implicit.
Metadata in Carrot II Current metadata –TF.IDF for both documents and collections –Full-text index –Metadata are transferred between different nodes Potential.
Information retrieval – LSI, pLSI and LDA
Modern Information Retrieval Chapter 1: Introduction
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
Joint Sentiment/Topic Model for Sentiment Analysis Chenghua Lin & Yulan He CIKM09.
More Interfaces for Retrieval. Information Retrieval Activities Selecting a collection –Lists, overviews, wizards, automatic selection Submitting a request.
Interfaces for Selecting and Understanding Collections.
Latent Dirichlet Allocation a generative model for text
Intelligent User Interfaces Research Group Directed by: Frank Shipman.
Recognizing User Interest and Document Value from Reading and Organizing Activities in Document Triage Rajiv Badi, Soonil Bae, J. Michael Moore, Konstantinos.
LATENT DIRICHLET ALLOCATION. Outline Introduction Model Description Inference and Parameter Estimation Example Reference.
The Visual Knowledge Builder: A Second Generation Spatial Hypertext Frank M. Shipman III Haowei Hsieh Preetam Maloor J. Michael Moore.
Managing Software Projects in Spatial Hypertext : Experiences in Dogfooding Frank Shipman Department of Computer Science & Center for the Study of Digital.
Projects in the Intelligent User Interfaces Group Frank Shipman Associate Director, Center for the Study of Digital Libraries.
The 2nd International Conference of e-Learning and Distance Education, 21 to 23 February 2011, Riyadh, Saudi Arabia Prof. Dr. Torky Sultan Faculty of Computers.
Attention and Event Detection Identifying, attributing and describing spatial bursts Early online identification of attention items in social media Louis.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Personalization of the Digital Library Experience: Progress and Prospects Nicholas J. Belkin Rutgers University, USA
Search Engines and Information Retrieval Chapter 1.
Correlated Topic Models By Blei and Lafferty (NIPS 2005) Presented by Chunping Wang ECE, Duke University August 4 th, 2006.
Example 16,000 documents 100 topic Picked those with large p(w|z)
Evaluation Experiments and Experience from the Perspective of Interactive Information Retrieval Ross Wilkinson Mingfang Wu ICT Centre CSIRO, Australia.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Streaming Predictions of User Behavior in Real- Time Ethan DereszynskiEthan Dereszynski (Webtrends) Eric ButlerEric Butler (Cedexis) OSCON 2014.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Implicit Acquisition of Context for Personalization of Information Retrieval Systems Chang Liu, Nicholas J. Belkin School of Communication and Information.
27. May Topic Models Nam Khanh Tran L3S Research Center.
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
CONCLUSION & FUTURE WORK Given a new user with an information gathering task consisting of document IDs and respective term vectors, this can be compared.
Context-Sensitive Information Retrieval Using Implicit Feedback Xuehua Shen : department of Computer Science University of Illinois at Urbana-Champaign.
Finding the Hidden Scenes Behind Android Applications Joey Allen Mentor: Xiangyu Niu CURENT REU Program: Final Presentation 7/16/2014.
Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.
Toward A Session-Based Search Engine Smitha Sriram, Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
CONCLUSION & FUTURE WORK Normally, users perform search tasks using multiple applications in concert: a search engine interface presents lists of potentially.
Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Latent Dirichlet Allocation D. Blei, A. Ng, and M. Jordan. Journal of Machine Learning Research, 3: , January Jonathan Huang
CONCLUSIONS & CONTRIBUTIONS Ground-truth dataset, simulated search tasks environment Multiple everyday applications (MS Word, MS PowerPoint, Mozilla Browser)
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
MOVIE RETRIEVAL SYSTEM INFORMATION VISUALIZATION & PROPOSING NEW INTERFACE IAT 814 Adrian Bisek.
Topic Modeling using Latent Dirichlet Allocation
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Local Linear Matrix Factorization for Document Modeling Institute of Computing Technology, Chinese Academy of Sciences Lu Bai,
Unclassified//For Official Use Only 1 RAPID: Representation and Analysis of Probabilistic Intelligence Data Carnegie Mellon University PI : Prof. Jaime.
Date: 2013/9/25 Author: Mikhail Ageev, Dmitry Lagun, Eugene Agichtein Source: SIGIR’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Improving Search Result.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Unified Relevance Feedback for Multi-Application User Interest Modeling Sampath Jayarathna PhD Candidate Computer Science & Engineering.
Acknowledgements : This research is supported by NSF grant INTRODUCTION MULTI LAYER PERCEPTRONS (MLP) DATA SET FOR TRAINING Learning weights using.
Recognizing Document Value from Reading and Organizing Activities in Document Triage Rajiv Badi, Soonil Bae, J. Michael Moore, Konstantinos Meintanis,
UNIVERSITY UTARA MALAYSIA COLLEGE OF ARTS & SCIENCES.
User Errors in Formulating Queries and IR Techniques to Overcome Them Birger Larsen Information Interaction and Information Architecture Royal School of.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
CONCLUSIONS & CONTRIBUTIONS Ground-truth dataset, simulated search tasks environment Implicit feedback, semi-explicit feedback (annotations), explicit.
Autonomous Interface Agents Henry Lieberman Media Laboratory, MIT Presented by Sumit Taank Vishal Mishra.
University Of Seoul Ubiquitous Sensor Network Lab Query Dependent Pseudo-Relevance Feedback based on Wikipedia 전자전기컴퓨터공학 부 USN 연구실 G
Information Overload on the Internet: The Web Mining Techniques Approach UNIVERSITI UTARA MALAYSIA COLLEGE OF ARTS AND SCIENCES RESEARCH METHODOLOGY (SZRZ6014)
Hierarchical Clustering & Topic Models
Visualizing User Activity History
Trevor Savage, Bogdan Dit, Malcom Gethers and Denys Poshyvanyk
What is Research? A research study is a study conducted to collect and analyse information in order to increase our understanding of a topic or an issue.
Matching Words with Pictures
Michal Rosen-Zvi University of California, Irvine
Presentation transcript:

CONCLUSION & FUTURE WORK Normally, users perform triage tasks using multiple applications in concert: a search engine interface presents lists of potentially relevant documents; a document reader displays their contents; and a third tool—a text editor or personal information management application—is used to record notes and assessments (MS Word and MS PowerPoint). An Interest Profile Manager infers users' interests from their interactions with the multi-applications, coupled with the characteristics of the interest modeling techniques. The resulting interest profile is used to generate visualizations that direct users' attention to documents or parts of documents that match their inferred interests. We present preliminary investigation of predicting, in real time, weather a user is about to switch interest - that is, weather the user is about to finish the current triage task and switch to another triage task. Acknowledgements : This research is supported by NSF grant UIMaP: Mining Short-Term User Interest from Personalized Triage Tasks Sampath Jayarathna, Atish Patra and Frank Shipman Computer Science & Engineering, Texas A&M University – College Station UIMaP: Mining Short-Term User Interest from Personalized Triage Tasks Sampath Jayarathna, Atish Patra and Frank Shipman Computer Science & Engineering, Texas A&M University – College Station ABSTRACT We are interested about open-ended information gathering tasks— document triage tasks in particular—in which people collect Web documents for interpretation and synthesis. User interests are usually distributed in different systems during a triage tasks. Traditional user interest modeling methods are not designed for integrating and analyzing interests from multiple sources, hence, they are not very effective for obtaining comparatively complete description of user interests in a multi- application environment. We propose UIMaP -User Interest Management and Personalization, to detect triage task sessions and group them according to common task or intent. Figure 1. Multi-Application Short-term Interest Modeling Even with the best search engine and the most effective query formulation, “triage tasks” require people to work through long lists of documents to synthesize the information they need; there is usually no single document containing one right answer. In fact, as people skim early documents, they may determine additional information needs that suggest further queries and results in even more documents to process. Problem Statement: Our task is, given a sequence of document triage tasks, predict whether the user will continue on the same triage session or switch to a new one. USER INTEREST MINING WebAnnotate - During information task, useful documents may be long, and cover multiple subtopics; users may read some segments and ignore others. In order to record which portion(s) of the document pique the user’s interests, an explicit interest expressions (e.g. annotations using WebAnnotate) capturing tool is used. Figure 2. Triage Session Segmentation Latent Dirichlet Allocation (LDA) -LDA performs unsupervised identification of hidden topics in a document triage task using a generative probabilistic model, so that each triage task can be seen as a mixture of words (bag of words) over these topics The resultant LDA output consists of a topic probability distribution per triage task and word probability distribution per topic. The topic probability distribution provides list of triage tasks and their corresponding topics with the given topic probability distribution. By using a simple heuristics (topic with the highest probability), it is possible to obtain a triage task clusters or the tasks associated with each of the topics. REFERENCES SEARCH AND RECOMMENDATIONS Our major contributions in this work can be summarized as: (1)A novel application to perform short-term interest mining based on evidence coming from multiple applications and segmenting triage tasks using topic modeling. (2)In the future we plan on creating a weighting schema to identify the importance of evidence coming from multiple applications like VKB, Web Browser, Word and PowerPoint. 1.Bae, S., Hsieh, H., Kim, D., Marshall, C.C., Meintanis, K., Moore, J.M., Zacchi, A. and Shipman, F.M. Supporting document triage via annotation-based visualizations. American Society for Information Science and Technology, 45 (1) Blei, David M., Andrew Y. Ng, and Michael I. Jordan. "Latent dirichlet allocation." the Journal of machine Learning research 3 (2003): Guo, Qi, and Eugene Agichtein. "Beyond session segmentation: predicting changes in search intent with client-side user interactions." Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval. ACM, Tolomei, G., Orlando, S. and Silvestri, F., Towards a task-based search and recommender systems. in In Proceedings of ICDE Workshops, (2010),