UPRM Computing Systems Research Group Prof. Bienvenido Vélez-Rivera – Leader José Enseñat – Graduate student Juan Torres – Undergraduate student.

Slides:



Advertisements
Similar presentations
Less is More Probabilistic Model for Retrieving Fewer Relevant Docuemtns Harr Chen and David R. Karger MIT CSAIL SIGIR2006 4/30/2007.
Advertisements

Mining Compressed Frequent- Pattern Sets Dong Xin, Jiawei Han, Xifeng Yan, Hong Cheng Department of Computer Science University of Illinois at Urbana-Champaign.
1 Very Large-Scale Incremental Clustering Berk Berker Mumin Cebe Ismet Zeki Yalniz 27 March 2007.
Personalized Query Classification Bin Cao, Qiang Yang, Derek Hao Hu, et al. Computer Science and Engineering Hong Kong UST.
Minimizing Seed Set for Viral Marketing Cheng Long & Raymond Chi-Wing Wong Presented by: Cheng Long 20-August-2011.
Engineering a Set Intersection Algorithm for Information Retrieval Alex Lopez-Ortiz UNB / InterNAP Joint work with Ian Munro and Erik Demaine.
The Literature Review in 3 Key Steps
WSCD INTRODUCTION  Query suggestion has often been described as the process of making a user query resemble more closely the documents it is expected.
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
Engineering Research Methodology
Distributed Search over the Hidden Web Hierarchical Database Sampling and Selection Panagiotis G. Ipeirotis Luis Gravano Computer Science Department Columbia.
Creating Concept Hierarchies in a Customer Self-Help System Bob Wall CS /29/05.
Integrated Web Environment for Walsaip Applications Research Assistant Viky Arnedo Martinez Advisor. José A. Borges HCIG Graduate Student.
LinkSelector: A Web Mining Approach to Hyperlink Selection for Web Portals Xiao Fang University of Arizona 10/18/2002.
Recall: Query Reformulation Approaches 1. Relevance feedback based vector model (Rocchio …) probabilistic model (Robertson & Sparck Jones, Croft…) 2. Cluster.
1 MARG-DARSHAK: A Scrapbook on Web Search engines allow the users to enter keywords relating to a topic and retrieve information about internet sites (URLs)
Federated Search of Text Search Engines in Uncooperative Environments Luo Si Language Technology Institute School of Computer Science Carnegie Mellon University.
1 CS/INFO 430 Information Retrieval Lecture 23 Usability 1.
Query Planning for Searching Inter- Dependent Deep-Web Databases Fan Wang 1, Gagan Agrawal 1, Ruoming Jin 2 1 Department of Computer.
Search Result Diversification by M. Drosou and E. Pitoura Presenter: Bilge Koroglu June 14, 2011.
Search Engines and Information Retrieval Chapter 1.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
APPLICATIONS OF DATA MINING IN INFORMATION RETRIEVAL.
Bibliometrics toolkit: ISI products Website: Last edited: 11 Mar 2011 Thomson Reuters ISI product set is the market leader for.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Tag Data and Personalized Information Retrieval 1.
Presented by: Apeksha Khabia Guided by: Dr. M. B. Chandak
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
Question Answering From Zero to Hero Elena Eneva 11 Oct 2001 Advanced IR Seminar.
« Pruning Policies for Two-Tiered Inverted Index with Correctness Guarantee » Proceedings of the 30th annual international ACM SIGIR, Amsterdam 2007) A.
Feedback through critical indicators of student performance: contributing to the assessment of high school education David González-Barreto, PhD Antonio.
Personalized Search Xiao Liu
CS 533 Information Retrieval Systems.  Introduction  Connectivity Analysis  Kleinberg’s Algorithm  Problems Encountered  Improved Connectivity Analysis.
Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.
Can Change this on the Master Slide Monday, August 20, 2007Can change this on the Master Slide0 A Distributed Ranking Algorithm for the iTrust Information.
Efficient Instant-Fuzzy Search with Proximity Ranking Authors: Inci Centidil, Jamshid Esmaelnezhad, Taewoo Kim, and Chen Li IDCE Conference 2014 Presented.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
International Conference on Engineering Education Education, Research and Development: Fostering Economic Growth July 23-28, 2006 San Juan, Puerto Rico.
Seismic Monitoring of the Northeastern Caribbean Region Christa G. von Hillebrandt-Andrade, Víctor Huérfano and Juan Lugo Puerto Rico Seismic Network,
Diversifying Search Results Rakesh Agrawal, Sreenivas Gollapudi, Alan Halverson, Samuel Ieong Search Labs, Microsoft Research WSDM, February 10, 2009 TexPoint.
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
A Novel Visualization Model for Web Search Results Nguyen T, and Zhang J IEEE Transactions on Visualization and Computer Graphics PAWS Meeting Presented.
- University of North Texas - DSCI 5240 Fall Graduate Presentation - Option A Slides Modified From 2008 Jones and Bartlett Publishers, Inc. Version.
Named Entity Recognition in Query Jiafeng Guo 1, Gu Xu 2, Xueqi Cheng 1,Hang Li 2 1 Institute of Computing Technology, CAS, China 2 Microsoft Research.
Query Execution on NetTraveler Angel L. Villalaín-García Manuel Rodríguez-Martínez University of Puerto Rico - Mayaguez Campus.
Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Longzhuang Li, Yi Shang, Wei Zhang 2002.ACM. Improvement of HITS-based Algorithms.
Acquisition of Categorized Named Entities for Web Search Marius Pasca Google Inc. from Conference on Information and Knowledge Management (CIKM) ’04.
Generating Query Substitutions Alicia Wood. What is the problem to be solved?
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
Applicant’s Profile Study for Improving Undergraduate Enrollment at the University of Puerto Rico at Mayagüez Dr. David González Barreto Dr. Antonio A.
Why IR test collections are so bad Mark Sanderson University of Sheffield.
Sigir’99 Inside Internet Search Engines: Products William Chang and Jan Pedersen.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
1 Learning to Impress in Sponsored Search Xin Supervisors: Prof. King and Prof. Lyu.
Automatic Categorization of Query Results Kaushik Chakrabarti, Surajit Chaudhuri, Seung-won Hwang Sushruth Puttaswamy.
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
Document Clustering for Natural Language Dialogue-based IR (Google for the Blind) Antoine Raux IR Seminar and Lab Fall 2003 Initial Presentation.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
FirstGov Search July 28th, 2004 NOAA Webshop
Federated & Meta Search
How does Google search for everything? Computer Science at Work
Marianela Santiago Dr. Bienvenido Vélez Faculty Advisor
Query Caching in Agent-based Distributed Information Retrieval
Information retrieval and PageRank
Panagiotis G. Ipeirotis Luis Gravano
Sensing Object Semantics for Interactive Multimedia Applications
CS/INFO 430 Information Retrieval
Presentation transcript:

UPRM Computing Systems Research Group Prof. Bienvenido Vélez-Rivera – Leader José Enseñat – Graduate student Juan Torres – Undergraduate student University of Puerto Rico Mayagüez Campus PRECISE Project Mayagüez, October 07, 2000

Problem Statement

Query-based Web Search large result-set short query BUT - queries hard to write -sequential access to result set inadequate

Proposed Solution

Inforadar’s Interactive query hierarchies seed query result set for selected query dynamic categories are queries selected query

Inforadar’s Interactive query hierarchies colors indicate node status level 2 categories icons mark documents read or in-basket

Theoretical Formulation

(a) low information loss high redundancy Coverage-based Category Evaluation Metric Goal: Avoid Redundancy and Information Loss q q2q2 q1q1 (b) high information loss low redundancy (c) better Ideal: Select categories that best approximate a partition But: This is an NP-complete problem seed

CTS: A greedy approximation algorithm for category selection Approach: CTS picks term f i maximizing: C = set of documents covered by previously selected terms winning category! low coverage high redundancy Goal: Pick best term among { t 1, t 2, t 3 } C D(q ^ t 3 ) D(q ^ t 2 ) D(q ^ t 1 ) D(q)

Experimental Plan InforadarImplement Inforadar site indexing ALL website data at UPRM InforadarMake Inforadar the official search engine for the UPRM web site Conduct usability study Analyze real user feedback Incorporate feedback into an improved design

References Query Lookahead for Query-Based Document Categorization. –Ph.D. Thesis –Massachusetts Institute of Technology –September 1999 Fast and Effective Query Refinement –Bienvenido Vélez, Ron Weiss, Mark Sheldon and David K. Gifford –ACM Conference in Research and Development in Information Retrieval (SIGIR 97) HyPursuit: A network search engine exploiting concent-link similarity –R. Weiss, B. Vélez, M. Sheldon, C. Namprempre, P. Szilagy and D. K. Gifford.. –ACM Conference on Hypertext (HyperText 96)