Collaborative Query Previews in Digital Libraries Lin Fu, Dion Goh, Schubert Foo Division of Information Studies School of Communication and Information.

Slides:



Advertisements
Similar presentations
OAF Workshop, May 13-14, 2002, Pisa.CYCLADES IST CYCLADES An Open Collaborative Virtual Archive Environment Umberto Straccia.
Advertisements

Web Mining.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Presentation by Priyanka Sawarkar
The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
A Prototype Implementation of a Framework for Organising Virtual Exhibitions over the Web Ali Elbekai, Nick Rossiter School of Computing, Engineering and.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
A Graph-based Recommender System Zan Huang, Wingyan Chung, Thian-Huat Ong, Hsinchun Chen Artificial Intelligence Lab The University of Arizona 07/15/2002.
1 DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen, Germany.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Project 1 Introduction to HTML.
Search Engines and Information Retrieval
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Requirements Specification
Interactive Dynamic Aggregate Queries Kenneth A. Ross Junyan Ding Columbia University.
GenSpace: Exploring Social Networking Metaphors for Knowledge Sharing and Scientific Collaborative Work Chris Murphy, Swapneel Sheth, Gail Kaiser, Lauren.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
© Tefko Saracevic, Rutgers University1 digital libraries and human information behavior Tefko Saracevic, Ph.D. School of Communication, Information and.
Evaluating usability through claims analysis Suzette Keith Ann Blandford, Bob Fields, Richard Butterworth, Yin Leng Theng.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
An Agent-Oriented Approach to the Integration of Information Sources Michael Christoffel Institute for Program Structures and Data Organization, University.
Development of Japanese GIS Tool for use in the Humanities ○ Masatoshi ISHIKAWA †, Yoichi KAWANISHI ††, Hidefumi OKUMURA †††, Shoichiro HARA †††† † University.
The 2nd International Conference of e-Learning and Distance Education, 21 to 23 February 2011, Riyadh, Saudi Arabia Prof. Dr. Torky Sultan Faculty of Computers.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
A summary of the report written by W. Alink, R.A.F. Bhoedjang, P.A. Boncz, and A.P. de Vries.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Personalization of the Digital Library Experience: Progress and Prospects Nicholas J. Belkin Rutgers University, USA
Search Engines and Information Retrieval Chapter 1.
“Old Style” Libraries, Digital Libraries: Convergences, Divergences, And the Troubles in Between.
Recommender systems Drew Culbert IST /12/02.
Using the SAS® Information Delivery Portal
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Presented by Abirami Poonkundran.  Introduction  Current Work  Current Tools  Solution  Tesseract  Tesseract Usage Scenarios  Information Flow.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Chapter 1 Introduction to Data Mining
1 Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007)
Music Recommendation A Data Mining Approach Daniel McEnnis 2nd year PhD Daniel McEnnis 2nd year PhD.
ZLOT Prototype Assessment John Carlo Bertot Associate Professor School of Information Studies Florida State University.
Microsoft Academic Search Search | Explore | Discover Alex D. Wade Director - Scholarly Communication.
Jela Steinerová, Andrea Hrčková Comenius University Bratislava Slovakia 15th International Conference on Grey Literature GL 15.
A N AJAX INTERFACE FOR THE LINC SYSTEM By Jesse Prabawa Gozali.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Event-Based Hybrid Consistency Framework (EBHCF) for Distributed Annotation Records Ahmet Fatih Mustacoglu Advisor: Prof. Geoffrey.
Individualized Knowledge Access David Karger Lynn Andrea Stein Mark Ackerman Ralph Swick.
Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
CS3041 – Final week Today: Searching and Visualization Friday: Software tools –Study guide distributed (in class only) Monday: Social Imps –Study guide.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Problem Query image by content in an image database.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Information Visualization, Human-Computer Interaction, and Cognitive Psychology: Domain Visualizations Kevin W. Boyack Sandia National Laboratories.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
Institute for the Protection and Security of the Citizen HAZAS – Hazard Assessment ECCAIRS Technical Course Provided by the Joint Research Centre - Ispra.
Functionality Working Group Dagobert Soergel University at Buffalo 1.
1 DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen, Germany.
Reference Management Module I: Introduction By Rehema Chande-Mallya(PhD)
UNC Digital Library Project
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Exploratory search: New name for an old hat?
Dr. Bhavani Thuraisingham The University of Texas at Dallas
Magnet & /facet Zheng Liang
Haystack: an Adaptive Personalized Information Retrieval System
Presentation transcript:

Collaborative Query Previews in Digital Libraries Lin Fu, Dion Goh, Schubert Foo Division of Information Studies School of Communication and Information Nanyang Technological University

Presentation Overview Background Background Query Previews and Collaborative Filtering Query Previews and Collaborative Filtering Collaborative Query Previews (CQPs) Collaborative Query Previews (CQPs) System Design and Implementation System Design and Implementation Advantages of the System Advantages of the System Future work Future work

Background Information Overload: Information Overload: World Wide Web World Wide Web Digital libraries Digital libraries Information Seeking: Information Seeking: Information seeking is a broad term encompassing the ways individuals articulate their information needs, seek, evaluate, select and use information (Lokman & Stephanie, 2001) Information seeking is a broad term encompassing the ways individuals articulate their information needs, seek, evaluate, select and use information (Lokman & Stephanie, 2001) Collaboration and communication are important Collaboration and communication are important Pre-Query Information (PQI) Pre-Query Information (PQI) Information needs Information needs Information system Information system Knowledge of the collection Knowledge of the collection

Use of PQI in Information Retrieval Information Systems Physical Collections Digital Library Target Information Pre-Query Information Information Needs Collection Knowledge Information Systems Query Structure of the Collection Domain knowledge

Example of Collection Knowledge Suppose a user wants to search a paper on overview-detail style interface but does not know the title, and also a novice in this field. Suppose a user wants to search a paper on overview-detail style interface but does not know the title, and also a novice in this field. The user enters “interface” or “overview, detail” as the query. However, nothing in the top 50 results rings a bell The user enters “interface” or “overview, detail” as the query. However, nothing in the top 50 results rings a bell Someone else searching for the same paper might remember its name clearly (“Reading of Electronic Documents: The Usability of Linear, Fisheye, and Overview+Detail Interfaces”). He knows that using “fisheye, overview, detail” as the query keyword will yield a good result Someone else searching for the same paper might remember its name clearly (“Reading of Electronic Documents: The Usability of Linear, Fisheye, and Overview+Detail Interfaces”). He knows that using “fisheye, overview, detail” as the query keyword will yield a good result

Concept 1: Query Previews Definition: Definition: Query previews provide an overview about the data distribution in a data collection (Greene et al., 1999). Query previews provide an overview about the data distribution in a data collection (Greene et al., 1999). Overviews are represented as aggregate information on attributes of the collection--- known as summary data. Overviews are represented as aggregate information on attributes of the collection--- known as summary data. The summary data is displayed using various visualization techniques: histograms, timelines. The summary data is displayed using various visualization techniques: histograms, timelines.

Query Preview Example

Reduce queries with zero or large number of hits. Reduce queries with zero or large number of hits. Prevent the retrieval of undesired records. Prevent the retrieval of undesired records. Represent statistical information of the database visually Represent statistical information of the database visually Advantages of Query Previews:

Concept 2: Collaborative Filtering Definition: Definition: Collaborative filtering is a technique for recommending items to a user based on similarities between the past behavior of the user and that of likeminded people (Chun & Hong, 2001) Collaborative filtering is a technique for recommending items to a user based on similarities between the past behavior of the user and that of likeminded people (Chun & Hong, 2001) Examples: Examples: Tapestry: a system that can filter information according to other users’ annotations (Goldberg, Nichols, Oki & Terry, 1992) Tapestry: a system that can filter information according to other users’ annotations (Goldberg, Nichols, Oki & Terry, 1992) GroupLens: a recommender system using user ratings of documents (Resnick, Courtiat & Villemur, 2001) GroupLens: a recommender system using user ratings of documents (Resnick, Courtiat & Villemur, 2001)

Advantages of Collaborative Filtering Use the community for knowledge sharing. Use the community for knowledge sharing. Select high quality items from a large information stream. Select high quality items from a large information stream.

Limitations of Existing Techniques Query Previews: Query Previews: Lack of support for communication and collaboration. Lack of support for communication and collaboration. Collaborative Filtering: Collaborative Filtering: Lack of support for gathering PQI. Lack of support for gathering PQI.

Collaborative Query Previews (CQPs) CQP is an integrated approach to augment information seeking by supporting collaboration and communication during the process of gathering PQI. CQP is an integrated approach to augment information seeking by supporting collaboration and communication during the process of gathering PQI. CQPs generate an overview about a data collection through a set of aggregate information. CQPs generate an overview about a data collection through a set of aggregate information. CQPs introduce a collaborative aspect by providing recommendations of queries. CQPs introduce a collaborative aspect by providing recommendations of queries.

Collaborative Query Previews (CQPs) Direct Previews of the Data Collection: Direct Previews of the Data Collection: Through the aggregate information on selected attributes, users can get familiar with the structure of the database. Through the aggregate information on selected attributes, users can get familiar with the structure of the database. Recommendation of Queries: Recommendation of Queries: Through collaborative filtering techniques, CQPs recommend related queries previously executed by other users to help the current user make better sense of how the document collection met past information needs that coincide with the present information need. Through collaborative filtering techniques, CQPs recommend related queries previously executed by other users to help the current user make better sense of how the document collection met past information needs that coincide with the present information need.

Design and Implementation Introduction: Introduction: ZWE provides an integrated platform for supporting a variety of scholarly tasks including browsing, querying, organizing and annotating of information resources (Goh, Fu & Foo, 2002) using a spatial metaphor. ZWE provides an integrated platform for supporting a variety of scholarly tasks including browsing, querying, organizing and annotating of information resources (Goh, Fu & Foo, 2002) using a spatial metaphor. ZWE supports the entire process of information seeking by incorporating CQPs. ZWE supports the entire process of information seeking by incorporating CQPs.

Design and Implementation Tabs Query previews Artifacts (photos, metadata, annotations) Browsing tree Query area Work area Popup menu Recommended queries Result lists

Design and Implementation Multimedia Repository Past Queries Repository User Profiles Repository Searching Browsing Query Previews Recommendation Zoomable Work Environment Authoring Metadata Repository Feature Extraction Display User Management

Design and Implementation JAZZ: a Zoomable User Interface (ZUI) API that allows developers to quickly and easily build zoomable information spaces. JAZZ: a Zoomable User Interface (ZUI) API that allows developers to quickly and easily build zoomable information spaces.

Design and Implementation Tamino XML Server: a platform to build an XML based information retrieval system. Tamino XML Server: a platform to build an XML based information retrieval system. Database Schema XML Tamino Manager Schema Editor Interactive Tools X-Query Tools

Design and Implementation For query recommendation module, we proposed a hybrid approach (Fu, Goh & Foo, 2003a, 2003b) to cluster past queries and apply the algorithms to find similar past queries for a given query. For query recommendation module, we proposed a hybrid approach (Fu, Goh & Foo, 2003a, 2003b) to cluster past queries and apply the algorithms to find similar past queries for a given query. Experiments show that our hybrid algorithm outperforms the existing query clustering approach. Experiments show that our hybrid algorithm outperforms the existing query clustering approach.

Advantages of Proposed System Integerated work environment: more interactive, zoomable. Multifaceted information artifacts. Generic framework. Integerated work environment: more interactive, zoomable. Multifaceted information artifacts. Generic framework. CQPs support the information seeking process from two perspectives: CQPs support the information seeking process from two perspectives: From direct previews of the data collection. From direct previews of the data collection. From queries issued previously by others. From queries issued previously by others.

Future Work With the initial prototype developed, the next phase of this work will focus on the evaluation of CQPs by users of the digital library. With the initial prototype developed, the next phase of this work will focus on the evaluation of CQPs by users of the digital library. Continuing research is also being carried out to improve the aspects of query clustering by further investigating the use of hybrid approaches, including content- based, feedback-based and result-based approaches. Continuing research is also being carried out to improve the aspects of query clustering by further investigating the use of hybrid approaches, including content- based, feedback-based and result-based approaches.

Thank You For more information Schubert Foo