Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.

Slides:



Advertisements
Similar presentations
Date: 2013/1/17 Author: Yang Liu, Ruihua Song, Yu Chen, Jian-Yun Nie and Ji-Rong Wen Source: SIGIR12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Adaptive.
Advertisements

Date: 2014/05/06 Author: Michael Schuhmacher, Simon Paolo Ponzetto Source: WSDM’14 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Knowledge-based Graph Document.
Processing XML Keyword Search by Constructing Effective Structured Queries Jianxin Li, Chengfei Liu, Rui Zhou and Bo Ning Swinburne University of Technology,
13/04/20151 SPARK: Top- k Keyword Query in Relational Database Wei Wang University of New South Wales Australia.
Diversity Maximization Under Matroid Constraints Date : 2013/11/06 Source : KDD’13 Authors : Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur Advisor :
Efficient IR-Style Keyword Search over Relational Databases Vagelis Hristidis University of California, San Diego Luis Gravano Columbia University Yannis.
Date : 2014/12/04 Author : Parikshit Sondhi, ChengXiang Zhai Source : CIKM’14 Advisor : Jia-ling Koh Speaker : Sz-Han,Wang.
Date : 2013/05/27 Author : Anish Das Sarma, Lujun Fang, Nitin Gupta, Alon Halevy, Hongrae Lee, Fei Wu, Reynold Xin, Gong Yu Source : SIGMOD’12 Speaker.
A Phrase Mining Framework for Recursive Construction of a Topical Hierarchy Date : 2014/04/15 Source : KDD’13 Authors : Chi Wang, Marina Danilevsky, Nihit.
Querying for Information Integration: How to go from an Imprecise Intent to a Precise Query? Aditya Telang Sharma Chakravarthy, Chengkai Li.
Effective Keyword Search in Relational Databases Fang Liu (University of Illinois at Chicago) Clement Yu (University of Illinois at Chicago) Weiyi Meng.
DOMAIN DEPENDENT QUERY REFORMULATION FOR WEB SEARCH Date : 2013/06/17 Author : Van Dang, Giridhar Kumaran, Adam Troy Source : CIKM’12 Advisor : Dr. Jia-Ling.
CS 101 Sect 7 – Databases (DB) Why databases Difference between a DB and a Web search What is a DB An hands-on case: the JCU Library 1
1 Ranked Queries over sources with Boolean Query Interfaces without Ranking Support Vagelis Hristidis, Florida International University Yuheng Hu, Arizona.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Authors: Bhavana Bharat Dalvi, Meghana Kshirsagar, S. Sudarshan Presented By: Aruna Keyword Search on External Memory Data Graphs.
NUITS: A Novel User Interface for Efficient Keyword Search over Databases The integration of DB and IR provides users with a wide range of high quality.
Search Engines and Information Retrieval Chapter 1.
Leveraging Conceptual Lexicon : Query Disambiguation using Proximity Information for Patent Retrieval Date : 2013/10/30 Author : Parvaz Mahdabi, Shima.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
A Markov Random Field Model for Term Dependencies Donald Metzler W. Bruce Croft Present by Chia-Hao Lee.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
1 Efficient Search Ranking in Social Network ACM CIKM2007 Monique V. Vieira, Bruno M. Fonseca, Rodrigo Damazio, Paulo B. Golgher, Davi de Castro Reis,
Querying Structured Text in an XML Database By Xuemei Luo.
A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering Jeongwoo Ko, Luo Si, Eric Nyberg (SIGIR ’ 07) Speaker: Cho, Chin Wei Advisor:
KEYS 2012 May 20, 2012, Scottsdale, Arizona, USA The Third International Workshop on Keyword Search on Structured Data.
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
EASE: An Effective 3-in-1 Keyword Search Method for Unstructured, Semi-structured and Structured Data Cuoliang Li, Beng Chin Ooi, Jianhua Feng, Jianyong.
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
RANKING SUPPORT FOR KEYWORD SEARCH ON STRUCTURED DATA USING RELEVANCE MODEL Date: 2012/06/04 Source: Veli Bicer(CIKM’11) Speaker: Er-gang Liu Advisor:
University of Malta CSA3080: Lecture 4 © Chris Staff 1 of 14 CSA3080: Adaptive Hypertext Systems I Dr. Christopher Staff Department.
FINDING RELEVANT INFORMATION OF CERTAIN TYPES FROM ENTERPRISE DATA Date: 2012/04/30 Source: Xitong Liu (CIKM’11) Speaker: Er-gang Liu Advisor: Dr. Jia-ling.
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
Date : 2012/10/25 Author : Yosi Mass, Yehoshua Sagiv Source : WSDM’12 Speaker : Er-Gang Liu Advisor : Dr. Jia-ling Koh 1.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology SIGIR1 Improving Web Search Results Using Affinity Graph.
BioSnowball: Automated Population of Wikis (KDD ‘10) Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/11/30 1.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
Templated Search over Relational Databases Date: 2015/01/15 Author: Anastasios Zouzias, Michail Vlachos, Vagelis Hristidis Source: ACM CIKM’14 Advisor:
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Ranking CSCI 572: Information Retrieval and Search Engines Summer 2010.
LOGO 1 Corroborate and Learn Facts from the Web Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Shubin Zhao, Jonathan Betz (KDD '07 )
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
Date: 2012/07/02 Source: Marina Drosou, Evaggelia Pitoura (CIKM’11) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.
A Word Clustering Approach for Language Model-based Sentence Retrieval in Question Answering Systems Saeedeh Momtazi, Dietrich Klakow University of Saarland,Germany.
Effective Keyword-Based Selection of Relational Databases By Bei Yu, Guoliang Li, Karen Sollins & Anthony K. H. Tung Presented by Deborah Kallina.
 Frequent Word Combinations Mining and Indexing on HBase Hemanth Gokavarapu Santhosh Kumar Saminathan.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Date: 2013/6/10 Author: Shiwen Cheng, Arash Termehchy, Vagelis Hristidis Source: CIKM’12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Predicting the Effectiveness.
Post-Ranking query suggestion by diversifying search Chao Wang.
Date: 2012/11/29 Author: Chen Wang, Keping Bi, Yunhua Hu, Hang Li, Guihong Cao Source: WSDM’12 Advisor: Jia-ling, Koh Speaker: Shun-Chen, Cheng.
Compact Query Term Selection Using Topically Related Text Date : 2013/10/09 Source : SIGIR’13 Authors : K. Tamsin Maxwell, W. Bruce Croft Advisor : Dr.Jia-ling,
Date: 2013/4/1 Author: Jaime I. Lopez-Veyna, Victor J. Sosa-Sosa, Ivan Lopez-Arevalo Source: KEYS’12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang KESOSD.
Date: 2012/5/28 Source: Alexander Kotov. al(CIKM’11) Advisor: Jia-ling, Koh Speaker: Jiun Jia, Chiou Interactive Sense Feedback for Difficult Queries.
CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,
TO Each His Own: Personalized Content Selection Based on Text Comprehensibility Date: 2013/01/24 Author: Chenhao Tan, Evgeniy Gabrilovich, Bo Pang Source:
Search Result Diversification in Resource Selection for Federated Search Date : 2014/06/17 Author : Dzung Hong, Luo Si Source : SIGIR’13 Advisor: Jia-ling.
Date: 2013/9/25 Author: Mikhail Ageev, Dmitry Lagun, Eugene Agichtein Source: SIGIR’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Improving Search Result.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
CiteData: A New Multi-Faceted Dataset for Evaluating Personalized Search Performance CIKM’10 Advisor : Jia-Ling, Koh Speaker : Po-Hsien, Shih.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
ClusCite:Effective Citation Recommendation by Information Network-Based Clustering Date: 2014/10/16 Author: Xiang Ren, Jialu Liu,Xiao Yu, Urvashi Khandelwal,
QUERY-PERFORMANCE PREDICTION: SETTING THE EXPECTATIONS STRAIGHT Date : 2014/08/18 Author : Fiana Raiber, Oren Kurland Source : SIGIR’14 Advisor : Jia-ling.
Intent-Aware Semantic Query Annotation
Date : 2013/1/10 Author : Lanbo Zhang, Yi Zhang, Yunfei Chen
Heterogeneous Graph Attention Network
Presentation transcript:

Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1

Outline Introduction Proposed Framework Overview Keyword Node Detection Keyword Node Combination Interpretation Generation Interpretation Ranking Keyword semantics confidence Connected structure confidence Experiment Plan Conclusion 2

Outline Introduction Proposed Framework Overview Keyword Node Detection Keyword Node Combination Interpretation Generation Interpretation Ranking Keyword semantics confidence Connected structure confidence Experiment Plan Conclusion 3

4 Motivation Unstructured Accessed by keywords Large user population Text documents: Ranking

5 Motivation Database Easy to use Increasing the DB usability Enabling interesting and unexpected discoveries

6 Goal Phase 1 Phase 2 Phase 3 System takes a keyword query as input Generates a set of possible interpretations sorted by their relevance to the query Generated interpretations are presented to the user User can choose which interpretation(s) best match his/her search intention System translates the selected interpretations into a set of SQL queries Retrieving the results.

Outline Introduction Proposed Framework Overview Keyword Node Detection Keyword Node Combination Interpretation Generation Interpretation Ranking Keyword semantics confidence Connected structure confidence Experiment Plan Conclusion 7

8

9

Outline Introduction Proposed Framework Overview Keyword Node Detection Keyword Node Combination Interpretation Generation Interpretation Ranking Keyword semantics confidence Connected structure confidence Experiment Plan Conclusion 10

11 Keyword Node Detection Paper title Author name Conference name Figure out all the semantics of each keyword in the query

Outline Introduction Proposed Framework Overview Keyword Node Detection Keyword Node Combination Interpretation Generation Interpretation Ranking Keyword semantics confidence Connected structure confidence Experiment Plan Conclusion 12

Keyword Node Combination 13 Combination Taking all the sets of keyword nodes and generate all possible combinations of keyword nodes from each set. All possible combinations

Outline Introduction Proposed Framework Overview Keyword Node Detection Keyword Node Combination Interpretation Generation Interpretation Ranking Keyword semantics confidence Connected structure confidence Experiment Plan Conclusion 14

15 Interpretation Generation Processing inputs each combination of keyword nodes and finds a set of connected graphs of relations that link all the keywords. QI 1 QI 2 a paper whose title contains “Markov” is accepted in conference “SIGIR”; a “Markov” paper cites another paper which is accepted in conference “SIGIR”.

Outline Introduction Proposed Framework Overview Keyword Node Detection Keyword Node Combination Interpretation Generation Interpretation Ranking Keyword semantics confidence Connected structure confidence Experiment Plan Conclusion 16

17 Keyword semantics confidence Connected structure confidence Interpretation Ranking Guideline 1: The more tuples match the keyword with some semantics. Guideline 2: The more discriminative the keyword is across the relation. Guideline 3: The more important the relation.

Keyword semantics confidence Indicating that a query keyword tends to refer to the semantics with many tuples that contain the keyword and match that semantics. N Ri ‧ aj (k) : number of tuples in R i that contain k in attribute A j N ”Paper” ‧ ”Title” (Markov) =2 N ”Author” ‧ ”Name” (Markov) =1

Keyword semantics confidence N Ri : number of tuples in R i N ”Paper” =4 N ”Author” =4 Reducing the confidence of keyword referring to the attribute of relation with the increasing number of tuples that contain keyword in attribute

Keyword semantics confidence Im(Ri) represents the importance of relation, it is set by experts.

Keyword semantics confidence dl(k) : average size of attribute values of tuples that contain k in R i Penalize the attribute length since longer attribute values have a larger chance to contain the keyword. avdl : average size of attribute values of all tuples in R i s1 : a constant

Outline Introduction Proposed Framework Overview Keyword Node Detection Keyword Node Combination Interpretation Generation Interpretation Ranking Keyword semantics confidence Connected structure confidence Experiment Plan Conclusion 22

23 Connected structure confidence The larger size of the interpretation, the less useful it tends to be The more cohesive the keywords are in the query interpretation, the more useful it tends to be.

24 Interpretation Ranking QI a QI b

Outline Introduction Proposed Framework Overview Keyword Node Detection Keyword Node Combination Interpretation Generation Interpretation Ranking Keyword semantics confidence Connected structure confidence Experiment Plan Conclusion 25

Data Set Internet Movie Database(IMDB), where around 200,000 movies of recent years are selected in our dataset DBLP, which contains publications since Query Set Setting 18 queries for each of the datasets. Dividing these queries into three clusters: short queries (2 or 3) keywords, medium queries (4 or 5) and long queries with Experiment Plan

Processing Time Run 10 times and collect the average processing time. Compare two processing Paper Algorithm Search Engine Quality of the Query Interpretations The participants are asked to score the quality of each query interpretation (from 0 to 5 points, 5 means best while 0 means worst) 27 Experiment Plan

Outline Introduction Proposed Framework Overview Keyword Node Detection Keyword Node Combination Interpretation Generation Interpretation Ranking Keyword semantics confidence Connected structure confidence Experiment Plan Conclusion 28

Conclusion 29 Query interpretation is a critical issue in keyword search over relational databases that has yet received very little attention. This paper proposed a 3 phase keyword search paradigm that focused on query interpretation generating and ranking. Analyzing the challenges to rank all the interpretations and proposed some guidelines to compute the confidence of an interpretation to be the user desired.