Huizhong Doan, Yonbo Cao, Chin-Yew Lin and Yong Yu Shanghai Jiao Tong University & MSRA ACL 2008 2008/7/91Rick Liu.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Incorporating Participant Reputation in Community-driven Question Answering Systems Liangjie Hong, Zaihan Yang and Brian D. Davison Computer Science and.
Chapter 5: Introduction to Information Retrieval
Automatic Question Generation from Queries Natural Language Computing, Microsoft Research Asia Chin-Yew LIN
SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.
Effective Keyword Search in Relational Databases Fang Liu (University of Illinois at Chicago) Clement Yu (University of Illinois at Chicago) Weiyi Meng.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Vikas BhardwajColumbia University NLP for the Web – Spring 2010 Improving QA Accuracy by Question Inversion Prager et al. IBM T.J. Watson Res. Ctr. 02/18/2010.
Presented by Li-Tal Mashiach Learning to Rank: A Machine Learning Approach to Static Ranking Algorithms for Large Data Sets Student Symposium.
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
Using TF-IDF to Determine Word Relevance in Document Queries
Information Retrieval Ch Information retrieval Goal: Finding documents Search engines on the world wide web IR system characters Document collection.
Sigir’99 Inside Internet Search Engines: Search Jan Pedersen and William Chang.
Affinity Rank Yi Liu, Benyu Zhang, Zheng Chen MSRA.
1 CS 430 / INFO 430 Information Retrieval Lecture 3 Searching Full Text 3.
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
Quality-aware Collaborative Question Answering: Methods and Evaluation Maggy Anastasia Suryanto, Ee-Peng Lim Singapore Management University Aixin Sun.
Quality-Aware Collaborative Question Answering: Methods and Evaluation Maggy Anastasia Suryanto, Ee-Peng Lim, Aixin Sun, and Roger H. L. Chiang. In Proceedings.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
Combining Lexical Semantic Resources with Question & Answer Archives for Translation-Based Answer Finding Delphine Bernhard and Iryna Gurevvch Ubiquitous.
1 Cross-Lingual Query Suggestion Using Query Logs of Different Languages SIGIR 07.
Finding Similar Questions in Large Question and Answer Archives Jiwoon Jeon, W. Bruce Croft and Joon Ho Lee Retrieval Models for Question and Answer Archives.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Topical Crawlers for Building Digital Library Collections Presenter: Qiaozhu Mei.
11 Learning to Suggest Questions in Online Learning to Suggest Questions in Online Forums Tom Chao Zhou, Chin-Yew Lin, Irwin King Michael R.
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
Beyond Search Engines: Advanced Web Searching Subject Directories  Librarians’ Index to the Internet  Infomine Finding Databases on a Subject  The Invisible.
Focused Crawling for both Topical Relevance and Quality of Medical Information By Tim Tang, David Hawking, Nick Craswell, Kathy Griffiths CIKM ’05 November,
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
INTRODUCING THE WEB INTELLIGENCE (WIT) GROUP Microsoft Research Asia.
Document Clustering 文件分類 林頌堅 世新大學圖書資訊學系 Sung-Chien Lin Department of Library and Information Studies Shih-Hsin University.
A General Optimization Framework for Smoothing Language Models on Graph Structures Qiaozhu Mei, Duo Zhang, ChengXiang Zhai University of Illinois at Urbana-Champaign.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
Generic text summarization using relevance measure and latent semantic analysis Gong Yihong and Xin Liu SIGIR, April 2015 Yubin Lim.
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
Answering Similar Region Search Queries Chang Sheng, Yu Zheng.
Hypersearching the Web, Chakrabarti, Soumen Presented By Ray Yamada.
Question Routing in Community Question Answering: Putting Category in Its Place 1 The Chinese University of Hong Kong, Shatin, N.T., Hong Kong 2 AT&T Labs.
Information Retrieval at NLC Jianfeng Gao NLC Group, Microsoft Research China.
Chapter 8 Evaluating Search Engine. Evaluation n Evaluation is key to building effective and efficient search engines  Measurement usually carried out.
Research © 2008 Yahoo! Generating Succinct Titles for Web URLs Kunal Punera joint work with Deepayan Chakrabarti and Ravi Kumar Yahoo! Research.
Linking Organizational Social Networking Profiles PROJECT ID: H JEROME CHENG ZHI KAI (A H ) 1.
Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.
August 17, 2005Question Answering Passage Retrieval Using Dependency Parsing 1/28 Question Answering Passage Retrieval Using Dependency Parsing Hang Cui.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
Performance Measures. Why to Conduct Performance Evaluation? 2 n Evaluation is the key to building effective & efficient IR (information retrieval) systems.
1 Evaluating High Accuracy Retrieval Techniques Chirag Shah,W. Bruce Croft Center for Intelligent Information Retrieval Department of Computer Science.
A Probabilistic Model for Fine-Grained Expert Search Shenghua Bao, Huizhong Duan, Qi Zhou, Miao Xiong, Yunbo Cao, Yong Yu June , 2008, Columbus Ohio.
Using Social Annotations to Improve Language Model for Information Retrieval Shengliang Xu, Shenghua Bao, Yong Yu Shanghai Jiao Tong University Yunbo Cao.
ASSOCIATIVE BROWSING Evaluating 1 Jinyoung Kim / W. Bruce Croft / David Smith for Personal Information.
Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.
The Loquacious ( 愛說話 ) User: A Document-Independent Source of Terms for Query Expansion Diane Kelly et al. University of North Carolina at Chapel Hill.
CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,
11 A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 1, Michael R. Lyu 1, Irwin King 1,2 1 The Chinese.
Self-taught Clustering – an instance of Transfer Unsupervised Learning † Wenyuan Dai joint work with ‡ Qiang Yang, † Gui-Rong Xue, and † Yong Yu † Shanghai.
A Framework to Predict the Quality of Answers with Non-Textual Features Jiwoon Jeon, W. Bruce Croft(University of Massachusetts-Amherst) Joon Ho Lee (Soongsil.
Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:
Automatic Categorization of Query Results Kaushik Chakrabarti, Surajit Chaudhuri, Seung-won Hwang Sushruth Puttaswamy.
Databases Computer Technology. First Record Last Record New Record Previous Record Current Record Next Record Working with Microsoft Access (Database)
Usefulness of Quality Click- through Data for Training Craig Macdonald, ladh Ounis Department of Computing Science University of Glasgow, Scotland, UK.
Question Answering Passage Retrieval Using Dependency Relations (SIGIR 2005) (National University of Singapore) Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan,
Wen Chan 1 , Jintao Du 1, Weidong Yang 1, Jinhui Tang 2, Xiangdong Zhou 1 1 School of Computer Science, Shanghai Key Laboratory of Data Science, Fudan.
Evaluation of IR Performance
Evaluating Information Retrieval Systems
Category-Sensitive Question Routing in Community Question Answering
财务管理案例教学法 研究及示例 ——王遐昌 2006/11/10.
Presentation transcript:

Huizhong Doan, Yonbo Cao, Chin-Yew Lin and Yong Yu Shanghai Jiao Tong University & MSRA ACL /7/91Rick Liu

2008/7/92Rick Liu

 Question Search  Help users to search previous answers 2008/7/93Rick Liu  Any nice hotels in Berlin or Hamburg?  How long does it take to Hamburg from Berlin?  Cheap hotels in Berlin?

2008/7/94Rick Liu

 Identifying question topic & focus  Question tree  Determining the tree cut  Modeling question topic & focus for search  Language model 2008/7/9Rick Liu5

 Topic terms  BaseNP, WH-ngram  Topic profile  probability distribution of categories  Specificity  inverse of the entropy of the topic profile  Topic chain  topic terms ordered by specificity value (desc)  Topic tree 2008/7/9Rick Liu6

2008/7/9Rick Liu7

 M = ( Γ, θ )  Γ = [ C1, C2,.. Ck ], tree cut  Θ = [ P(C1), P(C2),.. P(Ck) ], prob param vector  A cut is any set of nodes  Σ i=1..k P( Ci ) = /7/9Rick Liu8

2008/7/9Rick Liu9 [n 0, n 11 ], [n 12, n 21, n 22, n 23 ], [n 13, n 24 ] [n 11, n 21, n 22, n 23, n 24 ]

2008/7/9Rick Liu10  Minimum Description Length Ref : Li and Abe, 1998

2008/7/9Rick Liu11

 P( q | q )  q : queried question  q : targeted question 2008/7/9Rick Liu12 ~ ~

 Yahoo! Answers  Resolved questions  travel : 314,616 items  computers & internet : 210,785 items  Tree fields  title ( only used )  description  answers 2008/7/9Rick Liu13

 Employed Vector Space Model  Manual judgments : relevant / irrelevant  Baseline : VSM, LMIR  Evaluation : MAP, R-precision, MRR 2008/7/9Rick Liu14

2008/7/9Rick Liu15

2008/7/9Rick Liu16

2008/7/9Rick Liu17

 Examine the correctness of question topics and question foci  200 queried question => 69 question incorrect  (a) Only have the head part ( 59 )  (b) Incorrect order ( 10 )  (a) explains why λ is /7/9Rick Liu18

 FAQ data  Community based  Jeon et al., 2005  Compared four different retrieval methods ▪ Vector space model ▪ Okapi ▪ Language model ▪ Translation-based model  Translation-based model performed the best 2008/7/9Rick Liu19

 Lexical chasm  Where to stay in Hamburg?  The best hotel in Hamburg?  IBM model 1  Use question titles and question description as the parallel corpus 2008/7/9Rick Liu20

2008/7/9Rick Liu21

1) Data Structure 2) Use MDL-based Tree Cut Model to Identify 3) A new form of language modeling for question search 4) Extensive experiments 2008/7/9Rick Liu22 Now only community-based From forum sites / FAQ sites

2008/7/9Rick Liu23

2008/7/9Rick Liu24

2008/7/9Rick Liu25