Volunteer?. What’s the population of Raleigh? How many people live in Raleigh?

Slides:

Advertisements

Similar presentations

Answering Approximate Queries over Autonomous Web Databases Xiangfu Meng, Z. M. Ma, and Li Yan College of Information Science and Engineering, Northeastern.

Advertisements

Feature Selection as Relevant Information Encoding Naftali Tishby School of Computer Science and Engineering The Hebrew University, Jerusalem, Israel NIPS.

© 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Validity and Reliability Chapter Eight.

4 Why Should we Believe Politicians? Lupia and McCubbins – The Democratic Dilemma GV917.

Statistical Issues in Research Planning and Evaluation

WSCD INTRODUCTION  Query suggestion has often been described as the process of making a user query resemble more closely the documents it is expected.

Business Statistics for Managerial Decision

Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.

April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.

Detecting Near Duplicates for Web Crawling Authors : Gurmeet Singh Mank Arvind Jain Anish Das Sarma Presented by Chintan Udeshi 6/28/ Udeshi-CS572.

Extensions to Consumer theory Inter-temporal choice Uncertainty Revealed preferences.

ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.

8-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Statistics for Managers using Microsoft.

Copyright ©2011 Pearson Education 8-1 Chapter 8 Confidence Interval Estimation Statistics for Managers using Microsoft Excel 6 th Global Edition.

The Analysis of Variance

1 Seventh Lecture Error Analysis Instrumentation and Product Testing.

Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.

1/49 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 9 Estimation: Additional Topics.

Quality-aware Collaborative Question Answering: Methods and Evaluation Maggy Anastasia Suryanto, Ee-Peng Lim Singapore Management University Aixin Sun.

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.

Virginia Mathematics 2009 Grade 5 Standards of Learning Virginia Department of Education K-12 Mathematics Standards of Learning Institutes October 2009.

«Tag-based Social Interest Discovery» Proceedings of the 17th International World Wide Web Conference (WWW2008) Xin Li, Lei Guo, Yihong Zhao Yahoo! Inc.,

10.1: Estimating with Confidence Statistical inference uses language of probability to express the strength of our conclusions 2 types of formal statistical.

Confidence Interval Estimation

Probabilistic Model for Definitional Question Answering Kyoung-Soo Han, Young-In Song, and Hae-Chang Rim Korea University SIGIR 2006.

1 Context-Aware Search Personalization with Concept Preference CIKM’11 Advisor ： Jia Ling, Koh Speaker ： SHENG HONG, CHUNG.

10.2 Estimating a Population Mean when σ is Unknown When we don’t know σ, we substitute the standard error of for its standard deviation. This is the standard.

CHAPTER 4 Engineering Communication

Comparing Two Population Means

Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.

Confidence Interval Estimation

Tomek Strzalkowski & Sharon G. Small ILS Institute, SUNY Albany LAANCOR May 22, 2010 (Tacitly) Collaborative Question Answering Utilizing Web Trails 5/22/10.

PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.

When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.

CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.

MODULE: INFORMATION QUALITY: RESEARCH METHODS DRAFT 23 Everett Street Cambridge, MA 02138, USA

XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.

© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 1. The Statistical Imagination.

Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.

Facilitating Document Annotation using Content and Querying Value.

Understanding User’s Query Intent with Wikipedia G 여 승 후.

21/11/20151Gianluca Demartini Ranking Clusters for Web Search Gianluca Demartini Paul–Alexandru Chirita Ingo Brunkhorst Wolfgang Nejdl L3S Info Lunch Hannover,

Meet the web: First impressions How big is the web and how do you measure it? How many people use the web? How many use search engines? What is the shape.

2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )

CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.

Information Integration By Neel Bavishi. Mediator Introduction A mediator supports a virtual view or collection of views that integrates several sources.

Chap 8-1 Chapter 8 Confidence Interval Estimation Statistics for Managers Using Microsoft Excel 7 th Edition, Global Edition Copyright ©2014 Pearson Education.

Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.

1 BA 555 Practical Business Analysis Linear Programming (LP) Sensitivity Analysis Simulation Agenda.

The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.

26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.

Date: 2013/9/25 Author: Mikhail Ageev, Dmitry Lagun, Eugene Agichtein Source: SIGIR’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Improving Search Result.

Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.

CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.

Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.

Finding similar items by leveraging social tag clouds Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: SAC 2012’ Date: October 4, 2012.

Computational Intelligence: Methods and Applications Lecture 26 Density estimation, Expectation Maximization. Włodzisław Duch Dept. of Informatics, UMK.

Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.

The Law of Averages. What does the law of average say? We know that, from the definition of probability, in the long run the frequency of some event will.

CHAPTER 9 Testing a Claim

Unit 5: Hypothesis Testing

Warm Up Check your understanding P. 586 (You have 5 minutes to complete) I WILL be collecting these.

CHAPTER 9 Testing a Claim

Chapter 8: Estimating with Confidence

Chapter 10: Estimating with Confidence

Significance Tests: The Basics

CHAPTER 9 Testing a Claim

CHAPTER 9 Testing a Claim

Information Retrieval and Web Design

Presentation transcript:

Volunteer?

What’s the population of Raleigh?

How many people live in Raleigh?

Paraphrasing Invariance Coefficient -- Measuring Para-Query Invariance of Search Engines Tomasz Imielinski, Jinyun Yan

Outline Motivation –Why do we want to measure the semanticity of search engines Methods –How to measure –What do we need to prepare Future Work –what should be solved

Search Engines are becoming semantic Google’s Eric Schmidt on the topic of the long term goals of Google search tells Techcrunch: ( )Techcrunch “Wouldn’t it be nice if Google understood the meaning of your phrase rather than just the words that are in that phrase? We have a lot of discoveries in that area that are going to roll out in the next little while.” Semantic Search: Search beyond words

Are they really semantic? Semantic Search is hot –Microsoft acquired semantic search engine PowerSet in 2008 –Baidu published “box computing” in 2009 –Facebook seeks to build the semantic search engine in 2010 Many search engine companies use the word “semantic”. Are they really semantic ?

Today: Users have to ask nicely Search Engines are still quite sensitive to the way queries are formulated Semantically equivalent queries usually will get different results! Example: –1. why do I get canker sores –2. what causes canker sores –3. why canker sores –4. the reason of canker sores

Returned Results Every URL is different. Such result is not semantic

Why? What does Semantic mean? –Understand the meaning behind words Should Semantic Search Engines understand the meaning of query? –Yes If they understand the meaning of query, are they supposed to recognize semantically equivalent queries? –Yes What should be the corresponding response when seeing semantically equivalent queries? –Return the same answer!  It’s necessary for Semantic Search Engines to recognize para- queries, although not sufficient!

Measure Metric Semantic Invariance –Be invariant to semantically equivalent queries Equivalent queries => para-queries –Alternative ways to express the same search desire “Population of Raleigh” vs “How many people live in Raleigh” One feasible method to measure the semanticity –Return same results for para-queries. Necessary but not sufficient Semantic => Semantically invariant Semantically invariant Semantic

Data Collection Objective tests need a large amount of para-queries –How to ? Paraphrase generation –Para-query is a sub-class of paraphrase –Paraphrase: restatement of one phrase using different words or same words in different orders. –Borrow an idea from paraphrase generation Lexical Variation: Numeric Synonym replacement (1623 pairs) –Top 10 songs  Top ten songs –Top 8 myspace  Top eight myspace Para-query has its own characteristics – short, few content words

Data Collection Extra Information –Tom Hanks  Tom Hanks actor –8 categories extracted from wikipedia, 7198 pairs Query Reformulation –Similar to query reformulation: expand query using related words –Query reformulation doesn’t promise equivalent meaning

Collective Wisdom Rephraser Game –Duration: 15 minutes for each round –Round: 430 rounds –Input: a start query, a hidden phrase which is not visible to players –Goal: paraphrase the start query to gain score –Win policy: If your phrase is proposed by others later: Votes If your phrase matches the hidden phrase: Jack pot –Player: no limitation to the number of players

Rephraser Game

Amodal Perception In our game, every player independently plays the game – Can’t see others’ input –Can’t chat with other players Why such independence is important? –Some games with similar purposes provide assistant information –Amodal theory in psychology Extra information will change the interpretation of the motion of objects

Visible Objects

Occluded by a Pipe

Generate para-Queries from the Game Votes –For each phrase proposed by players, it has a corresponding Votes score, which displays how many other players agreed on it –Filtering by Votes, we collect para-queries with high probability Votes >= 3 human acceptability ratio = 89% –Human acceptability ratio = #(agreed persons)/ #(voting persons) Stronger confidence –Absent with all choices (human acceptability ratio) –Not satisfy p, 1-p distribution (Kappa) –K players independently come up with a phrase p, in total M phrases, the confidence of p? Templates –Select start queries to form templates Who is the governor of [Alaska] now? – who is the governor of [X] now? –Find substitutions for argument slots

Measurement Goal: Are search engines semantically invariant? Experimental Set-Up Total of 19,518 pairs of (start query, para-query) Measured by paraphrasing invariant coefficient and entropy Tested on top 4 search engines:

Paraphrasing Invariant Coefficient Associated Queries –Queries are associated by Search Engine Return Page –Query q1 and q2 can have the same SERP, or not Para-Queries –Suppose q1 and q2 are para-queries, if they have the same returned top K URLs by search engine S, we assume: Search engine S recognized that they are para-queries –Relax to consider the top result URL Paraphrasing Invariant Coefficient –Given a pair of para-queries, what’s the probability for a search engine S to return the same top result URL. –It’s estimated from our test set : #(pairs with same top URL) / #(pairs)

Result quite low => high probability that you get different results for para-queries varies in different domains

Entropy Uncertainty –A standard in Information Theory to measure the level of uncertainty –Use it to measure how uncertain a search engine’s results are for a set of para-queries (q1…qn) Calculation –Given a set of para-queries (q1…qn), it has a corresponding top URL set (u1…um) returned by search engine S –If m = 1: S recognized that they are para-queries (best condition) –If m=n: every query got different result (worst condition) –If m<n: some queries are considered as para-queries

Result The best value for entropy is 0 The last column demonstrates the result for the worst condition Close to the worst condition, far away from best condition

Conclusion Semantic Search Engines need to understand the meaning of queries. If they could understand the meaning of queries, they should recognize para-queries. However, right now, they can’t.

Future Work How to recognize para-queries? What’s other measure metrics for semantic search engines? –Recognizing para-queries is necessary but not sufficient.

Thanks for your attention Questions ?