Information Retrieval Performance Measurement Using Extrapolated Precision William C. Dimm DESI VI June 8, 2015.

Slides:



Advertisements
Similar presentations
Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)
Advertisements

Average Annual Percent Cost Increase since 1999: EID Water, cost for 1500 cf per month & national CPI-U detail for water, wastewater, & solid waste.
The ISA for Physics What you need to revise.
Retrieval Evaluation J. H. Wang Mar. 18, Outline Chap. 3, Retrieval Evaluation –Retrieval Performance Evaluation –Reference Collections.
Chapter 5: Confidence Intervals.
Shape Analysis and Retrieval ( ) (Michael) Misha Kazhdan.
Evaluation.  Allan, Ballesteros, Croft, and/or Turtle Types of Evaluation Might evaluate several aspects Evaluation generally comparative –System A vs.
Local Affine Feature Tracking in Films/Sitcoms Chunhui Gu CS Final Presentation Dec. 13, 2006.
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Retrieval Evaluation: Precision and Recall. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity.
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
When Small Effects are Impressive Deborah Prentice & Dale Miller.
Relationships. Direct Proportion Two quantities are directly proportional if an increase in one causes an increase in the other. Example: y = 2 x Example:
Experimental Design. Learning Targets I can… Identify the three types of variables in an experiment Identify quantitative and qualitative data Decide.
The College of Saint Rose CSC 460 / CIS 560 – Search and Information Retrieval David Goldschmidt, Ph.D. from Search Engines: Information Retrieval in Practice,
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Statistics Pooled Examples.
Demand Elasticities and Related Coefficients. Demand Curve Demand curves are assumed to be downward sloping, but the responsiveness of quantity (Q) to.
IR Evaluation Evaluate what? –user satisfaction on specific task –speed –presentation (interface) issue –etc. My focus today: –comparative performance.
Unit Three Ratios and Proportional Relationships Why do we learn vocabulary in math??
Microsoft Research1 A Statistical Analysis of the Precision-Recall Graph Ralf Herbrich, Hugo Zaragoza, Simon Hill. Microsoft Research, Cambridge University,
Graphing of Data Why do we display data with graphs?
IR System Evaluation Farhad Oroumchian. IR System Evaluation System-centered strategy –Given documents, queries, and relevance judgments –Try several.
Parallel and Distributed Searching. Lecture Objectives Review Boolean Searching Indicate how Searches may be carried out in parallel Overview Distributed.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
A graph represents the relationship between a pair of variables.
Chapter 8 Evaluating Search Engine. Evaluation n Evaluation is key to building effective and efficient search engines  Measurement usually carried out.
Threshold Setting and Performance Monitoring for Novel Text Mining Wenyin Tang and Flora S. Tsai School of Electrical and Electronic Engineering Nanyang.
Math 10 Lesson #2 Inverse Variation Mrs. Goodman.
What Does the User Really Want ? Relevance, Precision and Recall.
1 Performance Measures for Machine Learning. 2 Performance Measures Accuracy Weighted (Cost-Sensitive) Accuracy Lift Precision/Recall –F –Break Even Point.
Quiz 1 review. Evaluating Classifiers Reading: T. Fawcett paper, link on class website, Sections 1-4 Optional reading: Davis and Goadrich paper, link.
Chapter 24 Monetary and Fiscal Policy in the ISLM Model © 2005 Pearson Education Canada Inc.
Recall Gravitational PE n The work done to lift a ball of mass 1 kg a vertical height of 1 m is 10 J n What is the work done to lift 5 bowling balls?
Evaluating Classifiers Reading: T. Fawcett, An introduction to ROC analysis, Sections 1-4, 7 (linked from class website)An introduction to ROC analysis.
Information Retrieval (based on Jurafsky and Martin) Miriam Butt October 2003.
+ EXPERIMENTAL INVESTIGATIONS An experimental investigation is one in which a control is identified. The variables are measured in an effort to gather.
Matching not patching: primary maths and children’s thinking Anne Watson June 2009.
Information Retrieval Lecture 3 Introduction to Information Retrieval (Manning et al. 2007) Chapter 8 For the MSc Computer Science Programme Dell Zhang.
Proportionality SPH4U. Introduction In physics, we are often interested in how one variable affects another.
1 Math Supplement The Proportionality A “is proportional to” B.
Finding the Constant of Proportionality Math 7 Unit 1 Lesson 4.
Inverse Variation 2.5. In direct variation, if one variable increases, so does the other. Inverse variation is the opposite.
Graphing Techniques and Interpreting Graphs. Introduction A graph of your data allows you to see the following: Patterns Trends Shows Relationships between.
AGENDA LESSON 62 CORRECTIONS LESSON 63 QUESTIONS LESSON 64.
LESSON 5 - STATISTICS & RESEARCH STATISTICS – USE OF MATH TO ORGANIZE, SUMMARIZE, AND INTERPRET DATA.
A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval Chengxiang Zhai, John Lafferty School of Computer Science Carnegie.
Chapter 24 Monetary and Fiscal Policy in the ISLM Model.
Evaluating Classifiers
Aim: How do you construct a proper physics graph?
Rules for Graphing.
I need to use which graph?
تحليل الحساسية Sensitive Analysis.
3.2 Motion With Constant Acceleration
Graphing Techniques.
Scientific Measurement
Recall that a proportional relationship is a relationship between two quantities in which the ratio of one quantity to the.
Proportionality SPH4U.
The Method of Science.
Does age have a strong positive correlation with height? Explain.
Name:___________________________ Date:______________
Does age have a strong positive correlation with height? Explain.
Review 1+3= 4 7+3= = 5 7+4= = = 6 7+6= = = 7+7+7=
Chapter 6: Probability.
Introduction to Physics
Parabolic Curve Equation Relationship y=
Precision and Recall Reminder:
EXHIBIT 1 Three Categories of Resources
Observation Information we get from our senses alone.
ROC Curves and Operating Points
Presentation transcript:

Information Retrieval Performance Measurement Using Extrapolated Precision William C. Dimm DESI VI June 8, 2015

Comparing Precision-Recall Curves

What if we only know one point?

Comparing Precision-Recall Points

F 1 Comparison: Wrong Conclusion!

F 1 Depends Strongly on Recall

F 1 Contours

Danger Zone

Want Contours Like P-R Curves

Less Cutting Across Contours

How to Quantify?

Precision's Relationship with Cost ● Precision is meaningful – inversely proportional to number of docs to review: n = ρNR/P

Extrapolated Precision

X is Fairly Independent of R

Precision-Recall Curve Math

Actual Precision-Recall Curves

Actual Probability Curves

Model Probability vs Actual

Model Precision vs Actual

Extrapolation Limitations ● P < 0.99 ● R < 0.99 ● P >= 2ρ / (1 + ρ + R*(1 - ρ))

Summary ● Proportionality dictates recall – Need performance measure less sensitive to recall ● Extrapolate precision-recall point to target recall level using model curves ● Model precision-recall curves are constant performance contours ● When close to target recall, performance measure inversely proportional to review cost