FACTORS AFFECTING RESPONSE RATES FOR REAL-LIFE MIR QUERIES Jin Ha LeeM. Cameron JonesJ. Stephen Downie Graduate School of Library and Information Science.

Slides:



Advertisements
Similar presentations
Analyzing Significant Differences between Means
Advertisements

Dummy Dependent variable Models
Ordinary least Squares
Logistic Regression Psy 524 Ainsworth.
Research Methodology For reader assistance, have an introductory paragraph in which attention is given to the organization of the section in relation to.
The Role of Library School STEM Curriculum in Academic Job Placement Kelli Trei, Biosciences Librarian, University of Illinois at Urbana-Champaign Introduction.
Trashball: A Logistic Regression Classroom Activity Christopher Morrell (Joint work with Richard Auer) Mathematics and Statistics Department Loyola University.
Logistic Regression Part I - Introduction. Logistic Regression Regression where the response variable is dichotomous (not continuous) Examples –effect.
Stat 217 – Day 24 Analysis of Variance Have yesterday’s handout handy.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
SLIDE 1IS 240 – Spring 2010 Logistic Regression The logistic function: The logistic function is useful because it can take as an input any.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Stat 217 – Day 27 Chi-square tests (Topic 25). The Plan Exam 2 returned at end of class today  Mean.80 (36/45)  Solutions with commentary online  Discuss.
Handling Categorical Data. Learning Outcomes At the end of this session and with additional reading you will be able to: – Understand when and how to.
Multiple Regression – Basic Relationships
BA 427 – Assurance and Attestation Services
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Inferential Statistics
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Data and Data Collection Questionnaire
Survey Of Music Information Needs, Uses, and Seeking Behaviors Jin Ha Lee J. Stephen Downie Graduate School of Library and Information Science University.
Advanced Statistics for Researchers Meta-analysis and Systematic Review Avoiding bias in literature review and calculating effect sizes Dr. Chris Rakes.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Minorities and Retirement Security (MRS) Minorities and Retirement Security (MRS) Dr. Hervani (PI) Saeid Delnavaz (RA) Fourth Seminar May 30, 2014 Chicago.
1 A Collaborative Music Information Retrieval / Music Digital Library Research Model: A Briefing J. Stephen Downie, Ph.D. Graduate School of Library &
Exploiting Recommended Usage Metadata: Exploratory Analyses Xiao Hu, J. Stephen Downie, Andreas Ehmann THE ANDREW W. MELLON FOUNDATION The International.
PS 225 Lecture 20 Linear Regression Equation and Prediction.
Understanding User’s Query Intent with Wikipedia G 여 승 후.
Chapter 16 Data Analysis: Testing for Associations.
THE IRISH SOFTWARE ENGINEERING RESEARCH CENTRELERO© What we currently know about software fault prediction: A systematic review of the fault prediction.
Categorical data analysis: An overview of statistical techniques AnnMaria De Mars The Julia Group AnnMaria De Mars The Julia Group.
SCIENCE FAIR PROJECT TITLE Name(s) Mrs. Rubio Period _____.
Science Process Skills Vocabulary 8/17/15. Predicting Forming an idea of an expected result. Based on inferences.
Scientific Inquiry What is Science?  A body of knowledge  A set of theories that describes the world  A way of learning about the world  A method.
Information Systems Today: Managing in the Digital World TB3-1 3 Technology Briefing Database Management “Modern organizations are said to be drowning.
EXPERIMENTS AND EXPERIMENTAL DESIGN
Selecting Good Expansion Terms for Pseudo-Relevance Feedback Guihong Cao, Jian-Yun Nie, Jianfeng Gao, Stephen Robertson 2008 SIGIR reporter: Chen, Yi-wen.
+ Chapter Scientific Method variable is the factor that changes in an experiment in order to test a hypothesis. To test for one variable, scientists.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
THE 2006 MUSIC INFORMATION RETRIEVAL EVALUATION EXCHANGE (MIREX 2006) RESULTS OVERVIEW The IMIRSEL Group led by J. Stephen Downie Graduate School of Library.
BACKGROUND The Web is a global information resource Web users that seek information vary, culturally and ethnically Users of different cultural backgrounds.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
A Framework for Detection and Measurement of Phishing Attacks Reporter: Li, Fong Ruei National Taiwan University of Science and Technology 2/25/2016 Slide.
To Personalize or Not to Personalize: Modeling Queries with Variation in User Intent Presented by Jaime Teevan, Susan T. Dumais, Daniel J. Liebling Microsoft.
PSYCH 610 Entire Course (UOP) For more course tutorials visit  PSYCH 610 Week 1 Individual Assignment Research Studies Questionnaire.
Predicting Winning Price In Real Time Bidding With Censored Data Tejaswini Veena Sambamurthy Weicong Chen.
Alvin CHAN Kay CHEUNG Alex YING Relationship between Twitter Events and Real-life.
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
The Chi-Square Distribution  Chi-square tests for ….. goodness of fit, and independence 1.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
The Math Studies Project for Internal Assessment   Personal research   Relate to student’s own interest   Use mathematical tools from the syllabus.
Introduction to Power and Effect Size  More to life than statistical significance  Reporting effect size  Assessing power.
Analysis of Fastenal Quoting Practices
Consumer Behavior MKTG 302-PSYC 335 Consumer Research
Understanding Results
Data and Data Collection
Rainfall Example The data set contains cord yield (bushes per acre) and rainfall (inches) in six US corn-producing states (Iowa, Nebraska, Illinois, Indiana,
Steps of answering a scientific question
Hypothesis Construction
Nature of Science.
Incremental Partitioning of Variance (aka Hierarchical Regression)
Lecture11 review for final examination
Mathematics Benchmark
Assessed practical: Resistance of a wire
First and Last name homeroom
Presentation transcript:

FACTORS AFFECTING RESPONSE RATES FOR REAL-LIFE MIR QUERIES Jin Ha LeeM. Cameron JonesJ. Stephen Downie Graduate School of Library and Information Science University of Illinois at Urbana-Champaign Possible Explanations: Some questions are either impossible or difficult to answer given the query statement, regardless of the price. Users may also need more words to adequately describe their information needs for difficult questions that are less likely to get answered. As users provide more information they may also increase the chance that they are providing incorrect information which may result in search failure. Discussion Conclusion: Our results show that price has no effect and adding more words has a negative effect, although minimal, on the likelihood of a query being answered. Future Work: A qualitative analysis of the queries to determine which of the answered queries were answered correctly and also assess the level of accuracy of user provided information features for answered queries. Special Thanks to: The Andrew W. Mellon Foundation and the National Science Foundation (Grant No. NSF IIS ) Conclusion and Future Work Results The increase in the proportion of queries answered drops rapidly over time. More than half of all answered queries (51.69%) are answered within 2 hours of being posted and 83.71% within the first 24 hours. Difference in the mean price: (Δµ = 0.097) Logistic regression model: logit(π) = X The query price has no statistically significant effect on the probability of getting an answer (likelihood-ratio test statistic = 0.013; df = 1; p = 0.908). Difference in the mean query length: (Δµ = 4.553) Logistic regression model: logit(π) = X The word count has a statistically significant effect on the probability of getting an answer (likelihood-ratio test statistic = ; df = 1; p = 0.000), but the substantive difference is very small. Price Query Length Response Rate and Time Total number of queries: 2,208 Number of answered queries: 1,062 Number of unanswered queries: 1,146 Overall probability of a query being answered ≈ Project: This poster reports on research conducted as part of the Human Use of Music Information Retrieval Systems (HUMIRS) project. Objective: To improve our understanding of the factors related to a query being answered. Analysis: First, we examine how the proportion of queries answered varies over time. Then we compare selected features of answered and unanswered queries, namely the price offered for the answer and the length of the query, in order to understand if these variables affect the probability of a query being answered. Introduction Data: 2,208 real-life music-related queries collected from the Google Answers website’s music category. Features Examined: (1) if the query was answered or not; (2) the elapsed time in minutes between when the query was posted and when it was answered; (3) the number of content words in the query (i.e., stop words removed); and (4) the price offered for the answer. Statistical Analysis: A logistic regression model describes how the probability of a query being answered depends on the explanatory variables. We test if that probability is independent of variable X by the likelihood- ratio test. A well-fitting model is significant at the.05 level. Data and Analysis