How Opinions are Received by Online Communities A Case Study on Amazon.com Helpfulness Votes Cristian Danescu-Niculescu-Mizil 1, Gueorgi Kossinets 2, Jon.

Slides:



Advertisements
Similar presentations
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Advertisements

Cristian Danescu-Niculescu-Mizil Dept. of Computer Science Cornell University Gueorgi Kossinets Google Inc. Jon Kleinberg Dept. of Computer Science Cornell.
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Correlation Chapter 9.
The Wisdom of the Few A Collaborative Filtering Approach Based on Expert Opinions from the Web Xavier Amatriain Telefonica Research Nuria Oliver Telefonica.
Cristian Danescu-Niculescu-Mizil 1, Gueorgi Kossinets 2, Jon Kleinberg 1, Lillian Lee 1 1 Dept. of Computer Science, Cornell University, 2 Google Inc.
Unit 2: Research Methods in Psychology
Agenda for January 25 th Administrative Items/Announcements Attendance Handouts: course enrollment, RPP instructions Course packs available for sale in.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Experimental Evaluation
TESTING A HYPOTHESIS RELATING TO THE POPULATION MEAN 1 This sequence describes the testing of a hypothesis at the 5% and 1% significance levels. It also.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Chapter 2 Research Methods. The Scientific Approach: A Search for Laws Empiricism: testing hypothesis Basic assumption: events are governed by some lawful.
Variance and Standard Deviation. Variance: a measure of how data points differ from the mean Data Set 1: 3, 5, 7, 10, 10 Data Set 2: 7, 7, 7, 7, 7 What.
Correlation and Linear Regression
Chapter 2: The Research Enterprise in Psychology
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Quality Control McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 2: The Research Enterprise in Psychology
Chapter 2 Research Methods. The Scientific Approach: A Search for Laws Empiricism: testing hypothesis Basic assumption: events are governed by some lawful.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Chapter 2 The Research Enterprise in Psychology. n Basic assumption: events are governed by some lawful order  Goals: Measurement and description Understanding.
PowerPoint presentation to accompany Research Design Explained 6th edition ; ©2007 Mark Mitchell & Janina Jolley Chapter 7 Introduction to Descriptive.
Correlation.
Copyright © Allyn & Bacon 2007 Chapter 2: Research Methods.
Automatically Identifying Localizable Queries Center for E-Business Technology Seoul National University Seoul, Korea Nam, Kwang-hyun Intelligent Database.
The Research Enterprise in Psychology. The Scientific Method: Terminology Operational definitions are used to clarify precisely what is meant by each.
User Study Evaluation Human-Computer Interaction.
Nature of Science Science Nature of Science Scientific methods Formulation of a hypothesis Formulation of a hypothesis Survey literature/Archives.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Analyzing and Interpreting Quantitative Data
Yes….. I expect you to read it! Maybe more than once!
Measures of Variability Variability: describes the spread or dispersion of scores for a set of data.
Chapter 2 AP Psychology Outline
1 Discovering Authorities in Question Answer Communities by Using Link Analysis Pawel Jurczyk, Eugene Agichtein (CIKM 2007)
Chapter 2 The Research Enterprise in Psychology. Table of Contents The Scientific Approach: A Search for Laws Basic assumption: events are governed by.
Designing Ranking Systems for Consumer Reviews: The Economic Impact of Customer Sentiment in Electronic Markets Anindya Ghose Panagiotis Ipeirotis Stern.
Validity and Reliability Edgar Degas: Portraits in a New Orleans Cotton Office, 1873.
Correlation & Regression
Managerial Economics Demand Estimation & Forecasting.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
The Statistical Analysis of Data. Outline I. Types of Data A. Qualitative B. Quantitative C. Independent vs Dependent variables II. Descriptive Statistics.
How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Introduction to Earth Science Section 2 Section 2: Science as a Process Preview Key Ideas Behavior of Natural Systems Scientific Methods Scientific Measurements.
Interpersonal Relationships in Group Interaction in CSCW Environments Yang Cao, Golha Sharifi, Yamini Upadrashta, Julita Vassileva University of Saskatchewan,
Chapter 2 The Research Enterprise in Psychology. Table of Contents The Scientific Approach: A Search for Laws Basic assumption: events are governed by.
Chapter 6: Analyzing and Interpreting Quantitative Data
A New Approach to Utterance Verification Based on Neighborhood Information in Model Space Author :Hui Jiang, Chin-Hui Lee Reporter : 陳燦輝.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
A Framework to Predict the Quality of Answers with Non-Textual Features Jiwoon Jeon, W. Bruce Croft(University of Massachusetts-Amherst) Joon Ho Lee (Soongsil.
Research in Psychology Chapter Two 8-10% of Exam AP Psychology.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 1 Research: An Overview.
1 Collecting and Interpreting Quantitative Data Deborah K. van Alphen and Robert W. Lingard California State University, Northridge.
Hypothesis Testing and Statistical Significance
Introduction Dispersion 1 Central Tendency alone does not explain the observations fully as it does reveal the degree of spread or variability of individual.
BUS 308 Entire Course (Ash Course) For more course tutorials visit BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Lecture Slides Elementary Statistics Twelfth Edition
RESEARCH METHODS 8-10% 250$ 250$ 250$ 250$ 500$ 500$ 500$ 500$ 750$
Correlation and Regression
iSRD Spam Review Detection with Imbalanced Data Distributions
Collecting and Interpreting Quantitative Data
Presentation transcript:

How Opinions are Received by Online Communities A Case Study on Amazon.com Helpfulness Votes Cristian Danescu-Niculescu-Mizil 1, Gueorgi Kossinets 2, Jon Kleinberg 1, Lillian Lee 1 1 Dept. of Computer Science, Cornell University, 2 Google Inc. WWW IDS Lab. Hwang Inbeom

Copyright  2009 by CEBT Outline  Users’ evaluation on online reviews: Helpfulness votes Observation of behaviors Making some hypothesis and proving their validity Coming up with a mathematical model explains these behaviors 2

Copyright  2009 by CEBT Introduction Opinion What did Y think of X? 3

Copyright  2009 by CEBT Introduction Meta-Opinion What did Z think of Y’s opinion of X? 4

Copyright  2009 by CEBT The Helpfulness of Reviews  Widely-used web sites include not just reviews, but also evaluations of the helpfulness of the reviews The helpfulness vote – “Was this review helpful to you?” Helpfulness ratio: – “a out of b people found the review itself helpful” 5

Copyright  2009 by CEBT Amazon.com Helpfulness Votes Data  4,000,000 reviews about roughly 700,000 books, including average star ratings and helpfulness ratios 6 Average star rating Helpfulness ratio

Copyright  2009 by CEBT Definitions of “Helpfulness”  Helpfulness in the narrow sense: “Does this review help you in making a purchase decision?” Liu’s work: annotation and classification of review helpfulness Annotators’ evaluation differed significantly from the helpfulness votes  Helpfulness “in the wild” The way Amazon users evaluate each others’ reviews Intertwined with complex social feedback mechanisms 7

Copyright  2009 by CEBT Flow of Presentation HypothesizingVerifyingModeling 8

Copyright  2009 by CEBT Flow of Presentation Hypothesizing Conformity Individual-bias Brilliant-but-cruel Quality-only VerifyingModeling 9

Copyright  2009 by CEBT Hypotheses: Social Mechanisms underlying  Well-studied hypotheses for how social effects influence group’s reaction to an opinion The conformity hypothesis The individual-bias hypothesis The brilliant-but-cruel hypothesis The quality-only straw-man hypothesis 10

Copyright  2009 by CEBT Hypotheses  The conformity hypothesis Review is evaluated as more helpful when its star rating is closer to the consensus star rating – Helpfulness ratio will be the highest of which reviews have star rating equal to overall average  The individual-bias hypothesis When a user considers a review, he or she will rate it more highly if it expresses an opinion that he or she agrees with 11

Copyright  2009 by CEBT Hypotheses (contd.)  The brilliant-but-cruel hypothesis Negative reviewers are perceived as more intelligent, competent, and expert than positive reviewers  The Quality-only straw-man hypothesis Helpfulness is being evaluated purely based on the textual content of reviews Non-textual factors are simply correlates of textual quality 12

Copyright  2009 by CEBT Flow of Presentation Hypothesizing Verifying Absolute deviation of helpfulness ratio Signed deviation of helpfulness ratio Variance of star rating and helpfulness ratio Making use of plagiarism Modeling 13

Copyright  2009 by CEBT Hypotheses Conformity A review is evaluated as more helpful when its star rating is closer to the average star rating Individual-bias A review is evaluated as more helpful when its star rating is closer to evaluator’s opinion Brilliant-but-cruel A review is evaluated as more helpful when its star rating is below to the average star rating Quality-only Only textual information affects helpfulness evaluation 14

Copyright  2009 by CEBT Absolute Deviation from Average  Consistent with conformity hypothesis Strong inverse correlation between the median helpfulness ratio and the absolute deviation Reviews with star rating close to the average gets higher helpfulness ratio 15

Copyright  2009 by CEBT Hypotheses Conformity A review is evaluated as more helpful when its star rating is closer to the average star rating Individual-bias A review is evaluated as more helpful when its star rating is closer to evaluator’s opinion Brilliant-but-cruel A review is evaluated as more helpful when its star rating is below to the average star rating Quality-only Only textual information affects helpfulness evaluation 16

Copyright  2009 by CEBT Signed Deviation from Average  Not consistent with brilliant-but-cruel hypothesis There is tendency towards positivity Black lines should not be sloped that way if it is valid hypothesis 17

Copyright  2009 by CEBT Hypotheses Conformity A review is evaluated as more helpful when its star rating is closer to the average star rating Individual-bias A review is evaluated as more helpful when its star rating is closer to evaluator’s opinion Brilliant-but-cruel A review is evaluated as more helpful when its star rating is below to the average star rating Quality-only Only textual information affects helpfulness evaluation 18

Copyright  2009 by CEBT Addressing Individual-bias Effects  It is hard to distinguish between the conformity and the individual-bias hypothesis  We need to examine cases in which individual people’s opinions do not come from exactly the same distribution Cases in which there is high variance in star ratings Otherwise conformity and individual-bias are indistinguishable – Everyone has same opinion 19

Copyright  2009 by CEBT Variance of Star Rating and Helpfulness Ratio 20 Helpfulness ratio is the highest with reviews of which rating is slightly- above the average Two-humped camel plots: local minimum around average Helpfulness ratio is the highest when star ratings of reviews have average value

Copyright  2009 by CEBT Hypotheses Conformity A review is evaluated as more helpful when its star rating is closer to the average star rating Individual-bias A review is evaluated as more helpful when its star rating is closer to evaluator’s opinion Brilliant-but-cruel A review is evaluated as more helpful when its star rating is below to the average star rating Quality-only Only textual information affects helpfulness evaluation 21

Copyright  2009 by CEBT Plagiarism  Making use of plagiarism is effective way to control for the effect of review text  Definition of plagiarized pair(s) of reviews Two or more reviews of different products With near-complete textual overlap 22

Copyright  2009 by CEBT An Example Skull-splitting headache guaranteed! If you enjoy thumping, skull splitting migraine headache, then Sing N Learn is for you. As a longtime language instructor, I agree with the attempt and effort that this series makes, but it is the execution that ultimately weakens Sing N Learn Chinese. To be sure, there are much, much better ways to learn Chinese. In fact, I would recommend this title only as a last resort and after you’ve thoroughly exhausted traditional ways to learn Chinese … Migraine Headache at No Extra Charge If you enjoy a thumping, skull splitting migraine headache, then the Sing N Learn series is for you. As a longtime language instructor, I agree with the effort that this series makes, but it is the execution that ultimately weakens Sing N Learn series. To be sure, there are much, much better ways to learn a foreign language. In fact, I would recommend this title only as a last resort and after you’ve thoroughly exhausted traditional ways to learn Korean … 23

Copyright  2009 by CEBT Plagiarism (contd.)  Plagiarized reviews Almost(not exact) same text – More possibly, same text could be considered as spam reviews Different non-textual information  If the quality-only straw man hypothesis holds, helpfulness ratios of documents in each pair should be the same  Possible other methods Human annotation – Could be subjective Classification using machine learning methods – We cannot guarantee the accuracies of algorithms 24

Copyright  2009 by CEBT Experiments with Plagiarism  Text quality is not the only explanatory factor Statistically significant difference between the helpfulness ratios of plagiarized pairs 25 The plagiarized reviews with deviation 1 is significantly more helpful than those with deviation 1.5

Copyright  2009 by CEBT Hypotheses Conformity A review is evaluated as more helpful when its star rating is closer to the average star rating Individual-bias A review is evaluated as more helpful when its star rating is closer to evaluator’s opinion Brilliant-but-cruel A review is evaluated as more helpful when its star rating is below to the average star rating Quality-only Only textual information affects helpfulness evaluation 26

Copyright  2009 by CEBT Flow of Presentation HypothesizingVerifying Modeling Based on individual bias and mixtures of distributions 27

Copyright  2009 by CEBT Authors’ Model  Based on individual bias and mixtures of distributions  Two distributions: one for positive, one for negative evaluators Balance between positive and negative evaluators: Controversy level: – Density function of helpfulness ratios of positive evaluators – Gaussian distribution of which average is -centered – Density function of helpfulness ratios of negative evaluators – Gaussian distribution of which average is -centered 28

Copyright  2009 by CEBT Validity of the Model  Empirical observation and model generated 29

Copyright  2009 by CEBT Conclusion  A review’s perceived helpfulness depends not just on its content, but also the relation of its score to other scores  The dependence of the score is consistent with a simple and natural model of individual-bias in the presence of a mixture of opinion distributions  Directions for further research Variations in the effect can be used to form hypotheses about differences in the collective behaviors of the underlying populations It would be interesting to consider social feedback mechanisms that might be capable of modifying the effects authors observed here Considering possible outcomes of design problem for systems enabling the expression and dissemination of opinions 30

Copyright  2009 by CEBT Discussions  So, how can we use this? In which cases would this information be helpful? Available information is very limited – Star ratings – Helpfulness ratios  Conclusion is rather trivial Does not present new discoveries 31