The Ratings Game: Scoring Washington Reds

Slides:

Advertisements

Similar presentations

Chapter 8 Flashcards.

Advertisements

Correlation, Reliability and Regression Chapter 7.

© McGraw-Hill Higher Education. All rights reserved. Chapter 3 Reliability and Objectivity.

Effect Size and Meta-Analysis

Chapter 7 Correlational Research Gay, Mills, and Airasian

Introduction to Regression Analysis, Chapter 13,

Office of Institutional Research, Planning and Assessment January 24, 2011 UNDERSTANDING THE DIAGNOSTIC GUIDE.

Topic 6.1 Statistical Analysis. Lesson 1: Mean and Range.

Foundations of Educational Measurement

McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.

Lecture Four RISK & RETURN.

6 Analysis of Risk and Return ©2006 Thomson/South-Western.

WELCOME TO THETOPPERSWAY.COM.

Reliability REVIEW Inferential Infer sample findings to entire population Chi Square (2 nominal variables) t-test (1 nominal variable for 2 groups, 1 continuous)

EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.

Chapter 6 Foundations of Educational Measurement Part 1 Jeffrey Oescher.

Chapter 4: Test administration. z scores Standard score expressed in terms of standard deviation units which indicates distance raw score is from mean.

EDU 8603 Day 6. What do the following numbers mean?

Decision making Under Risk & Uncertainty. PAWAN MADUSHANKA MADUSHAN WIJEMANNA.

Measures of Central Tendency And Spread Understand the terms mean, median, mode, range, standard deviation.

10B11PD311 Economics REGRESSION ANALYSIS. 10B11PD311 Economics Regression Techniques and Demand Estimation Some important questions before a firm are.

Research Methods. Measures of Central Tendency You will be familiar with measures of central tendency- averages. Mean Median Mode.

Measures of Central Tendency Foundations of Algebra.

Lecture 10: Correlation and Regression Model.

Psychometrics. Goals of statistics Describe what is happening now –DESCRIPTIVE STATISTICS Determine what is probably happening or what might happen in.

NIH and IRB Purpose and Method M.Ed Session 2.

LESSON 5 - STATISTICS & RESEARCH STATISTICS – USE OF MATH TO ORGANIZE, SUMMARIZE, AND INTERPRET DATA.

Multivariate Analysis - Introduction. What is Multivariate Analysis? The expression multivariate analysis is used to describe analyses of data that have.

Social Influence on online wine evaluations at a wine social networking site: Effects of consensus and expertise Omer Gokcekus and Miles Hewstone (Seton.

Competitive Advantage

Inference about the slope parameter and correlation

Statistical analysis.

Reliability Analysis.

Statistics Use of mathematics to ORGANIZE, SUMMARIZE and INTERPRET numerical data. Needed to help psychologists draw conclusions.

Introduction to inference Estimating with confidence

Correlation, Bivariate Regression, and Multiple Regression

Regression Analysis Module 3.

Multivariate Analysis - Introduction

Statistical analysis.

Measures of Central Tendency

STANDARD DEVIATION.

Measures of Central Tendency

Correlation A Lecture for the Intro Stat Course

Measures of Central Tendency

Chapter 2 The Mean, Variance, Standard Deviation, and Z Scores

Introduction to Statistics

Introduction to Measurement Reference: Zikmund, Chapter 13

Confidence intervals: The basics

Psychology Statistics

Using statistics to evaluate your test Gerard Seinhorst

EPSY 5245 EPSY 5245 Michael C. Rodriguez

Dante Contreras Sebastián Bustos Paulina Sepúlveda

Reliability Analysis.

Preview Bem, 2011, claimed that through nine experiments he had demonstrated the existence of precognition Failure to replicate: “Across seven experiments.

Chapter 11: Measuring Research Variables

CHAPTER 3 Describing Relationships

Chapter Three Internal Analysis: Distinctive Competencies, Competitive Advantage, and Profitability.

Roberts and Sufi (2009) Here the concern is financial policies.

CHAPTER 3 Describing Relationships

Confidence intervals: The basics

15.1 The Role of Statistics in the Research Process

CHAPTER 3 Describing Relationships

CHAPTER 3 Describing Relationships

CHAPTER 3 Describing Relationships

CHAPTER 3 Describing Relationships

CHAPTER 3 Describing Relationships

CHAPTER 3 Describing Relationships

Descriptive Statistics

Multivariate Analysis - Introduction

CHAPTER 3 Describing Relationships

Presentation transcript:

The Ratings Game: Scoring Washington Reds Christopher Bitter University of Washington

Introduction Motivation Data: Questions U.S. consumers are “buying based on points” / ratings have a huge impact on wine sales Is this a viable strategy? How relevant are ratings? Data: 1,293 Washington State red wines rated by Wine Advocate, Wine Enthusiast, and Wine Spectator (3,879 total ratings) 2007-2012 vintages; 11 varietals; 8 AVAs; $11 to $150 (median $45); average score: 90.7 points Questions Do the publications agree with one another? Are the differences in scoring systematic? In other words – can they be explained by subjective preferences? Simplicity All know that a single number can’t capture the nuances in wine and the circumstances surrounding its enjoyment Can it help use choose higher quality wines that we will enjoy more?

Prior Work U.S. Wine Competitions Bordeaux en Primeur Tastings Hodgson (2008; 2009); Ashton (2012); Cao (2014); etc. Low correlations in scoring across judges – lack consensus Judges also lack reliability – unable to replicate scores in subsequent tastings of the same wine Bordeaux en Primeur Tastings Moderate degree of consensus (Ashton 2013, etc.) Differences are systematic – indicative of subjectivity (Masset et al. 2015; Cardebat & Vivat 2016) Unique settings - not entirely relevant to the typical U.S. wine drinker - ability to generalize results is uncertain Stuen et al. (2015) – study of CA and WA wines

Agreement?: Scoring Distributions Wine Enthusiast gives the highest scores / Wine Spectator the lowest (bias) Wine Spectator uses a narrower scoring range – 98% fall within a 9 point range (discriminates less) Do they use the 100 point scale in a consistent manner

Agreement? Correlations Low-to-moderate degree of consensus regarding wine quality Correlations intermediate between wine competition and Bordeaux settings

Agreement? Variation in Scores Mean standard deviation is 1.40 for the 1,293 wines Range is 4 or more 40% of the time May just focus on the range here

Disagreement Potential causes of disagreement in scoring Lack of accuracy / reliability Subjective preferences Testing for subjectivity If preferences play a role – scoring differences should be systematically related to wine attributes Difference in score between two publications modelled as a function of: price, vintage, varietal, appellation, and winery Ordinary least squares estimation

Regression Results Price and label attributes explain: 33% of the difference between Advocate & Enthusiast 43% of the difference between Advocate & Spectator 21% of the difference between Enthusiast & Spectator

Implications Consumers Producers Single score is not always representative of consensus opinion – limits relevance – better to consider multiple scores 63% of all wines in the $15 - $25 range achieved a max score of 90 or above – only 9% had a “consensus” score of 90 Subjectivity not necessarily negative - but implies that some ratings may be more relevant than others Ratings are relevant – but a blunt instrument Producers Good producers should be rewarded in the end – but variability in scoring favors those with better access to the review system Probability of getting a 90 point score in the $15 - $25 category improves from 28% with 1 rating to 63% with 3 Opportunity to exploit knowledge of scoring differences and preferences in order to improve ratings and sales? Superscoring

The End. Email Bitter@UW.edu for a copy of the paper or more information

Regression Coefficients: Raw Score Models