An empirical study on Perception of Correlation using Scatter Plots Created by:- Varshita Sher.

Slides:



Advertisements
Similar presentations
Scaling. Scaling seeks to discover how varying the physical parameters of the stimulus affects the psychological parameters. In general, scaling is concerned.
Advertisements

Design of Experiments Lecture I
Correlation Oh yeah!.
Correlation Chapter 6. Assumptions for Pearson r X and Y should be interval or ratio. X and Y should be normally distributed. Each X should be independent.
Inference for Regression
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Statistical Issues in Research Planning and Evaluation
Scatter Diagrams and Linear Correlation
A Short Introduction to Curve Fitting and Regression by Brad Morantz
Correlation CJ 526 Statistical Analysis in Criminal Justice.
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Excellence Justify the choice of your model by commenting on at least 3 points. Your comments could include the following: a)Relate the solution to the.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Chapter 7 Correlational Research Gay, Mills, and Airasian
Correlation and Regression Analysis
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Relationships Among Variables
Assessment Statements  The Internal Assessment (IA) Rubric IS the assessment statement.
Lecture 15 Basics of Regression Analysis
Inference for regression - Simple linear regression
3 CHAPTER Cost Behavior 3-1.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Hypothesis Testing in Linear Regression Analysis
Simple Linear Regression Models
CORRELATION & REGRESSION
Correlation.
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
The Scientific Method Honors Biology Laboratory Skills.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Correlation & Regression
Examining Relationships in Quantitative Research
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
Examining Relationships in Quantitative Research
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Correlation & Regression Analysis
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Absolute Threshold PSY 3215 – Perception – Appalachian State University.
CORRELATION ANALYSIS.
Psy Psychology of Hearing Psychophysics and Detection Theory Neal Viemeister
Principal Component Analysis
Chapter 13 Understanding research results: statistical inference.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
BPS - 5th Ed. Chapter 231 Inference for Regression.
CORRELATION-REGULATION ANALYSIS Томский политехнический университет.
Linear Regression 1 Sociology 5811 Lecture 19 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Stats Methods at IC Lecture 3: Regression.
Statistical analysis.
SUR-2250 Error Theory.
Regression Analysis.
Correlation and Simple Linear Regression
Statistical analysis.
Regression Analysis.
12 Inferential Analysis.
CHAPTER fourteen Correlation and Regression Analysis
CHAPTER 29: Multiple Regression*
CHAPTER 26: Inference for Regression
12 Inferential Analysis.
Basic Practice of Statistics - 3rd Edition Inference for Regression
Simple Linear Regression and Correlation
Product moment correlation
Presentation transcript:

An empirical study on Perception of Correlation using Scatter Plots Created by:- Varshita Sher

Introduction  Limited knowledge regarding metrics underlying the perception of SCPs.  Need to assess the presence of significant difference between subjective and objective correlation values w.r.t to different  Correlation indices  Data distribution  Symmetry of data enclosure  Density (no. of data points used to plot SCP)  Discuss the presence of any single linear or non-linear regression pattern or mathematical model to which the human perception conforms.

Aim  To study whether the “accuracy” and “precision” are systematically linked via Weber’s law where former refers to the ability of the user to guess the correlation simply by looking at the SCP and latter refers to the ability of the user to detect a presence of a difference in correlation coefficient of the two scatter plots.  Falsify the claims of a previous journal paper which states the applicability of Weber’s law to model the relation between human perception of correlation coefficient and mathematical formula calculated correlation value.  To study whether the JND (Just Noticeable Difference) is dependent on the reference correlation (positive or negative) chosen

Just Noticeable Difference  It is the quantity by which a given stimulus must increase or decrease before humans can reliably detect changes.  In simple terms the minimum amount of adjustment needed to produce a noticeable variation in sensory experience  For example, JND is be the minimum amount by which a person must raise his voice in order to be audible in a noisy room.  Ernst Weber was the first one to discover that this minimum amount is lawfully related to initial stimulus magnitude and coined the term, Weber’s law, as explained next.

Weber’s Law  Psychophysical law quantifying the perception of change in a given stimulus.  It states that the amount by which a physical stimulus must be increased in order for it to be detected by an observer is a constant fraction of the intensity of the original stimulus.  For example, in a quiet room it is easy to hear someone whisper but for a person to be audible in a noisy room he has to shout.  Hence, in context of JND: ΔI = K I where, K is the Weber’s fraction and I is the reference stimulus.

Tasks  Task JND: 2 adjacent SCPs with elliptical distribution, varying in respect to the correlation by an amount d. Cases considered :- d= 0.05, 0.1, 0.15, 0.25, 0.3, 0.35  Task Weber: elliptical distribution of SCP where the change in correlation was brought about by varying the semi-minor axis (b). Cases considered :- R = -0.9, -0.7,.., 0, 0.1, 0.2, …, 0.9

Tasks  Task Density: vary the number of points used to plot a SCP for three reference correlation values i.e. R = 0.3, 0.7 and 0.9 R = 0.5

Tasks  Task Distribution: the data points are distributed such that there are several different clusters – one along 45 degree line and two along -45 degree line. These two act as outliers and test the user’s ability to correctly estimate the correlation.

Tasks  Task Progressive Symmetry: change in variance of the variables at higher values. In this case we consider SCP where variance between data points increased at higher levels of first variable (x) and found that users chose to concentrate on majority of the x level where the data points were concentrated around the regression line and ignore higher dispersions at upper end of the x-axis. R = 0.7

Tasks  Task Reflective Asymmetry: study the effect of varying the symmetry of points on either side of the 45 degree line. R = 0.5

Software Implementation

Masking Effect

Software Implementation  Deployed Agile Software Development process - “incremental model”  Written using JavaScript, HTML, PHP  Collects user response in form of subjective correlation values when presented with different SCPs in accordance with the principal of repeated measures.  Procedure:  Pre-study presentation  Collecting demographic information using radio buttons, text boxes and Likert scaling for familiarity ratings  Training session providing feedback  Testing session for the main trials  Display 2 SCPs on the screen  Use respective slider bars to record answers for each one  Press ‘Next’ button to move onto next trial  Timely break intervals to reduce fatigue and boredom  Check points to monitor participant performance and detect random-clicks.  Masking screen to relax the eye vision  Amazon voucher as a thank-you token  Feedback survey

Back end processing done by software  The software calculates certain other metrics such as difference between objective and subjective correlation, amount of overestimation, underestimation, response time, etc.  It creates session variables for the demographic information entered by the user as they tend to remain constant for each individual.  The remaining information such as start time, stop time, estimated correlation, actual correlation (elements susceptible to change w.r.t individual trial) are stored as global variables.  A record is saved for each trial that consists of User ID, stimuli number, actual correlation, estimated correlation  2 ways to store user file  a personalized.txt file is generated (by the user ID of the participant). After each trial, data is appended to it with a new line character between each record.  a.json file is created as a backup copy. Using jStorage.set() data is stored on the web and accessed later on using jStorage.get().

Estimated Correlation Ideal correlation curve with weber fraction k Estimated correlation curve Falsifying Weber’s law

Ideal curve Obtained curve

Results  Used SPSS to perform hypothesis testing with significance level alpha= 0:05  Preliminary analysis – Friedman analysis to show presence of overall significance in data  Secondary analysis – Wilcoxon Signed-Rank test to detect source of significant difference  The human perception is affected by variation in data distribution, density and symmetricity, which means Weber’s law is insufficient to fit the human perception of “accuracy”  The “precision” also cannot be modelled as it was observed that the JND is dependent on the initial reference point chosen.  Task specific observations:  The JND lies in the range 0.05 < JND < 0.10 irrespective of the reference point chosen.  There isn’t a statistically significant difference between estimation of pos vs. neg correlations (elliptical configurations).  As the distribution levels increases, the accuracy in user performance decreases  Lower density values (such as 40) render biased results compared to 60, 80, 100 and 120. Also for each of them, error rate is max at higher R (R=0.7)  Reflective Asymmetry has no impact on lower correlation values (R<=0.3) but affects depiction of higher correlation coefficients.  Progressive Symmetry also leads to a decrease in accuracy of user judgment of correlation

Conclusions  As standalone quantities, the statistical indictor (Pearson or Spearman correlation coefficient) as well as the graphical indicator (SCPs) of correlation is insufficient and unreliable.  They must be considered as a ‘married couple’, complementing each other to depict necessary  We claim to disregard the common notion that ‘given a scatter plot, correlation can be easily perceived from it by anyone with just a little training.’

Thank you