How to Avoid the Lies and Damned Lies: Pitfalls of Data Analysis Clay Helberg Special Topics in Marketing Research Dr. Charles Trappey Summarized by Kevin.

Slides:



Advertisements
Similar presentations
Probability models- the Normal especially.
Advertisements

Statistical Decision Making
AP Statistics – Chapter 9 Test Review
Topic 6: Introduction to Hypothesis Testing
Empirical Analysis Doing and interpreting empirical work.
ProportionMisc.Grab BagRatiosIntro.
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
Writing a Research Protocol Michael Aronica MD Program Director Internal Medicine-Pediatrics.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Research Curriculum Session III – Estimating Sample Size and Power Jim Quinn MD MS Research Director, Division of Emergency Medicine Stanford University.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Today Concepts underlying inferential statistics
© 1999 Prentice-Hall, Inc. Chap Chapter Topics Hypothesis Testing Methodology Z Test for the Mean (  Known) p-Value Approach to Hypothesis Testing.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
On Comparing Classifiers: Pitfalls to Avoid and Recommended Approach Published by Steven L. Salzberg Presented by Prakash Tilwani MACS 598 April 25 th.
The t Tests Independent Samples.
Chapter 14 Inferential Data Analysis
Descriptive Statistics
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 7 Sampling, Significance Levels, and Hypothesis Testing Three scientific traditions critical.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Educational Research Chapter 13 Post-Analysis Considerations Gay, Mills, and Airasian.
Choosing Statistical Procedures
AM Recitation 2/10/11.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Chapter 8 Introduction to Hypothesis Testing
Variation, Validity, & Variables Lesson 3. Research Methods & Statistics n Integral relationship l Must consider both during planning n Research Methods.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 23, Slide 1 Chapter 23 Comparing Means.
Academic Viva POWER and ERROR T R Wilson. Impact Factor Measure reflecting the average number of citations to recent articles published in that journal.
Sample Size Determination Donna McClish. Issues in sample size determination Sample size formulas depend on –Study design –Outcome measure Dichotomous.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Individual values of X Frequency How many individuals   Distribution of a population.
MARE 250 Dr. Jason Turner Hypothesis Testing III.
Significance Toolbox 1) Identify the population of interest (What is the topic of discussion?) and parameter (mean, standard deviation, probability) you.
LECTURER PROF.Dr. DEMIR BAYKA AUTOMOTIVE ENGINEERING LABORATORY I.
Chapter 1 Measurement, Statistics, and Research. What is Measurement? Measurement is the process of comparing a value to a standard Measurement is the.
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
SINTEF Telecom and Informatics EuroSPI’99 Workshop on Data Analysis Popular Pitfalls of Data Analysis Tore Dybå, M.Sc. Research Scientist, SINTEF.
Copyright  2003 by Dr. Gallimore, Wright State University Department of Biomedical, Industrial Engineering & Human Factors Engineering Human Factors Research.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
1 Statistics in Research & Things to Consider for Your Proposal May 2, 2007.
ANOVA Assumptions 1.Normality (sampling distribution of the mean) 2.Homogeneity of Variance 3.Independence of Observations - reason for random assignment.
Misuses of Quantitative Research V. Darleen Opfer.
Chapter 3: Statistical Significance Testing Warner (2007). Applied statistics: From bivariate through multivariate. Sage Publications, Inc.
©2010 John Wiley and Sons Chapter 2 Research Methods in Human-Computer Interaction Chapter 2- Experimental Research.
Issues of Quantitative Research: Part 1 V. Darleen Opfer.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 7 Sampling, Significance Levels, and Hypothesis Testing Three scientific traditions.
Reasoning in Psychology Using Statistics Psychology
Analyzing Statistical Inferences July 30, Inferential Statistics? When? When you infer from a sample to a population Generalize sample results to.
Introduction to Validity True Experiment – searching for causality What effect does the I.V. have on the D.V. Correlation Design – searching for an association.
Descriptive and Inferential Statistics Descriptive statistics The science of describing distributions of samples or populations Inferential statistics.
T tests comparing two means t tests comparing two means.
BHS Methods in Behavioral Sciences I May 9, 2003 Chapter 6 and 7 (Ray) Control: The Keystone of the Experimental Method.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
More about tests and intervals CHAPTER 21. Do not state your claim as the null hypothesis, instead make what you’re trying to prove the alternative. The.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Statistics for Decision Making Hypothesis Testing QM Fall 2003 Instructor: John Seydel, Ph.D.
CRITICALLY APPRAISING EVIDENCE Lisa Broughton, PhD, RN, CCRN.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Understanding Results
Hypothesis Tests: One Sample
12 Inferential Analysis.
Chapter 6 Making Sense of Statistical Significance: Decision Errors, Effect Size and Statistical Power Part 1: Sept. 18, 2014.
Do you know population SD? Use Z Test Are there only 2 groups to
12 Inferential Analysis.
More About Tests Notes from
Rest of lecture 4 (Chapter 5: pg ) Statistical Inferences
Presentation transcript:

How to Avoid the Lies and Damned Lies: Pitfalls of Data Analysis Clay Helberg Special Topics in Marketing Research Dr. Charles Trappey Summarized by Kevin Beyer

The Problem with Statistics “There are three kinds of lies; lies, damned lies and statistics.” Twain or Desraeli Statistics requires the ability to consider things from a probabilistic perspective. Non-mathematicians view numbers as if they must be right, and therefore anything that isn’t ‘right’ must be ‘wrong’. Statistical Pitfalls: I.Sources of Bias, II.Errors in Methodology, III.Interpretation of Results

I. Sources of Bias 1. Representative Sampling –The observed sample must represent the target population Problematic sample = one that doesn’t parallel the population –Can’t always control for all of the key characteristics

I. Sources of Bias 2.Statistical Assumption –The validity of statistical procedure depends on statistical assumptions ANOVA depends on the assumption of normality and independence Creates a temptation to ignore any non-normality. Should try to find reasons why; measurement artifact -> develop a better measuring tool. –Assumption of independence is often violated Observations that are linked in some way may show dependencies Aggregating cases to the higher level is one way around this.

II. Errors in Methodology 1. Statistical Power Vertical dotted line reps. the point-null hypothesis Solid vertical line represents a criterion for significance Alpha = probability of a Type I error (reject null when shouldn’t) Beta = probability of a Type II error (don’t reject null when should) Power refers to your ability to avoid a Type II error – Depends on sample size, effect size, alpha, variability

II. Errors in Methodology 1. Statistical Power Cont’d Too little power, you run the risk of missing the effect you’re trying to find Important if you’re looking to claim ‘no difference’ - it may be there, but the sample size may be too small. Too much power can result in tiny or meaningless differences being statistically significant.

II. Errors in Methodology 2. Multiple Comparisons

II. Errors in Methodology 2. Multiple Comparisons

II. Errors in Methodology 3. Measurement Error Occurs especially is ‘noisy’ data, like surveys Important characteristics of measurement are reliability and validity Reliability is the ability of instrument to measure the same thing each time Validity is the extent in which the indicator is able to measure the thing it is meant to measure

III. Problems with Interpretation 1. Confusion Over Significance Statistical significance and practical significance are not the same 2. Precision and Accuracy Precision = how finely a specimen is specified (4.097 is more precise than 4.0) Accuracy = how close an estimate is to the true value Estimates can be precise without being accurate Don’t report more decimal places than are meaningful

III. Problems with Interpretation 3. Causality The bottom line on causality; you must have random assignment The experimenter must be assigning values of predicator variables to cases. A -> B, B -> A, A B 4. Graphic Representations It is easy to confuse readers when presenting quantitative info. Graphically