The Art of Deceptive Statistics Using statistics as a dishonest tool to achieve desired results, and how to determine the validity of statistical results.

Slides:



Advertisements
Similar presentations
Authority 2. HW 8: AGAIN HW 8 I wanted to bring up a couple of issues from grading HW 8. Even people who got problem #1 exactly right didn’t think about.
Advertisements

Measures of Center: Mean, Median, Mode Please view this tutorial and answer the follow-up questions on loose leaf to turn in to your teacher.
June 9, 2008Stat Lecture 8 - Sampling Distributions 1 Introduction to Inference Sampling Distributions Statistics Lecture 8.
Chapter 7: Data for Decisions Lesson Plan
Ch. 21 Practice.
1.3: Uses and Abuses of Statistics
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
+ Chapter 10 Section 10.4 Part 2 – Inference as Decision.
Chapter 18 Sampling Distribution Models
Chapter 19 Confidence Intervals for Proportions.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample
Or How to make the numbers say whatever you want..
Estimation from Samples Find a likely range of values for a population parameter (e.g. average, %) Find a likely range of values for a population parameter.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
On Comparing Classifiers: Pitfalls to Avoid and Recommended Approach Published by Steven L. Salzberg Presented by Prakash Tilwani MACS 598 April 25 th.
Statistics and Research in Psychology Essential Question: How do psychologists refer to the their research methodology, data and results?
Measures of Central Tendency
Personal Financial Literacy What do you know? Is the $1000 her gross or net income?gross net 1. Janie gets a check after her first two weeks at work,
Chapter 10 Hypothesis Testing
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
March  There is a maximum of one obtuse angle in a triangle, but can you prove it?  To prove something like this, we mathematicians must do a.
4.1Introduction The field of statistical inference consist of those methods used to make decisions or to draw conclusions about a population. These methods.
Grade 10 MCAS OPEN RESPONSE QUESTION SPRING 2001 Exam, #40
Dr. Serhat Eren DESCRIPTIVE STATISTICS FOR GROUPED DATA If there were 30 observations of weekly sales then you had all 30 numbers available to you.
PARAMETRIC STATISTICAL INFERENCE
Statistical Process Control
Understanding Numerical Data
Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.
Measures of Variability Variability: describes the spread or dispersion of scores for a set of data.
90288 – Select a Sample and Make Inferences from Data The Mayor’s Claim.
FUNDAMENTAL STATISTIC
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Understanding Randomness
Errors & Power. 2 Results of Significant Test 1. P-value < alpha 2. P-value > alpha Reject H o & conclude H a in context Fail to reject H o & cannot conclude.
By: Abby, Fiona, Gabby, and Willa. Look for the Facts Since the U.S. first created a legal drinking age many people have been trying to change and break.
Introduction Hypothesis testing for one mean Hypothesis testing for one proportion Hypothesis testing for two mean (difference) Hypothesis testing for.
Means & Medians Chapter 4. Parameter - Fixed value about a population Typical unknown.
Type I and Type II Error AP Stat Review, April 18, 2009.
Chapter Part 1 Sampling Distribution Models for.
Estimation of a Population Mean
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Minimum Sample Size Proportions on the TI Section
Section Estimating a Proportion with Confidence Objectives: 1.To find a confidence interval graphically 2.Understand a confidence interval as consisting.
Math 4030 – 9a Introduction to Hypothesis Testing
CONFIDENCE INTERVALS.
MAT 1000 Mathematics in Today's World. Last Time 1.Collecting data with experiments 2.Practical problems with experiments.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
By: Abby, Fiona, Gabby, and Willa. Look for the Facts Since the U.S. first created a legal drinking age many people have been trying to change and break.
Week 6. Statistics etc. GRS LX 865 Topics in Linguistics.
Ch. 18 – Sampling Distribution Models (Day 1 – Sample Proportions) Part V – From the Data at Hand to the World at Large.
Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.
Confidence Interval Estimation For statistical inference in decision making: Chapter 9.
4.1Introduction The field of statistical inference consist of those methods used to make decisions or to draw conclusions about a population. These methods.
Introduction to Inference Sampling Distributions.
DO NOW: On your DO NOW sheet in the Tuesday box: What is a common diagram psychologists use to describe data? Why can this graph be misleading?
President UniversityErwin SitompulPBST 10/1 Lecture 10 Probability and Statistics Dr.-Ing. Erwin Sitompul President University
Plan for Today: Chapter 1: Where Do Data Come From? Chapter 2: Samples, Good and Bad Chapter 3: What Do Samples Tell US? Chapter 4: Sample Surveys in the.
Introduction to Hypothesis Testing Chapter 10. What is a Hypothesis? A tentative explanation for an observation, phenomenon, or scientific problem that.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
By: Abby, Fiona, Gabby, and Willa. Look for the Facts Since the U.S. first created a legal drinking age many people have been trying to change and break.
Introduction to Inference
Ch. 18 – Sampling Distribution Models (Day 1 – Sample Proportions)
Testing Hypotheses about Proportions
1.3: Uses and Abuses of Statistics
Unit 7 Today we will look at: Normal distributions
Bell Ringer Open your student workbook and turn to pages 27 and 28.
WARM - UP Is the coin Fair?
Mean Deviation Standard Deviation Variance.
Presentation transcript:

The Art of Deceptive Statistics Using statistics as a dishonest tool to achieve desired results, and how to determine the validity of statistical results

Ways to use statistics for your own good Some ways that people use statistics for their own purposes are: Using a nonrandom sample that supports their conclusion Using different measures of center to describe similar data sets Reporting only some findings instead of all findings. Changing definitions to include more or less people

Not using a random sample Suppose a city council wanted to request federal assistance for their city. They would then want to show that their community is poor. One measure of which would be the value of their houses. If they decided to only take the price of houses from a very poor community, the average price of a home of the sample would be much lower than the true average price of a home. It’s not that they didn’t calculate the sample mean correctly, but they took a sample that wasn’t representative of the entire city. This is a prime example of purposely using misleading statistics.

What should have been done Houses from all over the city should have been taken, to get an accurate representation of the city’s true economic housing climate. The city could get in trouble if someone found out they deliberately used a nonrandom sample to report their findings.

Using different measures of center Another situation that statistics could be used to deceive people is when using measures of center. Remember that the three measures of center are mean(or average), median, and mode. Depending on which one you use, the center of a data set could look drastically different.

ExampleExample We have two datasets, A and B A is the set { 2,3,4,4,5,6,90 } B is the set { 1,2,2,4,6,8,11 } B is the set { 1,2,2,4,6,8,11 } What does it look like if we use the mean, median, or mode to describe these sets?

Example continued Mean of A: ( )/7 = 114/7 = Mean of B: ( )/7 = 34/7 = 4.86 The means paint the picture that these two data sets are drastically different, even though the large observation in A is the biggest difference Median of A: 4 Median of B: 4 If we use the median, then the two data sets look like they have the same center.

Example continued again... Mode for A: 4 occurs twice, so our mode is 4 Mode for B: 2 occurs twice, so our mode is 2 Using mode, A seems to be bigger than B. All three of these measures of centers paint different pictures of the data sets. None of them are wrong, but if you want it to look a certain way, you could purposely choose one that would back up your conclusion.

Underreporting your findings If you fail to report all of your statistical results from a study, this is also a form of deceptive statistics. One result could make your company or yourself look good, whereas the other would make you look bad. Obviously, you would only want to choose the report the results that help you out.

Underreporting Example A company that makes helmets releases a report that their new helmets are making the army safer, citing that deaths from head wounds are down 50%. What they don’t report is that people are still getting injured in the head, but since the helmets are better, they do not die. There are still plenty of people getting hurt, but its just in the form of injury, not death.

Changing Definitions Companies may also choose to include or remove certain demographics in a study that contradicts the conclusions they wish to make. Once again, this is not incorrect, but the purposeful removal of these groups make the act deceptive.

Changed Definitions Example How do you define a car accident as caused by alcohol? You could say if there was any alcohol consumed by anyone involved in the crash, it is alcohol related. Or, you could say that if the driver was legally intoxicated, then it is caused by alcohol.

Alcohol Example Continued Using these two definitions, we will get very different results if we want to determine the proportion of car accidents that are alcohol related. If an alcohol producer was making this report, they would clearly choose the definition that led to the lowest proportion of alcohol related accidents.

Ways to identify misleading stats You always want to look at the sample. How large is it? Who was in it? Is it representative? Investigate the methods used. Did they use the same measure of center for all datasets? Are all assumptions met for whatever method they used? Be aware of any excluded results that should be there.