Or How to make the numbers say whatever you want..

Slides:



Advertisements
Similar presentations
Mean = 4.8 Median = 3 Mode = 2. Correlation measures the strength of a relationship between two variables.
Advertisements

Psychology: A Modular Approach to Mind and Behavior, Tenth Edition, Dennis Coon Appendix Appendix: Behavioral Statistics.
Table of Contents Exit Appendix Behavioral Statistics.
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
BHS Methods in Behavioral Sciences I April 18, 2003 Chapter 4 (Ray) – Descriptive Statistics.
Or How to make the numbers say whatever you want..
Calculating & Reporting Healthcare Statistics
Descriptive Statistics
Biostatistics Unit 2 Descriptive Biostatistics 1.
1.2: Describing Distributions
Danny Oppenheimer Kirsten Thorpe. Any modality of communication between persons or groups. Materials that hold data in any form or that allow data to.
A primer in Biostatistics
Measures of Variability: Range, Variance, and Standard Deviation
1 Measures of Central Tendency Greg C Elvers, Ph.D.
Measures of Central Tendency
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Today: Central Tendency & Dispersion
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Summarizing Scores With Measures of Central Tendency
Describing distributions with numbers
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
Frequency Distribution I. How Many People Made Each Possible Score? A. This is something I show you for each quiz.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Psychometrics.
Statistical Analysis How do we make sense of the data we collect during a study or an experiment?
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Quantitative Skills: Data Analysis
Statistics Recording the results from our studies.
Descriptive Statistics Descriptive Statistics describe a set of data.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Research Methods Chapter 8 Data Analysis. Two Types of Statistics Descriptive –Allows you to describe relationships between variables Inferential –Allows.
Descriptive Statistics Used to describe or summarize sets of data to make them more understandable Used to describe or summarize sets of data to make them.
M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Descriptive Statistics
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
Describing distributions with numbers
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Descriptive Statistics Descriptive Statistics describe a set of data.
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
According to researchers, the average American guy is 31 years old, 5 feet 10 inches, 172 pounds, works 6.1 hours daily, and sleeps 7.7 hours. These numbers.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Practice Page 65 –2.1 Positive Skew Note Slides online.
Measures of Central Tendency: The Mean, Median, and Mode
Central Tendency & Dispersion
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
CHAPTER 3 : DESCRIPTIVE STATISTIC : NUMERICAL MEASURES (STATISTICS)
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Statistics Unit 9 only requires us to do Sections 1 & 2. * If we have time, there are some topics in Sections 3 & 4, that I will also cover. They tie in.
LIS 570 Summarising and presenting data - Univariate analysis.
STATISTICS STATISTICS Numerical data. How Do We Make Sense of the Data? descriptively Researchers use statistics for two major purposes: (1) descriptively.
Descriptive Statistics Research Writing Aiden Yeh, PhD.
Organizing and Analyzing Data. Types of statistical analysis DESCRIPTIVE STATISTICS: Organizes data measures of central tendency mean, median, mode measures.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
CHAPTER 3 – Numerical Techniques for Describing Data 3.1 Measures of Central Tendency 3.2 Measures of Variability.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
Outline Sampling Measurement Descriptive Statistics:
Summarizing Scores With Measures of Central Tendency
Statistics: the language of psychological research
Descriptive Statistics
Characteristics of the Mean
Descriptive Statistics: Describing Data
Presentation transcript:

or How to make the numbers say whatever you want.

Acknowledgements: Darrell Huff’s book: How to Lie With Statistics (First published in 1954) Danny Oppenheimer’s psychology course Other books that may be interesting: Statistical Tricks and Traps - by Ennis C. Almer.

“There are lies, damned lies, and statistics” -- Disraeli

Anecdotal evidence is unreliable Why does the phone always ring when you’re in the shower?

Determining the difference between chance and real effects

Practically all statistics are based on a sample of a population. So…... –how was the sample chosen? –how big is the sample? –what population does it claim to represent? –what population does it actually represent?

Data = Signal + Noise Signal = What we’re trying to measure Noise = Error in our measurement If noise is random, then as the sample size increases, noise tends to cancel, leaving only signal.

Flip a coin 5 times –Heads four times –80 % heads Flip 100 times –Same results??? In general, the larger the sample size, the better the estimation.

A telephone poll was taken during the U.S. presidential campaign of Franklin D. Roosevelt (1932). Based on the results of that poll, the pollsters predicted that FDR would not win. FDR did win, however. The poll did NOT accurately reflect all of the voters because the opinions of only one part of the population (wealthy people with telephones) were taken into account.

Center Mode Median Mean Spread Variability Standard Deviation Range

Mode The mode is any value that occurs most frequently The mode in this case is 50

Median (M) The midpoint of a distribution. Half (50%) of the observations are larger, half (50%) are smaller.

Steps to Find Median Arrange all the observations in order of size, from smallest to largest. Find the position of the median by counting the number of observations and finding the one in the middle. If there is an odd number of observations, the median will be one of your observations If there is an even number of observations, the median is the average of the center observations in the ordered list and will not be an observation.

Example How many CD’s do you own?

Mean Steps –Add the values of the observations –Divide the total by the number of observations –That’s the mean!

Mean – arithmetic average =  x/n Median – the halfway point Mode – the most common answer Average could mean any of them…

Incomes: $9000 $12,000 $120,000 $85,000 $15,000 Mean = $37,000 Median = $12,000 Mode = $9000 Each is a legitimate average but can serve conflicting purposes

Range – Overall difference between the highest and lowest scores. Variance – Average difference from the mean.

Standard Deviation Measures the spread by looking at how far the observations are from their mean. Measures the average distance of the observations from their mean. Variance (s 2 ) is the average of the squared distances from the mean. Standard Deviation is the square root of the variance.

Identical Range

Identical Variance

Median vs. Mean The midpoint and mean both describe the center, but which is better? The mean is strongly influenced by a few extreme observations and the median is not

The Normal Curve

Central Limit Theorem Any time you have a measure which is created by summing several individual trials of data (Signal + Noise) you will end up with a normal curve

Bimodal Distributions

Skewed Distributions

Uniform distributions

Correlation measures the strength of a relationship between two variables.

Positive Correlation 0 < R < 1 No Correlation R = 0 Negative Correlation -1 < R < 0

Correlation does not imply causation

A person wearing red is 129 times more likely to be gored by a charging bull! But what is the base rate? (more likely than what or who?)

98% of Americans have internet access available. The average temperature in Oklahoma City over the past 100 years is 63.4 degrees. (and each year has been within 1 degree of that average)

Consumer spending on prescription drugs has doubled since % of all car accidents occur within 10 miles of the home. 10 times as many people die each year by plane crashes than train crashes.

During the Spanish-American war, the death rate for soldiers was 9 per The death rate for civilians in New York City was 16 per 1000.

1 minute2 minutes Fluffy O’s Cereal Gives a body Energy!! 10 5 ??

Earnings

Even when the scale is fine, graphs can still be deceptive

As you can see, Bob earns substantially more than Joe Joe’s incomeBob’s Income

Is it really a two to one ratio that’s being displayed?

Portion of US income going to federal taxes Federal spending has become equal to the total income of the people of the black states.

Percent of US income going to federal taxes

Tylenol is used by 90% doctors for their own aches and pains! So are Aspirin, Ibuprofen, and Bayer!

When Dewey was elected Governor in 1942, the minimum teacher’s salary in some districts was as low as $900 per year. Upon Governor Dewey’s recommendation… the Legislature in 1947 appropriated $32,000,000 out of state surplus to provide an increase in teacher’s salaries. As a result, the minimum salaries of teachers in New York City range from $2500 to $5325.

1) Stanford is the #1 program in psychology 2) I am the only (and therefore #1) student at Stanford studying Decision Errors Therefore: I am the top student studying Decision Errors in the country.

1)Statistics are useful for figuring out random noise from real effects 2) Numbers are not absolute, and they can be easily manipulated 3) Always scrutinize data closely, and draw your own conclusions. 4) 85% of all statistics are made up on the spot: the rest are all wrong