Basic Statistical Concepts

Slides:



Advertisements
Similar presentations
Chapter 8 Linear Regression © 2010 Pearson Education 1.
Advertisements

Overview Correlation Regression -Definition
Basic Statistical Concepts
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Statistics for the Social Sciences
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
The Simple Regression Model
Statistics Psych 231: Research Methods in Psychology.
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
Statistics for the Social Sciences Psychology 340 Fall 2006 Relationships between variables.
CHAPTER 3 Describing Relationships
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Lecture 17: Correlations – Describing Relationships Between Two Variables 2011, 11, 22.
Correlation and Regression. Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score.
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
STAT 211 – 019 Dan Piett West Virginia University Lecture 2.
Correlation and regression 1: Correlation Coefficient
Statistics for the Behavioral Sciences Second Edition Chapter 4: Central Tendency and Variability iClicker Questions Copyright © 2012 by Worth Publishers.
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
Covariance and correlation
Numerical Descriptive Techniques
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
Descriptive Statistics Descriptive Statistics describe a set of data.
Chapter 4 Variability. Variability In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Relationships between variables Statistics for the Social Sciences Psychology 340 Spring 2010.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Descriptive Statistics Descriptive Statistics describe a set of data.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Numerical Measures of Variability
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
LECTURE 9 Tuesday, 24 FEBRUARY STA291 Fall Administrative 4.2 Measures of Variation (Empirical Rule) 4.4 Measures of Linear Relationship Suggested.
Data Analysis.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Today: Standard Deviations & Z-Scores Any questions from last time?
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Descriptive Statistics Psych 231: Research Methods in Psychology.
Descriptive Statistics ( )
Chapter 12 Understanding Research Results: Description and Correlation
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Reasoning in Psychology Using Statistics
Univariate Statistics
Reasoning in Psychology Using Statistics
Central Tendency and Variability
Statistics for the Social Sciences
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Understanding Research Results: Description and Correlation
Numerical Descriptive Measures
Small-N designs & Basic Statistical Concepts
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Descriptive Statistics
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Inferential Statistics
CHAPTER 3 Describing Relationships
Descriptive Statistics
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Warsaw Summer School 2017, OSU Study Abroad Program
CHAPTER 3 Describing Relationships
Presentation transcript:

Basic Statistical Concepts Psych 231: Research Methods in Psychology

Properties of distributions: Center There are three main measures of center Mean (M): the arithmetic average Add up all of the scores and divide by the total number Most used measure of center Median (Mdn): the middle score in terms of location The score that cuts off the top 50% of the from the bottom 50% Good for skewed distributions (e.g. net worth) Mode: the most frequent score Good for nominal scales (e.g. eye color) A must for multi-modal distributions

The Mean The most commonly used measure of center The arithmetic average Computing the mean Divide by the total number in the population The formula for the population mean is (a parameter): Add up all of the X’s The formula for the sample mean is (a statistic): Divide by the total number in the sample

Spread (Variability) How similar are the scores? Range: the maximum value - minimum value Only takes two scores from the distribution into account Influenced by extreme values (outliers) Standard deviation (SD): (essentially) the average amount that the scores in the distribution deviate from the mean Takes all of the scores into account Also influenced by extreme values (but not as much as the range) Variance: standard deviation squared

Variability Low variability The scores are fairly similar High variability The scores are fairly dissimilar mean mean

Standard deviation The standard deviation is the most popular and most important measure of variability. The standard deviation measures how far off all of the individuals in the distribution are from a standard, where that standard is the mean of the distribution. Essentially, the average of the deviations. m

An Example: Computing the Mean Our population 2, 4, 6, 8 1 2 3 4 5 6 7 8 9 10 m

An Example: Computing Standard Deviation (population) Step 1: To get a measure of the deviation we need to subtract the population mean from every individual in our distribution. Our population 2, 4, 6, 8 1 2 3 4 5 6 7 8 9 10 m -3 X -  = deviation scores 2 - 5 = -3

An Example: Computing Standard Deviation (population) Step 1: To get a measure of the deviation we need to subtract the population mean from every individual in our distribution. Our population 2, 4, 6, 8 1 2 3 4 5 6 7 8 9 10 m -1 X -  = deviation scores 2 - 5 = -3 4 - 5 = -1

An Example: Computing Standard Deviation (population) Step 1: To get a measure of the deviation we need to subtract the population mean from every individual in our distribution. Our population 2, 4, 6, 8 1 2 3 4 5 6 7 8 9 10 m 1 X -  = deviation scores 2 - 5 = -3 6 - 5 = +1 4 - 5 = -1

An Example: Computing Standard Deviation (population) Step 1: To get a measure of the deviation we need to subtract the population mean from every individual in our distribution. Our population 2, 4, 6, 8 1 2 3 4 5 6 7 8 9 10 m 3 X -  = deviation scores Notice that if you add up all of the deviations they must equal 0. 2 - 5 = -3 6 - 5 = +1 4 - 5 = -1 8 - 5 = +3

An Example: Computing Standard Deviation (population) Step 2: So what we have to do is get rid of the negative signs. We do this by squaring the deviations and then taking the square root of the sum of the squared deviations (SS). SS =  (X - )2 2 - 5 = -3 4 - 5 = -1 6 - 5 = +1 8 - 5 = +3 X -  = deviation scores = (-3)2 + (-1)2 + (+1)2 + (+3)2 = 9 + 1 + 1 + 9 = 20

An Example: Computing Standard Deviation (population) Step 3: ComputeVariance (which is simply the average of the squared deviations (SS)) So to get the mean, we need to divide by the number of individuals in the population. variance = 2 = SS/N SS = 20, N = 4 2 = 20/4 = 5.0

An Example: Computing Standard Deviation (population) Step 4: Compute Standard Deviation To get this we need to take the square root of the population variance. standard deviation =  =

An Example: Computing Standard Deviation (population) To review: Step 1: Compute deviation scores Step 2: Compute the SS Step 3: Determine the variance Take the average of the squared deviations Divide the SS by the N Step 4: Determine the standard deviation Take the square root of the variance

An Example: Computing Standard Deviation (SAMPLE) To review: Step 1: Compute deviation scores Step 2: Compute the SS Step 3: Determine the variance Take the average of the squared deviations Divide the SS by (n-1) Step 4: Determine the standard deviation Take the square root of the variance

Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score typically is. This suggests that there is a relationship between study time and test performance. We call this relationship a correlation.

Relationships between variables Properties of a correlation Form (linear or non-linear) Direction (positive or negative) Strength (none, weak, strong, perfect) To examine this relationship you should: Make a scatterplot Compute the Correlation Coefficient

Scatterplot Plots one variable against the other Useful for “seeing” the relationship Form, Direction, and Strength Each point corresponds to a different individual Imagine a line through the data points

Scatterplot Y X 1 2 3 4 5 6 Hours study X Exam perf. Y 6 1 2 5 3 4

Correlation Coefficient A numerical description of the relationship between two variables For relationship between two continuous variables we use Pearson’s r It basically tells us how much our two variables vary together As X goes up, what does Y typically do X, Y X, Y X, Y

Form Linear Non-linear

Direction Positive Negative Y X Y X As X goes up, Y goes up X & Y vary in the same direction positive Pearson’s r As X goes up, Y goes down X & Y vary in opposite directions negative Pearson’s r

Strength Zero means “no relationship”. The farther the r is from zero, the stronger the relationship The strength of the relationship Spread around the line (note the axis scales)

Strength r = 0.0 “no relationship” r = 1.0 “perfect positive corr.” r = -1.0 “perfect negative corr.” -1.0 0.0 +1.0 The farther from zero, the stronger the relationship

Strength Rel A Rel B r = -0.8 r = 0.5 -.8 .5 -1.0 0.0 +1.0 Which relationship is stronger? Rel A, -0.8 is stronger than +0.5

Regression Compute the equation for the line that best fits the data points Y X 1 2 3 4 5 6 Y = (X)(slope) + (intercept) 0.5 2.0 Change in Y Change in X = slope

Regression Can make specific predictions about Y based on X X = 5 Y = (X)(.5) + (2.0) Y X 1 2 3 4 5 6 Y = (5)(.5) + (2.0) Y = 2.5 + 2 = 4.5 4.5

Regression Also need a measure of error Y = X(.5) + (2.0) + error Same line, but different relationships (strength difference) Y X 1 2 3 4 5 6 Y X 1 2 3 4 5 6

Cautions with correlation & regression Don’t make causal claims Don’t extrapolate Extreme scores (outliers) can strongly influence the calculated relationship