Download presentation
Presentation is loading. Please wait.
Published byRoger Mills Modified over 9 years ago
1
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 2, 2009
2
Previously on “let’s learn statistics in five weeks” the logic of research – samples, populations, and variables descriptive and inferential statistics – statistics and parameters understanding experiments – experimental and correlational studies – independent and dependent variables characteristics of scores – nominal, ordinal, interval, and ratio scales – continuous and discrete
3
Which Scale? Does the variable have an intrinsic value? Does the variable have equal values between scores? Does the variable have a real zero point? Nominal YES Ordinal NO YESNO YESNO IntervalRatio
4
Continuous A continuous scale allows for fractional amounts – it ‘continues’ between the whole-number amount – decimals make sense Examples: – Height – Weight – IQ
5
Discrete In a discrete scale, only whole-number amounts can be measured – decimals do not make sense – usually, nominal and ordinal scales are discrete – some interval and ratio variables are also discrete number of children in a family Special type of discrete variable: dichotomous – only two amounts or categories – pass/fail; living/dead; male/female
6
Today…. Why graphical representations of data? Stem and leaf plots. Box plots. Frequency – what is it – how a frequency distribution is created Graphing frequency distributions – bar graphs, histograms, polygons Types of distribution – normal, skewed, bimodal Relative frequency and the normal curve – percentiles, area under the normal curve
7
“… look at the data” (Robert Bolles, 1998) Raw data is often messy, overwhelming, and un-interpretable. Many data sets can have thousands of measurements and hundreds of variables. Graphical representations of data can make data interpretable Looking at the data can inspire ideas.
8
What in the world could these data mean? Imagine over 30,000 observations TimeLatLong 930485:23:06.860000132.20497-111.028 930497:04:34.7732.20482-111.028 930497:04:59.759999832.20487-111.028 930497:05:46.760000232.20485-111.029 930497:06:05.760000232.20578-111.029 930497:06:16.760000232.20678-111.029 930497:06:28.759999832.20698-111.028 930497:09:31.7732.20687-110.999 930497:09:58.7732.2055-110.993 930497:10:07.7732.20555-110.992 930497:10:37.7732.20687-110.986 930497:11:38.7732.20672-110.979
9
After plotting those data By plotting the data and superimposing it on map data, suddenly the previous slide’s data can tell a story Of course not all data can tell such a story People have developed various ways to visualize their data graphically
10
Stem and Leaf Plots 5 | 4 6 7 9 9 5 6 | 3 4 6 8 8 5 7 | 2 2 5 6 4 8 | 1 4 8 3 9 | 0 10 | 6 1 N = 18 data - 54, 56, 57, 59, 59, 63, 64, 66, 68, 72 … preserves the data in tact. is a way to see the distribution numbers on the left of the line are called the stems and represent the leading edge of each of the numbers numbers on the right of the line are called the leaves and represent the individual numbers indicate their value by completing the stem.
11
Box Plots Each of the lines in a box plot represents either quartiles or the range of the data. In this particular plot the dots represent outliers.
12
Frequency distributions - why? Standard method for graphing data – easy way of visualizing group data Introduction to the Normal Distribution – underlies all of the statistical tests we will be studying this semester – understanding the concepts behind statistical testing will make life a lot easier later on
13
Frequency
14
Frequency - some definitions Raw scores are the scores we initially measure in a study The number of times a score occurs in a set is the score’s frequency A distribution is the general name for any organized set of data A frequency distribution organizes the scores based on each score’s frequency N is the total number of scores in the data
15
Understanding Frequency Distributions A frequency distribution table shows the number of times each score occurs in a set of data The symbol for a score’s frequency is simply f N = ∑f
16
Raw Scores The following is a data set of raw scores. We will use these raw scores to construct a frequency distribution table. 14 13151115 131012131413 14151714 15
17
Frequency Distribution Table
18
Frequency Distribution Table - Example Make a frequency distribution table for the following scores: 5, 7, 4, 5, 6, 5, 4
19
Frequency Distribution Table - Example Make a frequency distribution table for the following scores: 5, 7, 4, 5, 6, 5, 4 Value Frequency 71
20
Frequency Distribution Table - Example Make a frequency distribution table for the following scores: 5, 7, 4, 5, 6, 5, 4 Value Frequency 71 61
21
Frequency Distribution Table - Example Make a frequency distribution table for the following scores: 5, 7, 4, 5, 6, 5, 4 Value Frequency 71 61 53
22
Frequency Distribution Table - Example Make a frequency distribution table for the following scores: 5, 7, 4, 5, 6, 5, 4 Value Frequency 71 61 53 42
23
Frequency Distribution Table - Example Make a frequency distribution table for the following scores: 5, 7, 4, 5, 6, 5, 4 X f 71 61 53 42
24
Learning more about our data What are the values for N and ∑X for the scores below? 14 13151115 131012131413 14151714 15
25
Results via Frequency Distribution Table What is N? N = ∑f
26
Results via Frequency Distribution Table What is ∑X?
27
Results via Frequency Distribution Table What is ∑X? (17 * 1) = 17 (16 * 0) = 0 (15 * 4) = 60 (14 * 6) = 84 (13 * 4) = 52 (12 * 1) = 12 (11 * 1) = 11 (10 * 1) = 10 __________ Total = 246
28
Graphing Frequency Distributions
29
A frequency distribution graph shows the scores on the X axis and their frequency on the Y axis
30
Graphing Frequency Distributions A frequency distribution graph shows the scores on the X axis and their frequency on the Y axis Why? – Because it’s not easy to make sense of this:
31
Graphing Frequency Distributions A frequency distribution graph shows the scores on the X axis and their frequency on the Y axis Why? – Because it’s not easy to make sense of this: On a scale of 0-10, how excited are you about this class: __________ 0=absolutely dreading it10=extremely excited/highlight of my semester Data (raw scores) 5 7 2 3 5 5 5 8 7 7 4 5 10 7 5 4 5 5 7 3 6 2 6 3 5 5 7 2 4 6 3 7 5 5 7 3 5 6 5 5 8 6 7 5 3 5 7 2 3 5 4 5 4 8 3 6 5 5 5 1 2 4 7 5 5 4 3 3 7 5 8 6 3 5 10 0 6 6 3 8 5 4 3 2 4 6 3 7 5 5 7 5 7 5 10 7 5 4 5 5 7 6 3 8 1 5 5 6 4 9 8 5 8 5 7 5 10 7 5 4 5 5 7 4 8 4 5 8 5 5 7 5 5 5 2 4 6 3 7 5 2 4 6 3 7 5 8 6 3 5 10 0 6 7 2 8 8 5 5 8 6 3 6 2 6 3 5 5 7 2 5 10 7 5 4 5 5 7 5 7 5 10 7 5 4 5 5 5 7 2 3 3 7 5 8 6 3 5 10 0 6
32
Graphing Frequency Distributions Xf 104 97 835 740 633 543 411 311 23 16 04 0 1 2 3 4 5 6 7 8 9 10
33
Graphing Frequency Distributions A frequency distribution graph shows the scores on the X axis and their frequency on the Y axis The type of measurement scale (nominal, ordinal, interval, or ratio) determines whether we use: – a bar graph – a histogram – a frequency polygon
34
Graphs - bar graph A frequency bar graph is used for nominal and ordinal data
35
Graphs - bar graph A frequency bar graph is used for nominal and ordinal data Values on the x-axis
36
Graphs - bar graph A frequency bar graph is used for nominal and ordinal data Frequencies on the y-axis
37
Graphs - bar graph A frequency bar graph is used for nominal and ordinal data In a bar graph, bars do not touch
38
Graphs - histogram A histogram is used for a small range of different interval or ratio scores
39
Graphs - histogram A histogram is used for a small range of different interval or ratio scores Values on the x-axis
40
Graphs - histogram A histogram is used for a small range of different interval or ratio scores Frequencies on the y-axis
41
Graphs - histogram A histogram is used for a small range of different interval or ratio scores In a histogram, adjacent bars touch
42
Graphs - frequency polygon A frequency polygon is used for a large range of different scores
43
Graphs - frequency polygon A frequency polygon is used for a large range of different scores In a freq. polygon, there are many scores on the x-axis
44
Constructing a Frequency Distribution Step 1: make a frequency table Step 2: put values along x-axis (bottom of page) Step 3: put a scale of frequencies along y-axis (left edge of page) Step 4 (bar graphs and histograms) – make a bar for each value Step 4 (frequency polygons) – mark a point above each value with a height for the frequency of that value – connect the points with lines
45
Graphing - example A researcher observes driving behavior on a road, noting the gender of drivers, type of vehicle driven, and the speed at which they are traveling. Which type of graph should be used for each variable? Gender? nominal: bar graph Vehicle Type? nominal: bar graph Speed? ratio: frequency polygon
46
Use and Misuse of Graphs -2
47
Use and Misuse of Graphs Which graph is correct? Neither does a very good job at summarizing the data Beware of graphing tricks
48
Types of Distributions
49
Distributions Frequency tables, bar-graphs, histograms and frequency polygons describe frequency distributions
50
Distributions - Why? Describing the shape of this frequency distribution is important for both descriptive and inferential statistics The benefit of descriptive statistics is being able to understand a set of data without examining every score
51
Distributions : The Normal Curve It turns out that many, many variables have a distribution that looks the same. This has been called the ‘normal distribution’. A bell-shaped curve Symmetrical Extreme scores have a low frequency – extreme scores: scores that are relatively far above or far below the middle score
52
The Ideal Normal Curve
53
Symmetrical
54
The Ideal Normal Curve Most scores in middle range
55
The Ideal Normal Curve Few extreme scores
56
The Ideal Normal Curve In theory, tails never reach the x-axis
57
Normal Curve - height
58
Normal Curve - hours slept 0 1 2 3 4 5 6 7 8 9 10 11 12
59
Normal Curve - GPA
60
Normal Distributions While the scores in the population may approximate a normal distribution, it is not necessarily so for a sample of scores
61
Skewed Distributions A skewed distribution is not symmetrical. It has only one pronounced tail A distribution may be either negatively skewed or positively skewed Negative or positive depends on whether the tail slopes towards or away from zero – the side with the longer tail describes the distribution Tail on negative side : negatively skewed Tail on positive side : positively skewed
62
Negatively Skewed Distributions
63
Tail on negative side: Negatively skewed
64
Negatively Skewed Distributions Contains extreme low scores
65
Negatively Skewed Distributions Does not contain extreme high scores
66
Negatively Skewed Distributions Can occur due to a “ceiling effect”
67
Positively Skewed Distributions
68
Tail on positive side: Positively skewed
69
Positively Skewed Distributions Contains extreme high scores
70
Positively Skewed Distributions Does not contain extreme low scores
71
Positively Skewed Distributions Can occur due to a “floor effect”
72
Positively Skewed Distributions
73
Bimodal Distributions a symmetrical distribution containing two distinct humps
74
Bimodal - birth month
75
Distributions - data How many alcoholic drinks do you have per week?
76
Distributions - data How many alcoholic drinks do you have per week?
77
Distributions - data How many alcoholic drinks do you have per week? Positively skewed
78
Distributions - data How much did you spend on textbooks for this semester?
79
Distributions - data How much did you spend on textbooks for this semester?
80
Distributions - data How much did you spend on textbooks for this semester? Normal – one outlier
81
Kurtosis meso- Forming chiefly scientific terms with the sense ‘middle, intermediate’ lepto- Small, fine, thin, delicate platy- Forming nouns and adjectives, particularly in biology and anatomy, with the sense ‘broad, flat ’
82
Relative Frequency and the Normal Curve
83
Relative Frequency Another way to organize scores is by relative frequency Relative frequency is the proportion of time that a particular score occurs – remember: a proportion is a number between 0 and 1 Simple frequency: the number of times a score occurs Relative frequency: the proportion of times a score occurs
84
Relative Frequency - Why? We are still asking how often certain scores occurred Sometimes, relative frequency is easier to interpret than simple frequency Example: 82 people in the class reported drinking no alcohol weekly – Simple frequency 0.42 of the class (42%) reported drinking no alcohol – Relative frequency
85
Relative Frequency The formula for a score’s relative frequency is: relative frequency =
86
Relative Frequency Distribution
87
Example Using the following data set, find the relative frequency of the score 12 14 13151115 131012131413 14151714 15
88
Example The frequency table for this set of data is: 14 13151115 131012131413 14151714 15
89
Example The frequency for the score of 12 is 1, and N = 18 Therefore, the relative frequency of 12 is:
90
Example The frequency for the score of 12 is 1, and N = 18 Therefore, the relative frequency of 12 is:
91
Relative Frequencies We can also add relative frequencies together. – For example, what proportion of people scored a passing mark in this exam (>3): Value FrequencyRelative Frequency 65 5/18 = 0.28 56 6/18 = 0.33 43 3/18 = 0.17 32 2/18 = 0.11 21 1/18 = 0.06 11 1/18 = 0.06 N=18 Total=1.00
92
Relative Frequencies We can also add relative frequencies together. – For example, what proportion of people scored a passing mark in this exam (>3): 0.28+0.33+0.17=0.78 Value FrequencyRelative Frequency 65 5/18 = 0.28 56 6/18 = 0.33 43 3/18 = 0.17 32 2/18 = 0.11 21 1/18 = 0.06 11 1/18 = 0.06 N=18 Total=1.00
93
Relative Frequency and the Normal Curve When the data are normally distributed (as most data are), we can use the normal curve directly to determine relative frequency. There is a known proportion of scores above or below any point For example, exactly 0.50 of the scores lie above the mean
94
Relative Frequency and the Normal Curve The proportion of the total area under the normal curve at certain scores corresponds to the relative frequency of those scores.
95
Relative Frequency and the Normal Curve Normal distribution showing the area under the curve to the left of selected scores
96
Percentiles A percentile is the percent of all scores in the data that are at or below a score – Example: 98th percentile - 98% of the scores lie below this.
97
Homework Complete exercises 1, 6, and 9 for chapter 3. Read chapter 4 and 5 for next week.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.