Download presentation
Presentation is loading. Please wait.
Published byOsborn May Modified over 9 years ago
2
2 Textbook Shavelson, R.J. (1996). Statistical reasoning for the behavioral sciences (3 rd Ed.). Boston: Allyn & Bacon. Supplemental Material Ruiz-Primo, M.A., Mitchell, M., & Shavelson, R.J. (1996). Student guide for Shavelson statistical reasoning for the behavioral sciences (3 rd Ed.). Boston: Allyn & Bacon. Textbook Credits
3
3 Overview Example data on Teacher Expectancy Research Frequency Distributions Measures of Central Tendency & Variability The Normal Distribution
4
4 Teacher Expectancy Data
5
5 Research on Teacher Expectancy Study Design A 2 x 6 x 2 (treatment x grade level x test occasions) randomized experiment Schematic of design: Grade 1 2 3 4 5 6 Treatment Experimental X (“Bloomers”) Control Randomly Assigned Occasion 1 IQ Pretest Occasion 2 IQ Posttest
6
6 Teacher Expectancy Data Matrix Another convenient way to easily depict the data
7
7 Teacher Expectancy Frequency Distribution Posttest scores from the treatment group Sum = 30
8
8 Frequency Distribution Using Class Intervals Treatment group posttest scores divided into 11 class intervals Each class interval size is 3 (score values 123, 124, 125, …) Clear patterns emerge. Look @ interval 114-116 with highest f
9
9 Frequency Distribution Using Class Intervals Use 11 intervals Number of Class Intervals: Highest – Lowest score: 125-95 = 30 Class intervals size (i) = H–L / # of class intervals: 30/11 = 2.7(round to 3) Rule: Lowest interval score must be divisible by interval class size. Lowest score 95 is not divisible by 3 so subtract 1. 94 is still not divisible by 3 so subtract 1. 93 is divisible by 3 so lowest class interval score begins with 93.
10
10 Teacher Expectancy: Histogram Histogram showing the class interval posttest scores on the abscissa and frequency on the ordinate Lower and upper limit scores with zero values are shown.
11
11 Teacher Expectancy: Polygon Polygon showing the class interval posttest scores on the abscissa and frequency on the ordinate Lower and upper midpoint values with zero f are shown.
12
12 Teacher Expectancy: Polygon Polygon showing the class interval posttest scores on the abscissa and frequency on the ordinate Lower and upper midpoint values with zero f are shown.
13
13 Teacher Expectancy: Polygon
14
14 Teacher Expectancy: Stem-and-leaf plot Stem-and-leaf plot containing the data matrix posttest scores in increments of 5’s
15
15 Common Frequency Distribution Shapes Normal Distribution(bell shape) Unimodal distribution : 1 peak Symmetric about the mean Kurtosis (peakedness): leptokurtic Symmetric about the mean Bimodal Distribution: 2 peaks Multimodal Distribution: > 2 peaks Positively SkewedNegatively Skewed Rectangular Distribution(no peaks) Kurtosis (peakedness): platykurtic
16
16 Score frequencies are shown as a proportion of the total number of frequencies in the sample: RF = f / total # of subjects The Relative Frequency (Probability) Distribution Sum = 10Sum = 20 Total Sum = 30
17
17 Relative frequency polygons are constructed as the frequency polygons except the relative frequency is listed in the ordinate The Relative Frequency Polygon
18
18 The cumulative frequency distribution shows the number of scores falling below a certain point on the scale of scores The cf of a score is defined as the number of cases falling below the upper real limit of the class interval The Cumulative Frequency Distribution
19
19 The cumulative frequency polygon uses upper real limits and cumulative frequency The Cumulative Frequency Polygon
20
20 80% of the subjects in the experimental group of the expectancy study received a posttest score below 119.5 CP = CF / total number of subjects C% = CP x 100 Cumulative Proportions and Percentiles
21
21 Example: A raw score of 113.5 has a percentile rank of 57 Percentile Scores
22
22 Measures of Central Tendency The central tendency of the set of measurements - that is, the tendency of the data to cluster, or center, about certain numerical values. Central Tendency (Location)
23
23 Measures of Central Tendency The variability of the set of measurements–that is, the spread of the data. Variation (Dispersion)
24
24 Standard Notation MeasureSamplePopulation Mean Size nN
25
25 Mean Most common measure of central tendency Acts as ‘balance point’ Affected by extreme values (‘outliers’) Denoted as where
26
26 Mean Example
27
27 Median
28
28 Median Example: Odd Size Sample Raw Data:24.122.621.523.722.6 Ordered:21.522.622.623.724.1 Position:12345
29
29 Median Example: Even Size Sample
30
30 Mode Measure of central tendency Value that occurs most often Not affected by extreme values May be no mode or several modes May be used for quantitative or qualitative data
31
31 Mode Example No Mode Raw Data:10.34.98.911.76.37.7 One Mode Raw Data:6.34.98.9 6.3 4.94.9 More Than 1 Mode Raw Data:212828414343
32
32 Measures of Variability Measure of dispersion Difference between largest & smallest observations Range = x largest – x smallest Ignores how data are distributed
33
33 Standard Notation
34
34 Sample & Population Variance n – 1 in the denominator! The variance is the average of the squares of the distance each value is from the mean. N in the denominator!
35
35 Standard Deviation The standard deviation is the square root of the variance.
36
36 Sample Standard Deviation Formula = 2.523 S =
37
37 Overview of the Normal Distribution Serves as a reasonable good model of many natural phenomena Provides a good model for the frequency distribution of scores Of particular importance is in inferential statistics as a probability distribution There exists a close connection between the sample size and the distribution of means calculated for many samples of subjects drawn from the same population As the sample size increases, the distribution of scores becomes normal May provide a good approximation to probabilities of other distributions that are more difficult to work with
38
38 Properties of the Normal Distribution It is unimodal, observing the value of X and the mean It is symmetric about the mean; ½ the scores fall below the mean and ½ the scores fall above the mean The mean, mode, and median are all equal It is asymptotic (never touches the abscissa) It is continuous for all values of X from - ∞ to +∞
39
39 Properties of the Normal Distribution Unimodal Symmetric Mode=median=mean Asymptotic
40
40 Empirical Rule of the Normal Distribution Areas
41
41 Interpretation of z-Scores Example Approximately 68% of the measurements will have a z-score between –1 and 1. Approximately 95% of the measurements will have a z-score between –2 and 2. Approximately 99.7% of the measurements will have a z-score between –3 and 3.
42
42 Empirical Rule Example
43
43 Computing the z-Score
44
44 z-Score Example 1
45
45 Z-score Example 1 Find the area between the mean and a given raw score z score: Mean 0, s = 1 –Distance between a score (X) and the mean of a distribution in standard deviation (s) units –Used to display and interpret areas of the normal distribution Assume score is 9, mean = 8, s = 2 z = (x - mean) / s z = (9-8)/2 = 0.5 Next, find the area between the mean and z = 0.5 From Table B Column 2 in Appendix II we find: 0.1915 0.1915 or 19.15% of the cases
46
46 Z-score Example 2 Find the area below a given raw score z score: Mean 0, s = 1 –Distance between a score (X) and the mean of a distribution in standard deviation (s) units –Used to display and interpret areas of the normal distribution Assume score is 9, mean = 8, s = 2 z = (x - mean) / s z = (9-8)/2 = 0.5 Mark off area in the ND Find area below z = 0.5 From Table B column 3 we find: 0.6915 or 69.15% of the cases Below!
47
47 Z-score Example 3 Find the area above a given raw score z score: Mean 0, s = 1 –Distance between a score (X) and the mean of a distribution in standard deviation (s) units –Used to display and interpret areas of the normal distribution Assume score is 9, mean = 8, s = 2 z = (x - mean) / s z = (9-8)/2 = 0.5 Mark off area in the ND Find area above z = 0.5 From Table B column 3 we find: 0.3085 or 30.85% of the cases Above!
48
48 Z-score Example 3 Find the area between two given raw scores z score: Mean 0, s = 1 –Distance between a score (X) and the mean of a distribution in standard deviation (s) units –Used to display and interpret areas of the normal distribution Assume 1st score is 9, 2nd score is 5.8, mean = 8, s = 2 z = (x - mean) / s z = (9-8)/2 = 0.5 z = (x - mean) / s z = (5.8-8)/2 = -1.1 Find area between mean and z = 0.5 Find area between mean and z = -1.1 Total = 0.1915 + 0.3643 = 0.5558 0.5558 or 55.58% of the cases Between ! z X=9 =.5z X=5.8 = -1.1
49
49 Practice Exercises 1.Select a hypothetical product or a process and create some test data of your choice (plausible, no more than 10) as shown in textbook/class 2.Show your type of experimental approach 3.Create a detailed table of frequency distributions 4.Display your data with different types of graphs 5.Calculate the measures of central tendency and variability 6.Calculate the Z-score(s) and indicate the relative position in the normal distribution. 7.Provide any other pertinent information as a result
50
50 Questions ?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.