Psych 230 Psychological Measurement and Statistics Pedro Wolf September 9, 2009.

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Table of Contents Exit Appendix Behavioral Statistics.
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Introduction to Summary Statistics
BHS Methods in Behavioral Sciences I April 18, 2003 Chapter 4 (Ray) – Descriptive Statistics.
Statistics for the Social Sciences
PSY 307 – Statistics for the Behavioral Sciences
Descriptive Statistics
Analysis of Research Data
Intro to Descriptive Statistics
Measures of Variability
Data observation and Descriptive Statistics
SHOWTIME! STATISTICAL TOOLS IN EVALUATION DESCRIPTIVE VALUES MEASURES OF VARIABILITY.
Chapter 3: Central Tendency
Today: Central Tendency & Dispersion
Measures of Central Tendency
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
Chapter 3 Descriptive Measures
Summarizing Scores With Measures of Central Tendency
Objective To understand measures of central tendency and use them to analyze data.
BIOSTATISTICS II. RECAP ROLE OF BIOSATTISTICS IN PUBLIC HEALTH SOURCES AND FUNCTIONS OF VITAL STATISTICS RATES/ RATIOS/PROPORTIONS TYPES OF DATA CATEGORICAL.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
Chapters 1 & 2 Displaying Order; Central Tendency & Variability Thurs. Aug 21, 2014.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Reasoning in Psychology Using Statistics Psychology
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Descriptive Statistics
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Measures of Central Tendency: The Mean, Median, and Mode
Psy 230 Jeopardy Measurement Research Strategies Frequency Distributions Descriptive Stats Grab Bag $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500.
Central Tendency & Dispersion
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
1 Review Sections 2.1, 2.2, 1.3, 1.4, 1.5, 1.6 in text.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
LIS 570 Summarising and presenting data - Univariate analysis.
Introduction to statistics I Sophia King Rm. P24 HWB
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Anthony J Greene1 Central Tendency 1.Mean Population Vs. Sample Mean 2.Median 3.Mode 1.Describing a Distribution in Terms of Central Tendency 2.Differences.
Chapter 3: Central Tendency 1. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Statistics and Data Analysis
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 16, 2009.
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
MR. MARK ANTHONY GARCIA, M.S. MATHEMATICS DEPARTMENT DE LA SALLE UNIVERSITY.
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Summarizing Scores With Measures of Central Tendency
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Numerical Measures: Centrality and Variability
Chapter 3: Central Tendency
Presentation transcript:

Psych 230 Psychological Measurement and Statistics Pedro Wolf September 9, 2009

So Far Stem and leaf plots Bar plots Summarizing scores using Frequency – how a frequency distribution is created Graphing frequency distributions – bar graphs, histograms, polygons Types of distribution – normal, skewed, bimodal Relative frequency and the normal curve

the Normal Curve How likely is it that a certain score will occur?

Today…. Summarizing scores using central tendency – what is central tendency? The Mode – what it is, how to calculate it, & when to use it The Median – what it is, how to calculate it, & when to use it The Mean – what it is, how to calculate it, & when to use it – applying the mean to research

Range – what it is & how to calculate it Variance – what it is & how to calculate it Standard Deviation – what it is & how to calculate it Variability and the Normal Distribution Population Variance and Standard Deviation

Central Tendency

Why do we need a measure of Central Tendency? Often we would like to know the most typical or representative score of a dataset – How many drinks do students consume a week? – What are the political beliefs of students? – What is people’s favorite color? – How much do lawyers get paid? – What is the temperature in London? There are different ways to calculate a typical score. – Each way has advantages and disadvantages. Depends on: Type of data Distribution of data

What is a Measure of Central Tendency? Measures of central tendency answer the question: – “Are the scores generally high scores or generally low scores?” Allow us to compare values: – Average high / low temp in May in Tucson: 90ºF / 53ºF – Average high / low temp in May in St. Petersburg: 59ºF / 42ºF A statistic that indicates where the center of the distribution tends to be located

Measures of Central Tendency There are three commonly used measures of central tendency Mode Median Mean There is no single, perfect, measure of central tendency

Measures of Central Tendency

Example The following are the salaries of the 15 employees of a small consulting company $82,000$64,000$36,400$34,000$29,200 $29,200$29,200$28,000$26,800 $26,800 $26,800 $24,400$24,400 $24,400 $24,400 What is the typical salary of an employee in this company? How can different measures of central tendency be used to make different arguments?

The Mode

What is the Mode? The mode is the score that has the highest frequency in the data The mode is always used to describe central tendency when the scores reflect a nominal scale of measurement Can also be used for other scales of measurement Scores: 2,3,4,4,5,5,5 Mode=5

How to find the Mode Can find the mode by inspection (as opposed to computation) Simply the score with the highest frequency

Example - Mode from Raw Scores What is the mode of the following data:

Example - Mode from Raw Scores What is the mode of the following data: Mode = 14

Example - Mode from Frequency Table What is the mode of the following data: Value Frequency

Example - Mode from Frequency Table What is the mode of the following data: Value FrequencyMode =

Example - Mode from Graphs Quiz #1 Scores

Example - Mode from Graphs Mode = 5 Quiz #1 Scores

Unimodal Distributions When a graph has one hump (such as on the normal curve) the distribution is called unimodal

Bimodal Distributions When a graph shows two scores that are tied for the most frequently occurring score, it is called bimodal.

Example The following are the salaries of the 15 employees of a small consulting company. $82,000$64,000$36,400$34,000$29,200 $29,200$29,200$28,000$26,800 $26,800 $26,800 $24,400$24,400 $24,400 $24,400 What is the modal salary? $24,400 – is this a good description of the typical salary?

Mode Advantages: – can be used with nominal data – easily identified – unaffected by extreme scores – bimodal datasets may suggest interesting subgroups Disadvantages – not necessarily a unique score – not very precise – cannot be manipulated mathematically

The Median

What is the Median? The median is the middle score of the data; the score that divides the data in half The median is the score at the 50th percentile – you did this in your homework when calculating the quartiles The median is used to summarize ordinal or highly skewed interval or ratio scores

How to Find the Median When data are normally distributed, the median is the same score as the mode. When data are not normally distributed, follow the following procedure: – arrange the scores from lowest to highest. – if there are an odd number of scores, the median is the score in the middle position. – if there are an even number of scores, the median is the average of the two scores in the middle. – Median score = (N+1)/2

Example - Median from Raw Scores What is the median of the following data:

Example - Median from Raw Scores What is the median of the following data: First, arrange in order of magnitude

Example - Median from Raw Scores Number of scores (N) = N=24 Median is the average of the middle two: (N+1)/2 = (24+1)/2 = 25/2 = 12.5 Average of the 12th and 13th score ( ) / 2 = 13.5

Example - Median from Frequency Table What is the median of the following data: X fX f

Example - Median from Frequency Table What is the median of the following data: X fN=

Example - Median from Frequency Table What is the median of the following data: X fN= Median = (N+1)/2 = (17+1)/2 = 439th score

Example - Median from Frequency Table What is the median of the following data: X fN= Median = (N+1)/2 = (17+1)/2 = 439th score Median = 51

Median Advantages: – useful for skewed distributions – unaffected by extreme scores – useful for dividing sets of scores in to two halves (for example, high and low scorers in an exam) Disadvantages – does not take into account extreme scores – cannot be manipulated mathematically

The Mean

What is the Mean? The mean is the score located at the exact mathematical center of a distribution – the “average” The mean is used to summarize interval or ratio data in situations when the distribution is symmetrical and unimodal By far the most commonly used measure of central tendency

How to Find the Mean The symbol for the sample mean is The formula for the sample mean is:

Example Calculate the mean of the following data: 12, 15, 17, 12, 13, 9, 1, 6, 3, 12, 12, 16, 17 Mode = 12 Median = 12 N = 13  X =  X = 145

Example Calculate the mean of the following data: 12, 15, 17, 12, 13, 9, 1, 6, 3, 12, 12, 16, 17 Mode = 12 Median = 12 N = 13  X =  X = 145 =  X / N = 145 / 13 =

Example The following are the salaries of the 15 employees of a small consulting company $82,000$64,000$36,400$34,000$29,200 $29,200$29,200$28,000$26,800 $26,800 $26,800 $24,400$24,400 $24,400 $24,400 What is the mean salary? 510,000/15 = $34,000

Sample Mean vs. Population Mean is the sample mean. This is a sample statistic. The mean of a population is a parameter. It is symbolized by  (pronounced “mew”) is used to estimate the corresponding population mean 

Your Turn - Mean For the mean, we need  X and N We know that N = 18 What is  X?

Your Turn - Mean For the mean, we need  X and N We know that N = 18 What is  X?  X=246

Your Turn - Mean

Weighted mean The mean of a group of means Sometimes you want to compare groups with different numbers of scores Suppose you have 4 class averages: 75, 78, 72, 80. How do you find the mean? ( )/4 = Only works if every class has the same number of people

Formula for Weighted Mean ΣfX N tot Xw =

Weighted mean If the Ns of the groups of scores differ…

Which measure to use?

Central Tendency - Normal Distributions On a perfect normal distribution, all three measures of central tendency are located at the same score: mean=median=mode

Central Tendency - Normal Distributions As the mean uses all of the information in the data, it is the preferred one to use in this case

Central Tendency - Skewed Distributions Use the median to summarize a highly skewed distribution

Which measure to use? Generally, the mean is the best measure of central tendency – All scores count in computing the mean Unless: Nominal data: use the mode Highly skewed data: use the median

Samples and Populations The sample mean provides a better estimate of the central tendency of a population than the sample median. The more observations in our sample, the closer the sample mean will be to the population mean. Often, the sample mean will be more accurate than any one individual

Applying the Mean to Research

Why the Mean? So, the mean is usually the best “model” of our data – The best summary of the distribution – And, we can do statistics on it – This makes it ideal for comparing groups

Using the mean Prediction – without knowing anything else, the mean is our best estimate Describing a score’s location – a deviation score indicates a raw score’s location and frequency Describing the population mean – ultimately we want to describe the population Summarizing Experiments

We compute the mean every time we have a sample of normally distributed scores The first step in an experiment is to usually to compute the mean of the dependent variable

Example An experiment was conducted to test the efficacy of a new diet drug. The drug was administered to rats in various dosages and the rat’s food consumption measured. The data are as follows: Rat0mg2mg4mg6mg A B C D E181074

Example An experiment was conducted to test the efficacy of a new diet drug. The drug was administered to rats in various dosages and the rat’s food consumption measured. The data are as follows: Rat0mg2mg4mg6mg A B C D E X (0mg) = 12.4 X (2mg) = 10 X (4mg) = 10 X (6mg) = 5.6

Graphing Experimental Results Plot the independent variable on the X axis and the dependent variable on the Y axis Create a bar graph when the independent variable is a nominal or ordinal variable Create a line graph when the independent variable is an interval or a ratio variable

Bar Graphs The bar above each condition on the X axis is placed to the height on the Y axis that corresponds to the mean score for that condition

Line Graphs A line graph uses straight lines to connect adjacent data points

Today…. Understanding variability – what can it tell us Range – what it is & how to calculate it Variance – what it is & how to calculate it Standard Deviation – what it is & how to calculate it Variability and the Normal Distribution Population Variance and Standard Deviation

Understanding Variability

The mean gives us a good measure of the central tendency of our data - the average value Mean age in the class = Does this tell us all we need to know about the distribution of ages in the class? We also need a measure of how spread out the scores are

Understanding Variability Measures of variability describe the extent to which scores in a distribution differ from each other The mean is our best estimate of central tendency. What is our best estimate of variability?

Variability - Example Same means but different variability

Variability - Example Same means but different variability

Variability - Example Three variations of the normal distribution

The Range

What is the Range? The range indicates the distance between the two most extreme scores in a distribution Range = highest score – lowest score

The Range - Example What is the range of the following dataset? 12, 16, 18, 23, 11, 10, 9, 4, 23, 15, 14, 13 Range = highest score – lowest score Range = Range =19

The Range - Your turn What are the ranges of the following datasets? – 2, 6, 12, 10, 0 – 4, 7, 5, 8, 6 – 6, 6, 6, 6, 6 1) = 12 2) = 4 3) = 0

The Range Though the range does give some idea of the spread of data, it is quite a crude measure Based on highest and lowest values, and so reflects the least typical scores

Variance and Standard Deviation

Most psychological research involves interval or ratio scores which approximate a normal distribution In these situations, we use two, similar, measures of variability, known as the variance and the standard deviation A measure of how different the scores are from each other

Variance and Standard Deviation A measure of how different the scores are from each other – calculate by measuring how much the scores differ from the mean. (Remember: the mean is our best estimate of central tendency) The variance and standard deviation indicate how much the scores are spread out around the mean Mean hours of tv watched weekly = 7.99 Are the number of hours of tv between 6 and 8 hours or between 1 and 30 hours?

Variance and Standard Deviation We want a measure of how much the scores are spread out around the mean Why not just take an average of the distance between each score and the mean? – Data: 4, 6, 8, 10, 12 – Mean = 8 Cannot do this, because the sum of the deviations always equals 0 – positive deviations cancel out negative deviations

Variance and Standard Deviation So, we want a measure which is like the average of the deviations, but which is calculated differently Conceptually, we can think of the variance and standard deviation as the typical amount that each score differs from the mean

The Variance One solution to the problem of deviations canceling out is to square the deviations Why? All the positive deviations will stay positive and all negative deviations will become positive - no canceling out Data: 4, 6, 8, 10, 12 Mean=8 Sum of the deviations ∑(X-X) = – [(4-8)+(6-8)+(8-8)+(10-8)+(12-8)] – [(-4)+(-2)+(0)+(2)+(4)] = 0

The Variance What happens when we square the deviations? Data: 4, 6, 8, 10, 12 Mean=8 Sum of the deviations ∑(X-X) 2 = [(4-8) 2 + (6-8) 2 + (8-8) 2 +(10-8) 2 + (12-8) 2 ] = [(-4) 2 +(-2) 2 +(0) 2 +(2) 2 +(4) 2 ] = [(16) +(4) +(0) +(4) +(16)] = 40

The Variance Does 40 reflect the average deviation from the mean? To get an average deviation, we should divide by the number of scores (N) ∑(X-X) 2 = 40=8 N 5 This statistic is known as the variance (S 2 X )

How to find the Variance The sample variance is the average of the squared deviations of scores around the sample mean

How to find the Variance The sample variance is the average of the squared deviations of scores around the sample mean Note: we can use a simpler formula to calculate the variance by hand

How to find the Variance - Example Estimates of professor’s age: Data: 36, 34, 45, 31, 35, 38, 36 N = 7 X = ( ) / 7 = 36.43

How to find the Variance - Example Estimates of professor’s age: Data: 36, 34, 45, 31, 35, 38, 36 N = 7 X = ( ) / 7 = ∑(X-X) 2 = [( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 ] =

How to find the Variance - Example Estimates of professor’s age: Data: 36, 34, 45, 31, 35, 38, 36 N = 7 X = ( ) / 7 = ∑(X-X) 2 = [( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 ] = [(-0.43) 2 + (-2.43) 2 + (8.57) 2 + (-5.43) 2 + (-1.43) 2 + (1.57) 2 + (-0.43) 2 ] =

How to find the Variance - Example Estimates of professor’s age: Data: 36, 34, 45, 31, 35, 38, 36 N = 7 X = ( ) / 7 = ∑(X-X) 2 = [( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 ] = [(-0.43) 2 + (-2.43) 2 + (8.57) 2 + (-5.43) 2 + (-1.43) 2 + (1.57) 2 + (-0.43) 2 ] = =

How to find the Variance - Example N = 7 X = ∑(X-X) 2 =

How to find the Variance - Example N = 7 X = ∑(X-X) 2 = S 2 X = ∑(X-X) 2 = = N 7

How to find the Variance - Your Turn Find the variance of the following dataset: 3,3,5,9

How to find the Variance - Your Turn Find the variance of the following dataset: 3,3,5,9 N = 4 X = 5 ∑(X-X) 2 = [(3-5) 2 + (3-5) 2 + (5-5) 2 +(9-5) 2 ] = [(-2) 2 + (-2) 2 + (0) 2 + (4) 2 ] = [(4) +(4) +(0) + (16)] = 24 S 2 X = ∑(X-X) 2 = 24=6 N 4

The Variance - pros and cons Pros: The variance is a legitimate measure of variability Usefully communicates the relative variability of scores We will use it extensively in further statistics

The Variance - pros and cons Cons: The variance doesn’t make much sense as a measure of the “average deviation” – we have squared all of the scores, so they are unrealistically large – Professor’s Age example: Data: 36, 34, 45, 31, 35, 38, 36 Mean = Variance = 16 (16 what? 16 squared years)

What is the Standard Deviation? The variance is a squared deviation score. To convert it back to the original scale, we can take the square root of the variance. This is known as the standard deviation (S X ). The standard deviation indicates the “average deviation” from the mean, the consistency in the scores, and how far scores are spread out around the mean The larger the value of S X, the more the scores are spread out around the mean, and the wider the distribution

How to find the Standard Deviation The sample standard deviation is the square root of the variance Note: You can use a simpler formula to calculate the variance by hand.

Standard Deviation - Example Find the standard deviation of the following dataset: 3,3,5,9

Standard Deviation - Example Find the standard deviation of the following dataset: 3,3,5,9 N = 4 X = 5 ∑(X-X) 2 = 24 [(3-5) 2 + (3-5) 2 + (5-5) 2 +(9-5) 2 ] = [(-2) 2 + (-2) 2 + (0) 2 + (4) 2 ] = [(4) +(4) +(0) + (16)] = 24 S X = √ 24=√6=

Standard Deviation - Example Estimates of professor’s age: Data: 36, 34, 45, 31, 35, 38, 36

Standard Deviation - Example Estimates of professor’s age: Data: 36, 34, 45, 31, 35, 38, 36 N = 7 X = ∑(X-X) 2 = S X = √ =√16.244=4.03 7

Normal Distribution and the Standard Deviation The standard deviation is a measure of how far scores are from the mean, on average

Normal Distribution and the Standard Deviation The standard deviation is related to the normal distribution On any normal distribution, approximately 0.34 of the scores lie between the mean and the score one standard deviation higher than the mean And, of course, 0.34 lie between the mean and the score one standard deviation lower than the mean So, 0.68 of the scores (or 68%) are within one standard deviation of the mean

Normal Distribution and the Standard Deviation Approximately 34% of the scores in a perfect normal distribution are between the mean and the score that is one standard deviation from the mean.

Normal Distribution and the Standard Deviation Why is this useful? You score 85 on an exam - is this good? If you know scores on an exam are normally distributed, that the mean score was 80 and the standard deviation was 5, what percentage of people did you score better than?

Normal Distribution and the Standard Deviation Why is this useful? You score 85 on an exam - is this good? If you know scores on an exam are normally distributed, that the mean score was 80 and the standard deviation was 5, what percentage of people did you score better than? You were above the mean: better than 50% You are one standard deviation above the mean: better than another 34% Therefore, you scored better than 84% of people

Normal Distribution and the Standard Deviation

Mean=66.51 Var= StdDev=3.827

Normal Distribution and the Standard Deviation Mean=66.51 Var= StdDev=3.827

Computing the Sample Variance and Standard Deviation

Computing the Variance and Standard Deviation The previous formulas we used for calculating variance and standard deviation showed that we were computing a measure of the average deviation of scores from the mean However, if you have to do it by hand you can use some quicker and easier formulas for calculating these statistics.

How to find the Variance The computing formula for the sample variance is:

Computing the Variance - Example Estimates of professor’s age: Data: 36, 34, 45, 31, 35, 38, 36 N = 7 ∑X 2 = 9403 (∑X) 2 = 65025

Computing the Variance - Example Estimates of professor’s age: Data: 36, 34, 45, 31, 35, 38, 36 N = 7 ∑X 2 = 9403 (∑X) 2 = (65025 / 7) = = =

New Terminology and Notation Review The Sum of Squared Xs – first square each raw score and then sum the squared Xs. The Sum of Xs, Squared – first sum the raw scores and then square that sum.

Review Find ∑X 2 for the following data: 4, 6, 7, 3, 2, 7, 2, 5 – first square each raw score and then sum the squared Xs. (4) 2 + (6) 2 + (7) 2 + (3) 2 + (2) 2 + (7) 2 + (2) 2 + (5) 2 = = 192

Review Find (∑X) 2 for the following data: 4, 6, 7, 3, 2, 7, 2, 5 – first sum the raw scores and then square that sum. ( ) 2 = (36) 2 = 1296

How to find the Variance (∑X) 2 = ∑X 2 =9403 N=7

How to find the Variance (∑X) 2 = ∑X 2 =9403 N=7 S X 2 = 9403 – 65025/7 7

How to find the Variance (∑X) 2 = ∑X 2 =9403 N=7 S X 2 = 9403 – 65025/7 7 S X 2 = 9403 –

How to find the Variance (∑X) 2 = ∑X 2 =9403 N=7 S X 2 = 9403 – 65025/7 S X 2 = S X 2 = 9403 –

How to find the Variance (∑X) 2 = ∑X 2 =9403 N=7 S X 2 = 9403 – 65025/7 S X 2 = S X 2 = 9403 – S X 2 =

How to find the Standard Deviation The computing formula for the sample standard deviation is:

Computing the Standard Deviation - Example Estimates of professor’s age: Data: 36, 34, 45, 31, 35, 38, 36 N = 7 ∑X 2 = 9403 (∑X) 2 = S 2 X = S X = √ = 4.03

Applying Variability to Research

Example An experiment was conducted to test the efficacy of a new diet drug. The drug was administered to rats in various dosages and the rat’s food consumption measured. The data are as follows: Rat0mg2mg4mg6mg A B C D E X (0mg) = 12.4 X (2mg) = 10 X (4mg) = 10 X (6mg) = 5.6

Example An experiment was conducted to test the efficacy of a new diet drug. The drug was administered to rats in various dosages and the rat’s food consumption measured. The data are as follows: Rat0mg2mg4mg6mg A B C D E X (0mg) = 12.4, S x = 3.13 X (2mg) = 10, S x = 0 X (4mg) = 10, S x = 2.52 X (6mg) = 5.6, S x = 1.2

Rat data - population distributions mg, Mean = 5.8 2mg, Mean = 10 0mg, Mean = 12.4

Putting it all together…. Descriptive statistics - the three steps of analyzing any set of data are: 1.Consider the scale of measurement and the shape of the distribution 2.Describe where most participants scored, usually by computing the mean 3.Describe the variability of the scores, usually by computing the sample standard deviation