Chapter 1 The mean, the number of observations, the variance and the standard deviation.

Slides:



Advertisements
Similar presentations
Chapter 5 Introduction to Inferential Statistics.
Advertisements

QUANTITATIVE DATA ANALYSIS
Chapter 3 The Normal Curve.
The standard error of the sample mean and confidence intervals
The standard error of the sample mean and confidence intervals
Chapter 3 The Normal Curve Where have we been? To calculate SS, the variance, and the standard deviation: find the deviations from , square and sum.
Chapter 5 Introduction to Inferential Statistics.
Chapter 1 The mean, the number of observations, the variance and the standard deviation.
Descriptive Statistics Statistical Notation Measures of Central Tendency Measures of Variability Estimating Population Values.
PSY 307 – Statistics for the Behavioral Sciences
Chapter 3 The Normal Curve Where have we been? To calculate SS, the variance, and the standard deviation: find the deviations from , square and sum.
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
The standard error of the sample mean and confidence intervals How far is the average sample mean from the population mean? In what interval around mu.
Chapter 5 Introduction to Inferential Statistics.
Chapter 1 The mean, the number of observations, the variance and the standard deviation.
Chapter 1-6 Review Chapter 1 The mean, variance and minimizing error.
1 Chapter 4: Variability. 2 Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure.
Variability Ibrahim Altubasi, PT, PhD The University of Jordan.
Central Tendency and Variability
Measures of Variability: Range, Variance, and Standard Deviation
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
The standard error of the sample mean and confidence intervals How far is the average sample mean from the population mean? In what interval around mu.
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
Today: Central Tendency & Dispersion
Summarizing Scores With Measures of Central Tendency
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Chapter 3 Descriptive Measures
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Smith/Davis (c) 2005 Prentice Hall Chapter Six Summarizing and Comparing Data: Measures of Variation, Distribution of Means and the Standard Error of the.
Chapter 4 Variability. Variability In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In.
1 1 Slide © 2003 Thomson/South-Western. 2 2 Slide © 2003 Thomson/South-Western Chapter 3 Descriptive Statistics: Numerical Methods Part A n Measures of.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Chapter 3 Central Tendency and Variability. Characterizing Distributions - Central Tendency Most people know these as “averages” scores near the center.
Statistics 11 The mean The arithmetic average: The “balance point” of the distribution: X=2 -3 X=6+1 X= An error or deviation is the distance from.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Statistics for Psychology CHAPTER SIXTH EDITION Statistics for Psychology, Sixth Edition Arthur Aron | Elliot J. Coups | Elaine N. Aron Copyright © 2013.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Measures of Central Tendency: The Mean, Median, and Mode
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Variability Pick up little assignments from Wed. class.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Univariate Descriptive Research Recall that the objective of univariate descriptive research is to describe a single psychological variable.
Chapter 3: Averages and Variation Section 2: Measures of Dispersion.
Central Tendency & Dispersion
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Central Tendency. Variables have distributions A variable is something that changes or has different values (e.g., anger). A distribution is a collection.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western /Thomson Learning.
Today: Standard Deviations & Z-Scores Any questions from last time?
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Chapter 4: Variability. Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 2 The Mean, Variance, Standard.
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
Things you will need in class. zLecture notes from the my website on the internet. yGo to and look for the latest set of.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Statistics: The Z score and the normal distribution
Central Tendency and Variability
Summarizing Scores With Measures of Central Tendency
Summary descriptive statistics: means and standard deviations:
Chapter 2 The Mean, Variance, Standard Deviation, and Z Scores
Central Tendency.
Chapter 3.
Variance Variance: Standard deviation:
Summary descriptive statistics: means and standard deviations:
Numerical Descriptive Measures
Presentation transcript:

Chapter 1 The mean, the number of observations, the variance and the standard deviation

Some definitions zData - observations, measurements, scores zStatistics - a series of rules and methods that can be used to organize and interpret data. zDescriptive Statistics - methods to summarize large amounts of data with just a few numbers. zInferential Statistics - mathematical procedures to make statements of a population based on a sample.

More Definitions zParameter - a number that summarizes or describes some aspect of a population. zSample statistic - An estimate of a population parameter based on a random sample taken from the population. zSampling Error - the difference between a sample statistic that estimates a population parameter and the actual parameter. zNon-parametric Statistics - statistics for observations that do not allow the estimation of the population mean and variance.

Sampling Error - the difference between a sample statistic that estimates a population parameter and the actual parameter. Differences between sample statistics and population parameters are largely a function of stable, random individual differences and measurement problems

Where we are going

Descriptive Statistics zNumber of Observations zMeasures of Central Tendency zMeasures of Variability

Observations zEach score is represented by the letter X. zThe total number of observations is represented by N.

Measures of Central Tendency Finding the most typical score ymedian - the middle score ymode - the most frequent score ymean - the average score xIn this course, the mean will be our most important measure of central tendency

zGreek letters are used to represent population parameters. z z (mu) is the mathematical symbol for the mean. z z is the mathematical symbol for summation. zFormula -  = (  X) / N zEnglish: To calculate the mean, first add up all the scores, then divide by the number of scores you added up. Calculating the Mean

The mode, the median and the mean Ages of people retiring from Rutgers this year  X = 548 N = 9 Mean  = Mode is 60.Median is 63.

Measures of Variability – less important zRange - the distance from the highest to the lowest score. zInter-quartile Range - the distance from the top 25% to the bottom 25%. zSum of Squares (SS) – the total squared distance of all scores from the mean. You calculate it by finding the distance of each score from the mean, squared and then summed over all the scores.

Measures of Variability – more important zVariance (  2 )- also called sigma 2. The variance is the average squared distance of scores from mu. It is found by dividing the total squared distance of all the scores from the mean and then dividing by the number of scores (  2 =SS/N) zStandard Deviation (  )- also called sigma. The standard deviation is the square root of the variance. It is the average unsquared distance of scores in the population from their mean. (That is almost, but not exactly like saying that the standard deviation is the average distance of scores from the population mean.)

Computing the variance and the standard deviation Scores on a 10 question Psychology quiz Student John Jennifer Arthur Patrick Marie X78357X78357  X = 30 N = 5  = 6.00 X -   (X-  ) = 0.00 (X -  )  (X-  ) 2 = SS =  2 = SS/N = 3.20  = = 1.79

The variance is our most basic and important measure of variability zThe variance ( =sigma squared) is the average squared distance of individual scores from the population mean. zOther indices of variation are derived from the variance. zFor example,. as noted above, sigma is the average unsquared distance of scores from mu is the standard deviation. To find it you compute the square root of the variance.

Other measures of variability derived from the variance zWe can randomly choose scores from a population to form a random sample and then find the mean of such samples. zEach score you add to a sample tends to correct the sample mean back toward the population mean, mu. zThe average squared distance of sample means from the population mean is the variance divided by n, the size of the sample. zTo find the average unsquared distance of sample means from mu divide the variance by n, then take the square root. The result is called the standard error of the sample mean or, more briefly, the standard error of the mean. We’ll see more of this in Ch. 4.

Making predictions (1) zWithout any other information, the population mean (mu) is the best prediction of each and every person’s score. zSo you should predict that everyone will score precisely at the population mean. zWhy? Because the mean is an unbiased predictor or estimate. The mean is as close to the high as to the low scores in the population. zThis is mathematically proven by the fact that deviations around the mean sum to zero.

You should also predict that everyone will score right at the mean because: zThe mean is the number that is the smallest average squared distance from all the scores in the distribution. zThus, the mean is your best prediction, because it is a least squares, unbiased predictor.

What happens if we make a prediction other than mu. Scores on a Psychology quiz (mu = 6.00) What if we predict everyone will score 5.50? Deviations don’t sum to zero and the average squared distance of scores from the prediction increases Student John Jennifer Arthur Patrick Marie X78357X78357  X = 30 N = 5  = 6.00 X  (X- ?) = 2.50 (X -  )  (X- ?) 2 = SS =  2 = SS/N = 3.45  = = 1.86 X (X ) 2

Compare that to predicting that everyone will score right at the mean (mu). Scores on a 10 question Psychology quiz Student John Jennifer Arthur Patrick Marie X78357X78357  X = 30 N = 5  = 6.00 X -   (X-  ) = 0.00 (X -  )  (X-  ) 2 = SS =  2 = SS/N = 3.20  = = 1.79

But when you predict that everyone will score at the mean, you will be wrong. In fact, it is often the case that no one will score precisely at the mean. zIn statistics, we don’t expect our predictions to be precisely right. zWe want to make predictions that are wrong in a particular way. zWe want our predictions to be as close to the high scores as to the low scores in the population. zThe mean is the only number that is an unbiased predictor, it is the only number around which deviations sum to zero.

We want to be wrong by the least amount possible zIn statistics, we consider error to be the squared distance between a prediction and the actual score. zThe mean is the least average squared distance from all the scores in the population. zThe number that is the least average squared distance from the scores in the population is the prediction that is least wrong, the least in error. zThus, saying that everyone will score at the mean (even if no one does!) is the prediction that gives you the smallest amount of error.

Why doesn’t everyone score right at the mean? zSources of Error yIndividual differences – people have stable differences from one another. They differ in an infinite number of ways and combination of ways. yPROOF OF THAT: AREN’T YOU ARE MORE LIKE WHO YOU WILL BE IN 5 MINUTES THAN YOU ARE LIKE THE PERSON NEXT TO YOU??!

AND – THERE ARE ALWAYS MEASUREMENT PROBLEMS! Instruments are imperfect, scores get mistranscribed, participants may be uninterested or have a stomach ache, etc. etc. etc. …

Remember: THERE ARE ALWAYS MEASUREMENT PROBLEMS NO MEASUREMENT DEVICE IS EVER PERFECTLY ACCURATE, WHETHER IT IS A HIGHLY ACCURATE SCALE OR A 12 QUESTION QUESTIONNAIRE

Additionally, transient situational factors make measurement inaccurate This is especially true when we measure people. Let’s say we are measuring something relatively easy to measure, such as verbal ability. When we are measuring people, lots of transient factors (such as mood, events, time, motivation etc.) all change an individual’s responses and combine to make our measurement of verbal ability imperfect.

The mean square for error We call the average squared error of prediction when we use the mean as our prediction the “mean square for error”. It tells us how much (squared) error we make, on the average, when we predict that everyone will score precisely at the mean.

Mean square for error = the variance (sigma 2 ) zIf we predict that everyone will score right at the mean, how much error do you make on the average? To find out, find the distance of each score from the mean, square that distance and divide by the number of scores to find the average error. zWHOOPS: THAT’S SIGMA 2.

Questions and answers – the mean. zWHAT QUALITIES OF THE MEAN (MU) MAKE IT THE BEST PREDICTION YOU CAN MAKE OF WHERE EVERYONE WILL SCORE? zThe mean is an unbiased predictor or estimate, because the deviations around the mean always sum to zero. zThe mean is a least squares predictor because it is the smallest squared distance on the average from all the scores in the population.

So the variance has a third name. zThe variance is called the mean square for error as well as being called sigma 2. zAs the mean square for error, the variance is our numerical index of the effects of individual differences and measurement problems.

Q & A: the mean zWHY WOULD YOU PREDICT THAT EVERYONE WILL SCORE AT THE MEAN WHEN, IN FACT, OFTEN NO ONE CAN POSSIBLY SCORE PRECISELY AT THE MEAN? zIn statistics, we don’t expect our predictions to be precisely right. zWe want to make predictions that are close and wrong in a particular way. zWe want least squares, unbiased predictors.

Q & A: The variance zWHAT ARE THE OTHER NAMES FOR THE VARIANCE? zSigma 2 and the mean square for error. zWHAT OTHER MEASURES OF VARIABILITY CAN BE EASILY COMPUTED ONCE YOU KNOW THE VARIANCE? zThe standard deviation and the standard error of the sample mean.

How do you compute zTHE VARIANCE? Find the distance of each score from the mean, square it, sum them up and divide by the number of scores in the population. zTHE STANDARD DEVIATION? Compute the square root of the variance. zTHE STANDARD ERROR OF THE SAMPLE MEAN? Divide the variance by n, the size of the sample, and then take a square root.

END CHAPTER 1 SLIDES