Descriptive Statistics

Slides:



Advertisements
Similar presentations
Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
Advertisements

Descriptive Statistics
Statistics.
QUANTITATIVE DATA ANALYSIS
PRED 354 TEACH. PROBILITY & STATIS. FOR PRIMARY MATH
Descriptive Statistics Chapter 3 Numerical Scales Nominal scale-Uses numbers for identification (student ID numbers) Ordinal scale- Uses numbers for.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
EdPsy 511 August 28, Common Research Designs Correlational –Do two qualities “go together”. Comparing intact groups –a.k.a. causal-comparative and.
Descriptive Statistics
Intro to Descriptive Statistics
VARIABILITY. PREVIEW PREVIEW Figure 4.1 the statistical mode for defining abnormal behavior. The distribution of behavior scores for the entire population.
1 Chapter 4: Variability. 2 Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure.
Variability Ibrahim Altubasi, PT, PhD The University of Jordan.
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
Measures of Central Tendency
Measures of Central Tendency
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
Central Tendency Introduction to Statistics Chapter 3 Sep 1, 2009 Class #3.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics
Descriptive Statistics
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
EDPSY Chp. 2: Measurement and Statistical Notation.
Statistics 11 The mean The arithmetic average: The “balance point” of the distribution: X=2 -3 X=6+1 X= An error or deviation is the distance from.
Psychology’s Statistics. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
INVESTIGATION 1.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Agenda Descriptive Statistics Measures of Spread - Variability.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
CHAPTER 3  Descriptive Statistics Measures of Central Tendency 1.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Chapter 4: Variability. Variability Provides a quantitative measure of the degree to which scores in a distribution are spread out or clustered together.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Descriptive Statistics. My immediate family includes my wife Barbara, my sons Adam and Devon, and myself. I am 62, Barbara is 61, and the boys are both.
Summary Statistics: Measures of Location and Dispersion.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
LIS 570 Summarising and presenting data - Univariate analysis.
Introduction to statistics I Sophia King Rm. P24 HWB
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Descriptive Statistics(Summary and Variability measures)
Welcome to… The Exciting World of Descriptive Statistics in Educational Assessment!
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
A QUANTITATIVE RESEARCH PROJECT -
Descriptive Statistics ( )
Doc.RNDr.Iveta Bedáňová, Ph.D.
Descriptive Statistics: Overview
Tips for exam 1- Complete all the exercises from the back of each chapter. 2- Make sure you re-do the ones you got wrong! 3- Just before the exam, re-read.
Central Tendency and Variability
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Descriptive Statistics
Description of Data (Summary and Variability measures)
Numerical Descriptive Measures
Theme 4 Describing Variables Numerically
Central tendency and spread
Central Tendency.
Chapter 3.
Introduction to Statistics
Numerical Descriptive Measures
Preview Bem, 2011, claimed that through nine experiments he had demonstrated the existence of precognition Failure to replicate: “Across seven experiments.
Ms. Saint-Paul A.P. Psychology
15.1 The Role of Statistics in the Research Process
CHAPTER 2: Basic Summary Statistics
Numerical Descriptive Measures
Presentation transcript:

CHAPTER 1 Basic Concepts CHAPTER 2 Describing and Exploring Data Part B

Descriptive Statistics Measures of Central Tendency

Measures of Central Tendency Mean--------Interval or Ratio scale Polygon The sum of the values divided by the number of values--often called the "average." μ=ΣX/N Add all of the values together. Divide by the number of values to obtain the mean. Example: X 7 12 24 20 19 ????

Descriptive Statistics The Mean is: μ=ΣX/N= 82/5=16.4 (7 + 12 + 24 + 20 + 19) / 5 = 16.4.

The Characteristics of Mean 1. Changing a score in a distribution will change the mean 2. Introducing or removing a score from the distribution will change the mean 3. Adding or subtracting a constant from each score will change the mean 4. Multiplying or dividing each score by a constant will change the mean 5. Adding a score which is same as the mean will not change the mean

Measures of Central Tendency Median/MiddleOrdinal ScaleBar/Histogram Divides the values into two equal halves, with half of the values being lower than the median and half higher than the median. Sort the values into ascending order. If you have an odd number of values, the median is the middle value. If you have an even number of values, the median is the arithmetic mean (see above) of the two middle values. Example: The median of the same five numbers (7, 12, 24, 20, 19) is ???.

Measures of Central Tendency The median is 19. ModeNominal Scale Bar/Histogram The most frequently-occurring value (or values). Calculate the frequencies for all of the values in the data. The mode is the value (or values) with the highest frequency. Example: For individuals having the following ages -- 18, 18, 19, 20, 20, 20, 21, and 23, the mode is ????

CHARACTERISTICS OF MODE Nominal Scale Discrete Variable Describing Shape

WHEN TO USE WHICH MEASURE Measure of Central Tendency Level of Measurement Use When Examples Mode Nominal Data are categorical Eye color, party affiliation Median Ordinal Data include extreme scores Rank in class, birth order, income Mean Interval and ratio You can, and the data fit Speed of response, age in years

Variability

MEASURES OF VARIABILITY Variability--The degree of spread or dispersion in a set of scores Range—The difference between highest and lowest score +1 Standard Deviation—The average difference of each score from mean

Variability Variability is a measure of dispersion or spreading of scores around the mean, and has 2 purposes: 1. Describes the distribution Next slide

Variability 2. How well an individual score (or group of scores) represents the entire distribution. i.e. Z Score Ex. In inferential statistics we collect information from a small sample then, generalize the results obtained from the sample to the entire population.

Range, Interquartile Range, Semi-Interquartile Range, Standard Deviation, and Variance are the Measures of Variability The Range: The Range is the difference between the highest number –lowest number +1 2, 4, 7, 8, and 10 -> Discrete Numbers 2, 4.6, 7.3, 8.4, and 10 -> Continues Numbers The difference between the upper real limit of the highest number and the lower real limit of the lowest number.

Interquartile Range (IQR) Assesses the distance between the scores at the 75th and 25th percentile ranks. See next slide IQR = Q3-Q1

Interquartile Range (IQR) IQR is the range covered by the middle 50% of the distribution. IQR is the distance between the 3rd Quartile and 1st Quartile.

Interquartile Range (IQR) In descriptive statistics, the Interquartile Range (IQR), also called the midspread or middle fifty, is a measure of statistical dispersion, being equal to the difference between the upper and lower quartiles. (Q3 − Q1)=IQR

Semi-Interquartile Range (SIQR) Assesses the distance between the scores at the 75th and 25th percentile ranks divided by 2. SIQR = (Q3-Q1)/2

Semi-Interquartile Range (SIQR) SIQR is ½ or half of the Interquartile Range. It is used when our data are open ended (i.e., in research we continue to receive more extreme numbers). It is lowest measure of variability. SIQR = (Q3-Q1)/2

Variability SS, Standard Deviations and Variances X σ² = ss/N Variance Pop 1 σ = √ss/N Standard Deviation 2 4 s² = ss/n-1 or ss/df Variance Sample 5 s = √ss/df Standard Deviation SS=Σx²-[(Σx)²/N]  Computation SS=Σ( x-μ)²  Definition Sum of Squared Deviation from Mean Variance (σ²) is the Mean of Squared Deviations=MS Used in ANOVA

Practical Implication for Test Construction Variance and Covariance measure the quality of each item in a test. Reliability and validity measure the quality of the entire test. σ²=SS/N  used for one set of data Variance is the degree of variability of scores from mean. Correlation is based on a statistic called Covariance (Cov xy or S xy) COVxy=SP/N-1  used for 2 sets of data Covariance is a number that reflects the degree to which 2 variables vary together. r=sp/√ssx.ssy

Variance X σ² = ss/N Pop 1 s² = ss/n-1 or ss/df Sample 2 4 5 SS=Σx²-(Σx)²/N SS=Σ( x-μ)² Sum of Squared Deviation from Mean

COMPUTING THE STANDARD DEVIATION List scores and compute mean X 13 14 15 12 16 9 X = 13.4

COMPUTING THE STANDARD DEVIATION X (X-X) 13 -0.4 14 0.6 15 1.6 12 -1.4 16 2.6 9 -4.4 X = 0 List scores and compute mean Subtract mean from each score (Deviation) X = 13.4

COMPUTING THE STANDARD DEVIATION X 13 -0.4 0.16 14 0.6 0.36 15 1.6 2.56 12 -1.4 1.96 16 2.6 6.76 9 -4.4 19.36 X =13.4  X = 0   (X – X) (X – X)2 List scores and compute mean Subtract mean from each score (Deviation) Square each (Deviation)  

COMPUTING THE STANDARD DEVIATION X 13 -0.4 0.16 14 0.6 0.36 15 1.6 2.56 12 -1.4 1.96 16 2.6 6.76 9 -4.4 19.36 X =13.4  X = 0  X2 = 34.4 (X – X) (X – X)2 List scores and compute mean Subtract mean from each score Square each Deviation Sum Squared Deviations

COMPUTING THE STANDARD DEVIATION X 13 -0.4 0.16 14 0.6 0.36 15 1.6 2.56 12 -1.4 1.96 16 2.6 6.76 9 -4.4 19.36 X =13.4  X = 0  X2 = 34.4 (X – X) (X – X)2 List scores and compute mean Subtract mean from each score Square each deviation Sum squared deviations Divide sum of squared deviation by n – 1 34.4/9 = 3.82 (= s2) Compute square root of step 5 3.82 = 1.95  

@Suppose you earned a score of X = 54 on an exam. Which set of parameters would give you the highest grade? a. μ= 50 and σ= 2 σ²=4 b. μ= 50 and σ= 4 σ²=16 c. μ= 54 and σ= 2 σ²=4 d. μ= 54 and σ= 4 σ²=16

Suppose you earned a score of X = 46 on an exam. Which set of parameters would give you the highest grade? a. μ= 50 and σ= 2 σ²=4 b. μ= 50 and σ= 4 σ²=16 c. μ= 54 and σ= 2 σ²=4 d. μ= 54 and σ= 4 σ²=16

Covariance Correlation is based on a statistic called Covariance (Cov xy or S xy) ….. COVxy=SP/N-1 Correlation-- r=sp/√ssx.ssy Covariance is a number that reflects the degree to which 2 variables vary together. Original Data X Y 1 3 2 6 4 4 5 7

Covariance Correlation is based on a statistic called Covariance (Cov xy or S xy) ….. COVxy=SP/N-1 Correlation-- r=sp/√ssx.ssy Covariance is a number that reflects the degree to which 2 variables vary together. Original Data X Y 8 1 1 0 3 6 0 1

Covariance  

Descriptive Statistics for Non-dichotomous Variables

X6 X7 5 1 Calcúlate the VaríAnCe 5 1

Descriptive Statistics for Dichotomous Data (students assignments)

X6 X7 0 1 Calcúlate the VaríAnCe 0 1

Descriptive Statistics for Dichotomous Data Item Variance & Covariance

FACTORS THAT AFFECT VARIABILITY 1. Extreme Scores i.e. 1, 3, 8, 11, 1,000,000. We can’t use the Range in this situation but we can use the other measures of variability. 2. Sample Size If we increase the sample size will change the Range therefore we can’t use the Range in this situation but we can use the other measures of variability (i.e., IQR) 3. Stability Under Sampling (see next slide) The S and S² for all samples of a population should be the same because they come from the same population (all slices of a pizza should taste the same). 4. Open-Ended Distribution When we don’t know the highest score and lowest score in a distribution (because we keep adding to the sample) we use SIQR.

Variability

DATA ENTERY

Sample Survey Questioner

Bootstrap Bootstrapping is a method for deriving robust estimates of standard errors and confidence intervals for estimates such as the mean, median, proportion, odds ratio, correlation coefficient or regression coefficient.

Please read the sample review questions and take the Quiz 2.