Quantitative Methods in Social Research 2010/11 Week 5 (morning) session 11th February 2011 Descriptive Statistics.

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

Richard M. Jacobs, OSA, Ph.D.
Chapter 3, Numerical Descriptive Measures
Chapter 16: Correlation.
Descriptive Measures MARE 250 Dr. Jason Turner.
Measures of Dispersion
Descriptive Statistics
© 2002 Prentice-Hall, Inc.Chap 3-1 Basic Business Statistics (8 th Edition) Chapter 3 Numerical Descriptive Measures.
Calculating & Reporting Healthcare Statistics
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
Measures of Dispersion CJ 526 Statistical Analysis in Criminal Justice.
Analysis of Research Data
Statistics for Managers Using Microsoft® Excel 4th Edition
Basic Business Statistics 10th Edition
CHAPTER 3: Statistical Description of Data
Measures of Dispersion
DTC Quantitative Research Methods Descriptive Statistics Thursday 16 th October 2014.
© 2003 Prentice-Hall, Inc.Chap 3-1 Business Statistics: A First Course (3 rd Edition) Chapter 3 Numerical Descriptive Measures.
Data observation and Descriptive Statistics
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Chapter 5 – 1 Chapter 5: Measures of Variability The Importance of Measuring Variability The Range IQR (Inter-Quartile Range) Variance Standard Deviation.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Correlation and regression 1: Correlation Coefficient
Numerical Descriptive Measures
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Statistics for Managers.
Chapter 3 Statistical Concepts.
Numerical Descriptive Techniques
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Modified by ARQ, from © 2002 Prentice-Hall.Chap 3-1 Numerical Descriptive Measures Chapter %20ppts/c3.ppt.
Measures of Dispersion
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Thinking About Psychology: The Science of Mind and Behavior 2e Charles T. Blair-Broeker Randal M. Ernst.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Descriptive Statistics
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
UTOPPS—Fall 2004 Teaching Statistics in Psychology.
INVESTIGATION 1.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Business Statistics, A First Course.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
Numerical Measures of Variability
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Data Summary Using Descriptive Measures Sections 3.1 – 3.6, 3.8
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
LIS 570 Summarising and presenting data - Univariate analysis.
Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
Descriptive Statistics(Summary and Variability measures)
Statistical Methods © 2004 Prentice-Hall, Inc. Week 3-1 Week 3 Numerical Descriptive Measures Statistical Methods.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 18.
Descriptive Statistics ( )
Statistical analysis.
DTC Quantitative Methods Descriptive Statistics Thursday 26th January  
Business and Economics 6th Edition
Statistical analysis.
Teaching Statistics in Psychology
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Description of Data (Summary and Variability measures)
Chapter 5: Describing Distributions Numerically
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Advanced Algebra Unit 1 Vocabulary
Business and Economics 7th Edition
Presentation transcript:

Quantitative Methods in Social Research 2010/11 Week 5 (morning) session 11th February 2011 Descriptive Statistics

Some relevant online course extracts Diamond and Jefferies (2001) Chapter 5. - Measures and displays of spread Sarantakos (2007) Chapter 5. - Graphical displays. Huizingh (2007) Chapter 12. - SPSS material.

Descriptive statistics... ... are data summaries which provide an alternative to graphical representations of distributions of values ... aim to describe key aspects of distributions of values ... are of most relevance when we are thinking about interval-level variables (scales)

Types of descriptive statistics Measures of ... ... location (averages) ... spread ... skewness (asymmetry) ... kurtosis We typically want to know about the first two, sometimes about the third, and rarely about the fourth!

What is ‘kurtosis’ anyway? Increasing kurtosis is associated with the “movement of probability mass from the shoulders of a distribution into its center and tails.” (Balanda, K.P. and MacGillivray, H.L. 1988. ‘Kurtosis: A Critical Review’, The American Statistician 42:2: 111–119. Below, kurtosis increases from left to right...

Measures of location Mean (the arithmetic average of the values, i.e. the result of dividing the sum of the values by the total number of cases) Median (the middle value, when the values are ranked/ordered) Mode (the most common value)

... and measures of spread Standard deviation (and Variance) (This is linked with the mean, as it is based on averaging [squared] deviations from it. The variance is simply the standard deviation squared). Interquartile range / Quartile deviation (These are linked with the median, as they are also based on the values placed in order).

Measures of location and spread: an example (household size) Mean = 2.94, Median = 2, Mode = 2 Mean = 2.96, Median = 3, Mode = 2 s.d. = 1.93, skewness = 2.10; kurtosis = 5.54 s.d. = 1.58, skewness = 1.27; kurtosis = 2.24 West Midlands London

Why is the standard deviation so important? The standard deviation (or, more precisely, the variance) is important because it introduces the idea of summarising variation in terms of summed, squared deviations. And it is also central to some of the statistical theory used in statistical testing/statistical inference...

An example of the calculation of a standard deviation Number of seminars attended by a sample of undergraduates: 5, 4, 4, 7, 9, 8, 9, 4, 6, 5 Mean = 61/10 = 6.1 Variance = ((5 – 6.1)2 + (4 – 6.1)2 + (4 – 6.1)2 + (7 – 6.1)2 + (9 – 6.1)2 + (8 – 6.1)2 + (9 – 6.1)2 + (4 – 6.1)2 + (6 – 6.1)2 + (5 – 6.1)2)/(10 – 1) = 36.9 /9 = 4.1 Standard deviation = Square root of variance = 2.025

The Empire Median Strikes Back! Comparing descriptive statistics between groups can be done graphically in a rather nice way using a form of display called a ‘boxplot’. Boxplots are based on medians and quartiles rather than on the more commonly found mean and standard deviation.

Example of a boxplot

Moving on to bivariate ‘descriptive statistics'... These are referred to as ‘Measures of association’, as they quantify the (strength of the) association between two variables The most well-known of these is the (Pearson) correlation coefficient, often referred to as ‘the correlation coefficient’, or even ‘the correlation’.

Positive and negative relationships Positive or direct relationships If the points cluster around a line that runs from the lower left to upper right of the graph area, then the relationship between the two variables is positive or direct. An increase in the value of x is more likely to be associated with an increase in the value of y. The closer the points are to the line, the stronger the relationship. Negative or inverse relationships If the points tend to cluster around a line that runs from the upper left to lower right of the graph, then the relationship between the two variables is negative or inverse. An increase in the value of x is more likely to be associated with a decrease in the value of y.

Working out the correlation coefficient (Pearson’s r) Pearson’s r tells us how much one variable changes as the values of another changes – their covariation. Variation is measured with the standard deviation. This measures average variation of each variable from the mean for that variable. Covariation is measured by calculating the amount by which each value of X varies from the mean of X, and the amount by which each value of Y varies from the mean of Y and multiplying the differences together and finding the average (by dividing by n-1). Pearson’s r is calculated by dividing this by (SD of x) x (SD of y) in order to standardize it.

Working out the correlation coefficient (Pearson’s r) Because r is standardized it will always fall between +1 and -1. A correlation of either 1 or -1 means perfect association between the two variables. A correlation of 0 means that there is no association. Note: correlation does not mean causation. We can only investigate causation by reference to our theory. However (thinking about it the other way round) there is unlikely to be causation if there is not correlation.

Example of calculating a correlation coefficient X = 5, 4, 4, 7, 9, 8, 9, 4, 6, 5 Mean(X) = 6.1 Y = 8, 7, 9, 7, 8, 8, 8, 5, 5, 6 Mean(Y) = 7.1 (5 - 6.1)(8 – 7.1) + (4 – 6.1)(7 – 7.1) ... etc. -0.99 + 0.21 + ... = 7.9 (Covariation) S.D. (X) = 2.02 ; S.D. (Y) = 1.37 (7.9 / 9) / (2.02 x 1.37) = 0.317

Another measure of association Later in the module we will look at the calculation of a different measure of association for cross-tabulated variables, Cramér’s V Like the Pearson correlation coefficient, it has a maximum of 1, and 0 indicates no relationship, but it can only take on positive values, and makes no assumption of linearity.

An example of Cramér’s V