DESCRIPTIVE STATISTICS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.

Slides:



Advertisements
Similar presentations
Rubric Unit Plan Univariate Bivariate Examples Resources Curriculum Statistically Thinking A study of univariate and bivariate statistics.
Advertisements

Richard M. Jacobs, OSA, Ph.D.
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Unit 1.1 Investigating Data 1. Frequency and Histograms CCSS: S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box.
IB Math Studies – Topic 6 Statistics.
Descriptive Statistics
QUANTITATIVE DATA ANALYSIS
Lesson Fourteen Interpreting Scores. Contents Five Questions about Test Scores 1. The general pattern of the set of scores  How do scores run or what.
Types of Data Displays Based on the 2008 AZ State Mathematics Standard.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
A quick introduction to the analysis of questionnaire data John Richardson.
Analysis of Research Data
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Social Research Methods
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
STAT 211 – 019 Dan Piett West Virginia University Lecture 2.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 16 Descriptive Statistics.
Numerical Descriptive Techniques
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Data Handbook Chapter 4 & 5. Data A series of readings that represents a natural population parameter A series of readings that represents a natural population.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Describing Data.
Thinking About Psychology: The Science of Mind and Behavior 2e Charles T. Blair-Broeker Randal M. Ernst.
UNDERSTANDING RESEARCH RESULTS: DESCRIPTION AND CORRELATION © 2012 The McGraw-Hill Companies, Inc.
METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.
Chapter 2 Describing Data.
Descriptive Statistics
Research Methodology Lecture No :24. Recap Lecture In the last lecture we discussed about: Frequencies Bar charts and pie charts Histogram Stem and leaf.
Counseling Research: Quantitative, Qualitative, and Mixed Methods, 1e © 2010 Pearson Education, Inc. All rights reserved. Basic Statistical Concepts Sang.
Lecture 3 Describing Data Using Numerical Measures.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
Numerical Measures of Variability
Types of Graphs.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
Descriptive Statistics – Graphic Guidelines
LIS 570 Summarising and presenting data - Univariate analysis.
Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
Presenting Data Descriptive Statistics. Chapter- Presentation of Data Mona Kapoor.
Chapter 2 Describing and Presenting a Distribution of Scores.
Summation Notation, Percentiles and Measures of Central Tendency Overheads 3.
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Central Tendency  Key Learnings: Statistics is a branch of mathematics that involves collecting, organizing, interpreting, and making predictions from.
Chapter 11 Summarizing & Reporting Descriptive Data.
Exploratory Data Analysis
Methods for Describing Sets of Data
EHS 655 Lecture 4: Descriptive statistics, censored data
Figure 2-7 (p. 47) A bar graph showing the distribution of personality types in a sample of college students. Because personality type is a discrete variable.
Chapter 12 Understanding Research Results: Description and Correlation
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
DESCRIPTIVE STATISTICS
Description of Data (Summary and Variability measures)
Social Research Methods
Chapter 3 Describing Data Using Numerical Measures
Describing Data: Displaying and Exploring Data
Understanding Research Results: Description and Correlation
Lesson 12: Presentation and Analysis of Data
Warm Up # 3: Answer each question to the best of your knowledge.
15.1 The Role of Statistics in the Research Process
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Presentation transcript:

DESCRIPTIVE STATISTICS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON

STRUCTURE OF THE CHAPTER Frequencies, percentages and crosstabulations Measures of central tendency and dispersal Taking stock Correlations and measures of association Partial correlations Reliability

FREQUENCIES AND PERCENTAGES Graphical forms of data presentation: –Frequency and percentage tables; –Bar charts (for nominal and ordinal data); –Histograms (for continuous – interval and ratio – data); –Line graphs; –Pie charts; –High and low charts; –Scatterplots; –Stem and leaf displays; –Boxplots (box and whisker plots).

FREQUENCIES AND PERCENTAGES Bar charts for presenting categorical and discrete data, highest and lowest; Avoid using a third dimension (e.g. depth) in a graph when it is unnecessary; a third dimension to a graph must provide additional information; Histograms for presenting continuous data; Line graphs for showing trends, particularly in continuous data, for one or more variables at a time; Multiple line graphs for showing trends in continuous data on several variables in the same graph;

FREQUENCIES AND PERCENTAGES Pie charts and bar charts for showing proportions; Interdependence can be shown through cross- tabulations; Boxplots for showing the distribution of values for several variables in a single chart, together with their range and medians; Stacked bar charts for showing the frequencies of different groups within a specific variable for two or more variables in the same chart; Scatterplots for showing the relationship between two variables or several sets of two or more variables on the same chart.

A crosstabulation is a presentational device. –Rows for nominal data, columns for ordinal data. –Independent variables as row data, dependent variables as column data. CROSSTABULATIONS

BIVARIATE CROSSTABULATION

TRIVARIATE CROSSTABULATION Acceptability of formal, written public examinations Traditionalist Progressivist/ child-centred Formal, written public exams Socially advantaged Socially disadvantaged Socially advantaged Socially disadvantaged In favour65%70%35%20% Against35%30%65%80% Total per cent 100%

MEASURES OF CENTRAL TENDENCY AND DISPERSAL The mode (the score obtained by the greatest number of people); –For categorical (nominal) and ordinal data The mean (the average score); –For continuous data –Used if the data are not skewed –Used if there are no outliers

MEASURES OF CENTRAL TENDENCY AND DISPERSAL The median (the score obtained by the middle person in a ranked group of people, i.e. it has an equal number of scores above it and below it); –For continuous data –Used of the data are skewed –Used if there are outliers

MEASURES OF CENTRAL TENDENCY AND DISPERSAL Standard deviation (the average distance of each score from the mean, the average difference between each score and the mean, and how much, the scores, as a group, deviate from the mean. –A standardized measure of dispersal. –For interval and ratio data

STANDARD DEVIATION The standard deviation is calculated, in its most simplified form as: or d 2 = the deviation of the score from the mean (average), squared  = the sum of N = the number of cases A low standard deviation indicates that the scores cluster together, whilst a high standard deviation indicates that the scores are widely dispersed.

9 8 Mean 7 | 6 | 5 | 4 | 3 | 2 | 1 XXXX|X Mean = 6 High standard deviation

9 8 Mean 7 | 6 | 5 | 4 | 3 | 2 | 1 XXXXX Mean = 6 Moderately high standard deviation

9 8 Mean 7| 6| 5| 4| 3X 2X 1XXX Mean = 6 Low standard deviation

THE RANGE AND INTERQUARTILE RANGE The range: –The difference between the minimum and maximum score. –A measure of dispersal. –Outliers exert a disproportionate effect. The interquartile range: –The difference between the first and the third quartile, the difference between the 25 th and the 75 th percentile, i.e. the middle 50 per cent of scores (the second and third quartiles). –Overcomes problems of outliers/extreme scores.

CORRELATION Measure of association between two variables. Note the direction of the correlation: –Positive: As one variable increases, the other variables increases –Negative: As one variable increases, the other variable decreases –The strongest positive correlation coefficient is +1. –The strongest negative correlation coefficient is -1.

CORRELATION Note the magnitude of the correlation coefficient: − 0.20 to 0.35: slight association − 0.35 to 0.65: sufficient for crude prediction − 0.65 to 0.85: sufficient for accurate prediction − >0.85: strong correlation Ensure that the relationships are linear and not curvilinear (i.e. the line reaches an inflection point)

CURVILINEAR RELATIONSHIP

CORRELATION Foot size Hand size12345 Perfect positive correlation: + 1

CORRELATION Foot size Hand size Perfect negative correlation: + 1

CORRELATION Hand sizeFoot size Positive correlation: <+1

PERFECT POSITIVE CORRELATION

PERFECT NEGATIVE CORRELATION

MIXED CORRELATION

CORRELATIONS Correlations –Spearman correlation for nominal and ordinal data –Pearson correlation for interval and ratio data

BIVARIATE CORRELATIONS Correlations –Spearman correlation for nominal and ordinal data –Pearson correlation for interval and ratio data

MULTIPLE AND PARTIAL CORRELATIONS Multiple correlation: –The degree of association between three or more variables simultaneously. Partial correlation: –The degree of association between two variables after the influence of a third has been controlled or partialled out. –controlling for the effects of a third variable means holding it constant whilst manipulating the other two variables.

RELIABILITY Split-half reliability (correlation between one half of a test and the other matched half) The alpha coefficient

SPLIT-HALF RELIABILITY (Spearman-Brown) Reliability = r = the actual correlation between the two halves of the instrument (e.g. 0.85); Reliability = = = (very high)

CRONBACH ALPHA Reliability as internal consistency: Cronbach’s alpha (the alpha coefficient of reliability). A coefficient of inter-item correlations. It calculates the average of all possible split half reliability coefficients.

INTERPRETING THE RELIABILITY COEFFICIENT Maximum is +1 >.90very highly reliable highly reliable reliable marginally/minimally reliable <.60unacceptably low reliability