LIS 570 Summarising and presenting data - Univariate analysis.

Slides:



Advertisements
Similar presentations
Descriptive Measures MARE 250 Dr. Jason Turner.
Advertisements

Unit 1.1 Investigating Data 1. Frequency and Histograms CCSS: S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box.
Measures of Dispersion
Descriptive Statistics
Statistics.
Session 6.1 Univariate Data Analysis
Descriptive Statistics – Central Tendency & Variability Chapter 3 (Part 2) MSIS 111 Prof. Nick Dedeke.
Descriptive Statistics Chapter 3 Numerical Scales Nominal scale-Uses numbers for identification (student ID numbers) Ordinal scale- Uses numbers for.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Descriptive Statistics
Intro to Descriptive Statistics
Measures of Dispersion
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Descriptive Statistics PowerPoint Prepared by Alfred.
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
Measures of Central Tendency
Today: Central Tendency & Dispersion
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Chapter 2 Describing Data with Numerical Measurements
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Statistics for Linguistics Students Michaelmas 2004 Week 1 Bettina Braun.
Describing distributions with numbers
Objective To understand measures of central tendency and use them to analyze data.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Measures of Central Tendency or Measures of Location or Measures of Averages.
Chapters 1 & 2 Displaying Order; Central Tendency & Variability Thurs. Aug 21, 2014.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Chapter 2 Describing Data.
Describing distributions with numbers
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Univariate Descriptive Statistics Heibatollah Baghi, and Mastee Badii George Mason University.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Categorical vs. Quantitative…
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Data Quantitative data are numerical observation –age of students in a class. Age is quantitative data because it quantifies the age of a person Qualitative.
Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.
Measures of Central Tendency or Measures of Location or Measures of Averages.
Numerical Measures of Variability
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Summary Statistics: Measures of Location and Dispersion.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Descriptive Statistics(Summary and Variability measures)
Welcome to… The Exciting World of Descriptive Statistics in Educational Assessment!
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Statistical Methods Michael J. Watts
Statistical Methods Michael J. Watts
Statistics Unit Test Review
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
NUMERICAL DESCRIPTIVE MEASURES
Descriptive Statistics
Description of Data (Summary and Variability measures)
Descriptive Statistics
Descriptive Statistics
Basic Statistical Terms
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Univariate Statistics
Advanced Algebra Unit 1 Vocabulary
Central Tendency & Variability
Presentation transcript:

LIS 570 Summarising and presenting data - Univariate analysis

Summary  Basic definitions  Descriptive statistics  Describing frequency distributions  shape  central tendency  dispersion

Selecting analysis and statistical techniques De Vaus p133

 Values : the categories developed for a variable Nominal Ordinal Interval  Data : Observations (Measurements) taken on the units of analysis Basic Definitions

Basic definitions  Statistics - Methods for dealing with data Descriptive statistics summarise sample or census data Inferential statistics Draw conclusions about the population from the results of a random sample drawn from that population

Methods of analysis (De Vaus, 134)

Frequency Distributions  Ungrouped frequency distribution A list of each of the values of the variable The number of times and/or the percent of times each value occurs  Grouped frequency distribution A table or graph which shows the frequencies or percent for ranges of values

Frequency distributions

 Required information for frequency tables table number and title labels for the categories of the variables column headings the number of missing cases

Histograms

Describing Frequency Distributions  Shape  Symmetrical (Mirror image) Skewed Negative skew  tail toward lower scores Positive skew  tail toward higher scores  Dispersion  Central tendency

Shape - for ordinal or interval variables Positively skewed distribution Cluster towards the low end of the variable

Shape - for ordinal or interval variables Negatively skewed distribution Cluster towards the high end of the variable

Shape - Symmetry

Central Tendency  Typical or representative value or score Mean (arithmetic mean)( x ) Sum all the observations / n Use for interval variables when appropriate Median Value that divides the distribution so that an equal number of values are above the median and an equal number below Mode Value with the greatest frequency Uni-modal, bi-modal etc.

Mode  Best for nominal variables  Problems most common may not measure typicality may be more than one mode unstable - can be manipulated  Dispersion variation ratio (v) % of people not in the modal category

Median  Preferred for ordinal variables people are ranked from low to high median is the middle case the median category is the one that the middle person belongs to

Dispersion  The cth percentile of a set of numbers is a value such that c percent of the numbers fall below it and the rest fall above. The median is the 50th percentile The lower quartile is the 25th percentile The upper quartile is the 75th percentile  five number summary Median, quartiles and extremes

Dispersion Lower quartile Median Upper quartile

Boxplot Variable 1 Variable 2 Variable 3 Interquartile range IQR

Mean  uses the actual numerical values of the observations  most common measure of centre  makes sense only of interval or ratio data,  frequently computed for ordinal variables as well.

Dispersion  The standard deviation and variance measure spread about the mean as centre.  Variance mean of the squares of the deviations of the observations from the mean.  Standard deviation the positive square root of the variance

Example Data (6,7,5,3,4)  = =25 = 5 5 Variance (S 2 ) Calculate the mean for the variable Take each observation and subtract the mean from it Square the result from the above Add (sum) all the individual results Divide by n

Variance (s 2 ) Variance = sum of the sq deviations = 10 = 2 number of observation 5

Standard deviation (s)  Square root of the variance  2 = 1.4  an average deviation of the observations from their mean  influenced by outliers  best used with symmetrical distributions

Summary  Determine if variable is nominal, ordinal or interval Nominal Frequency tables Mode Ordinal Frequency tables (grouped frequency tables histogram Median and five number summary plus IQR Mode

Summary Interval Determine whether the distribution is skewed or symmetrical Compare median and mean Use the mean and the standard deviation if the distribution is not markedly skewed Otherwise use median and five number summary plus IQR Use the mode in addition if it adds anything.