Level of Measurement Data is generally represented as numbers, but the numbers do not always have the same meaning and cannot be used in the same way.

Slides:



Advertisements
Similar presentations
Level of Measurement Problems
Advertisements

Sections 1.3 Types of Data.
SW388R7 Data Analysis & Computers II Slide 1 Solving Problems in SPSS The data sets Options for variable lists in statistical procedures Options for variable.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
TYPES OF DATA. Qualitative vs. Quantitative Data A qualitative variable is one in which the “true” or naturally occurring levels or categories taken by.
Math 2260 Introduction to Probability and Statistics
EDU 660 Methods of Educational Research Descriptive Statistics John Wilson Ph.D.
Introduction to Statistics & Measurement
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Detecting univariate outliers Detecting multivariate outliers
Research Methods in MIS
Nominal Level Measurement n numbers used as ways to identify or name categories n numbers do not indicate degrees of a variable but simple groupings of.
Chi-square Test of Independence
Multiple Regression – Basic Relationships
8/2/2015Slide 1 SPSS does not calculate confidence intervals for proportions. The Excel spreadsheet that I used to calculate the proportions can be downloaded.
Assumption of Homoscedasticity
LEVEL OF MEASUREMENT Data is generally represented as numbers, but the numbers do not always have the same meaning and cannot be used in the same way.
Problem 1: Relationship between Two Variables-1 (1)
SW388R7 Data Analysis & Computers II Slide 1 Assumption of normality Transformations Assumption of normality script Practice problems.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
Correlation Question 1 This question asks you to use the Pearson correlation coefficient to measure the association between [educ4] and [empstat]. However,
1 COMM 301: Empirical Research in Communication Kwan M Lee Lect3_1.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
Measures of Central Tendency
STA 2023 Chapter 1 Notes. Terminology  Data: consists of information coming from observations, counts, measurements, or responses.  Statistics: the.
SW388R7 Data Analysis & Computers II Slide 1 Assumption of Homoscedasticity Homoscedasticity (aka homogeneity or uniformity of variance) Transformations.
9/18/2015Slide 1 The homework problems on comparing central tendency and variability extend the focus central tendency and variability to a comparison.
Section 1.2 Data Classification.
Sections 1-3 Types of Data. PARAMETERS AND STATISTICS Parameter: a numerical measurement describing some characteristic of a population. Statistic: a.
Hierarchical Binary Logistic Regression
9/23/2015Slide 1 Published reports of research usually contain a section which describes key characteristics of the sample included in the study. The “key”
SW388R6 Data Analysis and Computers I Slide 1 Central Tendency and Variability Sample Homework Problem Solving the Problem with SPSS Logic for Central.
Chi-Square Test of Independence Practice Problem – 1
Eng.Mosab I. Tabash Applied Statistics. Eng.Mosab I. Tabash Session 1 : Lesson 1 IntroductiontoStatisticsIntroductiontoStatistics.
Statistics for the Social Sciences Psychology 340 Spring 2009 Review of SPSS basics.
110/10/2015Slide 1 The homework problems on comparing central tendency and variability extend our focus on central tendency and variability to a comparison.
1 Concepts of Variables Greg C Elvers, Ph.D.. 2 Levels of Measurement When we observe and record a variable, it has characteristics that influence the.
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
SW318 Social Work Statistics Slide 1 Compare Central Tendency & Variability Group comparison of central tendency? Measurement Level? Badly Skewed? MedianMeanMedian.
As shown in Table 1, the groups differed in terms of language skills and the type of job last held. The intake form asked the client to indicate languages.
SW388R6 Data Analysis and Computers I Slide 1 Independent Samples T-Test of Population Means Key Points about Statistical Test Sample Homework Problem.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 1-3 Types of Data.
SW318 Social Work Statistics Slide 1 Get ready to work on practice problems 1. Create a directory and subdirectory on your computer named C:\StudentData\SW318_Spring_2004.
A Simple Guide to Using SPSS ( Statistical Package for the Social Sciences) for Windows.
SW318 Social Work Statistics Slide 1 Frequency: Nominal Variable Practice Problem This question asks the frequency of widowed respondents of the survey.
11/16/2015Slide 1 We will use a two-sample test of proportions to test whether or not there are group differences in the proportions of cases that have.
Chi-square Test of Independence
SW388R6 Data Analysis and Computers I Slide 1 Percentiles and Standard Scores Sample Percentile Homework Problem Solving the Percentile Problem with SPSS.
SW388R7 Data Analysis & Computers II Slide 1 Detecting Outliers Detecting univariate outliers Detecting multivariate outliers.
Variables It is very important in research to see variables, define them, and control or measure them.
Overview and Types of Data
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
PSC 47410: Data Analysis Workshop  What’s the purpose of this exercise?  The workshop’s research questions:  Who supports war in America?  How consistent.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Set-up a Data Entry Page Section 3. Set Up Columns Switch to Variable View. At the bottom left of your screen there are two tabs (Data View and Variable.
1/5/2016Slide 1 We will use a one-sample test of proportions to test whether or not our sample proportion supports the population proportion from which.
Chapter 2: Levels of Measurement. Researchers classify variables according to the extent to which the values of the variable measure the intended characteristics.
SW388R7 Data Analysis & Computers II Slide 1 Incorporating Nonmetric Data with Dummy Variables The logic of dummy-coding Dummy-coding in SPSS.
SW388R6 Data Analysis and Computers I Slide 1 Comparing Central Tendency and Variability across Groups Impact of Missing Data on Group Comparisons Sample.
The frequency distribution
SW388R7 Data Analysis & Computers II Slide 1 Solving Homework Problems in SPSS The data sets Options for variable lists in statistical procedures Options.
Extracting Information from an Excel List The purpose of creating a database, or list in Excel, is to be able to manipulate the data elements in ways that.
Analyzing Data. Learning Objectives You will learn to: – Import from excel – Add, move, recode, label, and compute variables – Perform descriptive analyses.
Biostatistics Introduction Article for Review.
(Slides not created solely by me – the internet is a wonderful tool) SW388R7 Data Analysis & Compute rs II Slide 1.
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
Data Entry, Coding & Cleaning SPSS Training Thomas Joshua, MS July, 2008.
SPSS For a Beginner CHAR By Adebisi A. Abdullateef
LEVEL OF MEASUREMENT Data is generally represented as numbers, but the numbers do not always have the same meaning and cannot be used in the same way.
Presentation transcript:

Level of Measurement Data is generally represented as numbers, but the numbers do not always have the same meaning and cannot be used in the same way. To distinguish the different ways in which numbers are used, we traditionally have identified the level of measurement of the variables as: nominal, ordinal, interval, and ratio. These levels were developed by Stanley Smith Stevens. Nominal Level Variables For nominal level variables, the numbers are shorthand for the categories of a variable, e.g. – 1 represents married persons, – 2 represents divorced persons, – 3 represents persons who have never been married, etc. The assignment of numbers to the categories is arbitrary and can be changed with no loss of meaning. The only legitimate mathematical operation we can do with nominal level data is count the number of times the different categories appear in the data set. Slide 1

Ordinal Level Variables Ordinal level variables are usually associated with labels as well, but the assignment of numbers to the categories is ordered, from low to high, e.g. – 1 is assigned to high school graduates, – 2 is assigned to junior college graduates, – 3 is assigned to college graduates, – 4 is assigned to graduates with a masters degree, etc. or from high to low: – 1 is assigned to graduates with a masters degree, – 2 is assigned to college graduates, – 3 is assigned to junior college graduates, – 4 is assigned to high school graduates, etc. The ordering of the numbers tracks the hierarchy of the labels. Though we can change the actual numbers used, the number assigned each higher level degree must consistently be a bigger number than what was used to represent lower ranked degrees (or a smaller number if ordered from high to low) Slide 2

The legitimate mathematical operations we can perform on ordinal data is sorting or ranking, as well as counting. Interval Level Variables Interval level variables have the additional characteristic that the difference between numbers is the same for all possible combinations, e.g.: – the difference between 1 and 2 years of age is the same amount as the difference between 21 and 22 years of age, or 50 and 51, or 65 and 66. – the difference between a height of 60 inches and a height of 55 inches is the same amount of difference as a height of 72 inches and a height of 67 inches. For interval level variables, it is mathematically legitimate to do arithmetic (add, subtract, multiple, and divide) as well as count the values, and sort or rank the values. Ratio Level Variables Ratio level variables have the additional property of having a true zero value so that ratios between values are meaningful, but practically speaking, ratio level data is treated the same as interval level. The commonly cited example is temperature. Slide 3

Quantitative and Categorical Variables The distinction between nominal and interval levels of variables is substantial. Computing an average marital status (treating a nominal level variable as interval) does not produce a meaningful result, and can be downright embarrassing. Presenting a count of all of the possible ages of the subjects in a data set (using only the nominal level property of an interval variable) does not communicate as much information as saying the average age was 27.5, with a range from 21 to 57. The differences in the use of data at these levels has led many authors to collapse the number of levels of measurement to two, substituting terms like: – quantitative or metric level instead of interval – categorical, qualitative, or non-metric instead of nominal In practice, ordinal level variable are sometimes treated as quantitative and at other times as categorical. The numeric codes for scale variables (1=disagree, 2=neutral, 3=agree) are generally treated as quantitative data and averaged. The numeric codes for year in school (1=freshman, 2=sophomore, 3=junior, 4=senior) are often not used, and comparisons are made using the categorical labels, e.g. the number (count) of seniors with some characteristic versus the number of juniors with the same characteristic. Slide 4

When ordinal level variables are used as quantitative variables, we are emphasizing the rank order of the categories, e.g. 3 ranks higher than 2 or 1, and 2 ranks higher than 1. Since the ranks themselves are interval level data, it is argued that arithmetic on the ordinal values is acceptable. Multiple Variables Measuring the Same Construct The same construct can be represented by variables at different levels of measurement. Education can be represented as – years of school (quantitative), – diploma such as high school, college, or post-graduate (categorical, though we could come up with a numbering scheme that made it quantitative) The implication of these different representations is that we cannot base a correct conclusion on the name of the variable or the construct it represents. A correct understanding of a variable’s level of measurement requires that we look at the numbers in the data set and the coding scheme (numeric codes and labels) applied to the variable. The authors of the text for this course use the labels: quantitative and categorical. We will use their terminology in the first set of homework problems. Slide 5

Dichotomous variables Dichotomous (two-category) variables can be used in special ways in statistical analysis. For example, they can be treated as an interval or ordinal level predictor in relationships that call for interval level predictors. Examples of dichotomous variables include: – gender: male, female – marital status: married, not married – support capital punishment: yes, no This will be explored in greater detail later in the course. For now, we will treat dichotomous variables as nominal categories. Slide 6

Homework problems This week’s homework problems are on level of measurement. Does the variable have value labels? – If there are no value labels, we will treat it as interval level, it can be used as a quantitative variable, and SPSS would categorize it as Scale. – If there are value labels, the variable is either nominal or ordinal. If the value labels have an order or hierarchy from low to high (or high to low), the variable is ordinal, it can be used either as a quantitative variable or a categorical variable, and SPSS would categorize it as Ordinal. If the value labels do not have an order to them, the variable is nominal, it can only be used as a categorical variable, and SPSS would characterize it as Nominal. Note: – Dichotomous variables should be answered as nominal level variables. – NA is not used as a response in these problems. Slide 7

Slide 8 In the SPSS Variable View, the level of measurement can be entered in the Measure column as Nominal, Ordinal, and Scale (for interval level variables), but this does not have much practical significance. Note: all of the variables in my datasets are types as Scale. This MAY or MAY NOT be the correct answer in a homework problem.

Slide 9 This problem is a level of measurement homework problem in Moodle. The Notes at the bottom of the problem tell you the data set that is being used, with the name of the variable in SPSS in square brackets. The introductory statement includes the type of problem or the statistical technique to be used, in this example: level of measurement.

Slide 10 The first question asks about the level of measurement of the variable for employment status, using Stevens’ framework. The SPSS variable for employment status in GSS2000R is wrkstat.

Slide 11 In the Data View of the data editor, we see that wrkstat contains numbers, but we cannot tell whether they are measures or codes.

Slide 12 First, to see what labels have been assigned to the variable, we click on the Variable View tab. Second, we look to see what numeric codes are used for missing data for the wrkstat row, in the Missing column. These are values that will not be used in the analysis and labels that we ignore in determining the level of measurement.

Slide 13 To see the values have been assigned labels, we look in the Variable View tab, and click in the right side of the cell on the row for wrkstat, in the column called Values.

Slide 14 When we clicked on the right end of the cell, the Value Labels dialog box opened. Ignoring the 0 and 9 which were coded as missing data, we see eight entries for work status.

Slide 15 Since there are value labels, the variable is either nominal or ordinal. If it might be ordinal, we examine the labels for order. I tried to think of them as describing the amount worked, but that clearly doesn’t work for retired, school, and keeping house. Since I find no plausible order, the variable is nominal rather than ordinal. To close the dialog box, click on the OK button.

Slide 16 We select nominal from the drop down list. The next sentence asks us to characterize it using the terminology of the text book. Since it is nominal, it can only be used as a categorical variable.

Slide 17 The next sentence asks us to characterize it using SPSS’s terminology. SPSS uses the nominal label for variables that are nominal level in Stevens’ scheme.

Slide 18 The next question asks about the level of measurement of the variable for number of hours worked in the past week, using Stevens’ framework. The SPSS variable for number of hours worked in the past week in GSS2000R is hrs1.

Slide 19 In the visible rows of the Data View, we see values that range from 38 to 60. Based on the variable label and the data values shown, my initial assessment is that this is an interval level variable.

Slide 20 The variable hrs1 uses three numeric codes for missing data: - 1, 98, and 99.

Slide 21 The only numbers assigned labels are the codes for missing data. There are no value labels for hrs1. Hrs1 is an interval level variable. Click at the right end of the Values cell for hrs1 to open the Value Labels dialog box.

Slide 22 We select interval from the drop down list. The next sentence asks us to characterize the variable using the terminology of the text book. Since it is interval, it qualifies as a quantitative variable.

Slide 23 The next sentence asks us to characterize it using SPSS’s terminology. SPSS uses the scale label for variables that are interval level in Stevens’ scheme.

Slide 24 The next question asks about the level of measurement of the variable for self-employment, using Stevens’ framework. The SPSS variable for self-employment in GSS2000R is wrkslf.

Slide 25 In the Data View, we see very restricted options for values of wrkself: 1, 2, and 9.

Slide 26 The variable wrkslf uses three numeric codes for missing data: 0, 8, and 9.

Slide 27 If we eliminate the codes for missing data (0, 8, and 9), there are only two valid values (1 and 2). While labels have been assigned to the values for this variable, it has only two categories and we characterize it as a special dichotomous variable. Following the directions for these problems, we choose nominal as the measurement level.

Slide 28 The next sentence asks us to characterize the variable using the terminology of the text book. Since it is nominal, it qualifies as a categorical variable.

Slide 29 The next sentence asks us to characterize it using SPSS’s terminology. SPSS uses the nominal label for variables that are nominal level in Stevens’ scheme.

Slide 30 The next question asks about the level of measurement of the variable for highest academic degree, using Stevens’ framework. The SPSS variable for highest academic degree in GSS2000R is degree.

Slide 31 In the Data View, we see restricted options for values of degree: 0, 1, 2, 3, 4, and 9.

Slide 32 The variable degree uses three numeric codes for missing data: 7, 8, and 9.

Slide 33 If we eliminate the codes for missing data (7, 8, and 9), there are five valid values. The are ordered by level of academic achievement and the number of years it takes to complete the degree. Graduate degrees take more years of school than bachelor degrees, which take more years of school than junior college degrees, etc. There is a clear order to the codes and labels.

Slide 34 The next sentence asks us to characterize the variable using the terminology of the text book. Since it is ordinal, it can be used both as a categorical variable and a quantitative variable.

Slide 35 The next sentence asks us to characterize it using SPSS’s terminology. SPSS uses the ordinal label for variables that are ordinal level in Stevens’ scheme.

Slide 36 We have entered a response for each part of the problem. To grade our answers, we click on the Submit button.

Slide 37 The green shading indicates that all of our responses are correct.