Constructing/transforming Variables Still preliminary to data analysis (statistics) Would fit comfortably under Measurement A bit more advanced is all.

Slides:



Advertisements
Similar presentations
Standardized Scales.
Advertisements

Developing a Questionnaire
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Mean, Proportion, CLT Bootstrap
Conceptualization and Measurement
Who Has a Good Opportunity for Phased Retirement? Robert Hutchens.
Dummy Variables Dummy variables refers to the technique of using a dichotomous variable (coded 0 or 1) to represent the separate categories of a nominal.
The Research Consumer Evaluates Measurement Reliability and Validity
Basic Issues in Measurements P M V Subbarao Professor Mechanical Engineering Department How to generate reliable Numbers?????
Bivariate Analysis Cross-tabulation and chi-square.
Reliability, the Properties of Random Errors, and Composite Scores.
RESEARCH METHODS Lecture 30. DATA TRANSFORMATION.
Getting Started With STATA How do I do this? It probably opened automatically, but you may have to save it to the desktop, and double-click it to open.
Effect Size and Meta-Analysis
CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.
Spring INTRODUCTION There exists a lot of methods used for identifying high risk locations or sites that experience more crashes than one would.
Beginning the Research Design
1 Measurement Measurement Rules. 2 Measurement Components CONCEPTUALIZATION CONCEPTUALIZATION NOMINAL DEFINITION NOMINAL DEFINITION OPERATIONAL DEFINITION.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Chapter 9 Flashcards. measurement method that uses uniform procedures to collect, score, interpret, and report numerical results; usually has norms and.
Chapter 7 Correlational Research Gay, Mills, and Airasian
LEVEL OF MEASUREMENT Data is generally represented as numbers, but the numbers do not always have the same meaning and cannot be used in the same way.
Questionnaire Development Part II: SPSS, Reliability, and Validity Personality Lab October 11, 2010.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Internal Consistency Reliability Analysis PowerPoint.
Measuring Social Life Ch. 5, pp
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
8/20/2015Slide 1 SOLVING THE PROBLEM The two-sample t-test compare the means for two groups on a single variable. the The paired t-test compares the means.
Measurement The hardest part of doing research? You’ll see when we begin operationalizing concepts May seem easy/trivial/even boring, but it is crucial.
M28- Categorical Analysis 1  Department of ISM, University of Alabama, Categorical Data.
Near East University Department of English Language Teaching Advanced Research Techniques Correlational Studies Abdalmonam H. Elkorbow.
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Goals for Today Review the basics of an experiment Learn how to create a unit-weighted composite variable and how/why it is used in psychology. Learn how.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 24 Designing a Quantitative Analysis Strategy: From Data Collection to Interpretation.
What is a Measurement? Concept of measurement is intuitively simple  Measure something two concepts involved  The thing you are measuring  The measurement.
Crosstabs How do we assess the relationship between two variables? (We’ll bring in more variables later.) Various ways, especially with interval-level.
CHAPTER 6, INDEXES, SCALES, AND TYPOLOGIES
Chapter 1:Statistics: The Art and Science of Learning from Data 1.1: How Can You Investigate Using Data? 1.2: We Learn about Populations Using Samples.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Chapter 12 Survey Research.
DTC Quantitative Methods Survey Research Design/Sampling (Mostly a hangover from Week 1…) Thursday 17 th January 2013.
Variables and their Operational Definitions
Quantitative research – variables, measurement levels, samples, populations HEM 4112 – Research methods I Martina Vukasovic.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Scaling and Index Construction
The Practice of Social Research Chapter 6 – Indexes, Scales, and Typologies.
PSC 47410: Data Analysis Workshop  What’s the purpose of this exercise?  The workshop’s research questions:  Who supports war in America?  How consistent.
Lab 2: Descriptive Statistics. Today’s Activities Complete Brief Personality Inventory – Must put PID on Scantron Calculate Scales and Compute Descriptive.
1-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Indexes Anthony Sealey University of Toronto This material is distributed under an Attribution-NonCommercial-ShareAlike 3.0 Unported Creative Commons License,
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
Course: Research in Biomedicine and Health III Seminar 5: Critical assessment of evidence.
Educational Research Chapter 8. Tools of Research Scales and instruments – measure complex characteristics such as intelligence and achievement Scales.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
O Make Copies of: o Areas under the Normal Curve o Appendix B.1, page 784 o (Student’s t distribution) o Appendix B.2, page 785 o Binomial Probability.
Measurement Chapter 6. Measuring Variables Measurement Classifying units of analysis by categories to represent variable concepts.
Data Entry, Coding & Cleaning SPSS Training Thomas Joshua, MS July, 2008.
Chapter 26: Generalizations and Surveys. Inductive Generalizations (pp ) Arguments to a general conclusion are fairly common. Some people claim.
Final Project Reminder
Final Project Reminder
CHAPTER 6, INDEXES, SCALES, AND TYPOLOGIES
Coding Manual and Process
Measuring Social Life: How Many? How Much? What Type?
RESEARCH METHODS Lecture 30
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Reliability, the Properties of Random Errors, and Composite Scores
Chapter 6 Indexes, Scales, and Typologies
Presentation transcript:

Constructing/transforming Variables Still preliminary to data analysis (statistics) Would fit comfortably under Measurement A bit more advanced is all All the earlier material about operationalization (importance of it, difficulty of doing well, need to think about and assess reliability and validity) apply

How (and why) change variables? Collapse codes Reverse coding Combine items Standardize items

Collapse codes Example: Party identification 1 = SD, 2 = WD, 3 = ID, 4 = Ind, 5 = IR, 6 = WR, 7 = SR Collapse to 1 = Dem (1-2 above), 2 = Ind (3-5), 3 = Rep (6-7)

Collapse codes (cont.) Why collapse codes? For theoretical reasons (In a given instance) we think only direction of partisanship matters To combine small categories Another option is to delete those cases To get a reasonable # of categories Age 18, 19, 20…97 = 80 categories

Reverse coding Examples: 1 = Disagree, 2 = Neutral, 3 = Agree Reverse so 1 = Agree, 2 = Neutral, 3 = Disagree 1 = Low…10 = High Reverse so 1 = High…10 = Low

Reverse coding (cont.) Why reverse coding? Questions are reversed in surveys Example: from homework, next slide Indicators have “opposite” meanings Example: unemployment (up is “bad”); increase in income (up is “good”) With reversal: It’s often easier to interpret items Combining items make sense

Tolerance questions Members of the [least-liked group] should be banned from being president of the U.S. Disagree is the “tolerant” response. Members of the [least-liked group] should be allowed to teach in public schools. Agree is the “tolerant” response.

Combine items Examples can be complex (we’ll see later) Simple example: Count the number of correct, or tolerant, or liberal, or … responses As is Knowledge and Tolerance scales (in homework)

Combine items (cont.) Why combine items? Multi-variable items are usually more valid Many concepts—e.g., type of election system, tolerance, knowledge—are hard to measure with a single question or indi- cator (they contain multiple components). In a combined measure, we can include items that measure all of these components

Combine items (cont.) Combined (multi-variable) items typically increase reliability as well Random error that affects individual items is averaged out Combining items also yield more refined measurement Simply put, we get more categories

Ex: Pro-business support Conceptual definition is simple: how favorable toward business are members of Congress? More specifically, rank U.S. House members by their favorability toward business. How might we do this?

Standardize items Why standardize items? To account for different numbers of base items. To compare or combine items measured on different scales altogether. Save this for later.

Standardize items Examples Percentaging (e.g., 90% is “equivalent” despite different length tests). Making measures comparable by deflating for changing bases (e.g., increased media coverage over time).

Media coverage over time: More pages & more periodicals

Deflating for total volume matters Downward trend in coverage of auto safety masked by increasing media volume.

A reminder Composite (multi-variable) measures are not always better (more valid/reliable). Results can be artifact of how you construct your variables.

Real world example of a composite (better?) measure Likelihood of voting (in election polls) Need to estimate because many who are interviewed will not vote People won’t/can’t estimate own behavior Pollsters use information about past voting, whether registered, interest in the race, etc. But: polls vary (in part) because different pollsters use different sets of questions

Why change variables? Additional reasons To change the nature of the variable Example: log transformation (age often done this way) Create new variables “Just” combining items again, but it can be very complex Example: next slide

Example: Clarity of responsibility for government decisions (Powell) Idea is that in some instances it is easy to assign credit or blame for what a government does; other times not Powell builds an index using measures of: Presence of minority government Whether there is bicameral opposition Number of political parties in the legislature Degree of cohesion among the parties Strength of committee chairs in the legislature

The text on combining/altering variables in SPSS Text talks about Recode and Compute operations for simple purposes Collapsing Additive index (e.g., # of yes answers) Useful but far from all that you can do Mentions transformations Refers to “a dizzying variety of complex transformations” possible” (text)

Comment/advice: If you want to do it, you almost certainly can do it in SPSS—and you can probably do it quite easily. Ex: Make individuals (in a survey) a 1 if var 5 + abs. value of var 6 + 2(var17) + log(var 19) is greater than 24.5 OR if var 3 equals 4; otherwise make them 0

Final note on SPSS When you recode: Save in a new variable almost always Checking after each operation More on changing variables in the lab sessions