After all the coding is done... Harry Ganzeboom Center for Survey Research – Academia Sinica July 24-25 2008.

Slides:



Advertisements
Similar presentations
David Hunter ILO Department of Statistics
Advertisements

Conceptualization, Operationalization, and Measurement
Occupations in ESS R1-R51 Coding and Scaling Occupations in ESS R1-R5 Harry B.G. Ganzeboom Ingrid Workshop, UvA Amsterdam, February
“... providing timely, accurate, and useful statistics in service to U.S. agriculture.” Using Mixed Methods to Evaluate Survey Questionnaires Heather Ridolfo,
Conceptualization and Measurement
Professor Gary Merlo Westfield State College
Brandon Walcutt Hankuk University of Foreign Studies Seoul, Korea ALLIED ACADEMIES CONFERENCE April 2010 WAGE DISCRIMINATION IN KOREA’S ESL INDUSTRY 1.
Reliability, the Properties of Random Errors, and Composite Scores.
Business Statistics for Managerial Decision
International Standard Classification of Occupations (ISCO 2008) and the measurement of cultural employment UIS Interagency Meeting on Cultural Employment.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Measurement in Survey Research Developing Questionnaire Items with Respect to Content and Analysis.
MEASUREMENT. Measurement “If you can’t measure it, you can’t manage it.” Bob Donath, Consultant.
Errors in Factual Questions and their Consequences Annette Scherpenzeel QMSS Seminar Lugano.
What are competencies – some definitions ……… Competencies are the characteristics of an employee that lead to the demonstration of skills & abilities,
5. The Structuring of Beliefs. Are Many Opinions Random? (re Converse) GV917.
Scaling and Attitude Measurement in Travel and Hospitality Research Research Methodologies CHAPTER 11.
Research Methods in MIS
Social Mobility & Status Attainment I Three Generations of Comparative Intergenerational Stratification Research.
Chapter 9 Flashcards. measurement method that uses uniform procedures to collect, score, interpret, and report numerical results; usually has norms and.
LEVEL OF MEASUREMENT Data is generally represented as numbers, but the numbers do not always have the same meaning and cannot be used in the same way.
RECRUITMENT AND SELECTION OF STAFF
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Measurement and Data Quality
Some comments on ESEC outline of the talk Theoretical aspects, objectives : what we agree on, what we suggest Measurement issues Coding ISCO Additional.
Critical Analysis. Key Ideas When evaluating claims based on statistical studies, you must assess the methods used for collecting and analysing the data.
1 The ICT Statistics of Business Sector in China By Lu Haiqi International Statistical Information Center, National Bureau of Statistics People’s Republic.
ISCO88, ISCO08 and ESeC Regional Meeting, 9 December 2005 Presentation of Hungary.
Chapter 1: Introduction to Statistics
USE OF LITHUANIAN CLASSIFICATION OF OCCUPATIONS ISCO 88, ISCO 2008 and the Development of the ESeC Regional Meeting, Oslo, 7 June 2005 Violeta Skamarociene.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Presented by: Kira N. Deane Statistician II (Ag.) Barbados Statistical Service (CARIBBEAN SEMINAR ON INTERNATIONAL CLASSIFICATIONS FOR STATISTICAL AND.
GEODE, 16 Jan 2007 Occupational Analysis – Issues and Examples Grid Enabled Occupational Data Environment GEODE Project workshop, 16 th January 2007 Vernon.
Validating ESeC using Round 1 of the European Social Survey ESeC Validation Conference, Lisbon, January 2006 Eric Harrison & David Rose ISER, University.
政府統計處 Census and Statistics Department Introduction to Statistical Work.
European conference on quality in official statistics Rome, 8-10 July 2008 How to assess the quality of the Italian classification of occupations Francesca.
Coding of parental occupations ICCS Marker Training Hamburg, July 2007.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
DRAFT Module 3: Measuring the ICT sector Ms Sheridan Roberts, Consultant Information Society Statistics Sunday 8 March 2009.
ISCO-08 - Current Status and plans to support implementation David Hunter Department of Statistics International Labour Office United Nations Expert Group.
Using EseC to look across and within classes Workshop on Application of ESeC Lake Bled, June 2006 Eric Harrison & David Rose ISER, University of.
Do’s and dont’s of occupation coding Harry B.G. Ganzeboom Center for Survey Research Academia Sinica July
1 ISCO USE IN CYPRUS. 2 ISCO 58 in the 1960´s ISCO 68 in the 1970´s & 1980´s ISCO 88 in the 1990´s ISCO 88.COM since 2000.
Constructing/transforming Variables Still preliminary to data analysis (statistics) Would fit comfortably under Measurement A bit more advanced is all.
1 Assessing inconsistencies in reported job characteristics of employed stayers: An analysis on two-wave panels from the Italian Labour Force Survey,
European Socio-economic Classification: Finalising the Matrix David Rose Institute for Social and Economic Research University of Essex.
Market research for a start-up. LEARNING OUTCOMES By the end of this lesson I will be able to: –Define and explain market research –Distinguish between.
Mihkel Reispass Methodologist Methodology Department.
Over-skilling and Over- education Peter J Sloane, Director, WELMERC, School of Business and Economics, Swansea University, IZA, Bonn and University of.
Validation studies : project using French data Assessing the consistency of ESeC with theoretical framework “à la Goldthorpe” Pointing out using ISCO as.
Reliability, the Properties of Random Errors, and Composite Scores Week 7, Psych R. Chris Fraley
Question paper 1997.
Defining ‘managers’ The terms “manager” and “professional” have no equivalent in French The name “professional” does not exit The translation of ‘manager’
SOCW 671: #5 Measurement Levels, Reliability, Validity, & Classic Measurement Theory.
The effects of Peer Pressure, Living Standards and Gender on Underage Drinking Psychologist- Kanari zukoshi.
ICCS Marker Training Hamburg July 2008 Final note on marking Reliability marking report from WinDEM will include record of scores for double-marked items,
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
Sampling Design & Measurement Scaling
Copyright © 2009 Pearson Education, Inc. 8.1 Sampling Distributions LEARNING GOAL Understand the fundamental ideas of sampling distributions and how the.
Reliability a measure is reliable if it gives the same information every time it is used. reliability is assessed by a number – typically a correlation.
13-Jul-07 State of the art of the ISCO-08 implementation.
ESSnet on the harmonisation and implementation of a European socio- economic classification Workpackage 2 – Expertise of the basic variables ___________________.
July 14, 2010 SHRM Poll: Green Jobs—Are They Here Yet?
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.
CEDEFOP Session 2: Closer look at the roadmap Forecasting occupations and qualifications Production of Skills Supply and Demand Forecasts Alphametrics.
پرسشنامه کارگاه.
Workshop on the data collection of occupational data 28 November 2008
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Concepts of industry, occupation and status in employment - Overview
representing the International Labour Office
Presentation transcript:

After all the coding is done... Harry Ganzeboom Center for Survey Research – Academia Sinica July

Analysing occupational information2 Scaling occupations Detailed occupation codes have various uses, but for most applications they are condensed again into social status scales. There is a great variety of national and international social status scales and ways they are constructed. Main division: –Nominal categories: EGP (Goldthorpe), Wright, Esping-Andersen. –Continuous scales: Prestige, Socio-economic Index [SEI] Each of these have their own theoretical backgrounds. The varieties of social status scales can only be compared when you have access to detailed occupations (and more).

Analysing occupational information3 Tools for ISCO-88 This webpage contains several useful [SPSS] tools to work with ISCO-88 codes: –ADD VALUE LABELS for all occupations –RECODE for EGP social classses –RECODE for SIOPS [Treiman’s] prestige scale –RECODE for ISEI [Ganzeboom et al.’s] SEI scale Note that the tools will work (A) for multiple occupations, and (B) for all levels of detail of coding (providing you have used trailing zeroes). There are also tools for ISCO-68 and will be for ISCO-08.

Analysing occupational information4 ISEI (1) A SEI [socio-economic index or Duncan] score scales occupation by averaging status characteristics of job holders, most often their education and earnings. Often the criterion information is taken from census data. ISEI was created for ISCO-88 using criterium information for educational and earnings ranks on a ‘world-wide’ sample of men from 17 countries.

Analysing occupational information5 ISEI (2) ISEI was constructed as an optimal scaling of (detailed) occupations as an intervening variable between education and earnings: “Occupation is what you do to convert your qualifications into income”. Metric between 10-90, but this is entirely arbitrary. ISEI was originally developed for ISCO-68, but its second generation version (for ISCO-88) has become widely used, also outside sociology.

Analysing occupational information6 Prestige Prestige: popular evalation of occupational status, i.e. you ask respondents to value occupations. Many local versions have been integrated by Treiman (1977) into the Standard International Occupational Prestige Score [SIOPS], related to ISCO-68. The version on my website is a mapping of the original SIOPS to ISCO-88.

Analysing occupational information7 EGP EGP class typology combines detailed occupation codes with measures on self- employment and supervising status. This leads to a nominal (partly ordered) set of distinctions: categories. EGP has become the de facto standard for stratification research. Much used.

Analysing occupational information8 Relationships EGP, ISEI, SIOPS All these measures are strongly associated. You need a lot of data if you are going to argue about the differences. EGP and ISEI resemble each other more than SIOPS. SIOPS [prestige] is theoretically the best idea, but it does not work well in practice. I prefer to use ISEI for my further discussion here.

Analysing occupational information9 Checks to be run... Use value labels to see whether the coders have indeed entered only valid codes. It is surprising to learn how often this check has not been run! It is even more surprising to learn how often this is the only check ever run!!!

Analysing occupational information10 MTMM-models Multi-Trait Multi-Method models were developed in psychometrics to estimate the reliability and validity of attitude items. The idea is that you can learn about reliability and validity (both!!) when you apply multiple methods (e.g. respons formats) to multiple [related] traits (e.g. personality characteristics. Remember: –Reliability: lack of random errors –Validity: lack of systematic error

Analysing occupational information11 MTMM model FOCC ROCC FISEI1FISEI2RISEI1RISEI2

Analysing occupational information12 Estimating MTMM for two coders The elementary MTMM model for two traits (occupations) and two methods (coders) has 7 parameters. The data generate only 6 degrees of freedom. However, by contraining (equalizing) the parameters, we can find the following interesting information: –How random error each coder has coded relative to the other. –Whether FOCC and ROCC differ in the amount of random error. –How much systematic bias each coder has added to their codes. –Degree of attention brought about by the coding unreliability; corrected (disattenuatud) correlation between FOCC and ROCC.

Analysing occupational information13

Analysing occupational information14 What are we learning by staring at these correlations? Within-coder correlation at best This means 0.90 index of reliability. Coders agree slightly less on father’s occ than respondent’s. Loss is around Within- and cross-coder intergenerational correlations are around 0.33 and fairly homogenous. Coder 1 has created slightly more consistency between father and respondent.

Analysing occupational information15 MTMM assumptions Coders are equally reliable for fathers and respondents. However, fathers’ occupations may be easier to code (less) reliably than respondents’ occupations. Systematic error is the same for all coders.

Analysing occupational information16 If estimated by SEM (Lisrel), we learn: Reliability coder 1/coder 2: / (NS). Reliability FOCC/ROCC: / (NS). Coder unique consistency: (significant). Corrected intergenerational correlation: The interesting conclusion for this (Italian) example is clearly the corrected intergenerational correlation. Note that this is even so with high coder reliability!

Analysing occupational information17 Conclusions Even if coders do a decent and honest job, they introduce random and systematic error. These errors are in the coding process, not by the data collection! If coders introduce only 10% error, they bring down the intergenerational correlation by 20%!

Analysing occupational information18 More sources of measurement problems.. and their repairs It is important to see that coder errors are just one single source of bad measurement. It might be true that even bigger trouble is created by what the respondents say. If you want to assess measurement error at the respondents level, you need to ask the question twice: –Within the same interview –From different sources (e.g. spouses about each other). –At diffent interviews, e.g. in panel designs.

Analysing occupational information19 Another source of error: the respondent. Note that all of the above is about errors generated in the coding proces. Occupational measures also contain other errors, most prominently generated by the respondent / interviewer. This type of error can only be estimated by asking the question again: –In the same interview. –From a different source (e.g spouses about each other). –In a different interview (panel).

Analysing occupational information20 Can you ask the occupation question again in the same interview? Yes, an acceptable way for respondents is to ask an open question (see above) and a closed question. Closed questions may not be as valid and flexible as open questions, but they may be more reliable. At least they do not suffer from coding error... This type of multiple measurement has been tried in ISSP87 for four countries and six Dutch surveys. It will be replicated in ISSP09.

Analysing occupational information21 Main conclusions on double measurement Crude closed questions are slightly more reliable than detailed open question. Crude questions suffer slightly more from systematic error than detailed questions: –Correlated error (‘echo effects’) –Education bias. However, the main boost comes from using multiple indicators, that leads to disattenuation. Estimates from ISSP and Dutch data suggest measurement relationships of around This would suggest that coding error is the major source of random error.

Analysing occupational information22 ISCO 2008 ILO has recently revised the ISCO to ISCO- 08. Current situation is that the new classification has been fixed and published. However, there are no definitions or manuals available yet. For previous versions it laster 1-2 years before these became available.

Analysing occupational information23 Stated goals of ISCO-08 Bring occupational classification in line with changed technologies and division of labor (e.g. ICT/IT). Make ISCO applicable in a wider range of countries and economies. To mend often noted problems with the application of ISCO-88. To produce a minor revision, not a totally different classification.

Analysing occupational information24 Problems with ISCO-88 (1) Unlike its predecessor (ISCO-68), ISCO-88 is primarily skill oriented. However, in practice the major group differentiation does not closely correspond to major ISCED (education) levels. ISCO-68 was more sensitive to employment status (self-employment) and industry.

Analysing occupational information25 Problems with ISCO-88 (2) Despite its stated principles, it is hard to pay tribute to skill level differentiation in manual work. ISCO-88 differentiates between (7000) Craft workers, and (8000) Machine Operators, which is similar, but not the same as Skilled versus Semi-skilled Manual Workers. In addition, many occupations occur both in the 7000 and 8000 categories.

Analysing occupational information26 Problems with ISCO-88 (3) ISCO-88 argued that occupation and employment status are different things and need to be measured separately. As a consequences some employers became classified with their employees, in particular there is no distinction between managing proprietors and managers, and not between working proprietors and their employees.

Analysing occupational information27 Problems with ISCO-88 (4) Managers were organized into three levels: –Corporate managers –Department managers [Production, Support] –General [Small enterprise] managers. The primary distinction here is the number of managers in an organisation, which is not often available in data. It is somewhat hard to classify work supervisors [Foremen] in ISCO-88.

Analysing occupational information28 Problems with ISCO-88 (5) Farmers are hard to classify in ISCO-88, because they appear in 5 places: –Operations Department Manager (1211) –Small Establishment Manager (1311) –Skilled Agricultural Worker (6100) –Subsistence Farmer (6200) –Farm Laborer (9200) None of this corresponds closely to distinctions made in farm work in national classifications.

Analysing occupational information29 Problems with ISCO-88 (6) ISCO-88 is overly broad in (5000) Service and Sales Occupations. In particular (5200) Sales Workers is very undifferentiated.

Analysing occupational information30 Problems with ISCO-88 (7) It is hard to find fitting codes for ‘crude’ occupations: factory worker, skilled worker, foreman, semi-skilled worker, apprentice. However, in some instances, there is no problem if one used major and sub-major groups codes: e.g. (9000) for Unskilled Worker.

Analysing occupational information31 ISCO-08 versus ISCO-88 ISCO-08 groups 10 major 34 sub-major 120 minor 403 unit Total: 567 groups ISCO-88 groups 10 major 28 sub-major 115 minor 363 unit Total 516 groups

Analysing occupational information32 Mergers and Splits Mergers: Many-to-one recodes. Splits: One-to-one recodes. Mergers & splits: Many-to-many recodes. All of these occur when comparing ISCO88 to ISCO08. When we crosswalk from 88 to 08 (and have no further information), only mergers are relevant. When we have ISCO88 and further information (like original verbatim info of original source classification), we also need to consider splits.

Analysing occupational information33 Mergers

Analysing occupational information34 Splits

Analysing occupational information35 Major groups 10 major groups: Essentially unchanged, with minor changes of titles. However: If minor groups have been moved between major groups (see below), this de facto changes major groups too! The major group that is likely most affected by such shifts is (5000) and in particular (5200) Sales Workers, that now contains a number of Elementary Sales Occupations.

Analysing occupational information36 Sub-major groups (2 digits) 34 sub-major groups: expanded from 28 major groups. Truly NEW: –(0100, 0200, 0300) Army ranks (3x) –(9400) Food Preparation Workers Other ‘new’ major groups are ‘upgraded’ or ‘merged’ minor groups. Roughly speaking, about half of the sub-major groups has remained the same, the other half has a different composition than in 1988.

Analysing occupational information37 ICT occupations Altogether, ISCO-08 distinguishes ca. 20 ICT occupations, that occur at several levels: –(2500) ICT Professionals (11x) –(3500) ICT Technicians (5x) –(1330) ICT Service Manager (1x) –(2356) ICT Teachers (1x) –(2434) ICT Sales Professionals (1x) Neither (2500) nor (3500) are new – actually both existed already in ISCO-68!

Analysing occupational information38 Problem 1: Imperfect skill orientation Some ambiguities between (7000) Craft Workers, and (8000) Machine Operators have been removed. An NEW feature is the distinction between (8100) Stationary Machine Operators, and (3130) Process Control Technicians, which probably refers to the complexity of the process / machine controlled / operated.

Analysing occupational information39 Problem 2: Employment status Although somewhat indirect, ISCO-08 has better fitting codes for Large Entrepreneurs and Foreman. There is an ambiguous distinction between (1420) Retail and Wholesale Trade Managers, and (5221) Shop Keepers.

Analysing occupational information40 Problem 3: Managers The implicit reference to firm size (i.e. number of departments) has disappeared, the same things are now referred to by main activity. At the sub-major group level Corporate Managers are now longer grouped with department managers, but with (high) Government Officials. Major changes occur at the 3-digit and 4 digit level. –(1330) ICT Services Managers –(1340) Professional Services Managers (9x)

Analysing occupational information41 Problem 4: Farmers Self-employed farmers can still be coded in as (1310) Managers in Agriculture etc. However, it also remains possible to code them with (6100) Market-oriented Skilled Agricultural Workers. Interestingly, a NEW feature is that (6200) Subsistence Farmers has now four minor groups.

Analysing occupational information42 Problem 5: Crude Sales / Service Sales salespersons are split: –(5221) Shop Keepers –(5222) Shop Supervisors –(5223) Shop Sales Assistants This is an improvement. Also, more levels and locations of sales (market, stall, cashiers) have been regrouped in the sub- major group (5200). This has made the sub-major group even more heterogeneous than it was.

Analysing occupational information43 Interesting.. Cooks are now split up into –(3434) Chef [a “Culinary Associate Professional”] –(5120) Cooks –(9400) Food Preparation Workers (9411) Fast Food Preparers (9412) Kitchen Helper I am very happy with this...

Analysing occupational information44 Problem 6: Crude occupations Some of the new features mend this problem: –“Foreman” can now be classified as (3120) Production Supervisor. –“Shop keeper” can go in two places. –“Skilled Worked” can be more conveniently coded as (7000).

Analysing occupational information45 Interesting... Specialized Secretaries and Office Managers are now in (3000) Associate Professionals. Some new occupations: –(2230) Traditional and Complementary Health Professional –(5245) Service Station Attendant –(7234) Bicycle Repairman –(9334) Shelf Filler –(9412) Kitchen Helper Disappeared: –(2121) Mathematician, Statistician –(6142) Charcoal Burner

Analysing occupational information46 How can we reclassify existing data? A simple conversions of ISCO-88 into ISCO-08 is not possible. Conversion tool will become available, that will do two things at the same time: –Straight recode of ISCO-88 into ISCO-08 (‘best fit’). Truncate trailing decimals, if this is the only thing that you want or can do. –Trailing decimals suggest the amount of alternatives (splits). You will have to consult a separate document to list these options. For this to be usefull you will need original strings or classifications.