Presentation is loading. Please wait.

Presentation is loading. Please wait.

NCRM, Session 27, 1 July 20081 Handling data on occupations, educational qualifications, and ethnicity Paul Lambert & Vernon Gayle, Univ. Stirling Talk.

Similar presentations


Presentation on theme: "NCRM, Session 27, 1 July 20081 Handling data on occupations, educational qualifications, and ethnicity Paul Lambert & Vernon Gayle, Univ. Stirling Talk."— Presentation transcript:

1 NCRM, Session 27, 1 July 20081 Handling data on occupations, educational qualifications, and ethnicity Paul Lambert & Vernon Gayle, Univ. Stirling Talk to the workshop ‘Resources for Data Management and Handling Social Science Data’ ESRC Research Methods Festival, Oxford, 1 July 2008

2 NCRM, Session 27, 1 July 20082 Handling variables DAMES project (www.dames.org.uk) - specialist data services on three major social science topics (occupations, education, ethnicity)www.dames.org.uk ‘GE*DE’ – ‘Grid Enabled Specialist Data Environments’ From: www.geode.stir.ac.ukwww.geode.stir.ac.uk

3 NCRM, Session 27, 1 July 20083 Handing social science variables – general themes Common v’s best practice –Recording the derivation/variable construction process –Reviewing alternative measures Comparability (between contexts - countries, times) –Input or output harmonisation? –Measurement or functional equivalence? –See esp. ‘Variable constructions in longitudinal research’, http://www.longitudinal.stir.ac.uk/variables/ http://www.longitudinal.stir.ac.uk/variables/ –Existing standards of National Statistics Institutes and international bodies (during data collection)

4 NCRM, Session 27, 1 July 20084 Handling variables – general themes, ctd. The unit of analysis –Individual, spouse, household, etc. –Current time; career summary, etc. Concept and measures –Variety of academic preferences –NSI standard measures

5 NCRM, Session 27, 1 July 20085 Key variables: concepts and measures VariableConceptSomething useful OccupationClass; stratification; unemployment www.geode.stir.ac.uk EducationCredentials; Ability; Meritwww.equalsoc.org/8 Ethnic groupEthnicity; race; religion; national origins [Bosveld et al 2006] AgeAge; life course stage; cohort Abbott, A. (2006) ‘Mobility: What, when, how?’, in Morgan et al., Mobility & Inequality, Stanford UP. GenderGender; household / family context www.genet.ac.uk IncomeIncome; wealth; poverty;www.data-archive.ac.ukwww.data-archive.ac.uk [SN 3909]

6 NCRM, Session 27, 1 July 20086 Key variables: comments & speculation (from www.longitudinal.stir.ac.uk/variables/Coefficients.html )www.longitudinal.stir.ac.uk/variables/Coefficients.html a) Data manipulation skills and inertia I would speculate that around 80% of applications using key variables don’t consult literature and evaluate alternative measures, but choose the first convenient and/or accessible variable in the dataset  Data supply decisions (‘what is on the archive version’) are critical Much of the explanation lies with lack of confidence in data manipulation / linking data Too many under-used resources – cf. www.esds.ac.ukwww.esds.ac.uk

7 NCRM, Session 27, 1 July 20087 b) Software and key variables – a personal view Stata is the superior package for secondary survey data analysis: Advanced data management and data analysis functionality Supports easy evaluation of alternative measures (e.g. est store) Culture of transparency of programming/data manipulation Problems with Stata Not available to all users {Slow estimation times}

8 NCRM, Session 27, 1 July 20088 c) Endogeneity and key variables ‘everything depends on everything else’ [Crouchley and Fligelstone 2004] We know a lot about simple properties of key variables –Key variables often change the main effects of other variables –Simple decisions about contrast categories can influence interpretations –Interaction terms are often significant and influential We have only scratched the surface of understanding key variables in multivariate context and interpretation –Key variables are often endogenous (because they are ‘key’!) –Work on standards / techniques for multi-process systems and/or comparing structural breaks involving key variables is attractive

9 NCRM, Session 27, 1 July 20089 d) Social science variables and functional form Functional form = the way in which measures are arithmetically incorporated in quantitative analysis  With occupations, education, ethnicity, and elsewhere, we tend to be too willing to make simplifying categorisations  An alternative - scaling and relative positions – is better suited for complex analytical procedures

10 NCRM, Session 27, 1 July 200810 1. Data and research on occupations In the social sciences, occupation is seen as one of the most important things to know about a person  Direct indicator of economic circumstances  Proxy Indicator of ‘social class’ or ‘stratification’ GEODE – how social scientists use data on occupations DAMES – extending GEODE resources Expanding range Improving usability

11 Stage 1 - Collecting Occupational Data (and making a mess) Example 1: BHPS Occ descriptionEmployment statusSOC-2000EMPST Miner (coal)Employee81227 Police officer (Serg.)Supervisor33126 Electrical engineerEmployee21237 Retail dealer (cars)Self-employed w/e12342 Example 2: European Social Survey, parent’s data Occ descriptionSOC-2000EMPST Miner?8122?6/7 Police officer?3312?6/7 Engineer?? Self employed businessman???1/2

12 NCRM, Session 27, 1 July 200812 www.geode.stir.ac.uk/ougs.html

13 13 Occupations: we agree on what we should do: Preserve two levels of data  Source data: Occupational unit groups, employment status  Social classifications and other outputs Use transparent (published) methods [i.e. OIR’s]  for classifying index units  for translating index units into social classifications for instance..  Bechhofer, F. 1969. 'Occupations' in Stacey, M. (ed.) Comparability in Social Research. London: Heinemann.  Jacoby, A. 1986. 'The Measurement of Social Class' Proceedings from the Social Research Association seminar on "Measuring Employment Status and Social Class". London: Social Research Association.  Lambert, P.S. 2002. 'Handling Occupational Information'. Building Research Capacity 4: 9-12.  Rose, D. and Pevalin, D.J. 2003. 'A Researcher's Guide to the National Statistics Socio- economic Classification'. London: Sage.

14 14 …in practice we don’t keep to this... Inconsistent preservation of source data Alternative OUG schemes SOC-90; SOC-2000; ISCO; SOC-90 (my special version) Inconsistencies in other index factors ‘employment status’; supervisory status; number of employees Individual or household; current job or career Inconsistent exploitation of Occupational Information Numerous alternative occupational information files (time; country; format) Substantive choices over social classifications Inconsistent translations to social classifications – ‘by file or by fiat’ Dynamic updates to occupational information resources Strict security constraints on users’ micro-social survey data Low uptake of existing occupational information resources

15 NCRM, Session 27, 1 July 200815 GEODE provides services to help social scientists deal with occupational information resources 1)disseminate, and access other, Occupational Information Resources 2)Link together their (secure) micro-data with OIR’s External user (micro-social data) Occ info (index file) (aggregate) User’s output (micro-social data) idougsex.ougCS-MCS-FEGPidougCS 11101. 6058I 111060. 23201. 6971II 232069. 33202.8743951VIIa 332071. 48741. 4 39. 58742. 5 51.

16 NCRM, Session 27, 1 July 200816 Occupational information resources: small electronic files about OUGs… Index units# distinct files (average size kb) Updates? CAMSIS, www.camsis.stir.ac.uk Local OUG*(e.s.) 200 (100)y CAMSIS value labels www.camsis.stir.ac.uk Local OUG50 (50)n ISEI tools, home.fsw.vu.nl/~ganzeboo m Int. OUG20 (50)y E-Sec matrices www.iser.essex.ac.uk/esec Int. OUG*(e.s.) 20 (200)n Hakim gender seg codes (Hakim 1998) Local OUG2 (paper)n

17 NCRM, Session 27, 1 July 200817 For example: ISCO-88 Skill levels classification

18 NCRM, Session 27, 1 July 200818 and: UK 1980 CAMSIS scales and CAMCON classes

19 NCRM, Session 27, 1 July 200819 Summary on occupations and data management Extensive debate about occupation-based social classifications Document your procedures....as you may be asked to do something different.. If you need to choose between occupation-based measures… –They all measure, mostly, the same things –Don’t assume concepts measure measures Lambert, P. S., & Bihagen, E. (2007). Concepts and Measures: Empirical evidence on the interpretation of ESeC and other occupation-based social classifications. Paper presented at the ISA RC28 conference, Montreal (14- 17 August), www.camsis.stir.ac.uk/stratif/archive/lambert_bihagen_2007_version1.pdf. www.camsis.stir.ac.uk/stratif/archive/lambert_bihagen_2007_version1.pdf

20 NCRM, Session 27, 1 July 200820

21 NCRM, Session 27, 1 July 200821

22 NCRM, Session 27, 1 July 200822 July 2008: Existing resources on occupations Popular websites: http://www2.warwick.ac.uk/fac/soc/ier/publications/software/cascot/ http://home.fsw.vu.nl/~ganzeboom/pisa/ www.iser.essex.ac.uk/esec/ www.camsis.stir.ac.uk/occunits/distribution.html Emerging resource: http://www.geode.stir.ac.uk/http://www.geode.stir.ac.uk/ Some papers: –Chan, T. W., & Goldthorpe, J. H. (2007). Class and Status: The Conceptual Distinction and its Empirical Relevance. American Sociological Review, 72, 512- 532. –Rose, D., & Harrison, E. (2007). The European Socio-economic Classification: A New Social Class Scheme for Comparative European Research. European Societies, 9(3), 459-490. –Lambert, P. S., Tan, K. L. L., Gayle, V., Prandy, K., & Bergman, M. M. (2008). The importance of specificity in occupation-based social classifications. International Journal of Sociology and Social Policy, 28(5/6), 179-192.

23 NCRM, Session 27, 1 July 200823 Using data on occupations – further speculation Growing interest in longitudinal analysis and use of longitudinal summary data on occupations Intuitive measures (e.g. ever in Class I)  Lampard, R. (2007). Is Social Mobility an Echo of Educational Mobility? Sociological Research Online, 12(5). Empirical career trajectories / sequences  Halpin, B., & Chan, T. W. (1998). Class Careers as Sequences. European Sociological Review, 14(2), 111-130. Growing cross-national comparisons –Ganzeboom, H. B. G. (2005). On the Cost of Being Crude: A Comparison of Detailed and Coarse Occupational Coding. In J. H. P. Hoffmeyer-Zlotnick & J. Harkness (Eds.), Methodological Aspects in Cross-National Research (pp. 241- 257). Mannheim: ZUMA, Nachrichten Spezial. Treatment of the non-working populations Seldom adequate to treat non-working as a category ‘Selection modelling’ approaches expanding

24 NCRM, Session 27, 1 July 200824 2. Data and research on education Although there have been standardisation attempts, data on an individual’s level of education is notoriously difficult to collect and compare between studies Between countries Between regions Between time periods Even between short time periods (Example of the UK Youth Cohort Study)

25 NCRM, Session 27, 1 July 200825 In international research.. There are two leading standards ISCED www.unesco.org/education/information/nfsunesco/doc/isced_1997.htm CASMIN education http://www.equalsoc.org/publications/show/40 –But not all researchers adopt them, or are satisfied with them when they do

26 NCRM, Session 27, 1 July 200826 In UK research.. There are some recommended standard data collection schemes… Simplified measure (‘other primary standard’) at: www.statistics.gov.uk/about/data/harmonisation/ www.statistics.gov.uk/about/data/harmonisation/..but many studies build up unstandardised data on highest levels of qualifications Often hundreds of unique qualification titles Little standardisation on relative levels Many surveys collect multiple response data (multiple qualifications held by an individual)

27 NCRM, Session 27, 1 July 200827 BHPS example

28 NCRM, Session 27, 1 July 200828 Family and Working Lives Survey (54 vars per educ record)

29 29 Data on education levels cf. occupations Underlying qualification units There are few obvious ‘educational unit groups’ There are many publicly defined alternative schemes Manipulation of educational data Few published ‘educational information resources’ Many open-access sources of data about educational qualifications –e.g. national statistics website reports There has been less previous recognition of value of standardisation –Though this is emerging in comparative research Educational data is dynamic and rapidly expanding

30 NCRM, Session 27, 1 July 200830 Educational data and cohort change A critical consideration concerns cohort change in educational qualifications and distributions Appreciating relative value of education level given context Multivariate analytical procedures Mean benefit of education within cohort?

31 NCRM, Session 27, 1 July 200831 Summary on education and data management We should document measures because.. Some way away from agreeing on preferred measures Dynamic nature of educational distributions Debate between categorisers and scorers… Some useful resources: Schneider, Silke L. (ed.) (2008), The International Standard Classification of Education (ISCED-97). An Evaluation of Content and Criterion Validity for 15 European Countries. Mannheim: MZES. ISBN 978-3-00-024388-2The International Standard Classification of Education (ISCED-97). An Evaluation of Content and Criterion Validity for 15 European Countries. ISMF educational databases and recodes: http://home.fsw.vu.nl/hbg.ganzeboom/ISMF/ismf.htm http://home.fsw.vu.nl/hbg.ganzeboom/ISMF/ismf.htm

32 NCRM, Session 27, 1 July 200832 3. Data and research on ethnicity Rapid growth in social science interest, and data, on ‘ethnic minority groups’, ‘immigration’, ‘immigrants’ Data includes: –Generic & specialist studies collecting ethnic ‘referents’  ‘ethnic identity’; ‘nationality’, parents’ nationality; country of birth; language spoken; religion; ‘race’ National research and data management: –Most countries have evolving standard definitions of ethnic groups International research and data management –Seen as highly problematic in many fields except immigration data –Lambert, P.S. (2005). Ethnicity and the Comparative Analysis of Contemporary Survey Data. In J. H. P. Hoffmeyer-Zlotnick & J. Harkness (Eds.), Methodological Aspects in Cross-National Research (pp. 259-277). Manheim: ZUMA-Nachrichten Spezial 11.

33 NCRM, Session 27, 1 July 200833

34 NCRM, Session 27, 1 July 200834

35 NCRM, Session 27, 1 July 200835 UK: ONS & ESDS data guides Input harmonisation within decades Output harmonisation between decades Bosveld, K., Connolly, H., & Rendall, M. S. (2006). A guide to comparing 1991 and 2001 Census ethnic group data. London: Office for National Statistics. –Academic strategies – ad hoc ‘black’ group, etc –Addition of extra categories over time –Mixed ethnicities, marriages… UK Focus on ‘ethnic identity’, lack of attention to alternative referents

36 NCRM, Session 27, 1 July 200836 Comparative research solutions? Measurement equivalence might be achieved by: Survey data collection Connecting related groups Longitudinal linkage Functional equivalence for categories: Simplified categorical distinctions Immigrant cohorts Scaling ethnic categories

37 NCRM, Session 27, 1 July 200837 Ethnicity and the DAMES project Hard subject to collate information on  Few recognisable ‘ethnic unit groups’  Limited previous ‘data management’ reflection  Very few published databases on ethnicity  Important question of sparse distributions  Dynamic, & rapidly expanding Likely role is to give new guidance on emerging strategies for analysing and exploiting data

38 NCRM, Session 27, 1 July 200838 Concluding summary: Handling data on occupations, educational qualifications and ethnicity Principles for data management: 1)Keep clear records –Recodes and transformations 2)Use existing standards 3)Do something, not nothing –Distributional differences by cohorts 4)Learn how to match files −Exploiting wider resources / other research


Download ppt "NCRM, Session 27, 1 July 20081 Handling data on occupations, educational qualifications, and ethnicity Paul Lambert & Vernon Gayle, Univ. Stirling Talk."

Similar presentations


Ads by Google