GEODE / SSSN, 23 Jan 2008 Handling Occupational Information GEODE – www.geode.stir.ac.ukwww.geode.stir.ac.uk Presentation to Scottish Social Survey Network,

Slides:



Advertisements
Similar presentations
HIS-CAM - RC28 Spring Testing the universality of historical occupational stratification structures across time and space
Advertisements

The Economic and Social Data Service (ESDS) Karen Dennison UK Data Archive Improving access to government datasets 18 January 2007.
GEODE - NeSC workshop, Oct 2006 GEODE: Grid Enabled Occupational Data Environment Paul Lambert and Larry Tan University of Stirling
For the e-Stat meeting of 27 Sept 2010 Paul Lambert / DAMES Node inputs.
For the e-Stat meeting of 6-7 April 2011 Paul Lambert / DAMES Node inputs 1)Updates on DAMES 2)Bringing DAMES inputs to e-Stat 3)Misc. feedback - Stat-JR.
DAMES - Data Management through e-Social Science 1 DAMES: Data Management through e-Social Science NCeSS Research Node University of Stirling / University.
S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge b. a School of Chemistry, University of Southampton, UK.; b School of Electronics.
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
Why, what were the idea ? 1.Create a data infrastructure, 2.Data + the knowledge products that are produced on the basis of data a) Efficiant access to.
CHARMCATS: Harmonisation demands for source metadata and output management CESSDA Expert Seminar: Towards the CESSDA- ERIC common Metadata Model and DDI3.
DEVELOPMENT OF CASCOT 5.0 (a multi-language text coding tool) Presentation to the DASISH project meeting, Gothenburg, November 2014 Peter Elias Margaret.
International Standard Classification of Occupations (ISCO 2008) and the measurement of cultural employment UIS Interagency Meeting on Cultural Employment.
GEODE Workshop 16 th January 2007 Issues in e-Science Richard Sinnott University of Glasgow Ken Turner University of Stirling.
GEODE Project introduction and summary, 12/12/05 GEODE: Grid Enabled Occupational Data Environment GEODE Project introduction and summary, 12/12/05 Motivation.
A Data Curation Application Using DDI: The DAMES Data Curation Tool for Organising Specialist Social Science Data Resources Simon Jones*, Guy Warner*,
NCRM, Session 27, 1 July Handling data on occupations, educational qualifications, and ethnicity Paul Lambert & Vernon Gayle, Univ. Stirling Talk.
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
LDA, 11th May Variable constructions in Longitudinal Research: Ethnicity Dr Paul Lambert, University of Stirling Session 2 of the ESRC Research Methods.
GEODE, March 2007 Handling Occupational Information and Introduction to GEODE GEODE – Grid Enabled Occupational.
ESRC - NCRM - Apr Concepts and Measures in occupation-based social classifications Presentation to: ‘Interpreting results from statistical modelling.
Understanding Trends in Occupational Sex Segregation By Daniel Guinea-Martin Advanced Centre for Scientific Research, Spain (formerly at the Office for.
GEODE, 16 Jan 2007 Occupational Analysis – Issues and Examples Grid Enabled Occupational Data Environment GEODE Project workshop, 16 th January 2007 Vernon.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
Assignment 2 1. Don’t forget the Flickr assignment #2 (due end of day today) 2. Don’t forget the Work Practice Diary (to be used for assignment 2) 3. Assignment.
Scientific Data Infrastructure: activities in the Capacities Programme of FP7 Presentation at euroCRIS Workshop, Brussels 15 September 2009 "The views.
GEODE, 16 Jan 2007 Curating Occupational Information GEODE – Grid Enabled Occupational Data Environment Session.
GEODE, 16 Jan 2007 Handling Occupational Information and Introduction to GEODE GEODE – Grid Enabled Occupational.
GEODE - eSS Manchester, June 2006 Development of a Grid Enabled Occupational Data Environment GEODE – Paper presented.
Population Census carried out in Armenia in 2011 as an example of the Generic Statistical Business Process Model Anahit Safyan Member of the State Council.
DLI Workshop -- Mar Hosted by Dalhousie University March 2000 DLI Training Workshop.
Task Group on development of e-Government indicators (TGEG) 2008 Global Event on Measuring the Information Society Report on e-Government indicators 2008.
EurOccupations Developing a detailed 7-country occupations database for comparative socio-economic research in the European Union Project period: May 2006-May.
1 Occupational Stratification Measures in Harmonised European Surveys Talk prepared for ISA RC28 Spring Meeting, Neuchatel, 7-9 May 2004 Paul Lambert Ken.
Some comments on using research data in the social sciences Paul Lambert, School of Applied Social Science, University of Stirling, 25 March 2013.
GEODE - Glasgow DCC, Nov 2006 Data curation standards and the messy world of social science occupational information resources Paper presented to the 2nd.
1 of 27 How to invest in Information for Development An Introduction Introduction This question is the focus of our examination of the information management.
United Nations Economic Commission for Europe Statistical Division Mapping Data Production Processes to the GSBPM Steven Vale UNECE
1 The Importance of Specificity in Occupation-based Social Classifications Paper presented to the Cambridge Stratification Seminar, September 2006.
Linking by Translation: the key to comparable codesets Ben Hickman Local Government Analysis & Research 19th March 2007.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Project? Microdata? Say what? TRY Conference May 5, 2008 Suzette Giles, Ryerson University Laine Ruus, University of Toronto.
Key variables1 Key Variables: Social Science Measurement and Functional Form Presentation to: ‘ Interpreting results from statistical modelling – A seminar.
GEODE - Durban ISA RC33, July 2006 Utilising a Grid Enabled Occupational Data Environment GEODE – Paper presented.
Economic Research and Policy Analysis Branch May 6, 2010 Access to Business Micro-Data to Support Economic Research and Policy Analysis: Where Do We Go.
The future of Statistical Production CSPA. 50 task team members 7 task teams CSPA 2015 project.
12-1 Links Gateway Vision Jeff Clovis ISI 4 Oct
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
Organising social science data – computer science perspectives Simon Jones Computing Science and Mathematics University of Stirling, Stirling, Scotland,
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
Conference on Data Quality for International Organizations Newport, Wales, United Kingdom, 27–28 April 2006 Session 3: Collection, management and dissemination.
GEODE – Sharing Occupational Data Through The Grid Dr. Paul Lambert, Dr. Vernon Gayle, Prof. Ken Prandy, Prof. Richard Sinnott, Prof. Ken Turner, Koon.
Hosted by the University of Regina Library December 1999 DLI Training Workshop Chuck Humphrey.
The European Socio-economic Classification: A Programme of Statistical Co-operation and Harmonisation Workshop on Application of ESeC Bled, June
13-Jul-07 State of the art of the ISCO-08 implementation.
: LSS1 Longitudinal Studies Seminars: Longitudinal Analyses Using STATA Stirling University, Data and Variable Management Paul Lambert.
11 September 2008 Expert group meeting on the scope and content of Social Statistics 1 The Development of Social Statistics in the European Statistical.
State of play and plans by variable Occupation. 2 Policy needs for comparable data on occupations  Indicators on gender segregation used in the follow.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
Tools of data analysis Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar 2 on.
Secondary survey data Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar 1 on ‘Dealing.
Linking data resources Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar 3 on.
SIMD and the flaws of area- based socio-economic profiles Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership.
GEODE, March 2007 Occupational Analysis – the examples of: - the Youth Cohort Study of England & Wales - ‘By Slow Degrees’ - social mobility research Grid.
Occupational data Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar 3 on ‘Dealing.
European Commission - DG Eurostat
“CareerGuide for Schools”
WORKSHOP ON THE DATA COLLECTION OF OCCUPATIONAL DATA Luxembourg, 28 November 2008 Occupation as a core variable in social surveys Sylvain Jouhette
Mapping Data Production Processes to the GSBPM
Measuring the very long, fuzzy tail in the occupational distribution
Presentation transcript:

GEODE / SSSN, 23 Jan 2008 Handling Occupational Information GEODE – Presentation to Scottish Social Survey Network, Master Class on ‘Data Analysis using Stata’, 23 rd Jan 2008 [This talk is a minor adaptation of a paper given to the GEODE Project workshop, 16 th Jan 2007] Paul Lambert, Larry Tan, Ken Turner, & Vernon GayleUniversity of Stirling Ken PrandyCardiff University Richard SinnottUniversity of Glasgow

GEODE / SSSN, 23 Jan 2008 Grid Enabled Occupational Data Environment Handling Occupational Information  some principles and problems GEODE activities and illustrations: 1. Occupational Information Depository 2. Access to occupational information

GEODE / SSSN, 23 Jan 2008 Why occupational analyses? (Quotes as reproduced in Coxon and Jones 1978; Crompton 1998) “A man’s work is as good a clue as any to the course of his life and to his social being and identity” (Hughes, 1958) “The backbone of the class structure, and indeed of the entire reward system of modern Western society, is the occupational order” (Parkin, 1972) “Nothing stamps a man as much as his occupation. Daily work determines the mode of life.. It constrains our ideas, feelings and tastes” (Goblot, 1961)

GEODE / SSSN, 23 Jan 2008 Context Occupational information crucial to social science investigation –Social class and social classifications –Employment statistics –Occupations and economics Most nations have facilities for collecting micro- data with occupational codes: –www2.warwick.ac.uk/fac/soc/ier/publications/software/cascot/ We lack accessible and standardised facilities for dealing with occupational micro-data

GEODE / SSSN, 23 Jan 2008 CASCOT (University of Warwick)

GEODE / SSSN, 23 Jan 2008 Occupational information resources: small electronic files… Index units# distinct files (average size kb) Updates? CAMSIS, Local OUG*(e.s.) 200 (100)y CAMSIS value labels Local OUG50 (50)n ISEI tools, home.fsw.vu.nl/~ganzeboom Int. OUG20 (50)y E-Sec matrices Int. OUG*(e.s.) 20 (200)n Hakim gender seg codes (Hakim 1998) Local OUG2 (paper)n

GEODE / SSSN, 23 Jan 2008 For example: ISCO-88 Skill levels classification

GEODE / SSSN, 23 Jan 2008 and: UK 1980 CAMSIS scales and CAMCOM classes

GEODE / SSSN, 23 Jan 2008 Social scientists want to: 1) Produce and disseminate, and access other, Occupational Information Resources 2) Link together their (secure) micro-data with OIR’s External user (micro-social data) Occ info (index file) (aggregate) User’s output (micro-social data) idougsex.ougCS-MCS-FEGPidougCS I II VIIa

GEODE / SSSN, 23 Jan 2008 We are agreed on how to do this: Preservation of two levels of data  Index units: Occupational Unit groups, employment status  Social classifications and other outputs Use of transparent (published) methods [i.e. OIR’s]  for classifying index units  for translating index units into social classifications for instance..  Bechhofer, F 'Occupations' in Stacey, M. (ed.) Comparability in Social Research. London: Heinemann.  Jacoby, A 'The Measurement of Social Class' Proceedings from the Social Research Association seminar on "Measuring Employment Status and Social Class". London: Social Research Association.  Lambert, P.S 'Handling Occupational Information'. Building Research Capacity 4:  Rose, D. and Pevalin, D.J 'A Researcher's Guide to the National Statistics Socio- economic Classification'. London: Sage.

GEODE / SSSN, 23 Jan 2008 …but here come the buts... Inconsistent preservation of source data Alternative OUG schemes SOC-90; SOC-2000; ISCO; SOC-90 (my special version) Inconsistencies in other index factors ‘employment status’; supervisory status; number of employees Individual or household; current job or career Inconsistent exploitation of Occupational Information Resources Numerous alternative occupational information files (time; country; format) Substantive choices over social classifications Inconsistent translations to social classifications – ‘by file or by fiat’ Dynamic updates to occupational information resources Strict security constraints on users’ micro-social survey data Low uptake of existing occupational information resources

GEODE / SSSN, 23 Jan 2008 Stata and handling occupational data Stata users have been much more consistent in occupational coding than other researchers.. ISKO: Stata module to recode 4-digit ISCO-68 occupational codes Stata is fairly well suited to manual occupational coding: Succinct file matching syntax “merge soc using “use “ Proprietary software is problematic: Many existing resources are SPSS format Stata format files don’t share well with other users Stata is too new for some occupational information resources

GEODE / SSSN, 23 Jan 2008 Two reactions and a proposed solution 1. Enforce common standards –In data collection and classification –E.g. Bechhofer 1969; Ganzeboom; Eurostat; ONS …on academic researchers..??!! 2. Give up –No attempt at engaging with published standards  Support plural occupational information resources in an accessible and consistent manner:  Internet facility coordinating OIR’s  GEODE – Grid Enabled Occupational Data Environment

GEODE / SSSN, 23 Jan 2008 GEODE: Grid Enabled Occupational Data Environment Objectives:  Create an international Virtual Organization for occupational data community Sharing, indexing, & curating diverse occupational data  Operate as a user-friendly portal Facilitate non-specialist user’s access to occupational information −Search for and download occupational information −Support linkage from user’s micro-data to OIR’s …and do this by exploiting ‘e-Science’ technologies..

GEODE / SSSN, 23 Jan 2008 DAMES, GEODE and ‘The Grid’ ‘The Grid’ and ‘eScience’: 1. Online Coordination of electronic resources and collaborations  (Distributed computing)  Large scale  Collaborative  Heterogeneous 2. Standard protocols / information management systems UK eSocial Science: 1) Investment in assessing / implementing technology 2) Computationally demanding data analysis 3) Qualitative and quantitative data collection technologies 4) **Data sharing, processing and access** DAMES: project on Data Management through e-Social Science

GEODE / SSSN, 23 Jan 2008 Approaches to analysing occupations - methodologies During data collection:  Efforts in input harmonisation in data collection [e.g. Hoffman 2000; van Leeuwen et al 2003]  Most data models are output harmonisation [e.g. ONS unit linkages; IPUMS; van Deth 2003] During Data analysis: Model of measurement equivalence Same codings from the same index units [Ganzeboom and Treiman 2003] Same codings for different index units [E-SEC; RGSC; EGP] Functional equivalence is rarely reviewed cf. CAMSIS,

GEODE / SSSN, 23 Jan 2008 Rant: The importance of specificity in occupation-based social classifications [Lambert et al 2008] “Occupations are ranked in the same order in most nations and over time...Hout referred to the pattern of invariance as the “Treiman constant”...the Treiman constant may be the only universal sociologists have discovered.” (Hout and DiPrete, 2006:2-3) “the idea of indexing a person’s origin and destination by occupation is weakened if the meaning of being, say, a manual worker is not the same at origin and destination. Historical comparisons become unreliable” (Payne, 1992: 220, cited in Bottero, 2005:65)

GEODE / SSSN, 23 Jan 2008 In practical terms.. Specificity is very challenging: Different occupational information for different countries, time periods, genders Changing occupational information during a project  It is very rare to see social science publications which use a specific approach to occupational data  This is mostly due to computing / data management hurdles…

GEODE / SSSN, 23 Jan 2008 GEODE (1): Occupational information depository Storing occupational information resources Strategy: 1)‘Uncurated’ entry form, suits all formats, completed online 2)Curated entry (performed manually or automatically):  Translation to csv index file  Modify GEODE-M record for index file  Storage: OGSA-DAI framework to link index files

GEODE / SSSN, 23 Jan 2008 n Picture – uploading data file

GEODE / SSSN, 23 Jan 2008

n Picture – searching / downloading – two types of resource

GEODE / SSSN, 23 Jan compare with current practices..

GEODE / SSSN, 23 Jan 2008 GEODE (2): Portal for accessing & linking occupational data Searching and retrieving data GEODE ‘search’ and ‘browse’ facilities Abstracts / descriptions Time periods / countries / occupational units Further developments.. –Improved search/browse algorithms –evaluative information ↔ GEODE data depositor’s VO?

GEODE / SSSN, 23 Jan 2008 Searching – uncurated resources

GEODE / SSSN, 23 Jan 2008 Searching – curated resources

GEODE / SSSN, 23 Jan 2008 GEODE portal access File linkage mechanisms Multiple occupational variables on (A) Strict security constraints on (A) Inconsistent OUG formats on (A)  JAVA application launched on users machine  Simple file matching procedure  Works on resources located at any URI  Continuing development Currently requires plain text input Multiple occ. variables require repeated matching exercises (e.g. husband’s occ.; wife’s occ.) Micro-social data (A) ↔ Occupational information resources (B)

GEODE / SSSN, 23 Jan 2008 Java portal n picture

GEODE / SSSN, 23 Jan 2008 Summary – Handling Occupational Data (1) Text records → OUG data Currently: Text coding software (e.g. CASCOT) Manual look-up GEODE: Linkage to existing resources Further facilities possible but not planned (users typically have adequate resources) (2) OUG data → summary indicators Currently: Numerous aggregate occupational information resources **Bespoke data programming requirements** GEODE: Core provision: management and access of these data resources Service to large volumes of users

GEODE / SSSN, 23 Jan 2008 References: Occupations n Bechhofer, F 'Occupations' in Stacey, M. (ed.) Comparability in Social Research. London: Heinemann (in association with British Sociological Association / Social Science Research Council). n Ganzeboom, H.B.G 'On the Cost of Being Crude: A Comparison of Detailed and Coarse Occupational Coding' in Hoffmeyer-Zlotnick, J.H.P. and Harkness, J. (eds.) Methodological Aspects in Cross-National Research. Mannheim: ZUMA, Nachrichten Spezial. n Ganzeboom, H.B.G. and Treiman, D.J 'Three internationally standarised measures for comparative research on occupational status' in Hoffmeyer-Zlotnick, J.H.P. and Wolf, C. (eds.) Advances in Cross-National Comparison. A European Working Book for Demographic and Socio-Economic Variables. New York: Kluwer Academic Press. n Hoffman, E International statistical comparisons of occupations and social structures: problems, possibilities and the role of ISCO-88. Geneva: International Labour Office. n Hout, M. and DiPrete, T.A 'What we have learned: RC28s contributions to knowledge about social stratification' Research into Social Stratification and Mobility. n Lambert, P.S., Zijdeman, R.L., Maas, I., Prandy, K. and Van Leeuwen, M 'Testing the universality of historical occupational stratifcation structures across time and space' ISA RC-28 on Social Stratification and Mobility, Spring meeting. Nijmegen, Netherlands. n Lambert, P.S., Prandy, K. and Bottero, W 'By Slow Degrees: Two Centuries of Social Reproduction and Mobility in Britain'. Sociological Research Online 12. n Lambert, P.S., Tan, K.L.T., Gayle, V., Prandy, K. and Bergman, M.M forthcoming. 'The importance of specificity in occupation-based social classifications'. International Journal of Sociology and Social Policy. n Marsh, C 'Occupationally Based Measures' in Jacoby, A. (ed.) The Measurement of Social Class. London: Social Research Association. n Payne, G 'Competing views on contemporary social mobility and social divisions' in Burrows, R. and Marsh, C. (eds.) Consumption and Class. Basingstoke: Falmer Press. n Rose, D. and Pevalin, D.J 'A Researcher's Guide to the National Statistics Socio-economic Classification'. London: Sage. n Stewart, A., Prandy, K. and Blackburn, R.M Social Stratification and Occupations. London: MacMillan. n van Leeuwen, M.H.D., Maas, I. and Miles, A HISCO: Historical International Standard Classification of Occupations. Leuven: Leuven University Press.