GEODE, March 2007 Handling Occupational Information and Introduction to GEODE GEODE – www.geode.stir.ac.ukwww.geode.stir.ac.uk Grid Enabled Occupational.

Slides:



Advertisements
Similar presentations
The Economic and Social Data Service (ESDS) Kevin Schürer ESDS/UKDA ESDS Awareness Day 5 December 2003.
Advertisements

The Economic and Social Data Service (ESDS) Karen Dennison UK Data Archive Improving access to government datasets 18 January 2007.
GEODE - NeSC workshop, Oct 2006 GEODE: Grid Enabled Occupational Data Environment Paul Lambert and Larry Tan University of Stirling
For the e-Stat meeting of 27 Sept 2010 Paul Lambert / DAMES Node inputs.
For the e-Stat meeting of 6-7 April 2011 Paul Lambert / DAMES Node inputs 1)Updates on DAMES 2)Bringing DAMES inputs to e-Stat 3)Misc. feedback - Stat-JR.
Obesity e-Lab Enabling obesity research using the Health Surveys for England: The Obesity e-Lab project Dexter Canoy The University of Manchester
DAMES - Data Management through e-Social Science 1 DAMES: Data Management through e-Social Science NCeSS Research Node University of Stirling / University.
S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge b. a School of Chemistry, University of Southampton, UK.; b School of Electronics.
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
An O*NET Academy Briefing: Tools and Technology for In- Demand Occupations Presented by Dr. Janet Wall Sr. Trainer, O*NET Academy.
Why, what were the idea ? 1.Create a data infrastructure, 2.Data + the knowledge products that are produced on the basis of data a) Efficiant access to.
SYSTEM PROGRAMMING & SYSTEM ADMINISTRATION
International Standard Classification of Occupations (ISCO 2008) and the measurement of cultural employment UIS Interagency Meeting on Cultural Employment.
Classifications and CASCOT Ritva Ellison Institute for Employment Research University of Warwick.
GEODE Workshop 16 th January 2007 Issues in e-Science Richard Sinnott University of Glasgow Ken Turner University of Stirling.
GEODE Project introduction and summary, 12/12/05 GEODE: Grid Enabled Occupational Data Environment GEODE Project introduction and summary, 12/12/05 Motivation.
A Data Curation Application Using DDI: The DAMES Data Curation Tool for Organising Specialist Social Science Data Resources Simon Jones*, Guy Warner*,
Shirley Crompton Source: Rob Allan. Institutional Repository Subject Repository Data Producer Repository share resources solve bigger problems integrate.
Embedding NVivo in postgraduate social research training Howard Davis & Anne Krayer 6 th ESRC Research Methods Festival 8-10 July 2014.
NCRM, Session 27, 1 July Handling data on occupations, educational qualifications, and ethnicity Paul Lambert & Vernon Gayle, Univ. Stirling Talk.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
LEMMA: Learning Environment in Multilevel Modelling and Applications Fiona Steele London School of Economics & Political Science Director NCRM LEMMA node,
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
ESRC - NCRM - Apr Concepts and Measures in occupation-based social classifications Presentation to: ‘Interpreting results from statistical modelling.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
GEODE, 16 Jan 2007 Curating Occupational Information GEODE – Grid Enabled Occupational Data Environment Session.
GEODE, 16 Jan 2007 Handling Occupational Information and Introduction to GEODE GEODE – Grid Enabled Occupational.
GEODE - eSS Manchester, June 2006 Development of a Grid Enabled Occupational Data Environment GEODE – Paper presented.
Usability Issues Documentation J. Apostolakis for Geant4 16 January 2009.
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
DLI Workshop -- Mar Hosted by Dalhousie University March 2000 DLI Training Workshop.
CRM Prep Workshop Part 3 Records Systems, Storage and Retrieval.
GEODE / SSSN, 23 Jan 2008 Handling Occupational Information GEODE – Presentation to Scottish Social Survey Network,
Innovations in Data Dissemination Thomas L. Mesenbourg, Jr. Acting Director U.S. Census Bureau United Nations Seminar on Innovations in Official Statistics.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
Some comments on using research data in the social sciences Paul Lambert, School of Applied Social Science, University of Stirling, 25 March 2013.
GEODE - Glasgow DCC, Nov 2006 Data curation standards and the messy world of social science occupational information resources Paper presented to the 2nd.
South Africa Case Study Update Matile Malimabe Executive Manager: Standards Acting Executive Manager: Data Management & Technology.
Session 8: Statistical Infrastructure Joseph Ilboudo UNECA/ACS Workshop Review of RRSF Implementation.
United Nations Economic Commission for Europe Statistical Division Mapping Data Production Processes to the GSBPM Steven Vale UNECE
1 The Importance of Specificity in Occupation-based Social Classifications Paper presented to the Cambridge Stratification Seminar, September 2006.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Solutions using Microsoft Content Management Server 2002 Connector for SharePoint Technologies Sue Corke Mark Harrison Microsoft UK.
GEODE - Durban ISA RC33, July 2006 Utilising a Grid Enabled Occupational Data Environment GEODE – Paper presented.
Developing and applying business process models in practice Statistics Norway Jenny Linnerud and Anne Gro Hustoft.
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
Economic Research and Policy Analysis Branch May 6, 2010 Access to Business Micro-Data to Support Economic Research and Policy Analysis: Where Do We Go.
The future of Statistical Production CSPA. 50 task team members 7 task teams CSPA 2015 project.
12-1 Links Gateway Vision Jeff Clovis ISI 4 Oct
Organising social science data – computer science perspectives Simon Jones Computing Science and Mathematics University of Stirling, Stirling, Scotland,
Extracting value from grey literature Processes and technologies for aggregating and analysing the hidden Big Data treasure of the organisations.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
GEODE – Sharing Occupational Data Through The Grid Dr. Paul Lambert, Dr. Vernon Gayle, Prof. Ken Prandy, Prof. Richard Sinnott, Prof. Ken Turner, Koon.
Hosted by the University of Regina Library December 1999 DLI Training Workshop Chuck Humphrey.
13-Jul-07 State of the art of the ISCO-08 implementation.
Developing GRID Applications GRACE Project
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
Tools of data analysis Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar 2 on.
Linking data resources Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar 3 on.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
GEODE, March 2007 Occupational Analysis – the examples of: - the Youth Cohort Study of England & Wales - ‘By Slow Degrees’ - social mobility research Grid.
Occupational data Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar 3 on ‘Dealing.
New features in KE EMu 3.1 and beyond
Karen Dennison Collections Development Manager
Ahmet Fatih Mustacoglu
Mapping Data Production Processes to the GSBPM
Presentation transcript:

GEODE, March 2007 Handling Occupational Information and Introduction to GEODE GEODE – Grid Enabled Occupational Data Environment [Session 1 of GEODE Project workshop, 16 th January 2007] Paul Lambert, Larry Tan, Ken Turner, & Vernon GayleUniversity of Stirling Ken PrandyCardiff University Richard SinnottUniversity of Glasgow

GEODE, March 2007 Grid Enabled Occupational Data Environment 1. Handling Occupational Information  some principles and problems GEODE activities and illustrations: 2. Occupational Information Depository 3. Access to occupational information

GEODE, March 2007 Why occupational analyses? (Quotes as reproduced in Coxon and Jones 1978; Crompton 1998) “A man’s work is as good a clue as any to the course of his life and to his social being and identity” (Hughes, 1958) “The backbone of the class structure, and indeed of the entire reward system of modern Western society, is the occupational order” (Parkin, 1972) “Nothing stamps a man as much as his occupation. Daily work determines the mode of life.. It constrains our ideas, feelings and tastes” (Goblot, 1961)

GEODE, March 2007 Context Occupational information crucial to social science investigation –Social class and social classifications –Employment statistics –Occupations and economics Most nations have facilities for collecting micro- data with occupational codes: –www2.warwick.ac.uk/fac/soc/ier/publications/software/cascot/ We lack accessible and standardised facilities for dealing with occupational micro-data

GEODE, March 2007 Occupational information resources: small electronic files… Index units# distinct files (average size kb) Updates? CAMSIS, Local OUG*(e.s.) 200 (100)y CAMSIS value labels Local OUG50 (50)n ISEI tools, home.fsw.vu.nl/~ganzeboom Int. OUG20 (50)y E-Sec matrices Int. OUG*(e.s.) 20 (200)n Hakim gender seg codes (Hakim 1998) Local OUG2 (paper)n

GEODE, March 2007 For example: ISCO-88 Skill levels classification

GEODE, March 2007 and: UK 1980 CAMSIS scales and CAMCOM classes

GEODE, March 2007 Social scientists want to: 1) Produce and disseminate, and access other, Occupational Information Resources 2) Link together their (secure) micro-data with OIR’s External user (micro-social data) Occ info (index file) (aggregate) User’s output (micro-social data) idougsex.ougCS-MCS-FEGPidougCS I II VIIa

GEODE, March 2007 We are agreed on how to do this: Preservation of two levels of data  Index units: Occupational Unit groups, employment status  Social classifications and other outputs Use of transparent (published) methods [i.e. OIR’s]  for classifying index units  for translating index units into social classifications for instance..  Bechhofer, F 'Occupations' in Stacey, M. (ed.) Comparability in Social Research. London: Heinemann.  Jacoby, A 'The Measurement of Social Class' Proceedings from the Social Research Association seminar on "Measuring Employment Status and Social Class". London: Social Research Association.  Lambert, P.S 'Handling Occupational Information'. Building Research Capacity 4:  Rose, D. and Pevalin, D.J 'A Researcher's Guide to the National Statistics Socio- economic Classification'. London: Sage.

GEODE, March 2007 …but here come the buts... Inconsistent preservation of source data Alternative OUG schemes SOC-90; SOC-2000; ISCO; SOC-90 (my special version) Inconsistencies in other index factors ‘employment status’; supervisory status; number of employees Individual or household; current job or career Inconsistent exploitation of Occupational Information Resources Numerous alternative occupational information files (time; country; format) Substantive choices over social classifications Inconsistent translations to social classifications – ‘by file or by fiat’ Dynamic updates to occupational information resources Strict security constraints on users’ micro-social survey data Low uptake of existing occupational information resources

GEODE, March 2007 Two reactions and a proposed solution 1. Enforce common standards –In data collection and classification –E.g. Bechhofer 1969; Ganzeboom; Eurostat; ONS …on academic researchers..??!! 2. Give up –No attempt at engaging with published standards  Support plural occupational information resources in an accessible and consistent manner  Internet facility coordinating OIR’s  GEODE – Grid Enabled Occupational Data Environment

GEODE, March 2007 GEODE: Grid Enabled Occupational Data Environment Objectives:  Create an international Virtual Organization for occupational data community Sharing, indexing, & curating diverse occupational data  Operate as a user-friendly portal Facilitate non-specialist user’s access to occupational information −Search for and download occupational information −Support linkage from user’s micro-data to OIR’s …and do this by exploiting ‘e-Science’ technologies..

GEODE, March 2007 ‘The Grid’ as a new technology ‘The Grid’ and ‘eScience’: 1. Online Coordination of electronic resources and collaborations  (Distributed computing)  Large scale  Collaborative  Heterogeneous 2. Standard protocols / information management systems UK eSocial Science: 1) Investment in assessing / implementing technology 2) Computationally demanding data analysis 3) Qualitative and quantitative data collection technologies 4) **Data sharing, processing and access**

GEODE, March 2007 GEODE, eScience and eSocial Science Some tentative comparisons... Similar toDiverges from GEODE-M metadataDDI; Data Web; UKDA[IDEAS]; [CAMSIS]; Data depository (OGSA-DAI) ConvertGrid[CASCOT]; [CAMSIS]; [ESDS]; Data Chronicles Data matching service (OGSA-DAI + MDS) [BRIDGES]GEMEDA; Madiera User engagementNesstar; ConvertGridCQeSS

GEODE, March 2007 GEODE - architecture

GEODE, March ) Occupational information depository Storing occupational information resources Considerations: All data stored at GEODE v’s Linkage to external data Proprietary software (plain text / SPSS / STATA) Rectangular index files v’s other formats (e.g. pdf) ‘index file’ format favoured Finite number of occ info. files / model of plurality of supply International community of data providers Negligible security restrictions (free online resources) Strategy: 1)‘Uncurated’ entry form, suits all formats, completed online 2)Curated entry (performed manually or automatically):  Translation to csv index file  Modify GEODE-M record for index file  Storage: OGSA-DAI framework to link index files

GEODE, March 2007 n Picture – uploading data file

GEODE, March 2007

n Picture – searching / downloading – two types of resource

GEODE, March compare with current practices..

GEODE, March ) Accessing and linking Occupational Information

GEODE, March 2007 GEODE portal access 3.1) Searching and retrieving data GEODE ‘search’ and ‘browse’ facilities Abstracts / descriptions Time periods / countries / occupational units Further developments.. –Improved search/browse algorithms –evaluative information ↔ GEODE data depositor’s VO?

GEODE, March 2007 Searching – uncurated resources

GEODE, March 2007 Searching – curated resources

GEODE, March 2007 GEODE portal access 3.2) File linkage mechanisms Multiple occupational variables on (A) Strict security constraints on (A) Inconsistent OUG formats on (A)  JAVA application launched on users machine  Simple file matching procedure  Works on resources located at any URI  Continuing development Currently requires plain text input Multiple occ. variables require repeated matching exercises (e.g. husband’s occ.; wife’s occ.) Micro-social data (A) ↔ Occupational information resources (B)

GEODE, March 2007 Java portal n picture

GEODE, March ) Example: NS-SEC to SOC90 2 sources of original translation known to GEODE: –(a) ONS (2002), NS-SEC translation matrices, –(b) Prandy (2001) CAMSIS scores for UK,  GEODE curation –(a) added to GEODE in March 2007 –(b) added to GEODE in December 2006  Access and linkage –Search facility – locate (a) and (b) –File matching facility

GEODE, March 2007 Original data (a)

GEODE, March 2007 Original data (a)

GEODE, March 2007 Original data (b)

GEODE, March 2007 Original data (b)

GEODE, March 2007 Searching on GEODE – ‘uncurated data’

GEODE, March 2007 Searching on GEODE – ‘uncurated data’

GEODE, March 2007 Searching on GEODE – ‘curated data’

GEODE, March 2007 Searching on GEODE – ‘curated data’

GEODE, March 2007 Example – curated data resource as used by GEODE

GEODE, March 2007 GEODE file matching – preparing to link data

GEODE, March 2007 Matching data – JAVA application, CAMSIS data

GEODE, March 2007 Ukempst?

GEODE, March 2007 Matching data – JAVA application, ONS data

GEODE, March 2007 Plain text files…

GEODE, March 2007 Some results….

GEODE, March ) Summary – Handling Occupational Data (1) Text records → OUG data Currently: Text coding software (e.g. CASCOT) Manual look-up GEODE: Linkage to existing resources Further facilities possible but not planned (users typically have adequate resources) (2) OUG data → summary indicators Currently: Numerous aggregate occupational information resources Bespoke data programming requirements GEODE: Core provision: management and access of these data resources Service to large volumes of users

GEODE, March 2007 GEODE Strategy - Conclusions and prospects Occupational Information Depository OGSA-DAI implementations Index-files annotated through ‘GEODE-M’ Some ongoing manual support requirements File linkage mechanisms JAVA application – still some user actions needed Generic data service –Hinges on numeric OUG index [cf. CASCOT]CASCOT –other application areas – e.g. Education, Geography Why use the Grid? –E-Science project –Data linkage procedures –Data depository mechanisms (OGSA-DAI) –Extent of manual input?

GEODE, March 2007 GEODE – user uptake High potential demand Numerous queries on occupational data management Numerous researchers wishing to distribute occupational data First GEODE services not yet user-friendly Carrots –High demands for easier access and review  Sticks –Poor standards of many previous research which neglects good review of occupational information  Hurdles –Change research cultures in social science disciplines(?)