GEODE - eSS Manchester, June 2006 Development of a Grid Enabled Occupational Data Environment GEODE – www.geode.stir.ac.ukwww.geode.stir.ac.uk Paper presented.

Slides:



Advertisements
Similar presentations
GEODE - NeSC workshop, Oct 2006 GEODE: Grid Enabled Occupational Data Environment Paul Lambert and Larry Tan University of Stirling
Advertisements

For the e-Stat meeting of 27 Sept 2010 Paul Lambert / DAMES Node inputs.
For the e-Stat meeting of 6-7 April 2011 Paul Lambert / DAMES Node inputs 1)Updates on DAMES 2)Bringing DAMES inputs to e-Stat 3)Misc. feedback - Stat-JR.
DAMES - Data Management through e-Social Science 1 DAMES: Data Management through e-Social Science NCeSS Research Node University of Stirling / University.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
C. Grimme, A. Papaspyrou Scheduling in C3-Grid AstroGrid-D Workshop Project: C3-Grid Collaborative Climate Community Data and Processing Grid Scheduling.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Resource wrappers, web services, grid services Jaspreet Singh School of Computer.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
GEODE Project introduction and summary, 12/12/05 GEODE: Grid Enabled Occupational Data Environment GEODE Project introduction and summary, 12/12/05 Motivation.
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
A Data Curation Application Using DDI: The DAMES Data Curation Tool for Organising Specialist Social Science Data Resources Simon Jones*, Guy Warner*,
Globus Computing Infrustructure Software Globus Toolkit 11-2.
ÆKOS: A new paradigm for discovery and access to complex ecological data David Turner, Paul Chinnick, Andrew Graham, Matt Schneider, Craig Walker Logos.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
GEODE, March 2007 Handling Occupational Information and Introduction to GEODE GEODE – Grid Enabled Occupational.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
ADC Meeting ICEO Standards Working Group Steven F. Browdy, Co-Chair ADC Workshop Washington, D.C. September, 2007.
Long Term Ecological Research Network Information System LTER Grid Pilot Study LTER Information Manager’s Meeting Montreal, Canada 4-7 August 2005 Mark.
GEODE, 16 Jan 2007 Curating Occupational Information GEODE – Grid Enabled Occupational Data Environment Session.
GEODE, 16 Jan 2007 Handling Occupational Information and Introduction to GEODE GEODE – Grid Enabled Occupational.
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
GEODE / SSSN, 23 Jan 2008 Handling Occupational Information GEODE – Presentation to Scottish Social Survey Network,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Event-Based Hybrid Consistency Framework (EBHCF) for Distributed Annotation Records Ahmet Fatih Mustacoglu Advisor: Prof. Geoffrey.
OGSA-DAI in OMII-Europe Neil Chue Hong EPCC, University of Edinburgh.
Some comments on using research data in the social sciences Paul Lambert, School of Applied Social Science, University of Stirling, 25 March 2013.
GEODE - Glasgow DCC, Nov 2006 Data curation standards and the messy world of social science occupational information resources Paper presented to the 2nd.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
1 The Importance of Specificity in Occupation-based Social Classifications Paper presented to the Cambridge Stratification Seminar, September 2006.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
INFSO-RI Enabling Grids for E-sciencE OGSA DAI Data Access and Integration Marek Ciglan Institute of Informatics, Slovac Academy.
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
State Key Laboratory of Resources and Environmental Information System China Integration of Grid Service and Web Processing Service Gao Ang State Key Laboratory.
Metadata Mòrag Burgon-Lyon University of Glasgow.
interactive logbook Paul Kiddie, Mike Sharples et al. The Development of an Application to Enhance.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
GEODE - Durban ISA RC33, July 2006 Utilising a Grid Enabled Occupational Data Environment GEODE – Paper presented.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Organising social science data – computer science perspectives Simon Jones Computing Science and Mathematics University of Stirling, Stirling, Scotland,
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
Improving User Access to Metadata for Public and Restricted Use US Federal Statistical Files William C. Block Jeremy Williams Lars Vilhuber Carl Lagoze.
GEODE – Sharing Occupational Data Through The Grid Dr. Paul Lambert, Dr. Vernon Gayle, Prof. Ken Prandy, Prof. Richard Sinnott, Prof. Ken Turner, Koon.
The National Grid Service Mike Mineter.
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
The AstroGrid-D Information Service Stellaris A central grid component to store, manage and transform metadata - and connect to the VO!
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
Open Ag Data : Landscape Analysis ●Who is involved in collecting data on agricultural investments, and from whom? ●How is data publicly shared? Which.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI on OMII 2.0 OMII The Open Middleware Infrastructure Institute NeSC,
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Statistical Information Systems Introducing SIS tool .Stat
DataNet Collaboration
An Overview of Data-PASS Shared Catalog
Ahmet Fatih Mustacoglu
Google Sky.
Presentation transcript:

GEODE - eSS Manchester, June 2006 Development of a Grid Enabled Occupational Data Environment GEODE – Paper presented to the Second International Conference on e-Social Science, Manchester, June 2006 Paul Lambert, Larry Tan, Ken Turner, & Vernon GayleUniversity of Stirling Ken PrandyCardiff University Richard SinnottUniversity of Glasgow

GEODE - eSS Manchester, June 2006 Development of a Grid Enabled Occupational Data Environment 1. Introduction: Occupational Information Activities in two areas: 2. Occupational Information Depository 3. Access to occupational information 4. Conclusions and prospects

GEODE - eSS Manchester, June 2006 What’s the problem? Indexed mainly by Occupational Unit Group (OUG). But… Numerous alternative occupational data files (time; country; format) Alternative OUG schemes; other index factors (‘employment status’) Inconsistent translations to social classifications – ‘by file or by fiat’ Dynamic updates to occupational data resources Low uptake of existing occupational information resources Strict security constraints on users’ micro-social survey data External user (micro-social data) Occ info (index file) (aggregate) User’s output (micro-social data) idougsex.ougCS-MCS-FEGPidougCS I II VIIa

GEODE - eSS Manchester, June 2006 Some illustrative occupational information resources Index units# distinct files (average size kb) Updates? CAMSIS, Local OUG*(e.s.) 200 (100)y CAMSIS value labels Local OUG50 (50)n ISEI tools, home.fsw.vu.nl/~ganzeboom Int. OUG20 (50)y E-Sec matrices Int. OUG*(e.s.) 20 (200)n Hakim gender seg codes (Hakim 1998) Local OUG2 (paper)n

GEODE - eSS Manchester, June 2006 GEODE: Grid Enabled Occupational Data Environment Objectives: 1)Operate as a portal Facilitate linking occupational information to users’ datasets –(initial focus on CAMSIS occupational information resources) GEODE data resources – occupational information data curated as data service in Stirling, accessed by users via portal 2)Create an international Virtual Organization for occupational data community Sharing, indexing, & curating diverse occupational data Other analytical functions on occupational data?

GEODE - eSS Manchester, June 2006 GEODE – Building blocks Globus Toolkit 4 (WSRF implementation) To build grid application services GridSphere (portal framework – JSR 168) OGSA-DAI (data access grid middleware) DDI (social science metadata in XML) Development environment:  Jakarta Tomcat 5.x  Axis SOAP Engine  Java

GEODE - eSS Manchester, June ) Occupational Information Depository Grid as a system that… (e.g. Foster et al 2001) : coordinates resources that are not subject to centralized control uses standard, open, general-purpose protocols and interfaces {delivers non-trivial qualities of service} Use with occupational information depository:  Create a community where members have abstract access to heterogeneous resources securely, and achieve wider collaboration

GEODE - eSS Manchester, June 2006 GEODE - architecture

GEODE - eSS Manchester, June 2006 GEODE: Occupational Information Depository Data Index Service – uses DDI and OGSA-DAI User Requirements / Evaluations Three elements: 1)Semantic data curation 2)Data storage 3)Data indexing / access

GEODE - eSS Manchester, June 2006 Occupational information depository 2.1) Semantic curation of occupational information  Establish a ‘GEODE-M’ meta- data subset (.xml) Founded on Michigan Data Documentation Initiative Minimise curation requirements to suit occ. information resource providers (pilots)  Web proforma entry [via Portal using Gridsphere] Release date Country Time period Author Format Missing data Data extensions OUG variable Other identifier variables Output variables

GEODE - eSS Manchester, June 2006 Occupational information depository 2.2) Storing occupational information resources Considerations: All data stored at GEODE v’s Linkage to external data Proprietary software (plain text / SPSS / STATA) Rectangular index files v’s other formats (e.g. pdf) ‘index file’ format is easy and aids data storage / indexing Finite number of occ info. files / model of plurality of supply International community of data providers Negligible security restrictions (free online resources) Strategy: 1)GEODE-M proforma, suits all formats, completed online 2)Translation to csv index file 3)Modify GEODE-M record for index file  (2) & (3) performed automatically or manually 4)Storage: OGSA-DAI framework to link index files

GEODE - eSS Manchester, June 2006 Occupational information depository OGSA-DAI implementations on prototype service: –Testing dynamic deployment of selected data resources (CAMSIS) Registration with index service (pilot tests) Searchable via portal service –OGSA-DAI evaluations Foundations suited to collation of diverse occ data resources Also facilitates data access functions (see 3) Accepts GEODE-curated resources; externally curated resources; and potential connections with other Grid data services {issues in support for alterative security levels to allow modification of initially deposited resources}

GEODE - eSS Manchester, June 2006 Occupational information depository 2.3) Virtual Organisation for Occupational Information Depository MDS (via GT4) to manage VO access to and distribution of occupational information resources International virtual community Dynamic data supply

GEODE - eSS Manchester, June ) Access to Occupational Information

GEODE - eSS Manchester, June 2006 GEODE portal access 3.1) File linkage mechanisms Multiple occupational variables on (A) Strict security constraints on (A) Inconsistent OUG formats on (A)  Prototype linkages (e.g. CAMSIS) require full access to (A)  Cater to limited access to (A):  Investigate digital certification (X.509) to allow restricted data transfer A_[OUGs] + A_[context]  Requirements analysis Minimal user certification process Avoid application installation by users Users’ complex survey data (e.g. multiple occupational records) Micro-social data (A) ↔ Occupational information resources (B)

GEODE - eSS Manchester, June 2006 GEODE portal access 3.2) Analytical queries Process analytical tasks on aggregate occupational information resources  Summary data –Coverage searches –Summary statistics ? Consider more complex analyses? –CAMSIS derivations –Involve interactive data management tasks –[cf. Nesstar / Data Web]

GEODE - eSS Manchester, June ) Conclusions and prospects Occupational Information Depository OGSA-DAI implementations Index-files annotated through ‘GEODE-M’ Some ongoing manual support requirements Portal framework Accessible GT4 / GSI structures… Curation of occupational data –Contribution: widely used international resources –Semantics: data annotation (DDI) Generic data service –Hinges on numeric OUG index [cf. CASCOT]CASCOT –other application areas – e.g. Education, Geography

GEODE - eSS Manchester, June 2006 GEODE, eScience and eSocial Science Some tentative comparisons... Similar toDiverges from GEODE-M metadataDDI; Data Web; UKDA[IDEAS]; [CAMSIS]; Data depository (OGSA-DAI) ConvertGrid[CASCOT]; [CAMSIS]; [ESDS]; Data Chronicles Data matching service (OGSA-DAI + MDS) [BRIDGES]GEMEDA; Madiera User engagementNesstar; ConvertGridCQeSS