DAMES, 31/JAN/2012, T6 Opportunities and prospects in social research Paul Lambert, 31 st January 2012 Talk to the seminar Data management in the social.

Slides:



Advertisements
Similar presentations
Large-Scale, Adaptive Fabric Configuration for Grid Computing Peter Toft HP Labs, Bristol June 2003 (v1.03) Localised for UK English.
Advertisements

Linking research & learning technologies through standards June Handle Workshop Towards a National Persistent Identifier Infrastructure Handle.
ICT research priorities and recommendations for strategy development in the WBC Ulrike Kunze / PT-DLR, Germany Consultation session on recommendations.
April 2010 MRC Data Sharing Policy Peter Dukes Policy Lead – Data Sharing & Preservation.
1 Future strategy for e-submission as seen by industry Dr Michael Colmorgen, IFAH-Europe 2nd Veterinary Workshop on E-submission 4 Dec 2009, EMEA, London.
Steps towards E-Government in Syria
Linking the DAMES & e-Stat Nodes Paul Lambert, 26 Feb 2010, Bristol, e-Stat review meeting DAMES is the Data Management through e-Social Science research.
Statistics Training and ESRC Chris Skinner National Centre for Research Methods University of Southampton.
Manipulating data: Deriving variables, handling missing data, and cleaning data - practices, services and standards Paul Lambert (Dept. Applied Social.
Depositing Data for Archiving Libby Bishop ESDS Qualidata, University of Essex Changing Families, Changing Food Meeting University of Sheffield 15 March.
ESDS Qualidata and QUADS Coordination Louise Corti Online Resources Day 15 November 2005, London.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
A centre of expertise in data curation and preservation DCC/NeSC eScience Workshop, June 2008 Working in partnership with the eScience community This work.
For the e-Stat meeting of 27 Sept 2010 Paul Lambert / DAMES Node inputs.
For the e-Stat meeting of 6-7 April 2011 Paul Lambert / DAMES Node inputs 1)Updates on DAMES 2)Bringing DAMES inputs to e-Stat 3)Misc. feedback - Stat-JR.
2011 Census Outputs Dissemination Plans Working in Partnership Chris Ashford – ONS Census Outputs.
Cross-national data in DAMES and GE*DE Paul Lambert, University of Stirling Prepared for the Workshop on Cross-Nationally comparative social survey research,
DAMES - Data Management through e-Social Science 1 DAMES: Data Management through e-Social Science NCeSS Research Node University of Stirling / University.
Workflows for Social Science Ken Turner Computing Science and Mathematics 31st January 2012.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
A centre of expertise in digital information management UKOLN is supported by: UK Perspectives on the Curation and Preservation of Scientific.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
Knowledge Management at the Gordon – Staff Portal Project Presented by Deirdre Carmichael 12 September 2008.
APA CONFERENCE, FRASCATI 6 November 2012 Data management planning at the DCC Martin Donnelly Digital Curation Centre University of Edinburgh.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
Building a Virtual Research Environment for the Humanities Ruth Kirkham – Project Manager John Pybus – Technical Support
Methodbox: Preparing for National Service Shoaib Sufi 11/10/11.
The e-Research framework for South Africa developed by Fernihough (2011), after in depth interviews with various.
Information Professionals and Learning Object Repositories … more than just metadata quality … Sarah Currier Stòr Cùram Project Librarian JISC X4L Repository.
GEODE Workshop 16 th January 2007 Issues in e-Science Richard Sinnott University of Glasgow Ken Turner University of Stirling.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
GEODE Project introduction and summary, 12/12/05 GEODE: Grid Enabled Occupational Data Environment GEODE Project introduction and summary, 12/12/05 Motivation.
A Data Curation Application Using DDI: The DAMES Data Curation Tool for Organising Specialist Social Science Data Resources Simon Jones*, Guy Warner*,
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
Embedding NVivo in postgraduate social research training Howard Davis & Anne Krayer 6 th ESRC Research Methods Festival 8-10 July 2014.
Good practice in Research Data Management Module 6: Tools, training and support.
A long tradition. e-science, Data Centres, and the Virtual Observatory why is e-science important ? what is the structure of the VO ? what then must we.
The International Higher Education University Research Performance Forum April 2013 – Pan Pacific Orchard, Singapore Case Study – 2.00pm – 2.45pm.
E-Learning portal Learning and Development Portal.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
Supporting further and higher education The UK FAIR Programme: OAI in context Chris Awre OAI3, CERN, February 2004.
NEPTUNE Canada Workshop Oceans 2.0 Project Environment NEPTUNE Canada DMAS Team Victoria, BC February 16, 2009.
GEODE - eSS Manchester, June 2006 Development of a Grid Enabled Occupational Data Environment GEODE – Paper presented.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Some comments on using research data in the social sciences Paul Lambert, School of Applied Social Science, University of Stirling, 25 March 2013.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Infrastructures for Social Simulation Rob Procter National e-Infrastructure for Social Simulation ISGC 2010 Social Simulation Tutorial.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
GEODE - Durban ISA RC33, July 2006 Utilising a Grid Enabled Occupational Data Environment GEODE – Paper presented.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
Organising social science data – computer science perspectives Simon Jones Computing Science and Mathematics University of Stirling, Stirling, Scotland,
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Learning Objectives Understand the concepts of Information systems.
OGC/OGF usage in UK e-Social Science OGF 21, Seattle, USA Paul Townend School of Computing, University of Leeds.
Building Capacities for Establishment of Social Science Digital Data Archives Aleksandra Bradić-Martinović, Institute of Economic Sciences, Belgrade Achievements.
Online survey analysis tools Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar.
Tools of data analysis Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar 2 on.
Secondary survey data Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar 1 on ‘Dealing.
Linking data resources Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar 3 on.
Webinar 4: Academic tools of data analysis: Comparing SPSS, Stata and R and engaging with Higher Education institutions Scottish Civil Society Data Partnership.
GISELA & CHAIN Workshop Digital Cultural Heritage Network
aspects of archive system design
Jarek Nabrzyski Director, Center for Research Computing
National e-Infrastructure Vision
Karen Dennison Collections Development Manager
GISELA & CHAIN Workshop Digital Cultural Heritage Network
WP6 – EOSC integration J-F. Perrin (ILL) 15th Jan 2019
Presentation transcript:

DAMES, 31/JAN/2012, T6 Opportunities and prospects in social research Paul Lambert, 31 st January 2012 Talk to the seminar Data management in the social sciences and the contribution of the DAMES Node, a session organised as part of the Data Management through e-Social Science ESRC research Node

DAMES, 31/JAN/2012, T6 Start by thinking big… Landes (1969) analysis Knowledge-based revolutions Importance of standardising technology for cooperation (not just creating it) Importance of access to underlying materials – coal, cotton, etc. Uneven development (nationally) Landes, D.S. (1969). The Unbound Prometheus: Technological Change and Industrial Development in Western Europe from 1750 to the Present. Cambridge: Cambridge University Press. Emergent uses of computing and the internet, such as in e- Science traditions, arguably share similar characteristics Standardisation, communication, vast volumes of resources Social research data, e.g. large scale surveys and other large quantitative resources, exemplify these opportunities

DAMES, 31/JAN/2012, T6 E-Social Science / Digital Social Research ESRC & JISC initiatives as major UK investment in e-social science technology (see e-Science broadly involves using emergent computer technologies with enhanced capacities for communication/collaboration & data processing Handling and displaying large volumes of complex data E.g. GeoVue; LifeGuide; DReSS; Obesity e-labGeoVueLifeGuideDReSSObesity e-lab Resources for computationally demanding analyses CQeSS; MoSeS; eStat; NeISS CQeSSMoSeSeStatNeISS Standards setting in collaboration, data preparation, data and research support – DRS; MeRC; OeSS; DAMES DRSMeRCOeSS DAMES

DAMES, 31/JAN/2012, T6 Example: Understanding New Forms of Digital Records (DReSS) transcribed talk audio video digital records system logs location transcript code tree video system log

..more examples.. (strategies for social scientists to tap into the e-Infrastructure) National e-Infrastructure for Social Simulation Expert led simulation demonstrations Combining data resources Workflows for the simulation analysis Modify and re-specify existing simulation templates StatJR a tool to specify complex statistical models in generic / visual terms Multilevel models Multiple data permutations and analytical alternatives Ready access to a suite of complex modelling tools DAMES, 31/JAN/2012, T6

e-Science, data management, and research revolutions (!) Data management through e-Social Science DAMES ( ) – developing services / resources using e- Science approaches which will help social scientists in undertaking data management tasks Information / data retrieval (e.g. GESDE systems) Storage and processing of data and metadata (e.g. secure portals and curation and fusion tools) …Data management is at the centre of transformations in the exploitation of information resources… Collaboration / standardisation in constructing empirical results Facility to host and distribute new forms of data Facility to discriminate between the masses of data DAMES, 31/JAN/2012, T6

Prospects in social research The changing terrain of social research and three exciting developments/frontiers: 1)Data access 2)Data management and analysis 3)Log books Some thoughts on the trajectory of social research developments DAMES, 31/JAN/2012, T6

1) Access to data.. Example: Accessing surveys via UK Data Archive Shibboleth authentication Download and analyse in Stata, SPSS, etc DAMES, 31/JAN/2012, T6

Supplementary (digital) data E.g. Occupational information resources = data files within information on occupations, which can be usefully linked to micro-data about occupations e.g. GEODE acts as a library of OIRs, Such resources are often not widely known about, but have the ability to enhance analysis DAMES, 31/JAN/2012, T6

Steady accumulation of options / permutations / approaches in… 2a) Data Management Pre-analysis (and re- analysis) routines Sensitivity analysis Standardisation, harmonisation 2b) Data Analysis Descriptive tools Ongoing development of complex analytical models GLLMMs for structural data features, multi- process systems, etc DAMES, 31/JAN/2012, T6

E-Stat ebooks (image from doc in prep., Browne et al. 2011) (Links to product from StatJR)

3) Log books Software tools for logging work are increasingly well developed See our workshops on documentation/replication Other initiatives in sharing records of work E-Stat: Electronic workbooks for the data and model building process E-Stat MyExperiment: Depository for project files These havent yet been extensively exploited in survey research – but they should be! DAMES, 31/JAN/2012, T6

14 The idea of workflows Workflow modelling has an exciting future.. Workflow documentation oMyExperiment [ oSocial survey analysis Long, J.S. (2009) Workflow of Data Analysis using Stata. CRC press At present… Tool development in process Depositing workflows might impose constraints/burdens DAMES, 31/JAN/2012, T1

Example of using MS Excel for workflow documentation in survey research 15DAMES, 31/JAN/2012, T1

16 Who will take the initiative? Long, J. S. (2009). The Workflow of Data Analysis Using Stata. Boca Raton: CRC Press. 1-5: Programming in Stata; 6: Cleaning your data; 7: Analysing data and presenting results; 8: Protecting your work DAMES, 31/JAN/2012, T1 Because claims in published papers that additional materails are available from author usually prove false, at least after a few months, the California Center for Population Research at UCLA recently implemented a mechanism by which additional materials, for example, -do- and –log- files, can be attached to papers posted in its Population Working Paper archive. Other research centers are to be encouraged to do the same (p404 of Treiman (2009) Quantitative Data Analysis. NY: Jossey Bass) Bespoke solutions or the generic/dynamic approaches of e-Science?

Well-known challenges in survey research Were data rich, but analysts poor UK Data Forum (2007); Wiles et al (2009) ( Under-use of suitably complex statistical models Coordination and communication on data processing Recodes / Standardisation / harmonisation / documentation Lack of generic/accessible representation of tasks Limited disciplinary/project/researcher cross-over when dealing with data Specific software orientations These are not generally problems of scale, but of organisation DAMES, 31/JAN/2012, T6

Managed solutions? Data handling/analysis capacity-building ESRC programmes (NCRM, RDI, RMP); training workshops/materials; P/G funds; strategic research grant investment Documentation/replication policies Software for data access and analysis NESSTAR – UK Data Archive data/metadata browser Long (2009) on the Stata software Remote access to data (e.g. SDS) DAMES, 31/JAN/2012, T6

..train and/or constrain the analysts.. Train them -> DAMES, 31/JAN/2012, T6

..constrain the analysis.. DAMES, 31/JAN/2012, T6

Social solutions? Tools and infrastructure for better standards to are built up from within (aided by collaborative technologies) E.g. GESDE, P-ADLS, MethodBox, DAMES, 31/JAN/2012, T6

Summary e-Science would often be seen as about enabling effective research in conditions of abundant resources In practical terms, for social researchers, this means navigating through the vast array of data and analytical resources, and undertaking robust and replicable work Likely continuation of mix of generic and specific, managed and social, approaches DAMES, 31/JAN/2012, T6