INFuture2015 Zagreb, 11-13 November 2015 Long-term Preservation of Longitudinal Statistical Surveys in Psycholinguistic Research Hrvoje Stančić Faculty.

Slides:



Advertisements
Similar presentations
1 Statistics Norway Information Architecture – some challenges ODaF meeting, Colchester April 2008 Rune Gløersen Director Department for IT and.
Advertisements

13 September 2012 SDMX Technical Working Group1 Report of the SDMX Technical Standards Working Group SDMX Expert Group Meeting, Paris, September 2012.
Federal Department of Home Affairs FDHA Federal Statistical Office FSO Meeting of the OECD Expert Group on SDMX September, OECD, Paris Centralized.
Business microdata dissemination at Istat Daniela Ichim Luisa Franconi
Meta Dater Metadata Management and Production System for surveys in Empirical Socio-economic Research A Project funded by EU under the 5 th Framework Programme.
United Nations Economic Commission for Europe Statistical Division High-Level Group Achievements and Plans Steven Vale UNECE
Mogens Grosen Nielsen Statistics Denmark
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
Manual on Disability Statistics Central Statistics Office Ministry of Statistics & PI Government of India New Delhi.
The European Statistical System Vision Infrastructure Programme Daniel Defays, Director Directorate B, Eurostat Eurostat Workshop on the Modernisation.
by Ha Do Statistical Standard Methodology and ITC Department
Producing and managing metadata Workshop on Writing Metadata for Development Indicators Lusaka, Zambia 30 July – 1 August 2012 Writing Metadata for Development.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
The use and convergence of quality assurance frameworks for international and supranational organisations compiling statistics The European Conference.
Background Data validation, a critical issue for the E.S.S.
WP.5 - DDI-SDMX Integration
Priorities in the Study of Information Sciences Faculty of Humanities and Social Sciences, University of Zagreb, Croatia Ph.D. Sanja Seljan, associate.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
NSI 1 Collect Process AnalyseDisseminate Survey A Survey B Historically statistical organisations have produced specialised business processes and IT.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
M ETADATA OF NATIONAL STATISTICAL OFFICES B ELARUS, R USSIA AND K AZAKHSTAN Miroslava Brchanova, Moscow, October, 2014.
SDMX and DDI Working Together Technical Workshop 5-7 June 2013
Terminology and Standards Dan Gillman US Bureau of Labor Statistics.
4 April 2007METIS Work Session1 Metadata Standards and Their Support of Data Management Needs Daniel W. Gillman Bureau of Labor Statistics Paul Johanis.
Implementation of Eurostat Quality Declarations with Cost- Effective Use of Standards Q European conference on quality in statistics Vienna 2-5 June.
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
« 8-11 July 2008 « Metadata Life Cycle « STATISTICS PORTUGAL.
Current and Future Applications of the Generic Statistical Business Process Model at Statistics Canada Laurie Reedman and Claude Julien May 5, 2010.
BUILDING ON COMMON GROUND: EXPLORING THE INTERSECTION OF ARCHIVES AND DATA CURATION Lizzy Rolando & Wendy Hagenmaier 6/3/2015IASSIST 2015.
Introduction ESDS Qualidata John Southall ESDS Creating and delivering re-usable qualitative data 24 June 2004.
United Nations Economic Commission for Europe Statistical Division Mapping Data Production Processes to the GSBPM Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division Introduction to Steven Vale UNECE
Marco Oksman SDMX Transformation Component Applying CSPA.
Slide 1 Eurostat Unit B3 – Statistical Information Technologies CoRD Meeting – 4 June 2007 Agenda Item 8 Preliminary ideas for a 2011 census hub Giuseppe.
Statistical Metadata Strategy and GSIM Implementation in Canada Statistics Canada.
1 1 Developing a framework for standardisation High-Level Seminar on Streamlining Statistical production Zlatibor, Serbia 6-7 July 2011 Rune Gløersen IT.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Record Authenticity as a Measure of Trust: A View Across Records Professions, Sectors, and Legal Systems Corinne Rogers University of British Columbia.
SDMX IT Tools Introduction
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
Data for secondary analysis: the experience of the UK Data Archive Hilary Beedham UK Data Archive.
Digitally Signed Records – Friend or Foe? Boris Herceg Hrvoje Brzica Financial Agency – FINA Hrvoje Stančić.
Aim: “to support the enhancement and implementation of the standards needed for the modernisation of statistical production and services”
Overview and challenges in the use of administrative data in official statistics IAOS Conference Shanghai, October 2008 Heli Jeskanen-Sundström Statistics.
United Nations Economic Commission for Europe Statistical Division GSBPM and Other Standards Steven Vale UNECE
Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.
SDMX Basics course, March 2016 Eurostat SDMX Basics course, March Introducing the Roadmap Marco Pellegrino Eurostat Unit B5: “Data and.
>> Metadata What is it, and what could it be? EU Twinning Project Activity E.2 26 May 2013.
Introduction to Statistics Estonia Study visit of the State Statistical Service of Ukraine on Dissemination of Statistical Information and related themes.
GT1 - MODELOS, FRAMEWORKS E ARQUITETURAS APRESENTAÇÃO DA NORMA – GT4 ISO TS 21547:2010 “Health informatics — Security requirements for archiving of electronic.
© 2016 Chapter 6 Data Management Health Information Management Technology: An Applied Approach.
statistiska_centralbyran_scbwww.linkedin.com/company/scb Panel Session A: Integrating Location in.
Quality declarations Study visit from Ukraine 19. March 2015
Contents Introducing the GSBPM Links to other standards
Interoperable data formats: SDMX
Generic Statistical Business Process Model (GSBPM)
GSBPM, GSIM, and CSPA.
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Social Research Methodology and Supplementary Documentation John Kallas University of the Aegean, Department of Sociology.
Presentation to SISAI Luxembourg, 12 June 2012
The Role of Metadata in Census Data Dissemination
Generic Statistical Information Model (GSIM)
Introducing the Data Documentation Initiative
Presentation of Project Joint meeting of the ESS.VIP.BUS ICT Project
Technical Coordination Group, Zagreb, Croatia, 26 January 2018
Data Architecture project
Palestinian Central Bureau of Statistics
Presentation transcript:

INFuture2015 Zagreb, November 2015 Long-term Preservation of Longitudinal Statistical Surveys in Psycholinguistic Research Hrvoje Stančić Faculty of Humanities and Social Sciences, Zagreb, Croatia Martina Poljičak Central Bureau of Statistics, Zagreb, Croatia Anabela Lendić Faculty of Humanities and Social Sciences, Zagreb, Croatia

Introduction Psycholinguistics ◦ Interdisciplinary nature of the field ◦ Different types of evidence and obtained data ◦ Language-specific research data Aphasia (speech-language pathology) ◦ Aphasic subjects taking part in clinical therapy Aphasia research ◦ Standard informed consent form ◦ Personal information 2

Introduction … Psycholinguistic research  Access to sensitive personal data and its protection in different research phases: ◦ collecting data ◦ PROCESSING data ◦ preserving data (for secondary use) What about ◦ long-term preservation and managment issues concerning data, standards, etc. (?) 3

RESEARCH QUESTION(S) / MOTIVATION Official Statistics (OS) has developed a sophisticated ecosystem of models used by OS organizations for collection, processing, and dissemination of statistics. Could OS models or OS concepts be used in collecting, processing, and preserving health- related digital records? … In aphasia research? 4

DATA COLLECTION  Statistical Classifications (in medicine  ICD, ICF, ICHI, NUTS for territory units, and many others...)  Thesaurus  Nomenclatures  The Neuchâtel Terminology Model (NTM) ◦ provides the framework for the development of a classification database ◦ semantic and conceptual sphere of metadata ◦ not related to technical aspects of a classification database 5

DATA PROCESSING Two categories ◦ restricted ◦ unrestricted Data classified as -confidential data -internal/private data -public data Data (variables) classified as -identifier -quasi-identifier -sensitive attributes -non-sensitive attributes 6

Interoperability and Shareable Artefact Catalogues Interoperability ◦ set of common principles and standards within and between statistical organisations  GSBPM – define business processes in OS  GSIM – conceptual model  set of standardized information objects Global Catalogues ◦ reusable processes, information objects and statistical services ◦ Common Statistical Production Architecture (CSPA) 7

LONG-TERM DATA PRESERVATION Data (and records) should stay at all times: ◦ authentic ◦ reliable ◦ usable, and ◦ its integrity should stay preserved 8

Standards in OS Metadata standard Description Data Documentati on Initiative (DDI) A metadata specification for the social and behavioral sciences created by the Data Documentation Initiative. Used to document data through its lifecycle and to enhance dataset interoperability. Statistical Data and Metadata Exchange (SDMX) A self-describing data format that provides both metadata and a method of data transmission. It is primarily used in "the world of official statistics", such as the EU, WHO, UNESCO, World Bank, and US Reserve Banks. 9

DDI-Lifecycle Model 10

Recommendations (I) Preserve the raw data, but remove variables such as name, social security number, and home address Use Data Disclosure Control Methods ◦ most basic methods for maintaining privacy ◦ include limitation of details, top/bottom coding, suppression, rounding and addition of noise Management system to handle data sensitivity levels and access rights 11

Recommendations (II) A system with functionalities similar to those of Statistical Metadata System (SMS) could be used to manage sensitive health-related data! Access to data objects according to users’ and user groups’ rights! 12

Recommendations (III) Use: ◦ standardized and ◦ globally accepted file formats Assure accessibility according to retention policies! Always have Data/Records Management Plan! 13

CONCLUSION Interdependence between the needs of psycholinguistic research and available models and standards in official statistics Solutions considered here include the ones for ◦ data collection ◦ statistical survey processing, and ◦ records management Knowledge and solutions from official statistics and modern archival science could be combined! 14

THANK YOU! Long-term Preservation of Longitudinal Statistical Surveys in Psycholinguistic Research Hrvoje Stančić Faculty of Humanities and Social Sciences, Zagreb, Croatia Martina Poljičak Central Bureau of Statistics, Zagreb, Croatia Anabela Lendić Faculty of Humanities and Social Sciences, Zagreb, Croatia