Object Description by Data Records DDI Moving Forward - IASSIST Sprint, May 2014 Joachim Wackerow Larry Hoyle.

Slides:



Advertisements
Similar presentations
Foundational Objects. Areas of coverage Technical objects Foundational objects Lessons learned from review of Use Case content Simple Study Simple Questionnaire.
Advertisements

Research Methods in Crime and Justice Chapter 4 Classifying Research.
DDI3 Uniform Resource Names: Locating and Providing the Related DDI3 Objects Part of Session: DDI 3 Tools: Possibilities for Implementers IASSIST Conference,
DDI URN Enabling identification and reuse of DDI metadata IDSC of IZA/GESIS/RatSWD Workshop: Persistent Identifiers for the Social Sciences Joachim Wackerow.
DDI 3.0 Conceptual Model Chris Nelson. Why Have a Model Non syntactic representation of the business domain Useful for identifying common constructs –Identification,
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
Research Methods in MIS
Codebook Centric to Life-Cycle Centric In the beginning….
Managing the Metadata Lifecycle The Future of DDI at GESIS and ICPSR Peter Granda, ICPSR Meinhard Moschner, GESIS Mary Vardigan, ICPSR Joachim Wackerow,
Reducing Metadata Objects Dan Gillman November 14, 2014.
 Name and organization  Have you worked with DDI before? (2 or 3)  If not, are you familiar with XML?  What kind of CAI systems do you use?  Goals.
Data Management: Documentation & Metadata Types of Documentation.
The education variables in the European Social Survey: Advantages in using the DDI for documentation Hilde Orten and Hege Midtsæter Norwegian Social Science.
Department of Computer Science 1 CSS 496 Business Process Re-engineering for BS(CS)
Modernizing the Data Documentation Initiative (DDI-4) Dan Gillman, Bureau of Labor Statistics Arofan Gregory, Open Data Foundation WICS, 5-7 May 2015.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Data Documentation Initiative (DDI): Goals and Benefits Mary Vardigan Director, DDI Alliance.
Data Mining Techniques
Implementing Digital Object Identifiers at the GESIS Data Archive for the Social Sciences Workshop “Persistent Identifiers for the Social Sciences” Bonn,
Locating objects identified by DDI3 Uniform Resource Names Part of Session: Concurrent B2: Reports and Updates on DDI activities 2nd Annual European DDI.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
NSI 1 Collect Process AnalyseDisseminate Survey A Survey B Historically statistical organisations have produced specialised business processes and IT.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
DDI Profiles to Support Software Development as well as Data Exchange and Analysis NADDI , Vancouver (Canada) David Schiller, IAB Ingo Barkow,
DDI Lifecycle: Moving Forward Outcome of the Recent Workshop in Dagstuhl Joachim Wackerow.
Experiences with the Design and Development Process of DDI Requirements for Future Work Ideas for Improvement Joachim Wackerow.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
State Term Contract & State Purchasing Agreement Website Innovative Ideas towards Improving Your Buying Experience DMS State Purchasing IT Team.
Research Data Management At the Smithsonian Using SIdora Nano Tech Working Group May 15, 2014.
3 rd Annual European DDI Users Group Meeting, 5-6 December 2011 The Ongoing Work for a Technical Vocabulary of DDI and SDMX Terms Marco Pellegrino Eurostat.
IMS Proof of Concept for Data Capture using Metadata Bryan Fitzpatrick Rapanea Consulting Limited June 2014.
PHDD – An RDF Vocabulary for the Physical Data Description Work in Progress Joachim Wackerow and Thomas Bosch (both GESIS – Leibniz Institute for the Social.
DDI-RDF Discovery Vocabulary A Metadata Vocabulary for Documenting Research and Survey Data Linked Data on the Web (LDOW 2013) Thomas Bosch.
Leveraging the DDI Model for Linked Statistical Data in the Social, Behavioural, and Economic Sciences DC Thomas Bosch GESIS – Leibniz.
1 Chapter 1 Research Methods When sociologists do quantitative research, they generally use either surveys or precollected data.quantitative research Qualitative.
DDI-RDF Leveraging the DDI Model for the Linked Data Web.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
© 2012 The MITRE Corporation. All rights reserved. For internal MITRE use 13 June 2013 Meeting #3 hData Record Format Taskforce 1 © 2012 The MITRE Corporation.
 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.
Study Discovery in Support of the Data Without Boundaries Initiative, the NIH Data Documentation Index and Infonomics Jay Greenfield Booz Allen Hamilton.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
UML-1 8. Capturing Requirements and Use Case Model.
Colectica: A Platform for DDI 3 based Metadata Management Design. Collect. Share.
Some Thoughts on DDI4 and Qualitative Data Larry Hoyle revised 8/13/2015DDI4 for Qualitative, Hoyle 1.
DDI and the Lifecycle of Longitudinal Surveys Larry Hoyle, IPSR, Univ. of Kansas Joachim Wackerow, GESIS - Leibniz Institute for the Social Sciences.
Introduction to the Semantic Web and Linked Data
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
COLECTICA FOR EXCEL: USING DDI LIFECYCLE WITH SPREADSHEETS NADDI 2013.
ABS Issues for DDI Futures Bryan Fitzpatrick October 2012.
Qualitative Data in DDI. What is “Qualitative Data” Text, mixed mode documents Images Video Physical Objects 10/5/2015Qualitative data in DDI - Hoyle.
Improving User Access to Metadata for Public and Restricted Use US Federal Statistical Files William C. Block Jeremy Williams Lars Vilhuber Carl Lagoze.
DDI Moving Forward: Sprint 1 Schloss Dagstuhl October 28-November 1, 2013.
DDI Lifecycle and Qualitative Data: Development of a Formal Model Arofan Gregory Joachim Wackerow.
Research Methodology II Term review. Theoretical framework  What is meant by a theory? It is a set of interrelated constructs, definitions and propositions.
Marion Wittenberg – DANS Merja Karjalainen – SND.
Software Packaging for Reuse James Marshall (INNOVIM), Code 614.5, NASA GSFC The Software Packaging for Reuse document (version 1.0), developed and recently.
Publishing DDI-Related Topics Advantages and Challenges of Creating Publications Joachim Wackerow EDDI16 - 8th Annual European DDI User Conference Cologne,
Documenting, Maintaining, and Sharing
DDI Extensions for Qualitative Data NADDI 2013
Data Management: Documentation & Metadata
Alternatives for Representing Coding of Qualitative Data in DDI
Session 2: Metadata and Catalogues
Question Banks, Reusability, and DDI 3.2 (Use Parameters)
in the data production process
Science: Learning Experience wested
Questasy: Documenting and Disseminating Longitudinal Data Online with DDI 3 Edwin de Vet 5/21/2019.
Presentation transcript:

Object Description by Data Records DDI Moving Forward - IASSIST Sprint, May 2014 Joachim Wackerow Larry Hoyle

Excursion: Data Record for Survey Data Record unit is a person Variables (like edctn ) describe a record unit

Record Types Record units: person, household, family – I.e. person id necessary Multiple records per unit – Unit id necessary Event data: unit documents spans of time ending in an event – Person id necessary

„Community Profiles“ Demographic variables (Social science) classifications like ISCO, ISCED Unit identifiers for describing relationships (i.e. person/household)

Description of Object Segments by Data Records Larry Hoyle had this idea while a stay at GESIS in November 2013 Larry presented at NADDI 2014: „Alternatives for Representing Coding of Qualitative Data in DDI“Alternatives for Representing Coding of Qualitative Data in DDI

Codes and Memos “Dinner site”“Presentations” Segments can also be marked in text. Like in this text example. “Marking”“Text reference” Memo: This is marked here for an example in a PowerPoint Presentation Hoyle, Alternatives for Representing Coding of Qualitative Data, NADDI2014

Example Records SegmentIDSegmentNameCampusBuildingEveningVenueYearBuiltAddressCluster1MiningVar 1AlumniCenter Oread Ave StudentUnion JayHawk Blvd.10.8 Dataset Hoyle, Alternatives for Representing Coding of Qualitative Data, NADDI2014

Example Record - Codes Hoyle, Alternatives for Representing Coding of Qualitative Data, NADDI2014 SegmentIDSegmentNameCampusBuildingEveningVenueYearBuiltAddressCluster1MiningVar 1AlumniCenter Oread Ave StudentUnion Jayhawk Blvd.10.8 CodesCodes are handled like “Select all that apply” questions. Could have Categories and Memo here as well 1=has code 0=does not

Example Record – Other Variables SegmentIDSegmentNameCampusBuildingEveningVenueYearBuiltAddressCluster1MiningVar 1AlumniCenter Oread Ave StudentUnion Jayhawk Blvd.10.8 Codes Quantitative Variables Text Mining Variables Hoyle, Alternatives for Representing Coding of Qualitative Data, NADDI2014

Data or Metadata? SegmentIDSegmentNameCampusBuildingEveningVenueYearBuiltAddressCluster1MiningVar 1AlumniCenter Oread Ave StudentUnion Jayhawk Blvd.10.8 Depends on use, doesn’t it? (c.f. NSA use of phone “metadata”) YearBuilt might be metadata when searching for image segments or text descriptions of old buildings It might be data if we used the text mining variables to predict building age Metadata? Data? Hoyle, Alternatives for Representing Coding of Qualitative Data, NADDI2014

Possible Usage of Description by Data Records in Future „Any“ object as record unit: – Segment („qualitative“ data) – New data sources – New instruments – New methods Approach most powerful together with process description (interplay with process model)

Object Description by Data Records Summary I Another way of a model „extension“ Description of any objects where no wide agreement exists Description of objects which are unknown at design time of model No advanced modelling skills required, less work than a formal model extension – Entity–relationship modelling useful for more complex usage Can be driven by user communities – Best practices and specific profiles per domain are useful – Data record descriptions in resource package enable reuse and sharing

Object Description by Data Records Summary II Easy data entry (spreadsheet) Only software built for a particular community can understand the data records Proof of concept (theoretical): description of a questionnaire Similar approach as in ROLAP (Relational Online Analytical Processing) Could be later transformed to a model extension when semantics are understood and widely agreed

Limitations of Model Extensions Can only describe current understanding of a domain Should rely on widely accepted understanding Requires modelling skills and work for integration in existing model DDI cannot describe all possible data sources, instruments, methods, and data objects – One possible solution: DDI can rely on metadata standards of other domains. This is not always the case.

Ideas on Future Work Required machine-actionable additions for DDI 4: – Purpose of „logical product“ – Type/role of logical record – Type/role of variable – Description of relationship between variables – Hooks for process description? Qualitative: identification of purely qualitative objects, possible candidates for DDI 4