Introducing da|raSearchNet The integrated data portal for the social sciences Tanja Friedrich, Brigitte Hausstein, Daniel Hienert, GESIS
What‘s inside? Context da|raSearchNet: idea and goals Development of da|raSearchNet Outlook
What‘s inside? Context da|raSearchNet: idea and goals Development of da|raSearchNet Outlook
Culture of data sharing Influence of data-driven research on scientific discovery: The Fourth Paradigm of research has produced a data deluge. In Hey et al. 2009 “The Fourth Paradigm“ is used to depict the influence of data-intensive research on scientific discovery. The term originated in a research group around computer scientist Jim Gray (Lynch 2009, p. 177.) The term data deluge is depicting “the imminent flood of scientific data” in the context of collaborative e-science (Hey/Trefethen 2003, p. 809). It has been taken up by numerous authors to illustrate the challenge of dealing with growing research data output.
Culture of data sharing Increasingly, data producers are obliged to publish their data for purposes of results validation and reuse. In its “Principles and Guidelines for Access to Research Data from Public Funding”, the OECD states as its first aim and objective to “Promote a culture of openness and sharing of research data among the public research communities within member countries and beyond” (OECD 2007). Funding agencies around the world at least commit researchers to archive their produced data for a defined time span (the German Research Foundation requires a minimum of 10 years, cf. Deutsche Forschungsgemeinschaft 1998, p. 12) and support data sharing (e.g. the German Research Foundation alongside other German research bodies, cf. Alliance of German Science Organisations 2010); other agencies, such as the US-American National Science Foundation, expect grantees to share – among other information – their primary data with other researchers (National Science Foundation 2012, chapter VI.D.4) or, as in the case of the British Economic and Social Research Council, make data sharing mandatory “to the maximum extend possible” (Economic and Social Research Council 2010, p. 2).
Data sharing infrastructure Data sharing is dependent on infrastructure: Data repositories Data catalogues … For data producers as well as data reusers Deposit safe by hugovk is licensed under CC BY-NC-SA 2.0
Data sharing infrastructure Examples: DataCite EUDAT figshare … never comprehensive, not cross-linked
Data sharing infrastructure Even within disciplinary boundaries, the landscape of data sharing infrastructure is scattered and diversified …
What‘s inside? Context da|raSearchNet: idea and goals Development of da|raSearchNet Outlook
Rationale Present situation when searching for social science data: knocking on many doors Vác Gates & Doors - The "Portrait" Collection by istvan is licensed under CC BY-NC-ND 2.0
Rationale Need for one comprehensive service for secondary researchers looking for social science data The Arched Entrance by highlights6 is licensed under CC BY-NC-ND 2.0
Rationale da|raSearchNet Need for one comprehensive service for secondary researchers looking for social science data da|raSearchNet The Arched Entrance by highlights6 is licensed under CC BY-NC-ND 2.0
Point of departure
Point of departure da|ra registration agency
Point of departure da|ra registration agency … a DOI-service primarily aimed at making data citable:
Point of departure da|ra registration agency … a DOI-service primarily aimed at making data citable:
Point of departure da|ra is a Service for data producers rather than data reusers
Point of departure Metadata search in the da|ra database is the nucleus for da|raSearchNet
Point of departure Metadata search in the da|ra database is the nucleus for da|raSearchNet
da|ra – where we stand Comprehensive Metadata (discipline-specific) for citation of datasets Basic search functionality Focus on German and US data (GESIS and ICPSR)
da|raSearchNet – what we aim at
da|raSearchNet – what we aim at Acquire more publication agents who register their data via da|ra
da|raSearchNet – what we aim at Include more entries for data that are not being registered via da|ra (automation supported)
da|raSearchNet – what we aim at Integration of metadata from other services
da|raSearchNet – what we aim at Improved search with value-added services
What‘s inside? Context da|raSearchNet: idea and goals Development of da|raSearchNet Outlook
How to develop da|raSearchNet
How to develop da|raSearchNet International networking and cooperation
How to develop da|raSearchNet International networking and cooperation Metadata standardization
How to develop da|raSearchNet International networking and cooperation Metadata standardization Metadata exchange
How to develop da|raSearchNet International networking and cooperation Metadata standardization Metadata exchange Search engine technology
How to develop da|raSearchNet International networking and cooperation Metadata standardization Metadata exchange Search engine technology da|raSearchNet
International networking and cooperation Incorporation of further international programmes Including metadata from international sources Cooperation within RDA
Metadata standardization
Metadata standardization Further development of da|ra 3.0 metadata scheme
Metadata standardization Further development of da|ra 3.0 metadata scheme RDA WG: interoperability of da|ra and Data-PASS cross-platform discovery based on DDI model
Metadata exchange Several other data repositories and portals offer free access to their metadata via OAI-PMH or Web API exchange_place by _SiD_ is licensed under CC BY-NC-ND 2.0
Metadata exchange da|raSearchNet will feature an OAI harvester for harvesting metadata and an API for exchange metadata with other services
Metadata exchange Researchers who are looking for data can search all these metadata in one interface: da|raSearchNet
Search engine technology Improvement of the existing da|ra search Adapting the search to an increased metadata pool Simple and advanced search, facetted search
Search engine technology Added value for data seekers Personalized services (search history, watch list, RSS feed …) Search term recommender Content syndication
What‘s inside? Context da|raSearchNet: idea and goals Development of da|raSearchNet Outlook
Outlook
Outlook Cooperation within RDA has started
Outlook Cooperation within RDA has started Search interface is already being improved
Outlook Cooperation within RDA has started Search interface is already being improved Further development is still dependent on approval of grant proposal
Thank you for your attention! Tanja Friedrich Brigitte Hausstein Daniel Hienert tanja.friedrich@gesis.org brigitte.hausstein@gesis.org daniel.hienert@gesis.org