Megan Force Editor, Data Citation Index

Slides:



Advertisements
Similar presentations
VO Sandpit, November 2009 Data Citation, Principles and Practice Sarah DataCite Annual Conference, 2014.
Advertisements

Lorrie Apple Johnson Lead Librarian, Information Analysis & Services Office of Scientific and Technical Information (OSTI) National Academy of Sciences.
Web of Science Search and Navigation in the Web of Knowledge
THE DATA CITATION INDEX & DATACITE NIGEL ROBINSON 26 AUGUST 2014.
Versioning Requirements and Proposed Solutions CM Jones, JE Brace, PL Cave & DR Puplett OR nd April
Data citation from the perspective of a scholarly publisher Lyubomir Penev TDWG Data Citation Workshop, New Orleans, Oct 2011 ViBRANT.
IDENTIFIERS & THE DATA CITATION INDEX DISCOVERY, ACCESS, AND CITATION OF PUBLISHED RESEARCH DATA NIGEL ROBINSON 17 OCTOBER 2013.
INCITES PLATFORM NATIONAL OCEANIC AND ATMOSPHERIC ADMINISTRATION (NOAA)
Data Citation Index Todd King PDS/PPI UCLA Megan Force Digital Research Analyst - Physical Science Thomson Reuters.
JRC's Open Access (OA) Policy G. P. Tartaglia, A. Annoni, G. Merlo, F
Data Publishing Workflows: Strategies and Standards
Release 4 of the COUNTER Code of Practice for e- Resources and new usage- based measures of impact Peter Shepherd COUNTER May 2014.
T H O M S O N S C I E N T I F I C Editorial Development James Testa, Director.
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
Guillaume Rivalle APRIL 2014 MEASURE YOUR RESEARCH PERFORMANCE WITH INCITES.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
IL Step 1: Sources of Information Information Literacy 1.
Libraries as Partners in Research: the UC Curation Center’s Tools and Services UC3 Team University of California Curation Center California Digital Library.
Bibliometrics toolkit: ISI products Website: Last edited: 11 Mar 2011 Thomson Reuters ISI product set is the market leader for.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
Web of Science® Krzysztof Szymanski October 13, 2010.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
1. 2 Rewards are real … but few (yet) 3 The citation benefit intensified over time... ...with publications from 2004 and 2005 cited 30 per cent more.
RESEARCH – DOING AND ANALYSING Gavin Coney Thomson Reuters May 2009.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
What is data citation & why do we care? What’s been happening here and overseas? How ready are you for data citation? 1 Welcome! Image:
Date, location Open Access policy guidelines for research institutions Name Logo area.
Data Citation Implementation Pilot Workshop
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
ICSU-WDS & RDA Data Publication Services WG. 2 Linking Research Data and the Literature: why? Why link? 1.Increase visibility & discoverability of research.
The Thomson Reuters Journal Selection Policy – Building Great Journals - Adding Value to Web of Science Maintaining and Growing Web of Science Regional.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
INTRODUCTION TO BIBLIOMETRICS 1. History Terminology Uses 2.
19th international symposium on Theses and Dissertations Data and Dissertations July 2016, Lille, France Dr. Jamal Alsalmi Sultan Qaboos University.
NRF Open Access Statement
Open Research Data and Open Access publications: How do they sit in the Web of Science? Guillaume Rivalle, Manager, Europe solution specialists
Demonstrating Scholarly Impact: Metrics, Tools and Trends
Measuring Scholarly and Public Impact: Let’s Talk Metrics
Bibliometrics toolkit: Thomson Reuters products
Peter Shepherd COUNTER March 2012
Research software best practices: Transparency, credit, and citation
Open access as a means to produce high quality data Anja Gassner Head Research Method Group Sentinel Landscape Coordinator FTA World Agroforestry Centre.
ACS 2016 Moving research forward with persistent identifiers
Outstanding Metadata Issues Affecting Data Citation Accuracy
Changing Practices… Changing Values
Working with your archive organization Broadening your user community
Linking persistent identifiers at the British Library
Emerging Sources Citation Index
CNI Spring 2010 Membership Meeting
Open Access to your Research Papers and Data
THE OFFICE FOR SCHOLARLY COMMUNICATION/ Responsible Metrics at Kent
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
Standards For Collection Management ALCTS Webinar – October 9, 2014
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Research Data Management
COUNTER Update February 2006.
Bird of Feather Session
WISER: Citiation searching
Donna Haraway: ‘Cyborg Manifesto’
Developing Institutional Data Repositories
Isid.research.ac.ir
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Knowledge Domains & Communities of Practice
Presentation transcript:

Megan Force Editor, Data Citation Index Outstanding Metadata Issues Affecting Data Citation Accuracy Megan Force Editor, Data Citation Index May 2018

Introduction - Clarivate Analytics We have a 60-year legacy of curating the most authoritative knowledgebase, including the Web of Science, a custodian of 100 years’ worth of research Over 7,000 leading research institutions and scholarly publishers use our products

Journal Citation Reports Specialist Literature Web of Science platform covers nearly 33,000 journals and other scholarly output including books, conference proceedings, datasets, and patents Web of Science – the most accurate and comprehensive resource for the world The Web of Science provides access to an unrivalled breadth of global research literature linked to a rigorously selected core of world class journals, ensuring a unique combination of discovery through meticulously captured metadata and citation connections, coupled with quality, impact and neutrality. Web of Science Journal Citation Reports 32,840 Total journals in Web of Science* ~11,500 journals in over 230 science and social sciences disciplines Regional Collections Total Cites Journal Impact Factor (JIF) Five-Year Journal Impact Factor Immediacy Index Cited Half-Life Citable Items JIF Percentile Eigenfactor® Metrics 18,245 Core Collection* 8,888 SCIE 3,256 SSCI 1,785 AHCI 5,369 ESCI 9,950 Specialist Literature* 5,358 BIOSIS Citation Index 4,941 Zoological Record 1,038 FSTA 4,338 INSPEC 5,533 Medline 7,945 CABI 4,646 Regional Collections* Specialist Literature Core Collection 18,245 journals from recognized globally significant journal lists and emerging sources Core Collection Databases: SCIE – Science Citation Index Expanded (included in JCR) SSCI – Social Sciences Citation Index (included in JCR) AHCI – Arts & Humanities Citation Index ESCI – Emerging Sources Citation Index * Unique journal count across databases Conferences 191K+ Books 81K+ Datasets 7M+ Patent Records 70M+

Data Data repository Facts collected for reference or analysis Definitions Data Facts collected for reference or analysis Non traditional scholarly output of scientific research often analysed in traditional research publications. May include numerical, textual, image, video or software information Data repository An online resource where data are deposited and stored for preservation and access

Data Citation Indexing: Transparency , Reuse, Credit • Enables research conclusions to be verified and validated • Makes reproducibility of premises and results possible • Exposes data findings and their value to a wider audience • Ensures a mechanism for receiving credit for scholarly work and an opportunity for tracking/ translating such attribution into rewards

Much work to do! Typical Data citation Introduction – data Lack of consensus across data repositories Varying quality across data repositories Evidence Data Citation index content from 350 data repositories 7M data records 6.5M citations Differences between disciplines Lack of curation Much work to do! Typical Data citation Armenteras, Dolors; Gibbes, Cerian; Anaya, Jesus; Davalos, Liliana (2017): R script to compare models to forest loss alert system. Dryad. http://dx.doi.org/10.5061/DRYAD.1925K/5

Data Citation Index: Example Record

Data Citation Metadata Data Citation Metadata Elements Data Citation Metadata Following established journal citation practices only gets us so far, as data and other non-traditional scholarly output feature unique and subtle differences with respect to identification and interpretation Various efforts are underway with respect to specific implementation of guidelines, yet there remain some gaps

Data citations: dates before digitization

Citation dates from periods before digitization Dates Before Digitization Citation dates from periods before digitization Datasets that were not ‘born digital’ May be the original publication date of a dataset which is now found online Data may be highly valuable (e.g. 19th century glacier data contributing to studies of climate change effects)

Pre-digitization citation dates: DataCite recommendations Dates Before Digitization Pre-digitization citation dates: DataCite recommendations

Dates before digitization: questions Do older dates in this context cause confusion? Do they make sense for certain disciplines? When does a previously non-digital publication become a data publication?

Data citations: determination of authorship, publishing entity

Author entity identification Data Citation Authorship Author entity identification Identifying the proper author entity for citation purposes may require significant negotiation with data repository: custodians, curators, etc, listed, but who is the author? Fundamental question of credit; vital for metrics/analytics

Author identification: questions Data Citation Authorship Author identification: questions How is ‘data author‘ defined for citation purposes? What can be done to ensure that data author becomes a more recognized, even mandatory concept/element?

Publishing entity identification Data Publication Publishing entity identification DataCite, https://schema.datacite.org/meta/kernel-3.1/doc/DataCite-MetadataKernel_v3.1.pdf Publisher element may be broadly defined; necessary to identify primary curator in order to avoid confusion/duplication of records

How is ‘data publisher‘ defined for citation purposes? Data Publication Publisher: questions How is ‘data publisher‘ defined for citation purposes?

Data citations: versioning practices

Undertaken after observing wide variation in versioning practices DCI versioning study Undertaken after observing wide variation in versioning practices Key metadata element for reproducibility 72% of all data repositories in DCI were found to have no version information for datasets

Where is version information found? Versioning Practices Where is version information found? A significant amount of version information is not readily available through a dedicated metadata tag Version information may be concatenated with dataset title, or may be found in the dataset identifier (accession number, DOI, etc)

Versioning Practices Versions in citations? Only 26% of repositories which employ versioning include version in a recommended data citation Versions are included in formal data citations for a greater share of these repositories

Versioning Practices Takeaways Repository policies with respect to versioning are not displayed on database websites Metadata vs. data versioning is generally unclear/difficult to determine Versioning practices may vary significantly even within the datasets of a single data repository While versioning practices are being adopted by repositories in the interest of correct citation and reproducibility, little or no guidance exists regarding best practice at a cross-disciplinary or discipline-specific level

Versioning practices: questions Should a single data item with multiple versions have only one citable record (reproducibility based on version information included in the citation blurb), or should a new DOI be issued for each version? Does this depend on format/discipline?

Data citations: dates after the current date

Citation dates after the current date Future Citation Dates Citation dates after the current date Metadata for datasets that are not yet accessible is being made available for harvest by outside groups; included in metadata feeds which primarily describe already-published datasets ‘Published dates’ for these datasets are listed as in the future Sometimes this data is specified as being under embargo until a specific date; other times no embargo is specified but there is a statement to the effect that the data will be made available within the next 6 months, year, etc. Intersection of publisher/repository/funder policies for data availability

Citation dates after the current date: questions Future Citation Dates Citation dates after the current date: questions Is there friction between stakeholder policies for data availability (timing of article publication vs. data object publication, etc.)? Should dataset metadata be provided to indexers, etc, for data that has not yet been published?

Wrap up: consensus and next steps Data Citation Metadata Elements Wrap up: consensus and next steps “Technical mechanisms for citation are only surface characteristics of the knowledge infrastructures in which they are embedded” - Christine Borgman Big Data, Little Data, No Data: Scholarship in the Networked World (2015)

Megan Force, Editor, Data Citation Index | (215) 823-6194 | megan Megan Force, Editor, Data Citation Index | (215) 823-6194 | megan.force@clarivate.com | clarivate.com