Linking Data from ScienceDirect Articles Presented by: IJsbrand Jan Aalbersberg Hannover, DataCite Meeting Date: June 8, 2010.

Slides:



Advertisements
Similar presentations
Presented by: Title: Date: DRM NISO Meeting, Denver Geoffrey Adams Dir. IT Solutions May 18 th, 2005.
Advertisements

Info to Enterprise Migration Implementation Case Study: SBC Corporation Presented to the Crystal Decisions Regional Users Group for the Bay Area on October.
Slide 1 Insert your own content. Slide 2 Insert your own content.
29 Oded Moshe, Director of Product Management Beta Release October 19, 2010 Official Release November 9, 2010.
September, 2005What IHE Delivers 1 Key Image Notes Evidence Documents Simple Image & Numeric Report Access to Radiology Information IHE Vendors Workshop.
June 28-29, 2005IHE Interoperability Workshop 1 Integrating the Healthcare Enterprise Cross-enterprise Document Sharing for Imaging (XDS-I) Rita Noumeir.
Mirror Mirror on the wall does your repository reflect it all? Peter West and Timothy Miles-Board EPrints Services University of Southampton Southampton,
1 of 16 Information Access The External Information Providers © FAO 2005 IMARK Investing in Information for Development Information Access The External.
Preservation, access and re-use of Research Data The STM view on publishing datasets Presented at the DataCite Summer Meeting 2010 Hannover, 8 June 2010.
Introduction to DataCite Adam Farquhar PhD Head of Digital Library Technology, The British Library President, DataCite June 2010.
Preservation, access and re-use of research data A Publishers perspective……and how we can help Joep Verheggen, Elsevier PARSE.insight workshop, Darmstadt,
Implementation of a Validated Statistical Computing Environment Presented by Jeff Schumack, Associate Director – Drug Development Information September.
I 4 Excellence Independence, Instructional Integrity, Interoperability Content Authoring and Management System March, 2009.
The creation of "Yaolan.com" A Site for Pre-natal and Parenting Education in Chinese by James Caldwell DAE Interactive Marketing a Web Connection Company.
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
The Biosafety Clearing-House of the Cartagena Protocol on Biosafety Tutorial – BCH Resources.
0 - 0.
UK PubMed Central – a service for biomedical researchers Increasing Nottinghams Research Impact Through Open Access Event 11th October 2007 Mark Samson.
Introduction to DataCite Adam Farquhar, PhD Head of Digital Library Technology, The British Library President, DataCite June, 2010.
1 DTI/EPSRC 7 th June 2005 Reacting to HCI Devices: Initial Work Using Resource Ontologies with RAVE Dr. Ian Grimstead Richard Potter BSc(Hons)
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Publisher perspective eBank/R4L/SPECTRa Joint Consultation Workshop London Metropole Hotel 20 October 2006.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
NIH Public Access Compliance Cleveland Health Sciences Library Case Western Reserve University Kathleen C. Blazar.
Introduction Lesson 1 Microsoft Office 2010 and the Internet
Configuration management
Software change management
© Paradigm Publishing, Inc Access 2010 Level 1 Unit 1Creating Tables and Queries Chapter 2Creating Relationships between Tables.
1 of 27 DA1241 Archive Companies Last updated: March-2004 DA1241 Archive Companies.
5.9 + = 10 a)3.6 b)4.1 c)5.3 Question 1: Good Answer!! Well Done!! = 10 Question 1:
Project Overview Slide 2 of 15 Overview Project in a Nutshell ◦Motivation ◦Aims and Objectives ◦Expected Outcomes PlanetData Programs Join PlanetData.
A Toolbox for Blackboard Tim Roberts
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. The Future is Now JeromeDL A Digital Library on Social Semantic.
Ben Johnson, HEFCE 15 May 2014 IN A POST-2014 REF.
How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Effective management Accurate tracking Easier automation.
An Introduction to the IMSLP Petrucci Music Library An online music resource
OPEN ACCESS PUBLICATION ISSUES FOR NSF OPP Advisory Committee May 30, /24/111 |
Systematic Review Data Repository (SRDR™) The Systematic Review Data Repository (SRDR™) was developed by the Tufts Evidence-based Practice Center (EPC),
Presenter : Mohit Pabby Product Trainer Elsevier & Web 2.0.
Elsevier's program to support research data Presented by: Dr. Eleonora Presani, Publisher High Energy Physics.
Lecturer: Ghadah Aldehim
DDI Best Practices Technical Best Practices. High Level Architecture URNs and Entity Resolution Managing Unique Identifiers DDI as Content for Repositories.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
Libra: Thesis and Dissertation Submission. What is Libra? UVA’s institutional repository, providing online archiving and access for the scholarly output.
The Role of Abstract and Citation Databases in Supporting Data Repositories DataCite Workshop: Möglichkeiten und neue Lösungen im Forschungsdatenmanagement.
RADAR “How To…” Guide DEPOSITING RESEARCH OUTPUTS in RADAR Covered: -Accessing RADAR -Logging in -Depositing outputs -Managing outputs -Uploading documents.
PUBLISHING ONLINE Chapter 2. Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
Innovation & Supplementary Material Eleonora Presani – Elsevier
Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.
Scholarly communications Discussion group Linked Data Workshop May 2010.
Avoiding a Digital Dark Age for Data: why data and publications belong together Integration of Research Data and Publications Eefke Smit International.
Services for Object Storage and Preservation March 2008 All content in these slides is considered work in progress. In no way does it represent an absolute.
It’s the data that makes a paper Joerg Heber Executive Editor Nature Communications.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Using Document Collaboration, Integration, and Charting Tools
Entering the Data Era; Digital Curation of Data-intensive Science…… and the role Publishers can play The STM view on publishing datasets Bloomsbury Conference.
Transportation Agenda 165. Transportation About Pages Pages organize and present information Pages are files that end in.aspx 166.
Open Access infrastructure and Open Data
Tim Smith CERN Geneva, Switzerland
Publishing software and data
UNIT 15 Webpage Creator.
Data publishing from the viewpoint of a biodiversity publisher
Publisher-Driven Preprints
Research Data Management
Entering the Data Era; Digital Curation of Data-intensive Science…… and the role Publishers can play The STM view on publishing datasets Bloomsbury Conference.
USER MANUAL - WORLDSCINET
USER MANUAL - WORLDSCINET
Presentation transcript:

Linking Data from ScienceDirect Articles Presented by: IJsbrand Jan Aalbersberg Hannover, DataCite Meeting Date: June 8, 2010

Linking to & from Data from & to ScienceDirect Articles Presented by: IJsbrand Jan Aalbersberg Hannover, DataCite Meeting Date: June 8, 2010

Linking Data in ScienceDirect The Past Supplementary data Entity links to databases The Present Some considerations PANGAEA-type linking A Future Getting even closer connected 3

The Past (supplementary data) Raw research data delivered as supplementary data Available for limited number of data set types / formats Data distributed over multiple articles and publishers Format frozen in time – not maintained for preservation Only available for smaller data sets (at most few 10 MBs) Limited access due to use of existing publishing platforms Data and article remain nicely coupled / packaged Supplementary data always being peer-reviewed 4

The Past (entity linking - manual) Authors manually identify (and tag) entities that are mentioned in articles and of which associated data is present (or registered) in databases, like GenBank, MINT, Uniprot, PDB, CCDC,... Very accurate and unambiguous However, requiring author effort Publisher takes care of actual linking Reciprocal linking usually taken care of 5

The Past/Present (entity linking – automatic) Sometimes automatically (e.g., NextBio and Reflect) Easily extendable to new / other entities Works retrospectively on older content Does create recall / precision errors 6

The Present (some considerations) STM, Brussels Declaration, June 2006:... believe that, as a general principle, data sets, raw data outputs of research, and sets or subsets of that data should wherever possible be made freely accessible... Data sets should be freely accessible – at publisher? Scientists prefer independent data repositories Need for single domain-specific coordination Huge costs for maintenance and preservation Proper deposit mechanism needed Through publisher? Extra overhead vs. ease of use Enforcing deposit prior to publication If community-supported, surely a possibility Data set standardization is needed for optimal use 7

The Present (more considerations) Scientist needs the combination of formal publication record and the raw data sets To get optimal interoperability, close collaboration between publisher and data set repositories needed Publisher should enable and support raw data sets Submission: enforce if supported by community Discoverability: interconnect article with data sets Reciprocal linking at deepest level possible PANGAEA-type linking Data feeds from publisher to repositories? Managing large amount of data set repositories? DataCite as single discussion partner 8

The Present (PANGAEA linking) 1.Author submits article to publisher 2.Author submits data set to repository 3.At article publication, repository links article DOI to associated data set DOI, creating actual connection 4.User sees link to ScienceDirect from PANGAEA 5.User sees link to PANGAEA from ScienceDirect: 9 SD Server USER SD Article PANGAEA Server link articles data + associations

PANGAEA links to ScienceDirect 10

ScienceDirect links to PANGAEA 11

A Future (tighter interoperability) Not just a link to / from data and journal article But provide integrated experience for scientist Single page (environment) with data and article 12 SD Server USER SD Article Supplementary Data Server articles data sets

A Future (tighter interoperability) Not just a link to / from data and journal article But provide integrated experience for scientist Single page (environment) with data and article Some users prefer it other way around; so also offer: 13 Data Set Server USER Data Set Article Server data sets articles

A Future (inline supplementary data) 14

A Future (inline supplementary data) 15 Structures submitted as supplementary data files (MOL files) Displayed inline through Reaxys application / service

Linking to & from Data from & to ScienceDirect Articles Presented by: IJsbrand Jan Aalbersberg Hannover, DataCite Meeting Date: June 8, 2010

Presented by: IJsbrand Jan Aalbersberg Hannover, DataCite Meeting Date: June 8, 2010 Creating the best User Experience by integrating Data with Articles

Presented by: IJsbrand Jan Aalbersberg Hannover, DataCite Meeting Date: June 8, 2010 Creating the best User Experience by integrating Data with Articles requires close collaboration between data set repositories and publishers