Towards Data Attribution & Citation in the Life Sciences Philip E. Bourne UCSD 8/22/11Data Attribution and Citation.

Slides:



Advertisements
Similar presentations
EBSCO Discovery Service
Advertisements

Linking Data from ScienceDirect Articles Presented by: IJsbrand Jan Aalbersberg Hannover, DataCite Meeting Date: June 8, 2010.
In the Format section, we have activated the Bibliographic style drop down menu. From this page, you can choose a specific journal or format (e.g. BMC.
Project E: Citation Understanding the problem space Progress so far How you can contribute : afternoon session Lessons learned and challenges ahead Acknowledgements:
A Guide to PMCID numbers Anca Geana, MBA, CRA – May 2012.
NIH Public Access Compliance Cleveland Health Sciences Library Case Western Reserve University Kathleen C. Blazar.
CrossRef Linking and Library Users “The vast majority of scholarly journals are now online, and there have been a number of studies of what features scholars.
How to write my paper and have it published in a computational biology journal? Phil Bourne University of California San Diego
Interoperability scenarios between UKPMC and OpenAIRE Jo McEntyre, Wolfram Horstmann.
Drinking from a Fire Hose: Keeping up with the Professional Literature Angela Murrell Kresge Library, TSRI
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
New Features Update ISI Web of Knowledge. Copyright 2006 Thomson Corporation 2 New features added Mozilla Firefox web browser is now supported New access.
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2004.
Data citation from the perspective of a scholarly publisher Lyubomir Penev TDWG Data Citation Workshop, New Orleans, Oct 2011 ViBRANT.
NATIONAL LIBRARY OF MEDICINE The PubMed ID and Entrez, PubMed and PubMed Central Edwin Sequeira National Center for Biotechnology Information June 21,
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
 BioMed Central is an STM (Science, Technology and Medicine) database. All articles are reviewed before publishing.  It offers full texts, citations,
SCIENTIFIC SOLUTIONS Thomson ResearchSoft Paul Torpey April 8, 2005.
New Modes of Scholarly Communication and Learning Philip E. Bourne University of California San Diego 1WSU December 2, 2008.
Introducing Symposia : “ The digital repository that thinks like a librarian”
WEB OF SCIENCE now including the CONFERENCE PROCEEDINGS CITATION INDEXES.
Machine Learning in the New World of Scholarly Communication Philip E. Bourne University of California San Diego
Digital Library Architecture and Technology
Biological Science Database Proquest WEDAD AL-HUSAINAN ISD/NSTIC Kuwait Institute for Scientific Research November/2012.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
1 Chuck Koscher, CrossRef New Developments Relating to Linking Metadata Metadata Practices on the Cutting Edge May 20, 2004 Chuck Koscher Technology Director,
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
DATAVERSE FOR JOURNALS Mercè Crosas, Ph.D. Director of Data Science IQSS, Harvard Society for Scholarly Publishing 37 th Meeting,
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
Libraries as Partners in Research: the UC Curation Center’s Tools and Services UC3 Team University of California Curation Center California Digital Library.
Self-archiving The term usually refers to the self-archiving of peer reviewed research journal and conference articles as well as theses, deposited in.
Some Thoughts on Scholarly Communication and the Role of Bio-ontologies Philip E. Bourne University of California San Diego
1 CrossRef - a DOI Implementation for Journal Publishers January 29, 2003 CENDI Workshop.
1 Ed Pentz, CrossRef CrossRef and DOIs: New Developments 32 nd LIBER Annual General Conference Extending the Network: libraries and their partners 18 June.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
GeNii New Contents Services of NII
Open Data Driving Scholarly Communications in 2020 Philip E. Bourne UCSD 7th Int. Data Curation Conference Bristol UK Dec. 7,
The Promise of Open Access Philip E. Bourne PhD University of California San Diego Open Access Day October 14, 2008
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
Navigating An Introductory Guide for Librarians Brought to you by:
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
8 October 2009Microbial Research Commons1 Toward a biomedical research commons: A view from NLM-NIH Jerry Sheehan Assistant Director for Policy Development.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
I am not a PDBid I am a Biological Macromolecule Philip E. Bourne University of California San Diego
Open Science One Person’s View and What We Are Doing About It Philip E. Bourne University of California San Diego 1PSB Open Science Workshop.
1 Annual Meeting 2004 CrossRef Publishers International Linking Association, Inc Charles Hotel, Cambridge, MA November 9 th, 2004.
Now launched! Visit nature.com/scientificdata Honorary Academic Editor Susanna-Assunta Sansone Advisory.
Philip E. Bourne Professional Development Lecture 7 Understanding and Working the Publishing Process.
Data Integration and Management A PDB Perspective.
The Long Tail of Sample-based Data in the Next Decade FROM DARKNESS TO LIGHT Kerstin Lehnert
Chuck Koscher Director of Technology CrossRef ICSTI General Assembly TACC Workshop Tokyo October 19, 2014 crossref: mainstay of the scholarly communication.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
Real World Experiences in Operating a Collaboratory: The Protein Data Bank Helen M. Berman Board of Governors Professor of Chemistry.
Databases, Ontologies and Text mining Session Introduction Part 2 Carole Goble, University of Manchester, UK Dietrich Rebholz-Schuhmann, EBI, UK Philip.
Internet Documentation and Integration of Metadata (IDIOM) Presented by Ahmet E. Topcu Advisor: Prof. Geoffrey C. Fox 1/14/2009.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Telling Research Stories Through SciVee Philip E. Bourne University of California San Diego AAAS February 21, 2010.
ISI Web of Knowledge update: October What’s New? Conference Proceedings Citation Indexes now in Web of Science –Two editions – Science and Social.
Data Citation Implementation Pilot Workshop
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
Using Content Presented by Karen Andrews Physical Sciences & Engineering Librarian, U.C. Davis Tuesday, September 13, :30-9:30 ASIDIC Fall 2005 Meeting.
Databases, Ontologies and Text mining Session Introduction Part 2
Next Generation Preprint Service
Jay Bhatt Drexel University Libraries
The Role of the ADS in Software Discovery and Citation
Philip Bourne University of California San Diego
New Features Update Web of Knowledge : Discovery Starts Here
Presentation transcript:

Towards Data Attribution & Citation in the Life Sciences Philip E. Bourne UCSD 8/22/11Data Attribution and Citation

Life Science Data Repositories  NLM is the elephant in the room.. However..  There are thousands on community maintained efforts – all want an NAR publication  The ability to cite and attribute the data are highly variable: –DOIs assigned in some cases, but not used –Attribution is through the metadata in most cases –Citation is typically by the associated literature reference if it exists, and/or a database identifier –The use of data repositories such as Dryad is compelling for the long tail problem –Data journals are on the horizon 8/22/11Data Attribution and Citation

Consider the PDB as a Use Case  Oldest data resource in biology?  A resource used by ~ 200,000 individuals per month – increasing number of school kids!  A resource distributing worldwide the equivalent to ¼ the National Library of Congress each month  A bicoastal/worldwide resource  1TB 8/22/11Data Attribution and Citation

Number of released entries Year PDB Typical Growth Curve – But the Complexity! 8/22/11

People are doing more with the data  Number of visits and page views is growing faster than number of unique visitors

The Data May Save Lives? * Jan. 2008Jan. 2009Jan. 2010Jul. 2009Jul. 2008Jul RUZ: 1918 H1 Hemagglutinin Structure Summary page activity for H1N1 Influenza related structures * 3B7E: Neuraminidase of A/Brevig Mission/1/1918 H1N1 strain in complex with zanamivir

PDB Data Attribution and Citation  About 25% of our budget has been spent on data remediation – multiple versions supported – the copy of record (as defined by the publication) is always available  Cant publish unless data are deposited – motivated by the community - very good data to publication correspondence  Data objects are discreet and we assign DOIs – but they are not used – database identifiers preferred 8/22/11Data Attribution and Citation

Ah yes.. But the CD4 Story…

1. A link brings up figures from the paper 0. Full text of PLoS papers stored in a database 2. Clicking the paper figure retrieves data from the PDB which is analyzed 3. A composite view of journal and database content results Literature/Data Integration 1.User clicks on content 2.Metadata and webservices to data provide an interactive view that can be annotated 3.Selecting features provides a data/knowledge mashup 4.Analysis leads to new content I can share 4. The composite view has links to pertinent blocks of literature text and back to the PDB The Knowledge and Data Cycle PLoS Comp. Biol (3) e34 8/22/11

Example of Interoperability: The Database View BMC Bioinformatics :220

Example of Interoperability – The Literature View From Anita de Waard, Elsevier

Acknowledgements Funding Agencies: NSF, NIGMS, DOE, NLM, NCI, NCRR, NIBIB, NINDS, NIDDK 128/22/11Data Attribution and Citation