Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy.

Slides:



Advertisements
Similar presentations
Introduction to DataCite Adam Farquhar PhD Head of Digital Library Technology, The British Library President, DataCite June 2010.
Advertisements

The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
CHORUS Implementation Webinar May 16, 2014 Mark Martin Assistant Director, Office of Scientific and Technical Information Office of Science U.S. Department.
US DOE’s Public Access Plan: A vision reaching fruition Ms. Deborah Cutler Alt. US INIS Liaison Officer Office of Scientific and Technical Information.
Giri Palanisamy Oak Ridge National Laboratory & Lorrie Apple Johnson U.S. Department of Energy October 16, 2013.
Lorrie Apple Johnson Lead Librarian, Information Analysis & Services Office of Scientific and Technical Information (OSTI) National Academy of Sciences.
The Knowledge Bank Project at the Ohio State University Presented at the American Accounting Association Meeting – Chicago 8/6/07 Charles J. Popovich Head.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Digital Object Identifiers for EOSDIS data HDF Workshop April 17, 2012 John Moses, ESDIS
Digital Object Identifiers for EOSDIS data ESDSWG TIWG November 2, 2011 John Moses, ESDIS
Steve Yip Head of Reference and Research Services HKUST Library Research Support Provided by HKUST Library and other JULAC Libraries in HK 1 Date : March.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of.
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
Presented by DOI Create: TERN as a use-case Siddeswara Guru
Digital Object Identifiers for EOSDIS data ESIP Winter Meeting Jan 6, 2011 John Moses, ESDIS
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
Agenda: DMWG SM policy status ESIP meeting recap Reminder - DM Webinar Series New and updated web pages on DM website Metadata Training Sessions CDI meeting.
UPSpace An institutional research repository for the University of Pretoria Presented by Ina Smith to the School of Public Management and Administration.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
ICPSR’s Approach to Data Citation and Persistent Identifiers Mary Vardigan Assistant Director, ICPSR Workshop on Persistent Identifiers in the Social Sciences.
Libraries as Partners in Research: the UC Curation Center’s Tools and Services UC3 Team University of California Curation Center California Digital Library.
DataCite Canada Cyndie Found, CISTI Background : Who is CISTI, Definition of Data Research Data Management(RDM) – Benefits, Challenges Addressing.
Five Years InterLab ’07 Los Alamos, New Mexico October 1–3, 2007 Valerie S. Allen, MSLIS U.S. Department of Energy Office of Scientific and.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
1 CrossRef - a DOI Implementation for Journal Publishers January 29, 2003 CENDI Workshop.
The Department of Energy’s Public Access Solution Giving Voice to Energy and Science R&D Results Jeffrey Salmon Deputy Director for Resource Management.
Walt Warnick, Ph.D. Director, Office of Scientific and Technical Information U.S. Department of Energy n Accelerating the Spread of Knowledge About Science.
1 Update to the Board of Research Data on Information CENDI Federal STI Managers’ Group CENDI Federal STI Managers’ Group January 31, 2012 Lisa Weber,
CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1.
S YCAMORE S CHOLARS ISU Institutional Repository.
Enhancing Digital Repository of Scholarly Publications at Indian Institute of Technology Bombay by Mr. Mahendra N. Jadhav Assistant Librarian Central Library.
1 Federated Search (Emphasizing WorldWideScience.org) as a Transformational Technology Enabling Knowledge Discovery InterLending and Document Supply Conference.
Data Citation & Digital Object Identifiers DOIs. 2 DOIs for articles mints DOIs for Journal articles and some datasets.
Data Management and Accessibility S.M. Kaye PPPL Research Seminar 12/16/2013.
Libraries and data – the DataCite consortium Jan Brase, DataCite February 2nd, 2011 Workshop: Persistent Identifiers for the Social Sciences Bonn, Germany.
1 ETEC Meeting December 7, 2007 Dr. Walter L. Warnick Director DOE Office of Scientific and Technical Information OSTI—Advancing Science Accelerating Discovery.
BRIAN A. HITSON ASSOCIATE DIRECTOR OFFICE OF SCIENTIFIC AND TECHNICAL INFORMATION OFFICE OF SCIENCE U.S. DEPARTMENT OF ENERGY JANUARY 29, 2013 Improving.
Speeding Nano Progress Using Information Diffusion Walt Warnick, Ph.D. Director, Office of Scientific and Technical Information U.S. Department of Energy.
Sharon M. Jordan Assistant Director for Program Integration U.S. DOE Office of Scientific & Technical Information Vantage Point: Government R&D Results.
VIVO and Scholarly Repositories: Synergistic Opportunities.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
1 OSTI - Accelerating Science Information Dr. Walter L. Warnick Director U.S. Department of Energy Office of Scientific and Technical Information Federal.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
Data Citation & Digital Object Identifiers DOIs. 2 Digital Object Identifiers 101 Persistent identifier Identifies intellectual property in the digital.
1 Not So Strange Bedfellows: Information Standards For Librarians AND Publishers November 6, 2015.
Publishing & Citing Research Data Arun Prakash. Agenda  Introduction  Why is Data publishing important ?  Ongoing Work  Role of Semantics.
Walter L. Warnick, Ph.D. Director, U.S. Department of Energy (DOE) Office of Scientific and Technical Information (OSTI) ETEC, October 19, 2012.
Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
OMICS international Contact us at: OMICS International through its Open Access Initiative is committed to make genuine and.
Advancing Science: OSTI’s Current and Future Search Strategies Jeff Given IT Operations Manager Computer Protection Program Manager Office of Scientific.
Breakout Session 2.2: A sustainable GEO Information System of Systems Chair: Lorenzo Bigagli Rapporteur: Greg Yetman.
Margret Plank 17th International Conference on Grey Literature 1st and 2nd December 2015, Amsterdam (Netherlands) Move beyond text – How TIB manages the.
Speeding Nano Progress by Accelerating the Spread of Knowledge Walt Warnick, Ph.D. Director, Office of Scientific and Technical Information U.S. Department.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Data Citation Implementation Pilot Workshop
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Department of Energy Office of Scientific and Technical Information STIP Meeting April 20-21, 2005 Dr. Walter Warnick Director, OSTI The Washington Perspective.
Capturing from the start: managing grey literature in a brand new research University Mohamed Ba-essa J. K. Vijayakumar.
Nuclear Engineering 590 Navigating the Research Universe Angela Davis Engineering Librarian
YOUR TITLE HERE Courtney Matthews, Digital Repository Librarian Web Advisory Committee April 20, 2016 uwspace.uwaterloo.ca Library Scholarly Communications.
Redesigning the DOE Data Explorer to embed dataset relationships at the point of search and to reflect landing page organization Sara Studwell Department.
Promoting and Preserving FIU Research and Scholarship
ACS 2016 Moving research forward with persistent identifiers
CNI Spring 2010 Membership Meeting
Access  Discovery  Compliance  Identification  Preservation
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Presentation transcript:

Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy

What This Presentation Is About 2  What is OSTI  History of OSTI’s data citation program  ARM Data Archive  Our role in the AIP project

Office of Scientific and Technical Information (OSTI) Mission: Advance science and sustain technological creativity by making R&D findings available and useful to the Department of Energy researchers and the public. Premise: 3 Science advances only if knowledge is shared. Corollary: Accelerating the sharing of knowledge speeds the advancement of science (discovery).

DOE STI Program  OSTI manages agency-wide program to ensure access and delivery of research results.  DOE R&D results are:  collected from DOE offices, labs, and facilities;  preserved for re-use; and  made accessible via multiple web outlets. 4

Importance of Data Research output = technical reports and journal articles, but also commonly includes large amounts of associated data. DOE Order 241.1B  Updated and released December of  The first time this directive officially stated that data from funded research could be identified/announced to OSTI. 5

Why Cite Data? 6 We believe that you should cite data in just the same way that you can cite other sources of information, such as articles and books.  enables easy reuse and verification of data,  allows the impact of data to be tracked, and  creates a scholarly structure that recognizes and rewards data producers. Data citation is important because:

Citing Datasets Noted in Technical Reports – A Pilot Project Initial Research (2008/2009) Used data from the Atmospheric Radiation Measurement (ARM) Archive, maintained at the Oak Ridge National Laboratory. Selected Digital Object Identifiers (DOIs) as the preferred persistent locator. Acquired an account with the German National Library of Science and Technology (TIB) as the DOI Registration Agency (RA) for this initial pilot. 7

Citing Datasets Noted in Technical Reports – A Pilot Project Demonstrated the ability to locate the digital objects associated with a sample of DOE reports. Created the associated metadata for the digital objects. Assigned a DOI to the objects, and successfully registered the DOIs with the TIB. Updated reports with live links to newly registered data DOIs. 8

Meanwhile…DataCite TIB teamed with an international consortium in December of 2009 to create the DataCite DOI Registration Agency. Consortium was composed of 11 institutions focused on improving the scholarly infrastructure around datasets and other non-textual information. Created services to support assignment of Digital Object Identifiers (DOIs) to datasets. Validates, maintains, and resolves DOIs and the associated metadata. 9

OSTI and DataCite 10  OSTI joined DataCite in January of  There were two other U.S. members, the California Digital Library and Purdue University Libraries.  DOE OSTI was and still is the only U.S. federal agency.  OSTI minted first DOI and registered it with DataCite on August 10, 2011.

OSTI’s Data ID Service Announcement Notice Collects the metadata needed to identify/announce datasets resulting from work funded by DOE.  Two options:  An individual may manually submit metadata via E-Link using Announcement Notice  Organizations may use OSTI’s automated web service for volume submissions.  Information submitted via AN allows OSTI to assign DOIs to datasets.  OSTI then registers these DOIs with DataCite as a service to researchers. 11

12  Dataset Type  Dataset Title  Creator(s)/Principal Investigator(s)  Dataset Product Number(s)  DOE Contract Number(s)  Originating Research Organization  Publication/Issue Date  Language  Country of Origin/Publication  Sponsoring Organization(s)  Site URL (landing page for dataset)  Contact Information (will not be displayed publicly) Required Metadata

Dissemination of Data-Related Information to DOE/OSTI Databases To SciTech Connect: Semantically searchable database containing all DOE records, including technical reports, journal articles, conference literature, multimedia, and datasets. To DOE Data Explorer: Inventory of DOE data collections wherever they reside. It also provides access to individual dataset records as they are submitted via the Data ID Service. Currently over 1050 data collections and datasets/datastreams in DDE. 13

DDE Data Collection Citation 14 Numeric Data Figures/Data Plots Specialized Mix Genome/Genetics Data Interactive Data Maps Animations/Simulations Multimedia

 SciTech Connect records, including dataset citations, are picked up and indexed by Google.  Dataset citations also flow to major interagency resource, Science.gov. Dissemination… to Major Search Engines and Beyond 15

OSTI’s Data ID Service Customers  The ARM Data Archive graduated from a pilot project to OSTI’s first data customer.  First DOI for a dataset was assigned by OSTI and registered with DataCite on 8/10/2011.  580 ARM datasets are now registered. 16

ARM Data Archive The Challenges:  There are millions of data files from over 3,000 data products.  Many continuous datastreams are created from around-the-clock monitoring of environment by multiple instruments. Temporal and geographic information becomes very important.  There is a large user community (climate change model community).  Data are also published via other portals. 17

DDE Citation for ARM Datastream 18

19 “Landing Page” for the DOI ( / ) assigned to this ARM datastream

OSTI’s Data ID Service Current Status Data Clients in Production  Atmospheric Radiation Management Program (ARM Archive at ORNL)  Irradiance and Meteorological Data, Renewable Resource Data Center (RReDC at NREL)  Coherent X-ray Imaging Data Bank (CXIDB at LBNL)  Next Generation Ecosystems Experiment – Arctic (NGEE-Arctic at ORNL) Data Clients in Testing  Oak Ridge Leadership Computing Facility (OLCF at the National Center for Computational Sciences, ORNL) Data Clients Committed and Planning  National Nuclear Data Center (NNDC at BNL)  DOE Geothermal Data Repository 20

21  History of collaboration between AIP and OSTI  Experience with dataset citation  Allocating agent for DOIs – i.e. DataCite membership AIP Pilot – Physics of Plasmas

Questions? 22 Mark Martin