Data Provenance and Attribution for Published Datasets The Challenge and the reality check April 9-10, 2009 National Academy of Sciences, Woods Hole, MA.

Slides:



Advertisements
Similar presentations
David Shotton Image BioInformatics Research Group Department of Zoology University of Oxford, UK The Dryad-UK vision © David Shotton,
Advertisements

Better Data, Better Science! [ Better Science through Better Data Management ] Todd D. OBrien NOAA – NMFS - COPEPOD.
Rolling Deck to Repository: Transforming the United States Academic Fleet Into an Integrated Global Observing System Suzanne M. Carbotte, Robert Arko,
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Dr. Markus Quandt GESIS – Leibniz-Institute for the Social Sciences Workshop: Persistent Identifiers for the Social Sciences University Club, Bonn, February.
Sami Borg and Helena Laaksonen : Acquisition policies for a new data archive IASSIST2005 Edinburgh, May 2005
Previous Work CMPE 185. Goals for this project To practice in-depth library research on a specific subject, and present a paper incorporating that research.
Data citation from the perspective of a scholarly publisher Lyubomir Penev TDWG Data Citation Workshop, New Orleans, Oct 2011 ViBRANT.
IDENTIFIERS & THE DATA CITATION INDEX DISCOVERY, ACCESS, AND CITATION OF PUBLISHED RESEARCH DATA NIGEL ROBINSON 17 OCTOBER 2013.
Converging parallel universes Library services as building blocks of digital humanities research 42nd LIBER Annual Conference Munich June 2013 Gregor Horstkemper.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Research Integrity: Collaborative Research Michelle Stickler, DEd Office for Research Protections
Talking to our faculty about open access and authors’ rights Joyner Library Forum October 23, 2008.
Biological and Chemical Oceanography Data Management Office 1 of 12 An Introduction to the Biological and Chemical Oceanography Data Management Office.
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
FISH 521 Peer review. Peer review Mechanics Advantages Challenges Solutions.
Journal Impact Factors and H index
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
Top Ten Ways to Get Published (in a scholarly journal) with apologies to David Letterman Jim Levin Education Studies University of California, San Diego.
5. Presentation of experimental results 5.5. Original contribution (paper) - the main outcome of scientific activities - together with patents, they can.
Writing to Publish Navigating the Academic Journal Review Process.
The impact of the development of institutional repositories on “Kiyo” or institutional research journals in Japan Hiroya Takeuchi and Syun Tutiya Chiba.
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
Update on the VERSIONS Project for SHERPA-LEAP SHERPA Liaison Meeting UCL, 29 March 2006.
Merging the National Library and the National Archives LIBER General Annual Conference, Tartu, June 2012 Els van Eijck van Heslinga, Head Finance and Corporate.
Advanced Technical Writing
Rajesh Singh Deputy Librarian University of Delhi Measuring Research Output.
Research evaluation requirements José Manuel Barrueco Universitat de València (SPAIN) Servei de Biblioteques i Documentació May, 2011.
Publishing Your Work Not a Question, But rather an Execution Who? Why? When? Where? How? รัตติกร ยิ้มนิรัญ สาขาวิชาฟิสิกส์ สำนักวิชา วิทยาศาสตร์ มหาวิทยาลัยเทคโนโลยีสุรนารี
Chapter 6 Researching Your Subject. In academic research, your goal is to find information that will help you answer a scholarly question. In workplace.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Skills Building Workshop: PUBLISH OR PERISH. Journal of the International AIDS Society Workshop Outline Journal of the International.
ISC Journal Citation Reprots تقارير استنادية للمجلات Mohammad Reza – Ghane Assistant Prof. in Library and Information Science & Director of Research Department.
Writing a Research Manuscript GradWRITE! Presentation Student Development Services Writing Support Centre University of Western Ontario.
PLoS Enlivening Scientific Culture Dr Chris Surridge Managing Editor, PLoS ONE Public Library of Science.
Shruthi(s) II M.Sc(CS) msccomputerscience.com. Introduction Digital Libraries have become the source of information sharing across the globe for education,
Presentation to Legal and Policy Issues Cluster JISC DRP Programme Meeting 28 March 2006.
Research Cycle When and How Information Gets Published.
VERTIGO data OCB database status update Cyndy Chandler Ocean Carbon and Biogeochemistry Data Management Office Cyndy Chandler Ocean Carbon and Biogeochemistry.
Can sharing research data raise your research profile and impact? Gerry Ryder Charles Darwin University, September 2015.
Reconstituting the Ocean: a tale from U.S. JGOFS Cyndy Chandler (MCG, WHOI) U.S. JGOFS Data Management Office and Ocean Carbon and Biogeochemistry Coordination.
Biological and Chemical Oceanography Data Management Office slide 1 of 19 CAMEO Data Management Bob Groman Biological and Chemical Oceanography Data Management.
Scientific Papers Chemical Literature Prepared by Dr. Q. Wang.
5.5. Original contribution (paper) - the main outcome of scientific activities - together with patents, they can not be combined together at one time -
1 Judy Hewitt, PhD On Detail to Office of Extramural Research National Institutes of Health May 18, 2015 Center for Scientific Review Advisory Council.
AuthorAID Workshop on Proposal Writing Rwanda June 2011.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Datasealofapproval.org13/12/2015 DANS is an institute of KNAW and NWO 1 Identifying and removing barriers for sharing scientific data Laurents Sesink
DEFENSE THREAT REDUCTION AGENCY JOINT SCIENCE AND TECHNOLOGY OFFICE CHEMICAL AND BIOLOGICAL DEFENSE create collaborate communicate Click to add title of.
MARE 103 MOP Proposal Lecture.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Scientific Writing Scientific Papers – Original Research Articles “A scientific paper is a written and published report describing original research.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 U.S. GEOTRACES Data Management Cyndy Chandler BCO-DMO ~ WHOI 23 September 2008.
Biological and Chemical Oceanography Data Management Office slide 1 of 22 Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological.
Using Library Resources Making the Library Work for You Kate Wise Spring 2008.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 The Biological and Chemical Oceanography Data Management Office (BCO-DMO) Cyndy.
Updating image To update the background image: Go to ‘View’ Select ‘Slide Master’ Select the page with the image Right click on the image and select ‘Change.
Development and Management of e-Repositories April 2013 IODE Project Office Oostende, Belgium Future Repository Trends: Repositories and Published.
Acknowledgments Funding provided by the Jewett Foundation Introduction Data collected in ocean sciences, whether generated from research or operational.
Dr.V.Jaiganesh Professor
Digitizing ODINs Partnership
Linked Data for Field Deployments
Compilation of SCOAP supported papers
Outline Goals: Searching scientific journal articles
Linking persistent identifiers at the British Library
Post-publication evaluation through tags
Scientific Publishing in the Digital Age
5. Presenting a scientific work
5. Presenting a scientific work
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Presentation transcript:

Data Provenance and Attribution for Published Datasets The Challenge and the reality check April 9-10, 2009 National Academy of Sciences, Woods Hole, MA Cyndy Chandler Biological and Chemical Oceanography Data Management Office Woods Hole Oceanographic Institution

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution2 of 18 What is the goal? to establish best practice guidelines for metadata capture and recording to support data provenance and attribution of published datasets to establish best practice guidelines for metadata capture and recording to support data provenance and attribution of published datasets this talk will focus on oceanographic data this talk will focus on oceanographic data

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution3 of 18 What is the problem? Why arent we doing this already? Why arent we doing this already? provenance tracking and attribution systems have been in use for a long time provenance tracking and attribution systems have been in use for a long time works of art works of art works of literature works of literature

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution4 of 18 Why arent we doing this already? What is so difficult about associating source data with a journal publication? What is so difficult about associating source data with a journal publication? data acquisition data publication journal publication

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution5 of 18 Why arent we doing this already? What are the challenges? Technical Technical Cultural Cultural Usual Usual

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution6 of 18 Why arent we doing this already? Technical reasons … data are not published data are not published what is the definition of a published dataset? what is the definition of a published dataset? and if the data are published and if the data are published its not clear how to cite them its not clear how to cite them they lack sufficient metadata they lack sufficient metadata metadata are non-standard metadata are non-standard or they lack a persistent identifier or they lack a persistent identifier

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution7 of 18 Why arent we doing this already? Technical reasons … data sets used to be smaller and were often published on paper (in a journal article or a data report, and they fit in Table 1) data sets used to be smaller and were often published on paper (in a journal article or a data report, and they fit in Table 1) data were published as a tangible thing data were published as a tangible thing as data acquisition becomes automated, rate of acquisition and volume increases as data acquisition becomes automated, rate of acquisition and volume increases but metadata acquisition (data documentation) is not being automated at the same rate but metadata acquisition (data documentation) is not being automated at the same rate

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution8 of 18 Why arent we doing this already? Cultural reasons … little incentive for researchers to publish their data little incentive for researchers to publish their data often augmented by the perception that the data are the property of the originating investigator, and might be stolen often augmented by the perception that the data are the property of the originating investigator, and might be stolen Conventional wisdom is still that publish or perish applies predominantly to journal publications, not data publication. (Funding agency program managers are beginning to effect change in this area.)

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution9 of 18 Why arent we doing this already? Usual reasons … lack of resources lack of resources Funding Funding Expertise Expertise Time Time

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution10 of 18 remember where these data come from … … this is the office ! Think Ill go record some metadata. Whos recording the metadata?

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution11 of 18 Why arent we doing this already? What is so difficult about associating source data with a journal publication? What is so difficult about associating source data with a journal publication? data acquisition data publication journal publication

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution12 of 18 data acquisition data publication journal publication a relatively simple case a relatively simple case Many of the VERTIGO project cruise data sets are available online from BCO-DMO Many of the VERTIGO project cruise data sets are available online from BCO-DMO and theyre tagged with metadata. and theyre tagged with metadata. The introductory paper refers to the online data server. The introductory paper refers to the online data server. Source data are available online for this special volume. Source data are available online for this special volume.

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution13 of 18 Why arent we doing this already? Lets assume this effort is fully funded ~ so all the usual reasons are no longer an issue ~ funding, expertise, time ~ no longer a challenge ! Combined cultural and technical challenges … The simplest system for data publication and attribution involves at least one representative from each of these three communities: The simplest system for data publication and attribution involves at least one representative from each of these three communities: Oceanographer ( research discipline )Oceanographer ( research discipline ) Data manager ( information science )Data manager ( information science ) Editor ( publishing community )Editor ( publishing community )

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution14 of 18 Why arent we doing this already? Combined cultural and technical challenges … The successful system for data publication and attribution more likely involves six communities The successful system for data publication and attribution more likely involves six communities Oceanographer (research discipline )Oceanographer (research discipline ) Data manager (information science )Data manager (information science ) Library scienceLibrary science Information technology expertise from these fieldsInformation technology expertise from these fields Social scienceSocial science Editor ( publishing community )Editor ( publishing community ) and effective communication between those communities

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution15 of 18 Additional Challenges What if all the whining from the previous slides could be addressed somehow? What if all the whining from the previous slides could be addressed somehow? Education Education Cultural changes Cultural changes Standards development and implementation Standards development and implementation Funding sources Funding sources Communication Communication challenges

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution16 of 18 Additional Challenges micro attribution – what level is required to support scientific inquiry? micro attribution – what level is required to support scientific inquiry? what are the identifiable entities within a publication that require data attribution what are the identifiable entities within a publication that require data attribution the entire article?the entire article? each table? each figure?each table? each figure? publications often have many source data setspublications often have many source data sets who does all that work? The author(s) ? who does all that work? The author(s) ?

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution17 of 18 It is important to figure this out. Data are difficult and expensive to collect, and can not be recollected. Data are difficult and expensive to collect, and can not be recollected. We want to maximize data reuse. We want to maximize data reuse.

09 April 2009Cyndy Chandler ~ Woods Hole Oceanographic Institution18 of 18 thank you