Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee

Slides:



Advertisements
Similar presentations
Putting Eprints Software into the User Community An invitation-only international roundtable workshop organised by JISC and the School of Electronics and.
Advertisements

Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University.
Swimming Upstream: Assessing the Librarys Role in Managing the River of Data on Campus Christie Peters | Science & Engineering Librarian Anita R. Dryden.
Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
RCN: OceanObs Task Team on Stimulating Interdisciplinary Cooperation Issue – how to facilitate communication and collaboration across disciplines Why –
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
EInfrastructures (Internet and Grids) US Resource Centers Perspective: implementation and execution challenges Alan Blatecky Executive Director SDSC.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
We are all e-Researchers now Stéphane Goldstein Head of Programmes, RIN SCONUL-IATUL seminar Wellcome Collection 21 November 2008.
New organisational perspectives in 'library business' in the future – case study Finland Kristiina Hormia-Poutanen National Library of Finland.
NSF and Environmental Cyberinfrastructure Margaret Leinen Environmental Cyberinfrastructure Workshop, NCAR 2002.
Western Regional Biomedical Collaboratory Creating a culture for collaboration.
GeoData 2011 Workshop Data Life Cycle Break Out #3 Wednesday, 2 March 2011 Moderator: Mohan Ramamurthy, Unidata.
© The Trustees of Indiana University Centralize Research Computing to Drive Innovation…Really Thomas J. Hacker Research & Academic Computing University.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
The Digital Journal Collection in Libraries -what Libraries Are doing -Impact on Scientists Carol Tenopir University of Tennessee
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
DuraCloud A service provided by Sandy Payette and Michele Kimpton.
1 Use of electronic information resources among the Croatian scientists in the field of social sciences in a pre-digital library environment: obstacles.
Presenter: Karla Strieb Assistant Executive Director Transforming Research Libraries June 3, 2010 Supporting E-science: Progress at Research Institutions.
Data Conservancy: A Blueprint for Libraries in the Data Age Sayeed Choudhury Johns Hopkins University
The Data Conservancy: A Digital Research and Curation Virtual Organization Karon Kelly National Center for Atmospheric Research – NCAR Library Special.
Data Management Plans Bill Michener University Libraries and Biology Dept. University of New Mexico.
A centre of expertise in digital information management UKOLN is supported by: Benefits of Research360 Catherine Pink Institutional Data.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
Managing Sustainability Solutions Initiative (SSI) data Kate Beard, Steve Cousins University of Maine NERACOOS/NECOSP Data Management Workshop, Sept. 26,
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
A River Runs Through It ARL Membership Meeting Sayeed Choudhury Sheridan Libraries, Johns Hopkins October 15, 2009.
Proposition: Digital Collections Are Easier to Find and Use through DLF Aquifer’s American Social History Online Katherine Kott, Aquifer Director Library.
13 September 2012 The Libraries’ Role in Research Data Management: A Case Study from the University of Minnesota Meghan Lafferty, Chemistry, Chemical Engineering,
Data sharing in environmental sciences: A survey of CNR researchers Daniela Luzi*, Roberta Ruggieri #, Stefania Biagioni°, Elisabetta Schiano § *CNR-IRPPS,
An International GIS and Data Curation dissemination framework using mobile devices: a Purdue-Aalto University example Authors: Benjamin Branch and Antti.
DASISH Final Conference Common Solutions to Common Problems.
Research and Educational Networking and Cyberinfrastructure Russ Hobby, Internet2 Dan Updegrove, NLR University of Kentucky CI Days 22 February 2010.
Managing End-User Development of Digital Library Resources to Support User Communities Robert R. Downs Center for International Earth Science Information.
An Environmental Scan for Data Services Trends that are shaping today’s environment for data services.
An Introduction. Aspiration To begin the process of adding significant value to those emerging repositories in which.
Michael Witt Interdisciplinary Research Librarian & Assistant Professor Purdue Libraries & Distributed Data Curation Center (D2C2) Eliciting.
Scientists’ Data and Information Practices and Needs Carol Tenopir, University of Tennessee and Mike Frame, USGS June 15, 2011 UC3 Summer Webinar Series.
Perspectives on Cyberinfrastructure Daniel E. Atkins Professor, University of Michigan School of Information & Dept. of EECS October 2002.
PSCIC Working Group: Parag Chitnis Chris Greer Susan Lolle Sam Scheiner Jane Silverthorne Bill Zamer Manfred Zorn.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
Finding Partners, Creating Impact Rusty Low Poles Together Workshop NOAA Boulder, CO July 20-22, 2005.
The Role of Academic Libraries in the Digital Data Universe Break-Out Session: New Partnership Models Bob Hanisch and Brian Schottlaender Co-Leaders ARL.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Block 7: Reports Back to Plenary Group on CE and CI Working Group Activities Tasks and Activities -- October 22 DataONE Kick-off Meeting October 20-22,
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Context: The Strategic Plan for Establishing the Network Integrated Biocollections Alliance Judith E. Skog, Office of the Assistant Director, Biological.
O C I October 31, 2006Office of CyberInfrastructure Implementing the Strategic Vision for Digital Data NSF Data Group ACCI Meeting October 31, 2006.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Queensland University of Technology CRICOS No J HOW RESEARCHERS FIND INFORMATION IN THE NEW DIGITAL AGE Gaynor Austen Director, Library Services.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Copyright and Data Matthew Mayernik National Center for Atmospheric Research Section: Responsible Data Use Version 1.0 October 2012 Copyright 2012 Matthew.
1 Why is Digital Curation Important for Workforce and Economic Development? Alan Blatecky Office of Cyberinfrastructure Symposium on Digital Curation in.
Data Conservancy and the US NSF DataNet Initiative Fourth Workshop on Data Preservation and Long-Term Analysis in HEP Sayeed Choudhury Johns Hopkins University.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
Integrating Cyberinfrastructure Technologies Mark C. Sheehan, Ph.D. ECAR Fellow EDUCAUSE Live! December 18, 2008 © 2008 EDUCAUSE. All rights reserved.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
Jeff Moon Data Librarian &
M25 Group Open Library Data A British Library Perspective
Joslynn Lee – Data Science Educator
DataNet Collaboration
Briefing to ARL Membership
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Bird of Feather Session
Wrap-Up – NSF Site Visit 8 February 2010
Research data lifecycle²
Presentation transcript:

Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee

NSF DataNet program will build new types of organizations that will…  integrate library and archival sciences, cyberinfrastructure, computer & information sciences, and domain science expertise to:  provide reliable digital preservation, access, integration, and analysis capabilities for science and/or engineering data over a decades-long timeline

DataONE (Data Observation Network for Earth) P.I., Bill Michener, University Libraries, Univ. New Mexico Presenter Name

Interdisciplinary challenges  Environmental science challenges  Cyberinfrastructure challenges  DataONE: A solution  Building on existing CI  Creating new CI  Changing science culture and institutions Carol Tenopir

… engaging diverse partners.  Libraries & digital libraries  Academic institutions  Research networks  NSF- and government- funded synthesis & supercomputer centers/networks  Governmental organizations  International organizations  Data and metadata archives  Professional societies  NGOs  Commercial sector

Baseline of Scientists To measure the current state of data needs, practices, knowledge of standards, and motivations regarding data collection, access, and preservation GOOD PRACTICES TIME

Assessment-stakeholders Scientists Computer – IT Personnel Public Officials Citizen-scientists Students & Teachers Libraries Librarians

Baseline Assessment of Scientists: distribution and responses  Scientists - various work sectors  Via champions  As of June 2010 N=1000  Preliminary results N=923 Preliminary results based on data collected from October 27, 2009 to April 30, 2010

Demographics N=909 Preliminary results based on data collected from October 27, 2009 to April 30, 2010 N=917

Age groups N=827

Primary discipline N=917

Lessons learned Preliminary results based on data collected from October 27, 2009 to April 30, Data management practices vary. 2. Many scientists are interested in sharing data. 3. There are many barriers to sharing data. 4. There are some differences in data management practices.

Lesson one Data management practices vary.

What metadata do you currently use to describe your data, if any (check all that apply)? DwCDCISOOpen GISFGDC EMLMy Lab NONE

 Data may be misinterpreted due to complexity of the data. (75%, N=899)  Data may be misinterpreted due to poor quality of the data. (71%, N=899)  Data may be used in other ways than intended. (74%, N=896) Approximately three-quarters agree that: Preliminary results based on data collected from October 27, 2009 to April 30, 2010

If some or all of your data are available to others, these data are available: Preliminary results based on data collected from October 27, 2009 to April 30, 2010

Lesson two Many scientists are interested in sharing data.

Interested in data sharing- with some restrictions  I would use other researchers' datasets if their datasets were easily accessible. (84%, N=902)  I would be willing to share data across a broad group of researchers who use data in different ways. (83%, N=893)  I would be willing to place at least some of my data into a central data repository with no restrictions. (79%, N=901)  It is appropriate to create new datasets from shared data. (77%, N=902).  I would be willing to place all of my data into a central data repository with no restrictions. (44%, N=894) Preliminary results based on data collected from October 27, 2009 to April 30, 2010

Conditions on sharing data ConditionMy DataOthers’ Data Acknowledge provider/funder94%94% Formally cite provider/funder94%95% Opportunity to collaborate81%82% Reciprocal sharing agreement71%71% Reprints of articles70%71% Complete list of products70%69% Preliminary results based on data collected from October 27, 2009 to April 30, 2010

Lesson three There are many barriers to data sharing.

If your data are not available electronically to others, why not (check all that apply)?  Insufficient time (54%)  Lack of funding (41%)  No place to put data (23%)  Don't have the rights to make the data public (22%) Preliminary results based on data collected from October 27, 2009 to April 30, 2010

Other barriers  Training on best practices (23%)  Organization provides funds for long- term data management (24%)  Organization provides funds for data management during project (31%)  Others can access my data easily (38%) N=923 Preliminary results based on data collected from October 27, 2009 to April 30, 2010

DCC Survey (2009) Preliminary Findings also identified barriers Barriers for sharing research data (N=1270)  Legal Issues41%  Misuse of data41%  Incompatible data types33%  Lack of Technical Infrastructure28%  Lack of financial resources27%  “Fear to lose” financial edge27%  Restricted access to data archive21%  No problems foreseen16%  Other 10%

Lesson four There are some differences in data management practices.

Atmospheric scientists  Share data with others (78%)  Others can access my data easily (50%)  Org provides necessary tools during the project (58%)  Org has process to manage data during the project (56%)  Org provides storage beyond the project (54%) Preliminary results based on data collected from October 27, 2009 to April 30, 2010

Differences by sector (academic, government)  High satisfaction with data collection (82%,73%)  Data available on organization site (54%,76%)  Moderate satisfaction integrating data (43%,44%)  Tools to manage data during project (45%, 48%)  Tools to store data beyond the project (38%, 54%)  Low satisfaction with tools to prepare metadata (27%,19%) Preliminary results based on data collected from October 27, 2009 to April 30, 2010

Age RangeMy Data Others’ Data 30&Under60%62% %39% %42% %35% Over 6032%32% It is fair exchange for use of data when legal permission is obtained. Preliminary results based on data collected from October 27, 2009 to April 30, 2010

Age RangeMy Data Others’ Data 30&Under39%42% %25% %33% %26% Over 6030%31% At least part of the costs of data acquisition, retrieval, or provision must be recovered.

Where do we go from here?  Data management plans  Identified many areas where D1 could learn from scientific communities  Survey closes July 31, 2010  Report in fall 2010