Michael Witt Interdisciplinary Research Librarian & Assistant Professor Purdue Libraries & Distributed Data Curation Center (D2C2) Eliciting.

Slides:



Advertisements
Similar presentations
Panel 2 – Promoting Re-Use of Scientific Collections John Harrison SHAMAN Project University of Liverpool
Advertisements

Building Support for a Discipline-Based Data Repository Ryan Scherle 1, Sarah Carrier 2, Jane Greenberg 2, Hilmar Lapp 1, Abbey Thompson 2, Todd Vision.
The Dryad Data Repository Ryan Scherle 1, Hilmar Lapp 1, Amol Bapat 2, Sarah Carrier 2, Jane Greenberg 2, Peggy Schaeffer 1, Todd Vision 1,3, Hollie White.
Swimming Upstream: Assessing the Librarys Role in Managing the River of Data on Campus Christie Peters | Science & Engineering Librarian Anita R. Dryden.
Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
Program Goals Just Arent Enough: Strategies for Putting Learning Outcomes into Words Dr. Jill L. Lane Research Associate/Program Manager Schreyer Institute.
Libraries in the New Research Environment Joyce Ray NAS/BRDI Symposium Associate Deputy for Libraries June 3, 2010.
Data Sharing Practices: Implications for Curation and Re-use Carole L. Palmer Center for Informatics Research in Science & Scholarship Graduate School.
Data Sharing, Small Science, and Institutional Repositories Melissa H. Cragin & Carole L. Palmer Center For Informatics Research in Science and Scholarship.
Data Sharing Practices: Implications for Curation and Re-use Carole L. Palmer & Tiffany Chao Center for Informatics Research in Science & Scholarship Graduate.
Tape library, CERN, Geneva by Cory Doctorow / CC BY-SA 2.0 Research Data Management Assessment.
Learning Hands-on and by Trial & Error with Data Curation Profiles D. Scott Brandt assoc dean for research Framing the digital curation curriculum International.
Data Management Plans PAUL H. BERN, PH.D. APRIL 3, 2014.
The Data Curation Profile IASSIST 2010 Jake Carlson Data Research Scientist Purdue University Libraries.
Connecting with Data Megan Sapp Nelson, Associate Professor of Library Sciences, Purdue University Libraries
Caro-COOPS Data Management: Metadata. Cast-Net addresses the need for improved connectivity among coastal observing systems by creating a regional framework.
Faculty Self-Archiving: The Gap between Opportunity and Practice Denise Troll Covey Carnegie Mellon University Libraries DLF Forum – November 2007.
River Campus Libraries DSpace at the University of Rochester Susan Gibbons Asst. Dean, Public Services & Collection Development
Active Data Curation in Libraries: Issues and Challenges ASEE ELD Presentation June 27, 2011 William H. Mischo & Mary C. Schlembach.
Supporting Data Management Across Disciplines Katherine McNeill Massachusetts Institute of Technology IASSIST Annual Conference 2010.
Learning by Doing: Cases of Librarians Working with Faculty Research Data for the First Time IASSIST 2010 Jake CarlsonMichael Witt Data Research Interdisciplinary.
DATA INFORMATION LITERACY Lisa Hinchliffe | Karen Hogenboom | Christie Wiley | Sarah Williams September 2013.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Data Information Literacy Symposium Purdue University, West Lafayette, IN, September 22-24, 2013.
Sun PASIG Fall 2008 Meeting 26 October 2008 Carole L. Palmer Center for Informatics Research in Science & Scholarship Graduate School of Library and Information.
Access. Knowledge. Success NSF Data Management Plan Requirements: Institutional Initiatives Fall 2010 Membership Meeting Coalition for Networked Information.
Data Curation Education and Biological Information Specialists DigCCurr 2007 Chapel Hill, April 20, 2007 P. Bryan Heidorn, Carole L. Palmer, Melissa H.
Content Strategy.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
13 September 2012 The Libraries’ Role in Research Data Management: A Case Study from the University of Minnesota Meghan Lafferty, Chemistry, Chemical Engineering,
World Data Center for Human Interactions in the Environment Needs Assessment for Managing and Preserving Geospatial Electronic Records: Preliminary Results.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Ensemble Computing in the National Science Digital Library (NSDL)
Information and Discovery in Neuroscience (IDN) Carole Palmer Graduate School of Library and Information Science University of Illinois at Urbana-Champaign.
Data Management Planning
Data Curation Education JCDL Pittsburgh, June 20, 2008 Linda C. Smith Melissa H. Cragin, Carole L. Palmer, W. John MacMullen, P. Bryan Heidorn.
© 2007, IDEALS This work is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 2.5 License. To view a copy of this license, visit
Data Practices across Disciplines: Informing Collections & Curation Carole L. Palmer Melissa H. Cragin, Tiffany Chao, & Nic Weber Center for Informatics.
Background Researchers and funders continue to be concerned about the lack of archiving of scientific data. Such data can be useful to researchers, educators,
The Role of Academic Libraries in the Digital Data Universe Break-Out Session: New Partnership Models Bob Hanisch and Brian Schottlaender Co-Leaders ARL.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Research Technology Facilitator Program Researchers as centers of resource networks.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
Changing Nature of Academic Librarianship: Implementing a Distributed Institutional Repository Jeremy Garritano (765) Chemical.
Research and Scholarly Communication in the Humanities New Partnerships Between Librarians and Scholars Presented to the Humanities Research Institute.
Data Management Plans PAUL H. BERN, PH.D. APRIL 3, 2014.
GT Research Data Project Team Original Charge: to investigate, evaluate, assess, and communicate Georgia Tech researchers’ data practices, processes, and.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
Working with your archive organization: Broadening your user community Robert R. Downs, PhD Socioeconomic Data and Applications Center (SEDAC) Center for.
Working with Your Archive : Broadening Your User Community Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Michael Witt, Jacob Carlson, D. Scott Brandt Purdue University Melissa H. Cragin University of Illinois at Urbana-Champaign Constructing Data Curation.
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
A. D. SMITH – SEPTEMBER 28, 2011 DATA CURATION PROFILE.
Open Exeter Project Team
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
DataNet Collaboration
UNC Digital Library Project
Research on Data Curation and Repositories
An ecosystem of contributions
Research Data Management
Data and Visualization Services
Purdue University The PURR campus data repository service: institutional effort looking towards international engagement Michael Witt, associate.
Dataverse for citing and sharing research data
Presentation transcript:

Michael Witt Interdisciplinary Research Librarian & Assistant Professor Purdue Libraries & Distributed Data Curation Center (D2C2) Eliciting Faculty Requirements for Research Data Repositories Open Repositories 2009, Georgia Tech: May 18, 2009

Witt, M. & Carlson, J. (2007). Conducting a data interview What is the story of your data? 2. What form and format are the data in? 3. What is the expected lifespan of your data? 4. How could your data be used, reused, and repurposed? 5. How large is your dataset, and what is its rate of growth? 6. Who are potential audiences for your data? 7. Who owns the data? 8. Does the dataset include any sensitive information? 9. What publications or discoveries have resulted from the data? 10. How should the data be made accessible? Ten Questions to Begin a Conversation With Your Faculty About Data Curation

Investigating Data Curation Profiles Across Multiple Research Disciplines Investigators in the Distributed Data Curation Center in the Libraries at Purdue University, and the University of Illinois, Urbana-Champaign will address the question “which researchers are willing to share data, when, with whom, and under what conditions?” The team will produce case studies of researcher data/metadata workflow, data curation profiles describing policies for archiving and making available research data, a matrix to compare parameters across disciplines, system requirements for managing data in a repository, and recommendations for implementing results under diverse systems. The project will describe the roles of librarians and identify the skill sets they need to facilitate scholarly communication and data sharing. Supported by IMLS LG

Investigators D. Scott Brandt (PI) – Purdue University Jacob Carlson – Purdue University Melissa Cragin – University of Illinois P. Bryan Heidorn – University of Illinois Carole Palmer – University of Illinois Sarah Shreeves – University of Illinois Michael Witt – Purdue University Two-year research project began on 11/15/2007.

Project Activities Two interviews each with 20 faculty who produce data in a variety of research domains Transcription, coding, and analysis (NVivo) Creation of “data curation profiles” and wiki Developing two case studies in Agronomy and Geology Two focus groups with subject-specialist librarians who acted as liaisons Distinguish and map needs expressed by faculty to repository functionality Assess current capabilities of repository systems and related technologies Experiment using institutional repositories for data curation in practical terms

Subjects Biology Horticulture Civil Engineering Electrical & Computer Engineering Biochemistry Food Science Earth & Atmospheric Science Agronomy Kinesiology Atmospheric Sciences Speech & Hearing Soil Science Anthropology Geology PurdueIllinois

Caveat audiens Preliminary findings (project is not yet complete) Convenience sample, not statistical Exploratory, qualitative study The subjects provide much more context and information in the interview transcripts, which are still being coded

Dataset static or dynamic?

Data bound by confidentiality or privacy concerns?

Is your manner of organization/description sufficient for another person with similar expertise to be able to understand and properly use the data?

How long to preserve your data?

With whom would you share your data immediately after the data were generated?

With whom would you share your data after the data were normalized and/or corrected?

With whom would you share your data after the data have been processed for analysis?

With whom would you share your data after the data have been analyzed?

With whom would you share your data immediately before publication of findings?

With whom would you share your data immediately after publication of findings?

Embargo?

Prioritize your needs for the following types of services The ability to audit this dataset to ensure its integrity over time? The ability to migrate datasets into new formats over time? A secondary storage site for the dataset? A secondary storage site for the dataset at a different geographic location? Documentation of any changes that were made to the dataset over time?

The ability to cite this dataset in my publications The ability for researchers within my discipline to easily find this dataset The ability for researchers outside of my discipline to easily find this dataset The ability for people to easily discover this dataset using Google Prioritize your needs for the following types of services

The ability for me to submit this dataset to a repository myself The process of submitting this dataset to a repository is automated The ability to make these data accessible in multiple formats The ability of the repository to provide version control for the data

Prioritize your needs for the following types of services The ability to apply standardized metadata from your discipline to the dataset The ability to see usage statistics of how many people accessed your data The ability to access the data at a mirror site if the main repository is “offline” The ability of others to comment on or annotate the dataset

Prioritize your needs for the following types of services The ability to connect the dataset to visualization or analytical tools The ability to support the use of web services APIs The ability to restrict access to datasets to authorized individuals

Summary Used from