The Texas Data Repository Texas State University Ray Uzwyshyn, Director, Collections and Digital Services, Texas State University Libraries.

Slides:



Advertisements
Similar presentations
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Advertisements

OVERVIEW & LIBRARY SUPPORT FOR DATA MANAGEMENT/SHARING Jim Van Loon, MSME/MLIS Science Librarian.
Rutgers University Libraries What is RUcore? o An institutional repository, to preserve, manage and make accessible the research and publications of the.
Texas State University Libraries Faculty Digitization Services Overview Ray Uzwyshyn, Ph.D. MBA MLIS Director, Collections and Digital Services.
Research Data Service at the IT Pro Forum HEIDI IMKER, DIRECTOR.
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
Data Management Plans For Universities and Institutes of Higher Education and Research Ray Uzwyshyn, Ph.D. MBA MLIS Director, Collections and Digital Services,
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
TDL Forum WEDNESDAY, APRIL 16, Agenda - Updates & Announcements ◦TCDL 2014 (Kristi) ◦Vireo Users Group Meeting (Kristi) ◦Staffing (Ryan) ◦SHARE.
Social Science Data and ETDs: Issues and Challenges Joan Cheverie Georgetown University Myron Gutmann ICPSR – University of Michigan Austin McLean ProQuest.
Engineering a New Home EMILY STENBERG DIGITAL PUBLISHING & PRESERVATION LIBRARIAN LAUREN TODD ENGINEERING SUBJECT LIBRARIAN WASHINGTON UNIVERSITY IN ST.
Presenter: Karla Strieb Assistant Executive Director Transforming Research Libraries June 3, 2010 Supporting E-science: Progress at Research Institutions.
R utgers C ommunity R epository RU CORE 1 Research Data and Context  Presentation Goals  The challenge of context  Metadata design to support context.
Libraries as Partners in Research: the UC Curation Center’s Tools and Services UC3 Team University of California Curation Center California Digital Library.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
PURR: A RESEARCH DATA CURATION SERVICE MODEL USING HUBZERO Courtney Earl Matthews Digital Data Repository Specialist HUBBUB 2012 Purdue University.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
A brief overview… “The Obama Administration is committed to the proposition that citizens deserve easy access to the results of scientific research their.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Texas Digital Library CENTRAL TEXAS AND SAN ANTONIO-AREA REGIONAL MEETING SEPTEMBER 5, 2013.
Collections and Digital Services Upcoming Initiatives, Continuing Projects and Larger Trajectories Ray Uzwyshyn, Ph.D. MBA MLIS Director, Collections and.
GPO’s Federal Digital System August 17, 2010 U.S. Government Printing Office.
TDL Forum WEDNESDAY, SEPTEMBER 16, 2015 Kristi Park Executive Director This work is licensed under a Creative Commons Attribution 4.0 International License.Creative.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Stacy Nowicki, Library Director Michigan Academic Library Council Meeting Davenport University, Grand Rapids, MI 18 March 2011 Dspace at Kalamazoo College.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
+ Building a Community of Practice for Research Data Services Experience of CLIR/DLF E-Research Peer Network & Mentoring Group Presentation for DLF Forum.
Redefining the Library’s Role through an Institutional Repository Sharon Mader, Dean Jeanne Pavy, Scholarly Communications Librarian Earl K. Long Library.
Discover ScholarSphere A repository service collaboration between the University Libraries and ITS.
YOUR TITLE HERE Courtney Matthews, Digital Repository Librarian Web Advisory Committee April 20, 2016 uwspace.uwaterloo.ca Library Scholarly Communications.
A Consortial Model for Research Data Services Using Dataverse Kristi Park, Director, Texas Digital Library Ryan Steans, Assistant Director, Texas Digital.
A Consortial Model for Research Data Services Using Dataverse Kristi Park, Director, Texas Digital Library Ryan Steans, Assistant Director, Texas Digital.
A Consortial Approach to Research Data Repository Services Laura Waugh, Texas Digital Library On behalf of the TDL Dataverse Implementation Working Group.
Developing and Implementing An Online Research Data Repository for Your University or College Campus Ray Uzwyshyn, Ph.D. MBA MLIS Director, Collections.
ICPSR Data Fair November 8, 2010 Katherine McNeill, MIT Libraries
NRF Open Access Statement
Jeff Moon Data Librarian &
Kathy Weimer Coordinator of Map and GIS Services and Collections
Emphasize “scholarly” and “universities” to distinguish TDL from other efforts. A digital infrastructure for the scholarly activities of Texas universities.
New Library Digital Initiatives for Research Faculty
Vision... “… a network of learning environments and resources for Science, Mathematics, Engineering and Technology education, will ultimately meet the.
Texas State University Libraries
Division of Collections and Digital Services Services for Faculty
TDL Forum WEDNESDAY, November 30, 2016 Kristi Park Executive Director
Easy Ways to Support Campus Data Needs
DataNet Collaboration
? What is Institutional Repository for Rutgers University
An Overview of Data-PASS Shared Catalog
Summit 2017 Breakout Group 2: Data Management (DM)
Data Management and Open Access Requirements for Funded Research
CFI John R Evans Leaders Fund Digital Data Management
Jay Bhatt Drexel University Libraries
Introduction and Use of Dataverse at Boston College
The Institute of Quantitative Social Science
Managing ETDs with Associated Complex Digital Objects
Project TIER is supported by the Alfred P. Sloan Foundation
Opening Access: Increasing Scholarly Impact with
Research Data Management
Repository Platforms for Research Data Interest Group: Requirements, Gaps, Capabilities, and Progress Robert R. Downs1, 1 NASA.
Research Infrastructures: Ensuring trust and quality of data
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Purdue University The PURR campus data repository service: institutional effort looking towards international engagement Michael Witt, associate.
OPEN ACCESS POLICY Larshan Naicker Rhodes University Library
Bird of Feather Session
Dataverse for citing and sharing research data
Research data lifecycle²
Presentation transcript:

The Texas Data Repository Texas State University Ray Uzwyshyn, Director, Collections and Digital Services, Texas State University Libraries

Online Data Repositories (Background) Online Way to Manage a Researcher’s Data/Metadata Long Term Data Archiving, Preservation, Sharing Strategy (data, paratextual material, field notes, docs, multimedia and programs) Permalinking Strategy for Online Data Citation/Access (DOI: Digital Object Identifier, UNF:University Numerical Fingerprint, Linked Data, Interoperability)

Why are Data Management Repositories Required? Most major Federal grant agencies require data management plans as mandatory part of the grant proposal/oversite process. ( NIH 2003; NSF 2011; NEH, 2013 USDA)

Research Data Repository Software May be hosted or installed on a university’s server Each software contains different ranges of management, collaborative options Open source and proprietary options Ingestion of Various Data Types (from Excel to SPSS to more esoteric disciplinary specific formats)

State of Texas Data Repository Group Formed in 2014 (Texas Digital Library) Charge: To Determine a Suitable Data repository infrastructure and management model at a consortial (State Level) Evaluate software models Develop Needs Assessment Make service recommendations Document findings

TDL is a Texas Consortium of 22 universities across Texas leveraging technological cooperation among academic libraries Coordinated through Texas Digital Library at UT Austin, connected with other bodies (TACC, Texas Advanced Center Computing, DPN, Digital Preservation Network, Duracloud) Texas Digital Library (TDL)

Conclusion: The group recommends that TDL adopt Harvard’s Dataverse to facilitate the discovery of research data. Dataverse provides the best : system performance Robustness Usability platform availability an active open source community 2014: Data Repository Working Group Formed Working Group Report (August 28, 2015)

Dataverse Harvard Provides a Software framework that enables institutions to host research data repositories Digital Preservation and archival Infrastructure: allows sharing, control, persistent data citation, data publishing and management

Dataverse Details ●Relatively simple ingest for researchers ●Ability to share with trusted research groups prior to publication ●Ability to version datasets ●Supports data citations (e.g. DOIs, recognition and credit) ●Allows for control over branding (customization) ●Helps researchers fulfill Data Management Plan requirements ●Sustainable platform with growing open source community led by Harvard

Texas Data Repository Network Architecture Why the Dataverse Network? (silent video overview) Open Journal Systems Dataverse Integration Open Journal Systems Dataverse Integration (2014) Research Study Data Data Set Files Metadata ( Data Describing the data) Paratextual Research Material (Methodology, Field Notes, Multimedia Graphs, Programs etc.) TDL Consortium Institutions (i.e. Texas State)

Dataverse Metadata Example

Dataverse Metadata Example (From the Simple to Very Complex)

TDL Dataverse Implementation Working Group (August 2015 – December 2016) Charge : Pilot test, assess, and launch a consortial repository for research data archiving and management. Committees Working Group members Texas Universities Texas State University Part of Main, Policy, Governance and Technology Implementation Groups Main Working Group & Subcommittees: Policy and Governance Workflows and Outreach Budget/Business Models Technology

Working Group Areas Many Planning Aspects of Data Research Repositories August 2015 – April 2016 The Research Data Repository Lifecycle

Consortial Repositories, Prototype and Pilot Study, May-August 2016

UX (Usability Focus)

Texas Data Repository Initiative Sept December 2016 Current Dates Soft launch, September – November 2016 Online Research Data Symposium (Baylor, November 15-16) Official Launch of Data Repository (December 1 st, 2016) Inaugural Year/Local Infrastructure (January – December, 2017)

Texas State University Library Infrastructures & Federal Mandates For Public Access to Research Publication repositories (D- Space) Data repositories, Texas Data Repository Human Resource Infrastructure Data Repository Librarian Subject Liaisons (Outreach) Publication Repository Librarian Workflows, standards, & policies public-access-results-federally-funded-research The Library Supports:

Texas Data Repository Accommodates Most Sizes of Data Projects Normal to Mid-Range, 90% Files/Data Fit on Server/Cloud, may be uploaded, Dataverse, 2GB File size max currently, unlimited number of files/faculty/dataverse) Huge, Global Scale Projects, 10% (Data may require specialized university IT Support, i.e. terabyte/petabyte online storage, consortial possibilities, Chronopolis, Texas Advanced Computer Center, DEEPN, Duracloud) Borgman, C Big Data, Little Data, No Data: Scholarship in the Networked Age

Types of Data Repositories Institutional/Consortial Repository (Texas State University and/or or consortial) Project/Discipline specific (usually large single faculty/faculty team projects, i.e. Academic Specialization, Purdue Nanohub, Engineering etc. )

DMP Policy Tool Integration Overview Video Overview Video Customizable Plan Outline Tool Resource Links Supports All Major Funders Texas Data Repository Template Boilerplate California Digital Library

Electronic Thesis and Dissertations (ETD) Repository (D-Space) Connections Co-publish data sets in ETD (D- SPACE) and Data Repository, Links in metadata in D-SPACE and DATA REPOSITORY Future Possible ETD (D-Space), VIREO, DATA REPOSITORY CONNECTIONS

Research Data Repository Adoption Lifecycle (2016) Research Universities &

Comments/Questions

Add Data Share, publish, and archive Find Data Search across disciplines Cite Data Obtain a citation and unique identifier

Use Cases: Make Research Data Publicly Available and/or Sharing Data Primary Actors: PIs of federally funded research Researchers working on unfunded research/ funded research with no retention requirements Graduate students working on theses, dissertations, or other data-generating projects. Geographically Dispersed Researchers and Project Teams Wishing to Collaborate

Use Case: Seek Data to (Re)Use Primary Actors: Researcher is interested in conducting a meta study reusing data developed in earlier studies Public using data for personal needs Organizations seeking data for their needs.

Pilot Study Responses Perceived Benefits of Data Repository Fulfill federal mandates for sharing publications and research data Make research data more widely available Statistics on downloads and citations of my data Make my data citeable through the assignment of a DOI (digital object identifier) Saving various versions of the dataset (data lifecycle) Collecting all my data in one place

Further Links/References ARL NSF Data Sharing Policy and Resource Links, ARL (White House Directives and Funded Research Data ) Borgman, C Big Data, Little Data, No Data. Scholarship in the Networked Age. California Digital Library DMT Tool: Chronopolis: Dataverse. Dataverse (Data Science Site). DPN (Digital Preservation Network) Duracloud: Purr. (Purdue Institutional Data Repository). Hubzero. Figshare. ICPSR Data Management & Curation. Research Data Management. Principles, Practices, and Prospects (November 2013). Council on Library and Information Resources. Cox, A. and Pinfield, S. Research Data Management and Libraries. Jounral of Librarianship and Information Science. June Fearon, D & Sallans, A. C. (January 2014). Institutional Research Data Management: Policies, Planning, Services and Surveys. Coalition for Networked Information. (video presentation) Data Management for Libraries: (LITA Guide) NMC Horizon Report: 2014 Library Edition. “Research Data Management”. pp. 6-7 and pp 24 – 45. Holden, J. Memorandum for Heads of Executive Departments and Agencies: Increasing Access to the Results of Federally Funded Research (2013). Green, A. Macdonald, S and Rice, R. Policy-making for Research Data in Repositories: A Guide. DISC-UK. Research Data Management in the Arts and Humanities (2013). University of Oxford. forum-rdmf/rdmf10-research-data-management-arts-and-humanities (Conference Presentations) forum-rdmf/rdmf10-research-data-management-arts-and-humanities

Why Are Data Management Plans Required Leverage and make available faculty, departmental and institutional research Allow publication of negative data (less research replication) Wordle of the National Science Foundation’s Award and Administration Guide. Chapter VI.D.4, Mandatory 2011

Data Management Plans Part of Evolving Science, Social Science and Humanities Research Process (Accuracy, efficiency, sharing) Wordle of the data management policy of the Office of Digital Humanities, National Endowment for the Humanities, 2013

ARL Libraries 2015 Online Data Management Plan Implementation Fearon, D & Sallans, A. C. (January 2014) Institutional Research Data Management: Policies, Planning, Services and Surveys. Coalition for Networked Information. (54 ARL Libraries currently offer data management services_)

Current DMP Platforms (2015) Fearon, D & Sallans, A. C. (January 2014) Institutional Research Data Management: Policies, Planning, Services and Surveys. Coalition for Networked Information. (54 ARL Libraries currently offer data management services_)

Data Sharing Currently, 80% of researchers do not share their data Andreoli-Versbach, P., Mueller-Langer, F. (November 2014). Open access to data: An ideal professed but not practiced. Research Policy.,

Collaboration Across Institutions Jones et al. (2008). Science 322: