Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012.

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Organising and Documenting Data Stuart Macdonald EDINA & Data Library DIY Research Data Management Training Kit for Librarians.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
University of Southampton, U.K.
Metadata: An Introduction By Wendy Duff October 13, 2001 ECURE.
The KnowledgeBank: Powered by DSpace Laura Tull Systems Librarian Ohio State University Libraries WiLSWorld July 27, 2004.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Elements of a Data Management Plan Alison Boyer Environmental Sciences Division Oak Ridge National Laboratory.
Elements of a Data Management Plan
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional research repository for the University of Pretoria.
Introduction to Geospatial Metadata – FGDC CSDGM National Coastal Data Development Center A division of the National Oceanographic Data Center Please .
GRAD 521, Research Data Management Winter 2014 – Lecture 2 Amanda L. Whitmire, Asst. Professor.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
Preserving the Scientific Record: Establishing Relationships with Archives Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review.
GRAD 521, Research Data Management Winter 2014 – Lecture 9 Amanda L. Whitmire, Asst. Professor.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
Data curation in an existing infrastructure: Stellenbosch University 1 st African Digital Curation Conference 12 – 13 February 2008 Wouter Klapwijk Senior.
Research Data Management System project: Best Practices in Research Data Management* *Adaptation of the NECDMC.
An Introduction to Metadata Tammy Walker Beaty Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN Data Management.
UVa Library Research Data Services
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Data Management: Documentation & Metadata Sherry Lake, Senior Data Consultant Bill Corey, Data Consultant Jeremy Bartczak, Intellectual Access & Metadata.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
P. Schirmbacher Humboldt-Universität zu Berlin The Changing Process of Scholarly Publishing or the Necessity of a New Culture of Electronic.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Data Management 101 for Earth Scientists Data Management Plans Robert Cook Environmental Sciences Division Oak Ridge National Laboratory.
Elements of a Data Management Plan: Roles and Responsibilities Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date.
Managing the Impacts of Change on Archiving Research Data A Presentation for “International Workshop on Strategies for Preservation of and Open Access.
Introduction to metadata
DataONE: Preserving Data and Enabling Data-Intensive Biological and Environmental Research Bob Cook Environmental Sciences Division Oak Ridge National.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Data Management & the Library. FACT #1 Research is increasingly digital and produces digital data.
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
Elements of a Data Management Plan Bill Michener University of New Mexico
OAIS, Designated Communities & Metadata Jerome McDonough Graduate School of Library & Information Science University of Illinois Urbana-Champaign
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Archiving microdata Standards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development.
Basic Encoded Archival Description METRO New York Library Council Workshop Presented by Lara Nicosia December 9, 2011 New York, NY.
Primer on Data Management Data Management Plans Robert Cook Environmental Sciences Division Oak Ridge National Laboratory American Meteorological Society.
DOE Data Management Plan Requirements
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Digital Stewardship Lee Dotson Digital Initiatives Librarian University of Central Florida John C. Hitt Library Presentation available at
Data Management and Digital Preservation Carly Dearborn, MSIS Digital Preservation & Electronic Records Archivist
OAI metadata: why and how Jenn Riley Metadata Librarian Indiana University.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
Navigating NECDMC Andrew Creamer, MAEd, MSLIS
Open Exeter Project Team
Metadata - what works, what doesn’t?
Introduction to Metadata
Data Management: Documentation & Metadata
Digital Project Lifecycle Curating Across the Curriculum
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Developing Institutional Data Repositories
Research data lifecycle²
Research Data Dr Aoife Coffey, Research Data Coordinator
Presentation transcript:

Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012

Why is a Librarian asking? We are curious. We manage information. Data are a kind of information.

TAKING CARE OF YOUR DATA What’s your plan?

GOAL: Achievable habits for implementing data management best practices into your workflow

“…the recorded factual material commonly accepted in the scientific community as necessary to validate validate research findings.” Research data is: U.S. Office of Management and Budget, Circular A-110

long-term “…management activities required to maintain research data long-term such that it is available for reusepreservation reuse and preservation.” Data curation is: Wikipedia CURATION ≠ ARCHIVAL

available “It is obvious that making data widely available is an essential element of scientific research.” Science editorial, “Making Data Maximally Available,” 11 Feb 2011

The case for data management stewardship curation etc. $

Common missteps “Why can’t I open this WordPerfect document?” “I think those data are on a ZipDisk somewhere…” “Oh, that dataset is on our group server…” “I never actually gave my advisor the final dataset…” “My laptop got stolen, so I lost the data…” “It was so long ago, I can’t remember …”

Research data lifecycle New research question posed Research planning & design Data collection & description Data processing & analysis Dissemination & publication of findings Data archiving Accessible data located Data transformed / repurposed Research Cycle

How can we help? New research question posed Research planning & design Data collection & description Data processing & analysis Dissemination & publication of findings Data archiving Accessible data located Data transformed / repurposed Research Cycle

Where to start? How much data? Resources needed Roles & responsibilities Metadata Data formats Data storage Ethics & consent Copyright (open data) Sharing Make a plan. Consider:

A few tidbits

Data storage & curation Anticipate: Volume/File type(s) Raw data vs. processed/analyzed data File Naming Conventions Privacy Concerns Storage practice Backup plans (LOCKSS, checksums)

File naming conventions 1. Be consistent Have conventions for naming: (1) Directory structure (2) Folder names (3) File names Always include the same information (e.g. date and time) Retain the order of information (e.g. YYYYMMDD, not MMDDYYY ) 2. Be descriptive Try to keep file and folder names under 32 characters example: Project_instrument_location_YYYYMMDDhhmmss_extra.ext SG157_ _001.raw (raw data)  SG157_ _001.mat (working data)  ESPOMZ_SG157_ _001.txt (shareable)

Legal and ethical considerations Intellectual property Office for Commercialization & Corporate Development (OCCD) Copyright Licensing Charging for data? Data attribution & citation Human subjects?  Informed consent & anonymization prior to publishing OSU: Office of Research Integrity, Institutional Review Board (IRB) Responsible Conduct of Research (RCR) Program

Archiving and preservation Policies Preservation options Types of repositories Costs and benefits

University of Southampton School of Electronics & Computer Science Southampton, UK, 2005 A word about backups…

Metadata “The metadata accompanying your data should be written for a user 20 years into the future -- what does that person need to know to use your data properly? Prepare the metadata for a user who is unfamiliar with your project, methods, or observations.” Oak Ridge National Laboratory Distributed Active Archive Center for Biogeochemical Dynamics (ORNL DAAC)

What is Metadata? Metadata is “data about data” WHO created the data? WHAT is the content of the data? WHEN were the data created? WHERE is it geographically? HOW were the data developed? WHY were the data developed?

Metadata schemes Dublin Core (DC), Darwin Core (DwC), EML, DDI, NBII, FGDC/CSDGM, ISO 19139, ISO 19115, DIF, LDIF, e-GMS, AGLS, METS, MODS, PREMIS, OAI-PMH, MARC, CDWA, CIDOC/CRM, DACS, DIG35, GILS, GML, ISBD, LCSH, KML, MARCXML, MEI, MODS, MIX, OAIS, ANSI/NISO Z39.88, PB Core, PRISM, QDC, RDF, SGML, VSO, XML, XMP X

Metadata schemes “Metadata schemes are like toothbrushes – everybody agrees that you should use one, but nobody wants to use someone else’s.”

You already use metadata…

Metadata in use StateCityLocationDateTimeTemperature (F) AlaskaAnchorageCity Hall2/12/ FloridaMiami Weather Center 2/12/ New York Empire State Building 2/12/

Metadata in real life You use it all the time…

Darwin Core | biological diversity, taxonomy Dublin Core | general DDI (Data Documentation Initiative) | social and behavioral sciences data DIF (Directory Interchange Format) | environmental sciences EML (Ecological Metadata Language) | ecology FGDC/CSDGM (Federal Geographic Data Committee/Content Standard for Digital Geospatial Metadata) | geographic data NBII (National Biological Information Infrastructure) | biology Major metadata standards

Metadata activity! Take it away, Maura…

Let’s Describe this Dataset Bright orange Garibaldi fish Hypsypops rubicundus California, USA Ornate Butterfly fish Chaetodon ornatissimus Indo-Pacific

Scenario 1 Research for preschoolers to see if they learn colors and patterns better from real life examples

Scenario 2 Research on what fish are local to a particular area. The photos are the data

Scenario 3 Research into specific details of specific types of fish

File/Folder Organization You have monitors attached to 18 athletes (6 tennis players, 6 golfers, 6 rowers) for 7 days. Each day you get 2 readouts for each athlete, 1 for heart rate and 1 for body temperature. You transfer the data to Excel. Name and organize the files for this experiment.

Think about your own data –What types of data need to be described? –What are the relationships between them? –What descriptive metadata can you find? –What metadata is being captured automatically? –What other descriptive metadata do you need to help users find your data? –What metadata do you need to help other scientists reproduce your data or use it for comparison? –What events has/will the data undergo? –For how long do you want to retain the data? –How intensive are your preservation needs? –How diverse is your user base? Does this influence your preservation needs?

Data Management Plans

The types of data Data & metadata standards | format and content Policies for access and sharing Policies and provisions for re-use Plans for archiving data {Budget} $ $ $

Use available resources ata-management- planning

Contact information Amanda Whitmire | Data Management Specialist Maura Valentino | Metadata Librarian

fin