Jane Greenberg SILS/Metadata Research Center School of Info. & Library Science Univ. of North Carolina at Chapel Hill The DRYAD Repository.

Slides:



Advertisements
Similar presentations
“Classifying Scientific Data Objects with Bibliographic Relationship
Advertisements

The Role of the Librarian in an Open Access World Ellen Finnie Duranceau Scholarly Publishing & Licensing Consultant MIT Libraries BioMed Central Consultation.
Access Strategies for Digital Video and Digital Rights Management Grace Agnew, Georgia Institute of Technology Mairéad Martin, University of Tennessee.
EPrints - Introducing EPrints 3 Software William J Nixon Digital Library Development Manager, University of Glasgow With many thanks to Les Carr and the.
Introduction The field of evolutionary biology draws from ecology, paleontology, population genetics, physiology, systematics, and new biological sub disciplines.
Theories of Evolution and Cultural Diffusion: The Dryad Repository Case Study for Understanding Changes in Organizing Information Practices ~~~~~~ ~~~~~~
Building Support for a Discipline-Based Data Repository Ryan Scherle 1, Sarah Carrier 2, Jane Greenberg 2, Hilmar Lapp 1, Abbey Thompson 2, Todd Vision.
Ryan Scherle and Jane Greenberg. A Repository of Data Underlying Journal Articles.
Toward a Data Repository for Evolutionary Biology: Toward a Data Repository for Evolutionary Biology: Jane Greenberg, Associate Professor, Director SILS/Metadata.
The Dryad Data Repository Ryan Scherle National Evolutionary Synthesis Center.
Evolutionary biology Population genetics Systematics Paleontology Botany and Zoology Genomics Ecology Medicine Agriculture Anthropology Bioinformatics.
The Dryad Data Repository Ryan Scherle 1, Hilmar Lapp 1, Amol Bapat 2, Sarah Carrier 2, Jane Greenberg 2, Peggy Schaeffer 1, Todd Vision 1,3, Hollie White.
Scholarly Communications in Flux Michael Jubb Director, Research Information Network Bloomsbury Conference on E-Publishing and E-Publications 29 June 2007.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
A centre of expertise in data curation and preservation DCC/NeSC eScience Workshop, June 2008 Working in partnership with the eScience community This work.
Social Sciences Collections & Research: a new content-based team Gillian Ridgley, Ian Cooke, Jerry Jenkins.
Data and Publication Discovery Brian Matthews, Information Management Group, STFC Rutherford Appleton Laboratory CLADDIER workshop, Chilworth, Southampton,
A centre of expertise in digital information management UKOLN is supported by: UK Perspectives on the Curation and Preservation of Scientific.
Can We Talk? MICHAEL Conference London May 23, 2008Joyce Ray.
The Discovery Landscape in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK – eBank UK project A centre.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Marine information management in UNESCO-IOC-ODINAFRICA Paul Nieuwenhuysen Vrije Universiteit Brussel, and University of Antwerp, Belgium Presented at.
Jane Greenberg, Professor and Director, Metadata Research Center School of Information And Library Science University of North Carolina at Chapel Hill.
The Dublin Core Collection Description Application Profile (DC CD AP) Pete Johnston, UKOLN, University of Bath Chair, DC Collection Description Working.
Current status Todd Vision (overview) Elena Feinstein (curation) Ryan Scherle (demo) 7/23/12Dryad Board of Directors1.
Good practice in Research Data Management Module 5: Deposit and long-term preservation.
OVERVIEW & LIBRARY SUPPORT FOR DATA MANAGEMENT/SHARING Jim Van Loon, MSME/MLIS Science Librarian.
Data archiving in evolutionary biology Michael Whitlock.
Helping Helping Interdisciplinary Vocabulary Engineering Ryan Scherle – National Evolutionary Synthesis Center Jose Aguera – University of North Carolina.
Educating Librarians in the Middle East: Building Bridges for the 21st Century Educating Librarians in the Middle East: Building Bridges for the 21st Century.
University of Southampton, U.K.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
The Subject Librarian's Role in Building Digital Collections: Where Information Management and Subject Expertise Meet Ruth Vondracek Oregon State University.
1 Digital Libraries and Evidence in the Developing World Context Dr. Jon Ferguson Senior Health Database Scientist IMMPACT Project University of Aberdeen.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Using Metadata Skills for a Course Inventory Lee Richardson Health Sciences Library University of North Carolina at Chapel Hill ALA Annual Conference June.
Ask A Librarian and QuestionPoint: Integrating Collaborative Digital Reference in the Real World (and in a really big library) Linda J. White Digital Project.
Evolving Roles in Scholarly Communications Susan Reilly, APA, Frascati, 7th Nov, 2012.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
A Metadata Application Profile for the DRIADE Project Sarah Carrier, Jed Dube, Jane Greenberg March 13, 2007 _____________________.
Supporting scientific communities by publishing data Dryad Digital Repository Peggy Schaeffer OpenAIRE/LIBER Workshop May 28, 2013 Ghent, Belgium.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
Requirements & Challenges: Status and Open Questions Hilmar Lapp National Evolutionary Synthesis Center DRIADE Workshop, Dec 5, 2006 Hilmar Lapp National.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
January 22, 2014 Questions? PIM lecture In Class Activities Readings for next week.
HIVE: Enabling Common Language and Interdisciplinarity EPA-NIEHS Advancing Environmental Health Data Sharing and Analysis: Finding a Common Language June.
Managing End-User Development of Digital Library Resources to Support User Communities Robert R. Downs Center for International Earth Science Information.
Data archiving and curation Ryan Scherle Data Repository Architect Dryad Digital Repository CurateGear January 8, 2014 You may reuse any of the original.
GEO Work Plan Symposium 2012 ID-03: Science and Technology in GEOSS ID-03-C1: Engaging the Science and Technology (S&T) Community in GEOSS Implementation.
Dryad A digital data repository A typical published data package Contains data that belongs in a specialized repository (e.g. Genbank, Treebase, Morphbank,
Digesting the Genome Glut Promoting the Use and Extension of GMOD To Emerging Model Organisms David Clements 1 Brian Osborne 2 Hilmar Lapp 1 Xianhua Liu.
ST-09-01: Catalyzing Research and Development (R&D) Funding for GEOSS Florence Béroud, EC Jérome Bequignon, ESA Kathy Fontaine, US ST Kick-off Meeting.
Jane Greenberg, Associative Professor and Director, SILS Metadata Research Center, School of Information and Library Science, University of North Carolina.
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
/Greenberg/NDS DataDryad.org and the interoperability continuum. Repositories and Interoperability 2nd National Data Service Consortium Workshop.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Metadata Issues Underlying the Development of a Data Repository for Evolutionary Biology Sarah Carrier, SILS, Master’s Student Jackson Dube, Visiting Scholar,
Jane Greenberg & the Dryad Team The DRYAD Repository ~~~~~~ INLS 720 visit to NESCent November 17, 2008.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
+ Building a Community of Practice for Research Data Services Experience of CLIR/DLF E-Research Peer Network & Mentoring Group Presentation for DLF Forum.
Chelcie Rowell Jane Greenberg Metadata Research Center UNC-Chapel Hill CONTROLLED VOCABULARY STATUS & POTENTIAL IN DATA REPOSITORIES Authority Control.
Data sharing and exchange: Experiences within the
Utility of an OAI Service Provider Search Portal
NRF Knowledge Management Corporate
Bird of Feather Session
Presentation transcript:

Jane Greenberg SILS/Metadata Research Center School of Info. & Library Science Univ. of North Carolina at Chapel Hill The DRYAD Repository ~~~~~~ Librarians and e-Science: Focusing Toward 20/20 CIC 2008: May 12

Overview DRYAD Formerly: DRIADE – (Digital Repository of Information and Data for Evolution) NESCent / SILS Metadata Research Center collaboration Research CIC context Conclusions /

Motivation for Dryad Small science repositories (SSR) Knowledge Network for Biocomplexity (KNB), Marine Metadata Initiative (MMI) Evolutionary biology Publication process Supplementary data (Evolution, American Naturalists) not Author, deposition date, not subject species, geo. locator Data deposition (Genbank, TreeBase, Morphbank) NESCent & SILS/Metadata Research Center ecology, paleontology, population genetics, physiology, systematics + genomics

Dryads Goals 1. One-stop deposition and shopping for data objects supporting published research… 108 data objects, 23 pubs. American Naturalist, Evolution, 2. Support the acquisition, preservation, resource discovery, and reuse of heterogeneous digital datasets 3. Balance a need for low barriers, with higher-level … data synthesis Dryad Team NESCent PI: Todd Vision, Director of Informatics and Associate Professor, Biology, UNC Hilmar Lapp, Assistant Director of Informatics Ryan Scherle, Data Repository Architect UNC/SILS/MRC PI: Jane Greenberg, Associate Professor, SILS and MRC Sarah Carrier, Research Assistant Abbey Thompson, DRIADE R.A./SILS Masters Student Hollie White, Doctoral Fellow Amy Bouck, Biology, Post doc

Dryad Depositor/s Specialized Repositories -Genbank -TreeBase -Morphbank -PaleoDB Journals & journal repositories Dryad -Data objects supporting published research Researcher/s One stop deposition One stop shopping an option

Research and Development

R & D: Accomplishments and Activities Functional requirements and model Workshops: Stakeholders (Dec. 06), SSR (May 07) Repository analysis (Dube, et al. JCDL, 2007) - OAIS (Open Archival Information System), DSpace Metadata architecture Level one application profile Namespace schemas: 1.Dublin Core 2.Data Documentation Initiative (DDI) 3.Ecological Metadata Language (EML) 4.PREMIS 5.Darwin Core Modular scheme: 1.Journal citation 2.Data objects (Carrier, et al., 2007)

R & D: Accomplishments and Activities Vocabulary analysis NBII Thesaurus, LCSH, the Gettys TGN 600 keywords, Dryad partner journals Facets: taxon, geographic name, time period, topic W3C SKOS (Simple Knowledge Organisation Systems) Instantiation study Bibliographic relationships for life-cycle management (Coleman, 2002; Smiraglia, 1999, 2000, 2001, 2002, etc.; Tillett; FRBR, DCAM)

Data object relationships Equivalence Derivative Whole-partSequential A (=same data set on paper) A (=data set in Excel) A (=same data set in SAS) A1 (=part 1 of a data set) C (=data set A revised) B (=data set A annotated) A (=data set) A (=data set) A 1 (=a subset of A) A2 (=part 2 of a data set)

Instantiation Scenario: Sherry collects data on the survival and growth of the plant Borrichia frutescens (the bushy seaside tansy)… back at the lab she enters the exact same data into an excel spreadsheet and saves it on her hard drive. Question: What is the relationship between Sherrys paper data sheet and her excel spreadsheet? Answer: Equivalent | Derivative | Whole-part | Sequential (circle one) Findings (20 participants) In general, more seasoned scientists better grasp Sequential data presented the most difficulty (less seasoned sci.) Unanimous support: very extremely important

R & D: Accomplishments and Activities Use-case study Intensive interviews with evolutionary biologists about data sharing ~ show KNB, ask about metadata creation, interface issues to help w/input Survey International survey, launched via evoldir, ~ 300 respondents ~ included questions on labeling practices, understanding of metadata User perceptions and behaviors re: data sharing

=

About the collaboration… Pros, Benefits Challenges Synergy between implementation and research Broader familiarity with contacts & related projects (collective knowledge) Broader range of expertise for problem solving MRC: Contributing to a project that will benefit science and society A live lab, new research opportunities Alignment of research and implementation goals (most useful may not be the most interesting, vice/versa) priorities Language barriers Funding models: Gap research and implementation Understanding: Trust, Task assignment Not having everyone in the same building

Concluding remarks… CIC What is eScience and why does it matter to libraries and librarians?... Matters to LIS researcher and educators too, to help advance practice and train information professionals What are the needs of scientists who are using large data sets? … Small science has needs too, similar and perhaps distinct What are new ways that librarians can collaborate with and support science researchers? Dryad offers an exciting model What are the skills needed by librarians to work successfully in this arena? Bias: Research and evaluate implementations

A final quote… A revolution is taking place in the scientific method….Hypothesize, design, and run experiment is being replaced by hypothesize, look up answer in database. (Towards 2020 Science, MS Research, 2006;Lesk, M. 2004)

Dryad repository: Wiki: Jane Greenberg, Director, SILS Metadata Research Center