The world’s libraries. Connected. The Challenges of Digging Data: A Study of Context in Archaeological Data Reuse Joint Conference on Digital Libraries.

Slides:



Advertisements
Similar presentations
Swimming Upstream: Assessing the Librarys Role in Managing the River of Data on Campus Christie Peters | Science & Engineering Librarian Anita R. Dryden.
Advertisements

DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
The world’s libraries. Connected. A Preliminary View of Data Reuse in the Zoological Community CollectionsWeb Stakeholders Workshop, May 2-3, 2013, Washington,
The world’s libraries. Connected. Satisfaction with Data Reuse: Survey Results from Users of a Social Science Data Archive Society of American Archivists.
Data Sharing in Zooarchaeology Challenges and Promises Sarah Whitcher Kansa The Alexandria Archive Institute Unless otherwise indicated, this work is licensed.
The world’s libraries. Connected. Digital Archaeological Data: Curation, Preservation, and Reuse SAA 75 th Annual Meeting, April 3-7, 2013 Honolulu, Hawaii.
The world’s libraries. Connected. Inside Zoological Collections: Perspectives of the Academic (Re)user The Society for the Preservation of Natural History.
IS214 Recap. IS214 Understanding Users and Their Work –User and task analysis –Ethnographic methods –Site visits: observation, interviews –Contextual.
The world’s libraries. Connected. Trust in Digital Repositories International Digital Curation Conference (IDCC) 8, January 14-17, 2013 Amsterdam, Netherlands.
The Data Curation Profile IASSIST 2010 Jake Carlson Data Research Scientist Purdue University Libraries.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
The Subject Librarian's Role in Building Digital Collections: Where Information Management and Subject Expertise Meet Ruth Vondracek Oregon State University.
NHPRC ELECTRONIC RECORDS RESEARCH FELLOWSHIP SYMPOSIUM Nov. 19, 2004 Rebecca Schulte University of Kansas Project Title: Testing Boundaries—An Exploration.
Choosing Your Primary Research Method What do you need to find out that your literature did not provide?
EMu and Archives NA EMu Users Conference – Oct Slide 1 EMu and Archives Experiences from the Canada Science and Technology Museum Corporation.
The world’s libraries. Connected. Can Quantitative Social Scientists Get Data Reuse Satisfaction? Research Data Access & Preservation Summit 2013, April.
Management, marketing and population of repositories Morag Greig, University of Glasgow.
The world’s libraries. Connected. Dissemination Information Packages for Information Reuse University of Amsterdam, Faculty of Media Studies January 18,
The world’s libraries. Connected. Data Reuse and Sensemaking among Novice Social Scientists ASIS&T 75 th Annual Meeting, October 26-30, 2012 Baltimore,
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
Final Search Terms: Archiving (digital or data) Authentication (data) Conservation (digital or data) Curation (digital or data) Cyberinfrastructure Data.
A Public Trust at Risk: The Heritage Health Index Report on the Condition of Alabama’s Collection.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
The world’s libraries. Connected. Data Reuse Experiences within Digital vs. Physical Zoological Collections University of Michigan Museum of Zoology (UMMZ),
R utgers C ommunity R epository RU CORE 1 Research Data and Context  Presentation Goals  The challenge of context  Metadata design to support context.
Libraries as Partners in Research: the UC Curation Center’s Tools and Services UC3 Team University of California Curation Center California Digital Library.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
Research Data Management At the Smithsonian Using SIdora Nano Tech Working Group May 15, 2014.
The Western Waters Digital Library: Building a Resource Through Multi- State Collaboration and Technology Dawn Paschal Assistant Dean, Digital Library.
Outcome Based Evaluation for Digital Library Projects and Services
79 th Annual Meeting of the Society of American Archivists Research Libraries Roundtable Panel August 19, 2015 Managing and Curating Data with Reuse in.
Evaluating a Research Report
Searching Sheet Music: IN Harmony Final Report Stacy Kowalczyk Digital Library Program Brownbag Spring Series February 13, 2008.
A survey based analysis on training opportunities Dr. Jūratė Kuprienė Framing the digital curation curriculum International Conference Florence, Italy.
University of Sunderland CIFM03Lecture 2 1 Quality Management of IT CIFM03 Lecture 2.
Michael Witt Interdisciplinary Research Librarian & Assistant Professor Purdue Libraries & Distributed Data Curation Center (D2C2) Eliciting.
Elements of a Data Management Plan: Roles and Responsibilities Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date.
Life Cycle Models & Principles Jake Carlson Associate Professor of Library Science Data Services Specialist Purdue University Libraries.
Imaging Pittsburgh: Creating a Shared Gateway to Digital Image Collections of the Pittsburgh Region IMLS 2002 National Leadership Grant Library & Museum.
1 Everyday Requirements for an Open Ontology Repository Denise Bedford Ontolog Community Panel Presentation April 3, 2008.
A Question of Interpretation The role of archivists in an online age Amanda Hill University of Manchester, UK.
The University of Michigan, School of Information, August 5, 2015 Data Management, Sharing and Reuse: A User’s Perspective Ixchel M. Faniel, Ph.D. Research.
The NCATE Journey Kate Steffens St. Cloud State University AACTE/NCATE Orientation - Spring 2008.
Background Researchers and funders continue to be concerned about the lack of archiving of scientific data. Such data can be useful to researchers, educators,
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
The Importance of Standards in Digital Preservation Tina Norris Kayla Payne Jennifer
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
Analyzing Ethnographic Data UrbP 298. Beyond interviews What are the different kinds of qualitative data? Interviews (unstructured to highly structured)
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Ingest – Acquisition and deposit Irena Vipavc Brvar ADP SEEDS Workshop I Belgrade, October.
MICHAEL Culture Association WP4 Integration of existing data structure into Europeana ATHENA, WP4 Working group technical meeting Konstanz, 7th of May.
Perspectives on Information Course Introduction January 25, 2016.
Data reusability: A comparison across disciplines
Do your data management and curation practices support data reuse?
Data Reuse: An Evolving Research Program
Data Management, Sharing and Reuse: A User’s Perspective
Managing and Curating Data with Reuse in Mind
Paolo Budroni, University of Vienna
2 November 2014 Putting Research Data into Context: A Scholarly Approach to Curating Data for Reuse Ixchel M. Faniel, Ph.D. Associate Research Scientist.
Practices Do Not Make Perfect
Sophia Lafferty-hess | research data manager
Data Management: Documentation & Metadata
Experiences of the Digital Repository of Ireland
The Realities of RDM: Identifying and Acting on Incentives When Planning RDM Services today’s webinar is on Identifying and acting on Incentives When Planning.
An Open Archival Repository System for UT Austin
Presentation transcript:

The world’s libraries. Connected. The Challenges of Digging Data: A Study of Context in Archaeological Data Reuse Joint Conference on Digital Libraries (JCDL), July 22-25, 2013 Indianapolis, Indiana Elizabeth Yakel, Ph.D. University of Michigan Ixchel M. Faniel, Ph.D. OCLC Research Eric Kansa. Ph.D. The Alexandria Archive Institute Open Context and University of California, Berkeley Sarah Whitcher Kansa, Ph.D. Julianna Barrera-Gomez OCLC Research

The world’s libraries. Connected. An Institute for Museum and Library Services (IMLS) funded project led by Dr. Ixchel Faniel and Dr. Elizabeth Yakel. Studying data reuse in three academic disciplines to identify how contextual information about the data that supports reuse can best be created and preserved. Focuses on research data produced and used by quantitative social scientists, archaeologists, and zoologists. The intended audiences of this project are researchers who use secondary data and the digital curators, digital repository managers, data center staff, and others who collect, manage, and store digital information. For more information, please visit

The world’s libraries. Connected. DIPIR Project Nancy McGovern ICPSR/MIT Ixchel Faniel OCLC Research (PI) Eric Kansa Open Context William Fink UM Museum of Zoology Elizabeth Yakel University of Michigan (Co-PI) The Research Team

The world’s libraries. Connected. Methods Overview ICSPROpen ContextUMMZ Phase 1: Project Start up Interviews Staff 10 Winter Winter Spring 2011 Phase 2: Collecting and analyzing user data Interviews data consumers 43 Winter Winter Fall 2012 Survey data consumers 2000 Summer 2012 Web analytics data consumers Server logs Ongoing Observations data consumers 10 Ongoing Phase 3: Mapping significant properties as representation information

The world’s libraries. Connected. Social and economic forces pushing toward digital archaeological data publication No robust set of standards exist for field archaeology Data reuse studies can inform standards development, but there are few outside of science and engineering disciplines Motivation The Challenges of Digging Data: A Study of Context in Archaeological Data Reuse

The world’s libraries. Connected. The Study Research Question 1.How does contextual information serve to preserve the meaning of and trust in archaeological field research over time? 2.How can existing cultural heritage standards be extended to incorporate these contextual elements? Data Collection 22 interviews with archaeologists Data Analysis Code set developed and expanded from interview protocol

The world’s libraries. Connected. The lack of context was a persistent problem. Data collection procedures were highly sought during data reuse. Additional context also played a role during data reuse. Findings

The world’s libraries. Connected. Findings The lack of context was a persistent problem during data reuse. MUSEUM COLLECTONS “…There was less concern about provenance information or context information. So objects are treated as objects and not as objects within their contextual world…” (CCU20). EARLY FIELD STUDIES So we did not have access to critical information, such as archaeological contexts, excavation methods, sampling methods, even identification methods. We didn't know if the analysts actually used comparative collections or just published manuals to identify specimens or how did she sample... She didn't mention or detail those things.” (CCU16). CONTEMPORARY FIELD STUDIES “You need to do a lot of cleaning and translating to make things work. But the concepts in the archaeological ontologies that are being used to describe are still professionally the same, but they’re recorded in various scales. They may use different terminologies, different data types” (CCU12).

The world’s libraries. Connected. Findings Data collection procedures were highly sought during data reuse. Accounting for Interpretations of Context Made in the Field “We make a sort of series of interlocking assumptions about the certificate of a finding and the material that I’m processing...” (CCU18). Accounting for Context Destroyed in the Field “Just knowing an object is there is nothing. You have to know all about it. You need to know where it comes from, how it was acquired, how it was excavated. Everything we know has to be tied to that object, otherwise, it’s useless” (CCU11). Accounting for Different Approaches in the Field “We have to look at their field methods and that's, for example, did they walk with spacing close enough so that they were picking up…They'll hit a site, but they'll walk by little tiny sherd scattered things…So you kind of need to know that. I've heard of things like shoulder surveys, where they literally walk side by side and pick those little things, but then, again, you've only, you're doing a very narrow tract. So there are procedures” (CCU01).

The world’s libraries. Connected. Findings Additional context that also played a role in data reuse. DATA RECORDING PROCEDURES “If somebody was writing about, say, a loci that they were digging and they were talking about some of the major finds before they were talking about the dirt, the matrix, and kind of its relationship to the other squares around it, I was more wary...” (CCU10). REPUTATION OF THE DATA REPOSITORY “They're very keen on producing the comprehensive metadata. And it's not that I trust each research [study]... but I trust that the metadata is there for me to go back and check out each file on my own. I don't give [the repository] a sort of blanket trust that all the data in there is correct, but...I sort of trust going there because I know that I can find the information I need to validate it” (CCU02). REPUTATION AND SCHOLARY AFFILIATION “there are individuals that I have a lot of respect for, and I really respect their training. If it's somebody whose training I don't know about, I'm going to be less likely to use their dataset because I'm not sure how reliable it is” (CCU06).

The world’s libraries. Connected. Implications: Documenting Context is Challenging What: typology & description of finds Who: institutional, personal (training, reputation) Where & When: stratigraphic / positional, chronology How: methods, sampling strategies, identification procedures, instruments, etc. Why: research, preservation, and documentation goals

The world’s libraries. Connected. Implications: Documenting Context is Challenging What: typology & description of finds Who: institutional, personal (training, reputation) Where & When: stratigraphic / positional, chronology How: methods, sampling strategies, identification procedures, instruments, etc. Why: research, preservation, and documentation goals CIDOC-CRM Ontology for “cultural heritage” (mainly museum) data, recently extended for archaeology: - Complex (dozens of classes & properties) - Abstract (models historical “events” relating people, places, things, and actions). Needs to be used in conjunction with controlled vocabularies

The world’s libraries. Connected. Implications: Documenting Context is Challenging What: typology & description of finds Who: institutional, personal (training, reputation) Where & When: stratigraphic / positional, chronology How: methods, sampling strategies, identification procedures, instruments, etc. Why: research, preservation, and documentation goals Can use general controlled vocabularies & thesauri (British Museum, EOL, UBERON & others) But! Expertise required (“Data Editors” in Open Context case) Specific classification can be controversial / disputed (research / interpretive goal)

The world’s libraries. Connected.

Implications: Documenting Context is Challenging What: typology & description of finds Who: institutional, personal (training, reputation) Where & When: stratigraphic / positional, chronology How: methods, sampling strategies, identification procedures, instruments, etc. Why: research, preservation, and documentation goals Name authorities, researcher identity systems (VIAF, ORCID)

The world’s libraries. Connected.

Implications: Documenting Context is Challenging What: typology & description of finds Who: institutional, personal (training, reputation) Where & When: stratigraphic / positional, chronology How: methods, sampling strategies, identification procedures, instruments, etc. Why: research, preservation, and documentation goals Standards either under- developed or not widely applied and understood. Challenges: (1) Interpretive (chronology is a research outcome, not a given) (2) Multidisciplinary breadth (zoology, soil science, chemistry, geology, botany, genetics...)

The world’s libraries. Connected. Conclusions Researchers have an interest in the entire data life-cycle (data collection preparation through repository) Need more studies involving data integration and reuse to help guide standards development (CIDOC-CRM not sufficient)

The world’s libraries. Connected. Conclusions Researchers have an interest in the entire data life-cycle (data collection preparation through repository) Need more studies involving data integration and reuse to help guide standards development (CIDOC-CRM not sufficient) One does not simply share usable data…

The world’s libraries. Connected.Acknowledgements Institute of Museum and Library Services, LG Our co-authors: Sarah Whitcher Kansa, Ph.D., Julianna Barrera-Gomez, M.S.I., Elizabeth Yakel, Ph.D. Partners: Nancy McGovern, Ph.D. (MIT), Eric Kansa, Ph.D. (Open Context), William Fink, Ph.D. (University of Michigan Museum of Zoology) Students: Morgan Daniels, Rebecca Frank, Adam Kriesberg, Jessica Schaengold, Gavin Strassel, Michele DeLia, Kathleen Fear, Mallory Hood, Molly Haig, Annelise Doll, Monique Lowe

The world’s libraries. Connected. Questions? Ixchel M. FanielEric Kansa