CBRC Digital Repository: Storing and viewing 3D objects, for science! James Halliday Programmer/Analyst, Library Technologies Juliet L. Hardesty Metadata Analyst, Metadata Resources & Systems Indiana University Libraries Digital Library Brown Bag, October 7, 2015 #dlbb
CBRC Center for Biological Research Collections Specimen collections – Herbarium - 2D plants – Paleontology and Zooarchaeology – 3D bones – 10’s of K’s of items Specify Software – Specimen management and cataloging – Darwin Core, geospatial info #dlbb
flickr Bones Bones & Plants #dlbb
Bones & Plants Various marmot bones from Mineral Hill CaveVarious marmot bones from Mineral Hill Cave, US Dept. of Interior Bureau of Land Management CBRC Member CollectionCBRC Member Collection, Indiana University Herbarium #dlbb
3D Bones #dlbb
CBRC Needs Cataloging and inventory management covered No place to store/preserve images No way to provide public access to digitized specimens #dlbb
What IU Libraries will provide Workflow for multiple file types (.tif,.stl,.zip) Possibility of system mostly out of the box that lets collection managers actually manage collections in Fedora – Type of collection (specialized metadata) – 3D images – New things we haven’t managed previously #dlbb
Darwin Core metadata All CBRC collections use the Darwin Core metadata standard Extension of Dublin Core to add metadata specific to biological collections See #dlbb
3D viewer integration 3D scans can currently be viewed using solutions such as SketchFab – see Adobe Acrobat PDF viewer We hope to instead integrate a viewer solution with Sufia using technology being developed at IU’s Advanced Visualization Lab #dlbb
3D printing at IU EquipmentEquipment, 3D Printing Studio, Indiana University New 3-D printing lab in Wells Library open to IU employees, studentsNew 3-D printing lab in Wells Library open to IU employees, students, Inside IU, Sept. 30, 2015 #dlbb
What is Sufia? Hydra-based institutional repository software Based on Hydra infrastructure (Fedora backend, Blacklight-based Rails frontend) Primarily based around self-deposit Originated as ScholarSphere at Penn State ( One of the most polished and well-developed Hydra projects #dlbb
ScholarSphere #dlbb
Sufia, Hydra and Fedora 4 As a Hydra project, Sufia uses Fedora as a backend ( Fedora 4 is a drastic restructuring of Fedora, making it cleaner and leaner Fedora 4 uses primarily RDF metadata Sufia 6 uses Fedora 4, and is one of the first Hydra projects to fully move to Fedora 4 #dlbb
CBRC – why Sufia 6? Migrations from Fedora 3 to Fedora 4 are tricky, due to drastic differences between the versions CBRC data is all new, allowing us to create a Fedora 4 repository from scratch without worrying about migration Sufia meets most of the needs of the CBRC project out of the box Some features such as batch ingest still need to be added by local developers #dlbb
The Future of Hydra and Fedora at IU For now, our Sufia repository is specific to the CBRC collections It also serves as a pilot project for Fedora 4 Eventually, we might use a single Sufia instance to hold a wider variety of content Existing IU projects that currently use Fedora 3 will eventually be migrated to Fedora 4 Potentially, non-Fedora projects (such as our DSpace installations) might be moved to Hydra/Sufia #dlbb
Sample Item – Canis familiaris mandible Sample item from one CBRC collection – 3D images created using FARO ScanArm equipment Each item will be digitized and stored in Sufia / Fedora along with associated metadata Metadata will also be stored in Specify, an open- source Java-based platform for storing information about biological research collections (see #dlbb
Sample Item in Specify #dlbb
Specify Taxonomy Tree #dlbb
Sample Item in Sufia #dlbb
Sample Item in Fedora #dlbb
Specify and Sufia work together All metadata stored in both Specify and Sufia Specify provides data entry and analysis tools that would be hard to replicate Data synchronization is an issue Sufia serves as the primary archival location for both data files and metadata #dlbb
Batch ingest and workflow We need automated scripts between the digitizers workstations, Sufia and Specify We will leverage existing infrastructure for image processing and backup as much as possible 2D image creation will follow the usual process, but 3D image creation is new #dlbb
3D file issues No existing standards for 3D preservation and access 3D imaging produces multiple files per object Archival files are quite large, but a smaller ‘derivative’ file can also be generated Might need to allow more than one 3D scan per object Web access to 3D object might be accomplished through a plug-in #dlbb
Thanks! Question? Comments? Github repo Jim Halliday Julie #dlbb