Download presentation
Presentation is loading. Please wait.
Published byTobias Robbins Modified over 9 years ago
1
NCSU Libraries Digital Repository Projects at the North Carolina State University Libraries James Jackson Sanborn Jim Tuttle Open Repositories/DSpace User Group ‘07
2
NCSU Libraries Early Repository Planning Digital Repository Planning Committee What it wouldn’t be (at least to start) –Distributed community structure –Open submission –‘Institutional’ Repository What it would be (at least to start) –Library-managed collections –Building block for campus partnership –Learning opportunity
3
NCSU Libraries Repository Building Blocks NCSU Electronic Theses and Dissertations –Started 1997 –Mandatory since 2002 –Virginia Tech’s ETDdb –~3,000 ETDs NCSU Authors Database –Started 1995 –Access Database/Cold Fusion front-end –~22,000 citations
4
NCSU Libraries Repository Building Blocks (cont’d) Technical Reports Print Collection –Campus Institutes and Departments –Massive fall-off in print distribution Special Collections Resource Center –Digitized texts and photographs –Campus Newsletters GIS Data –Library managed/acquired data collection –Homegrown data layer database/discovery tools
5
NCSU Libraries Repository Plan Target ‘Research’ collections first –Technical Reports –ETDs –Faculty Publications/Citations Treat each collection as its own project Actively pursue common technological solutions
6
NCSU Libraries Technical Reports DSpace Application Lightly Customized Library Harvested –Local Cataloging/Metadata database –Scripted Ingest Object Creation –Batch Ingest Mix of ongoing submission by institute/departmental personnel and Library capture.
7
NCSU Libraries Tech Rep Screenshot
8
NCSU Libraries Technical Reports Item Detail
9
NCSU Libraries Electronic Theses & Dissertations Partnership with Graduate School Hybrid System: DSpace and ETD-db –ETD-db submission/approval/management –Direct database extract for DSpace Ingest Object creation –Scheduled Batch Ingest process DSpace Considerations/Alterations –Metadata Mapping –Author Browse (exclude contributor.advisor) –Various interface changes
10
NCSU Libraries ETD-DB screenshot
11
NCSU Libraries ETD DSpace screenshot
12
NCSU Libraries Faculty Publications Built on Existing Author Database –Rebuilt Authors DB from Access/ColdFusion to Oracle/PHP Re-modeled data Added Functionality –OpenURL –‘Vita-like’ citation display –Full-text or submission links –Full-text stored in DSpace Citation metadata and file exported by script DSpace Identifier currently manually entered
13
NCSU Libraries Faculty Publications Schematic Scholar Oracle Faculty Publications DB (citations) Web interface (php) DSpace Java/JSP (full-text only) Cataloging and Coll. Mgt. Access DSpace Item DisplayWeb Submission Form ISI Ann. Reps Etc. View full-text S+R Citations Add/Edit data Handle IDs Submit Citations and/or Text File System (files) PostgreSQL (metadata)
14
NCSU Libraries FacPubs Search Screen
15
NCSU Libraries FacPubs result screenshot
16
NCSU Libraries FacPubs Item screenshot
17
NCSU Libraries Repository Governance Internal –Digital Repository Planning Committee –Data Repository Architect External –Faculty Repository Advisory Committee –Partnerships with departments and institutes
18
NCSU Libraries NCGDAP: Overview NDIIPP: National Digital Information Infrastructure and Preservation Program Collaboration with Library of Congress 1 of 8 three year projects to study long-term (50+ years) digital preservation Objective: engage existing state/federal geospatial data infrastructures in preservation Project approaches: Technical and Social
19
NCSU Libraries Repository Requirements Dim archive with possible future access –minimal IR/access component Minimal repository imprint on data –repository agnostic ingest and export Simple digital curation functions –Periodic MD5 checksum validation –Structured metadata index Expected archived-data exchange Leverage existing investments Free Software with active community
20
NCSU Libraries Automation: Threat and format analysis, validation Python wrappers for the following: Anti-virus – ClamAV Compressed files (tar, zip, gzip, bzip) At-risk formats Executable files (magic numbers) Jhove validation
21
NCSU Libraries Automation: Archive package organization ESRI ArcGIS toolbar for selected formats
22
NCSU Libraries Automation: Archive package organization Rule-based python logic –filestem –extension relationships ( multi- file format validation) –directory structure Manual intervention NOID assignment
23
NCSU Libraries Metadata: Seed file form 'Transfer set' metadata capture in 'Seed file' –communicates with DSpace backend, generates xml used to inform later scripts
24
NCSU Libraries Metadata: Communities and Collections Search by type for 100+ communities Facilitates creation and reduces errors
25
NCSU Libraries Curation Processing At-risk format migration, original retained Agency-specific XML templates in ArcCatalog with synchronization flags Provenance and curation metadata scripted
26
NCSU Libraries Source Metadata Translation Repository agnostic approach Spokes for each transformation Facilitates export from Dspace into other repositories Generate Dspace QDC, METS; populate Workflow database
27
NCSU Libraries Extra-repository AIP management Workflow Management Database (WMD) populated as a spoke on the metadata/ingest hub External tracking of NOID, Handle, ISO keywords, other metadata for interaction with other systems Integrates with existing GIS Lookup tool
28
NCSU Libraries Repository Architecture Overview PostgreSQL repository tomcat instance Faculty Publications PHP/DSpace hybrid Tomcat DSpace Internal NDIIPP (DSpace) SCRC (DSpace) Asset Store/ ATABeast (sub-directory for each DSpace app) One shared username. Separate database for each app Repository (DSpace) Technical Reports ETDs Collections (DSpace) SCRC --Course Catalogs --Green ‘N’ Growing
29
NCSU Libraries Upcoming Repository Related Projects Enhancements to current system –XTF search interface –Inter-archive exchange Digital Collections Repository –Special Collections Research Center –Other non-faculty collections Data Repository –Scientific data –Statistical resources
30
NCSU Libraries For More Information: James Jackson Sanborn –james_sanborn@ncsu.edu Jim Tuttle –jim_tuttle@ncsu.edu
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.