Download presentation
Presentation is loading. Please wait.
Published byCaitlin Cannon Modified over 9 years ago
1
An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009
2
Outline Organizational overview and background o Claire Stewart, Head, Digital Collections Winterton Collection project o Karen Miller, Monographic and Digital Projects Cataloger, Bibliographic Services Iterative approach o Bill Parod, Repository Architect, Enterprise Systems
3
Parallel committee/department structure
5
may 2008 Repository Implementation Group project schedule
6
may 2009
7
EAD to images to EAD+images
8
Winterton Collection cataloging Full cataloging for each of the 76 original collections and at the container level (album, envelope, etc.) for collections of more than one container. Individual photographs are not (generally) cataloged fully: o Title o Note (optional) o Publisher or Creator (if available)
9
Full cataloging included o Title o Dates of coverage o Abstract o Scope and contents description o Biographical or historical note o Physical description (size of album, how many pages, photos, etc.) o Subject headings
10
Providing cataloging at the album level means that Many individual photographs will not be described concisely by the subject headings assigned. Some subject headings may not apply at all to some photographs.
11
Transcribing only the photograph titles results in such problems as these when keyword searching: Non-English words are not translated People referred to in captions by their initials, not names Animals referred to by given name, not by species Non-descriptive captions
14
A.E.B. and his well-identified crowd
15
"Enmei and his rhino"
31
Repository Development Strategy 1.Implement models and services for ingest, preservation, and access of core content. 2. Provide tools for staff to ingest and manage repository content. 3. Facilitate integration of repository materials with end-user tools and services. 4. Iterate…
32
Draw Detailed Requirements from Project Commitments: A) OAI-ORE Annotation of OCA texts B) Cross Collection Search Project C) Winterton Photography Collection D) Kirtas Mounting Books Project E) EAD Initiative F) Hesler Photography Collection G) Chemical Bulletin H) Fava Masks I) Curator-driven Digitization Project J) Charlotte Moorman / Prgm. African Studies Audio
33
Inventory Content Types 1) EAD encoded finding aids 2) TEI encoded text transcriptions 3) High resolution images 4) Virtual crops of high resolution images 5) Page imaged books 6) 3D objects 7) Aggregations: full text, fielded, and faceted search 8) Audio 9) Video
34
Project / Content Type Matrix
35
Services by Content Type Text Service Image Service Metadata Conversion Service Discovery Service
36
Text Service EAD Objects EAD Disseminator Methods: getEADHeader getComponentAsHTML(unitid) getComponentStructure getChildComponents(unitid) getComponents getComponentStructure(unitid) getAncestorComponents(unitid) getComponentChildrenAsJSON(unitid) getComponentAsEmbeddedHTML(unitid) getComponent (unitid) getElementById (xml:id) getArchDescNoComponents getElementsByName(element_name) getDigest(unitid) getComponentAsDC(unitid) getComponentAsMODS(unitid) reindex Datastreams: DC MODS EAD EAD to DC XSL EAD to MODS XSL EAD to HTML XSL EAD to HTML Frag XSL EAD Children to JSON XSL RELS-EXT TEI Objects TEI Disseminator Methods: getTOC getImageTextTOC getStructuredTextTOC getHeader(xml:id) getHeading getChunk(xml:id) getPageByNumber(pageOrdinal) getPageByID(xml:id) reindex Datastreams: DC MARCXML DejaVu Book ORE REM Page Image ORE REM TEI RELS-EXT
37
EAD Objects EAD Service methods: getEADHeader getComponentAsHTML(unitid) getComponentStructure getChildComponents(unitid) getComponents getComponentStructure(unitid) getAncestorComponents(unitid) getComponentChildrenAsJSON(unitid) getComponentAsEmbeddedHTML(unitid) getComponent (unitid) getElementById (xml:id) getArchDescNoComponents getElementsByName(element_name) getDigest(unitid) getComponentAsDC(unitid) getComponentAsMODS(unitid) reindex Datastreams: DC MODS EAD EAD to DC XSL EAD to MODS XSL EAD to HTML XSL EAD to HTML Frag XSL EAD Children to JSON XSL RELS-EXT
38
Fedora Text Disseminator getComponent: unitid getComponentAsHTML: unitid getComponentAsDC: unitid getComponentAsMODS: unitid.... reindex SGREPServlet Encapsulate query syntax XSLT optional on query result SGREP : Executable program on service host Add Fedora Disseminator Methods Add Fedora Disseminators Add/Modify XSLT Processing on Retrieval Add/Modify SGREP Queries Replace Retrieval Software Text Service StackEnhancement Options Examples: EAD “Digest”- C0n + title/id of children and ancestors JSON support for EXT-JS HTML design iteration EAD to MODS conversion maturation
39
Image Service: Source Image
40
Cropped Image Single Image File Referenced By Crop Information:
41
Cropped Photo Single Image File Referenced By Crop Information: <svg:use xlink:href="http://repository.library.northwestern.edu/fedora/get/inu:inu-wint-22-30/DELIV-OPS">
42
Image and Crop Objects Image Service methods (supported by both image and crop objects): getWithWidth(width) getWithLongSide(length) getWithHeight(height) getCropWithWidth(x,y,width, height,destwidth) getCropWithHeight(x,y,width, height,destheight) getCropWithSize(x,y,width,height, destwidth, destheight) getWithSize(destwidth, destheight) Image Object Datastreams: DC MODS PREMIS SVG TIFF EXIF JP2 MIX_TIFF MIX_JP2 RELS-EXT Crop Object Datastreams: DC MODS PREMIS SVG RELS-EXT
43
http:/.../fedora/get/inu:inu-wint-22-30-2/inu:sdef-addimage/getWithLongSide?length=150
44
Fedora Image Disseminator getWithWidth(width) getWithLongSide(length) getWithHeight(height) getCropWithSize(x, y, width, height, destwid…) Image Servlet Encapsulate rendering parameters Object specific rendering parameters (SVG) User request rendering parameters Rendering service parameters and location Rendering Service : Aware, DJatoka Add Fedora Disseminator Methods Add Fedora Disseminators Add/Modify Rendering Options Add/Modify Rendering Service Parameters Replace Rendering Software Image Service StackEnhancement Options Examples: Added getLongSide(length) Added rotation Optimized rendering parameters Rendering features - vector overlay Object reference chaining Djatoka experimentation
45
EAD Objects EAD Service methods: getEADHeader getComponentAsHTML( unitid ) getComponentStructure getChildComponents( unitid ) getComponents getComponentStructure( unitid ) getAncestorComponents( unitid ) getComponentChildrenAsJSON( unitid ) getComponentAsEmbeddedHTML( unitid ) getComponent ( unitid ) getElementById ( xml:id ) getArchDescNoComponents getElementsByName( element_name ) getDigest( unitid ) getComponentAsDC( unitid ) getComponentAsMODS( unitid ) reindex Datastreams: DC MODS EAD EAD to DC XSL EAD to MODS XSL EAD to HTML XSL EAD to HTML Frag XSL EAD Children to JSON XSL RELS-EXT Image/Crop Objects Image Service methods: getWithWidth( width ) getWithLongSide( length ) getWithHeight( height ) getCropWithWidth( x,y,width, height,destwidth ) getCropWithHeight( x,y,width, height,destheight ) getCropWithSize( x,y,width,height, destwidth, destheight ) getWithSize( destwidth, destheight ) Datastreams: DC MODS PREMIS SVG TIFF EXIF JP2 MIX_TIFF MIX_JP2 RELS-EXT
46
Searching SOLR MODS described collections Metadata conversion services Faceting “ Searchable ” Interface o MODS Collection Datastream o Facet list o Field List
49
Project Checklist A) OAI-ORE Annotation of OCA texts B) Cross Collection Search Project C) Winterton Photography Collection D) Kirtas Mounting Books Project E) EAD Initiative F) Hesler Photography Collection G) Chemical Bulletin H) Fava Masks I) Curator-driven Digitization Project J) Charlotte Moorman / Prgm. African Studies Audio
51
Next Step: Collection Management Tools EAD Ingest Processing
52
Image Ingest Processing
53
Heterogeneous Ingest Processing
54
Ingest Management Tools Curator-selected Ingest (Ad-hoc collections) Digital Image Library (Art Slide Library +) Kirtas Book Scanning Mounting Books Project - OR09 Tuesday, 3:00: Session 8B
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.