Presentation is loading. Please wait.

Presentation is loading. Please wait.

An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.

Similar presentations


Presentation on theme: "An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009."— Presentation transcript:

1 An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009

2 Outline Organizational overview and background o Claire Stewart, Head, Digital Collections Winterton Collection project o Karen Miller, Monographic and Digital Projects Cataloger, Bibliographic Services Iterative approach o Bill Parod, Repository Architect, Enterprise Systems

3 Parallel committee/department structure

4

5 may 2008 Repository Implementation Group project schedule

6 may 2009

7 EAD to images to EAD+images

8 Winterton Collection cataloging Full cataloging for each of the 76 original collections and at the container level (album, envelope, etc.) for collections of more than one container. Individual photographs are not (generally) cataloged fully: o Title o Note (optional) o Publisher or Creator (if available)

9 Full cataloging included o Title o Dates of coverage o Abstract o Scope and contents description o Biographical or historical note o Physical description (size of album, how many pages, photos, etc.) o Subject headings

10 Providing cataloging at the album level means that Many individual photographs will not be described concisely by the subject headings assigned. Some subject headings may not apply at all to some photographs.

11 Transcribing only the photograph titles results in such problems as these when keyword searching: Non-English words are not translated People referred to in captions by their initials, not names Animals referred to by given name, not by species Non-descriptive captions

12

13

14 A.E.B. and his well-identified crowd

15 "Enmei and his rhino"

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31 Repository Development Strategy 1.Implement models and services for ingest, preservation, and access of core content. 2. Provide tools for staff to ingest and manage repository content. 3. Facilitate integration of repository materials with end-user tools and services. 4. Iterate…

32 Draw Detailed Requirements from Project Commitments: A) OAI-ORE Annotation of OCA texts B) Cross Collection Search Project C) Winterton Photography Collection D) Kirtas Mounting Books Project E) EAD Initiative F) Hesler Photography Collection G) Chemical Bulletin H) Fava Masks I) Curator-driven Digitization Project J) Charlotte Moorman / Prgm. African Studies Audio

33 Inventory Content Types 1) EAD encoded finding aids 2) TEI encoded text transcriptions 3) High resolution images 4) Virtual crops of high resolution images 5) Page imaged books 6) 3D objects 7) Aggregations: full text, fielded, and faceted search 8) Audio 9) Video

34 Project / Content Type Matrix

35 Services by Content Type Text Service Image Service Metadata Conversion Service Discovery Service

36 Text Service EAD Objects EAD Disseminator Methods: getEADHeader getComponentAsHTML(unitid) getComponentStructure getChildComponents(unitid) getComponents getComponentStructure(unitid) getAncestorComponents(unitid) getComponentChildrenAsJSON(unitid) getComponentAsEmbeddedHTML(unitid) getComponent (unitid) getElementById (xml:id) getArchDescNoComponents getElementsByName(element_name) getDigest(unitid) getComponentAsDC(unitid) getComponentAsMODS(unitid) reindex Datastreams: DC MODS EAD EAD to DC XSL EAD to MODS XSL EAD to HTML XSL EAD to HTML Frag XSL EAD Children to JSON XSL RELS-EXT TEI Objects TEI Disseminator Methods: getTOC getImageTextTOC getStructuredTextTOC getHeader(xml:id) getHeading getChunk(xml:id) getPageByNumber(pageOrdinal) getPageByID(xml:id) reindex Datastreams: DC MARCXML DejaVu Book ORE REM Page Image ORE REM TEI RELS-EXT

37 EAD Objects EAD Service methods: getEADHeader getComponentAsHTML(unitid) getComponentStructure getChildComponents(unitid) getComponents getComponentStructure(unitid) getAncestorComponents(unitid) getComponentChildrenAsJSON(unitid) getComponentAsEmbeddedHTML(unitid) getComponent (unitid) getElementById (xml:id) getArchDescNoComponents getElementsByName(element_name) getDigest(unitid) getComponentAsDC(unitid) getComponentAsMODS(unitid) reindex Datastreams: DC MODS EAD EAD to DC XSL EAD to MODS XSL EAD to HTML XSL EAD to HTML Frag XSL EAD Children to JSON XSL RELS-EXT

38 Fedora Text Disseminator getComponent: unitid getComponentAsHTML: unitid getComponentAsDC: unitid getComponentAsMODS: unitid.... reindex SGREPServlet Encapsulate query syntax XSLT optional on query result SGREP : Executable program on service host Add Fedora Disseminator Methods Add Fedora Disseminators Add/Modify XSLT Processing on Retrieval Add/Modify SGREP Queries Replace Retrieval Software Text Service StackEnhancement Options Examples: EAD “Digest”- C0n + title/id of children and ancestors JSON support for EXT-JS HTML design iteration EAD to MODS conversion maturation

39 Image Service: Source Image

40 Cropped Image Single Image File Referenced By Crop Information:

41 Cropped Photo Single Image File Referenced By Crop Information: <svg:use xlink:href="http://repository.library.northwestern.edu/fedora/get/inu:inu-wint-22-30/DELIV-OPS">

42 Image and Crop Objects Image Service methods (supported by both image and crop objects): getWithWidth(width) getWithLongSide(length) getWithHeight(height) getCropWithWidth(x,y,width, height,destwidth) getCropWithHeight(x,y,width, height,destheight) getCropWithSize(x,y,width,height, destwidth, destheight) getWithSize(destwidth, destheight) Image Object Datastreams: DC MODS PREMIS SVG TIFF EXIF JP2 MIX_TIFF MIX_JP2 RELS-EXT Crop Object Datastreams: DC MODS PREMIS SVG RELS-EXT

43 http:/.../fedora/get/inu:inu-wint-22-30-2/inu:sdef-addimage/getWithLongSide?length=150

44 Fedora Image Disseminator getWithWidth(width) getWithLongSide(length) getWithHeight(height) getCropWithSize(x, y, width, height, destwid…) Image Servlet Encapsulate rendering parameters Object specific rendering parameters (SVG) User request rendering parameters Rendering service parameters and location Rendering Service : Aware, DJatoka Add Fedora Disseminator Methods Add Fedora Disseminators Add/Modify Rendering Options Add/Modify Rendering Service Parameters Replace Rendering Software Image Service StackEnhancement Options Examples: Added getLongSide(length) Added rotation Optimized rendering parameters Rendering features - vector overlay Object reference chaining Djatoka experimentation

45 EAD Objects EAD Service methods: getEADHeader getComponentAsHTML( unitid ) getComponentStructure getChildComponents( unitid ) getComponents getComponentStructure( unitid ) getAncestorComponents( unitid ) getComponentChildrenAsJSON( unitid ) getComponentAsEmbeddedHTML( unitid ) getComponent ( unitid ) getElementById ( xml:id ) getArchDescNoComponents getElementsByName( element_name ) getDigest( unitid ) getComponentAsDC( unitid ) getComponentAsMODS( unitid ) reindex Datastreams: DC MODS EAD EAD to DC XSL EAD to MODS XSL EAD to HTML XSL EAD to HTML Frag XSL EAD Children to JSON XSL RELS-EXT Image/Crop Objects Image Service methods: getWithWidth( width ) getWithLongSide( length ) getWithHeight( height ) getCropWithWidth( x,y,width, height,destwidth ) getCropWithHeight( x,y,width, height,destheight ) getCropWithSize( x,y,width,height, destwidth, destheight ) getWithSize( destwidth, destheight ) Datastreams: DC MODS PREMIS SVG TIFF EXIF JP2 MIX_TIFF MIX_JP2 RELS-EXT

46 Searching SOLR MODS described collections Metadata conversion services Faceting “ Searchable ” Interface o MODS Collection Datastream o Facet list o Field List

47

48

49 Project Checklist A) OAI-ORE Annotation of OCA texts B) Cross Collection Search Project C) Winterton Photography Collection D) Kirtas Mounting Books Project E) EAD Initiative F) Hesler Photography Collection G) Chemical Bulletin H) Fava Masks I) Curator-driven Digitization Project J) Charlotte Moorman / Prgm. African Studies Audio

50

51 Next Step: Collection Management Tools EAD Ingest Processing

52 Image Ingest Processing

53 Heterogeneous Ingest Processing

54 Ingest Management Tools Curator-selected Ingest (Ad-hoc collections) Digital Image Library (Art Slide Library +) Kirtas Book Scanning Mounting Books Project - OR09 Tuesday, 3:00: Session 8B


Download ppt "An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009."

Similar presentations


Ads by Google