Presentation is loading. Please wait.

Presentation is loading. Please wait.

FEDORA at Northwestern University Bill Parod Academic Technologies Northwestern University

Similar presentations


Presentation on theme: "FEDORA at Northwestern University Bill Parod Academic Technologies Northwestern University"— Presentation transcript:

1 FEDORA at Northwestern University Bill Parod Academic Technologies Northwestern University bill-parod@northwestern.edu

2 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University General Background Academic Technologies Faculty projects Library partnerships Institutional partnerships Diverse clientele Diverse content “One-off” projects

3 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Current FEDORA Projects Block Museum of Art The Last Expression Art Collection Introduction to Asian Art History BBC Spoken Word Archive Paris Map Collection Encyclopedia of Chicago WordHoard Text Analysis Project

4 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Diversity of Content

5 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Wavelet Image Servers Vector Image Processors Streaming Media Servers RDBMS XML Databases XSLT Processors GIS Servlet Engines Diversity of Systems

6 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Abstract Image Models

7 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University 4 Image Behavior Classes Core behavior – getCoverpage – getThumbnail Basic image (UVa) – getThumbnail – getMedium – getHigh – getVeryHigh Addressable image – getRegion(rgn,size) – getViewer Layered image – getRegion(,,layers) – getViewer(layers) Geographic image – getRegion(,,, coords) – getViewer(, coords)

8 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University 4 Image Content Models Core behavior – XML Metadata – HTML XSLT script – Thumbnail Image Basic image (UVa) – Thumbnail jpeg – Medium Res jpeg – High Res jpeg – Very High Res jpg Addressable image – Image metadata – Viewer XSLT script Layered image – Layer metadata Geographic image – World file for projection

9 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Images SimpleZoomLayers Core getThumbnail getCoverpage Basic getThumbnail get Medium getHigh getVeryHigh Addr getRegion getViewer Layered getRegion getViewer

10 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University BDEF Interface Definition

11 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University BMECH Description Method bindings to implementation HTTP URL templates to image servlet Accepts image server metadata stream Accepts specific user parameters Provides implementation flexibility Currently using TrueSpectra/Scene7 image server

12 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University getCoverPage() for simple image – Block Museum Collection

13 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University getCoverPage() for zoomable image – History of Asian Art class

14 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Ingesting Images Imaging person deposits master TIFF images in WebDAV enabled file stor Image server configured with “virtual path” to WebDAV stor for master image tiff. TIFF master is converted to FlashPix and cached in image server Image server handles request for FEDORA dissemination

15 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Metadata in ExcelMETSFEDORA Tiffs in Xythos TrueSpectra Image Server Dissemination Requests Catalog in Excel converted to METS for FEDORA ingest Tiff Masters deposited in collection’s Xythos directory Access to Xythos directory enabled for TrueSpectra virtual paths METS/FEDORA record includes link to TrueSpectra image Access to image is through FEDORA image behaviors DepartmentAcademic Technologies Data flow Requests Users Image Workflow: FEDORA – TrueSpectra – Xythos

16 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Auto-ingesterFEDORAFiles in Xythos TrueSpectra Streaming Server Search Dissemination Requests Faculty or SupportAcademic Technologies Data flow Requests Users Physical Collection Management Scenario: FEDORA – Content Service – Xythos Integration Metadata update FEDORA collection object attached to Xythos directory Xythos notifies collection object of changes in the directory File added – collection creates new member item File updated – item accepts new version for file stream File removed – item is set dormant in FEDORA Metadata added/updated online or batch

17 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Basic Collection Object Collection behavior – getSearchForm – performSearch() – getItem() – getItems() – addItem() – deleteItem() – reindex() – displayItem() Core behavior – getCoverpage – getThumbnail Block Museum of Art The Last Expression Vesalius Figures BBC Audio History of Asian Art

18 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Collection Content Model Search Form XSLT for search results Index Header/footer XML for result stream Member PIDs

19 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Search Implementation FEDORA METS files currently indexed offline Plan to integrate update notification and indexing Search Engine – Have 3 implementations: FEDORA native search Sgrep OpenText Investigating SRW/CQL Search results passed through XSLT Easy to provide search capability to collections

20 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University FEDORA Dissemination Requests External Services Cache data Data Request Dissemination FEDORA – External Service Image Server Search Engine BMECH

21 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University

22 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University link

23 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Virtual Collections Collection maintenance – Topical galleries Ad-hoc or dynamic collections – For classes... – personal collections… – special exhibits…

24 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University

25 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Database Integration SQL/XQuery for object “data streams” SQL/XQuery for object disseminations

26 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Encyclopedia of Chicago In active development Metadata continually updated by research staff in Microsoft Access New content continually added to MS Access and file stor Varied entry types All have dynamic “See Also”s

27 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University SQL Datastreams “See Also” and “Content” datastreams – Cocoon urls that perform SQL queries on dynamic research data and convert to XML. – Dynamic updates during development – When project finished will consider moving to more robust database or “freeze” streams in the repository as “managed”.

28 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University FEDORA Dissemination Requests External Services Cache data Data Request Dissemination FEDORA – External Service RDBMS Search Engine BMECH Image Server Data stream

29 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University WordHoard Text Analysis Large TEI XML Etext corpora Word level grammatical and frequency data Text requests via Xquery Word level lexical queries via SQL

30 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Basic Text Behavior BMECH Backed by eXist database

31 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Viewer Object Presentation uncoupled from data object

32 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Example Book Model

33 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University TEXT TOC Service Request for TOC keyed by text PID TOC XML requested from text TOC DOM cached in service User requests with “open nodes” parameter Pruned DOM styled with XSLT from Viewer content model

34 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Abstract Text Model

35 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Text Methods Structured text (UVa) – getHeading – getTOC(level) – getChunk(idref) – getPage(idref) Core behavior – getCoverpage – getThumbnail

36 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Time-based Media Model

37 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Time-based Media Behaviors Core behavior – getCoverpage – getThumbnail Time-based media – Play – playSection()

38 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Behaviors by Type ImageMapA/VBookNewsEText Core Image Hi-Res Layered Geo Time Text

39 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Next Steps Implement more object types – Event, video, tabular data Transactions – Ad-hoc groupings of repository objects – Asset management, Annotation – Access control for user editing Interoperability – Search protocols and repository interactions Consider application models Specialized clients

40 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Specialized Clients

41 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Viewer Object

42 Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Summary Code reuse through object abstraction Flexible implementation binding Comprehensible APIs for applications Stable APIs for Content reuse

43 Thank You Bill Parod Academic Technologies Northwestern University bill-parod@northwestern.edu


Download ppt "FEDORA at Northwestern University Bill Parod Academic Technologies Northwestern University"

Similar presentations


Ads by Google