Download presentation
Presentation is loading. Please wait.
Published byJeffery Moody Modified over 9 years ago
1
FEDORA at Northwestern University Bill Parod Academic Technologies Northwestern University bill-parod@northwestern.edu
2
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University General Background Academic Technologies Faculty projects Library partnerships Institutional partnerships Diverse clientele Diverse content “One-off” projects
3
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Current FEDORA Projects Block Museum of Art The Last Expression Art Collection Introduction to Asian Art History BBC Spoken Word Archive Paris Map Collection Encyclopedia of Chicago WordHoard Text Analysis Project
4
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Diversity of Content
5
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Wavelet Image Servers Vector Image Processors Streaming Media Servers RDBMS XML Databases XSLT Processors GIS Servlet Engines Diversity of Systems
6
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Abstract Image Models
7
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University 4 Image Behavior Classes Core behavior – getCoverpage – getThumbnail Basic image (UVa) – getThumbnail – getMedium – getHigh – getVeryHigh Addressable image – getRegion(rgn,size) – getViewer Layered image – getRegion(,,layers) – getViewer(layers) Geographic image – getRegion(,,, coords) – getViewer(, coords)
8
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University 4 Image Content Models Core behavior – XML Metadata – HTML XSLT script – Thumbnail Image Basic image (UVa) – Thumbnail jpeg – Medium Res jpeg – High Res jpeg – Very High Res jpg Addressable image – Image metadata – Viewer XSLT script Layered image – Layer metadata Geographic image – World file for projection
9
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Images SimpleZoomLayers Core getThumbnail getCoverpage Basic getThumbnail get Medium getHigh getVeryHigh Addr getRegion getViewer Layered getRegion getViewer
10
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University BDEF Interface Definition
11
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University BMECH Description Method bindings to implementation HTTP URL templates to image servlet Accepts image server metadata stream Accepts specific user parameters Provides implementation flexibility Currently using TrueSpectra/Scene7 image server
12
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University getCoverPage() for simple image – Block Museum Collection
13
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University getCoverPage() for zoomable image – History of Asian Art class
14
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Ingesting Images Imaging person deposits master TIFF images in WebDAV enabled file stor Image server configured with “virtual path” to WebDAV stor for master image tiff. TIFF master is converted to FlashPix and cached in image server Image server handles request for FEDORA dissemination
15
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Metadata in ExcelMETSFEDORA Tiffs in Xythos TrueSpectra Image Server Dissemination Requests Catalog in Excel converted to METS for FEDORA ingest Tiff Masters deposited in collection’s Xythos directory Access to Xythos directory enabled for TrueSpectra virtual paths METS/FEDORA record includes link to TrueSpectra image Access to image is through FEDORA image behaviors DepartmentAcademic Technologies Data flow Requests Users Image Workflow: FEDORA – TrueSpectra – Xythos
16
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Auto-ingesterFEDORAFiles in Xythos TrueSpectra Streaming Server Search Dissemination Requests Faculty or SupportAcademic Technologies Data flow Requests Users Physical Collection Management Scenario: FEDORA – Content Service – Xythos Integration Metadata update FEDORA collection object attached to Xythos directory Xythos notifies collection object of changes in the directory File added – collection creates new member item File updated – item accepts new version for file stream File removed – item is set dormant in FEDORA Metadata added/updated online or batch
17
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Basic Collection Object Collection behavior – getSearchForm – performSearch() – getItem() – getItems() – addItem() – deleteItem() – reindex() – displayItem() Core behavior – getCoverpage – getThumbnail Block Museum of Art The Last Expression Vesalius Figures BBC Audio History of Asian Art
18
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Collection Content Model Search Form XSLT for search results Index Header/footer XML for result stream Member PIDs
19
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Search Implementation FEDORA METS files currently indexed offline Plan to integrate update notification and indexing Search Engine – Have 3 implementations: FEDORA native search Sgrep OpenText Investigating SRW/CQL Search results passed through XSLT Easy to provide search capability to collections
20
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University FEDORA Dissemination Requests External Services Cache data Data Request Dissemination FEDORA – External Service Image Server Search Engine BMECH
21
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University
22
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University link
23
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Virtual Collections Collection maintenance – Topical galleries Ad-hoc or dynamic collections – For classes... – personal collections… – special exhibits…
24
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University
25
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Database Integration SQL/XQuery for object “data streams” SQL/XQuery for object disseminations
26
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Encyclopedia of Chicago In active development Metadata continually updated by research staff in Microsoft Access New content continually added to MS Access and file stor Varied entry types All have dynamic “See Also”s
27
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University SQL Datastreams “See Also” and “Content” datastreams – Cocoon urls that perform SQL queries on dynamic research data and convert to XML. – Dynamic updates during development – When project finished will consider moving to more robust database or “freeze” streams in the repository as “managed”.
28
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University FEDORA Dissemination Requests External Services Cache data Data Request Dissemination FEDORA – External Service RDBMS Search Engine BMECH Image Server Data stream
29
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University WordHoard Text Analysis Large TEI XML Etext corpora Word level grammatical and frequency data Text requests via Xquery Word level lexical queries via SQL
30
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Basic Text Behavior BMECH Backed by eXist database
31
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Viewer Object Presentation uncoupled from data object
32
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Example Book Model
33
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University TEXT TOC Service Request for TOC keyed by text PID TOC XML requested from text TOC DOM cached in service User requests with “open nodes” parameter Pruned DOM styled with XSLT from Viewer content model
34
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Abstract Text Model
35
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Text Methods Structured text (UVa) – getHeading – getTOC(level) – getChunk(idref) – getPage(idref) Core behavior – getCoverpage – getThumbnail
36
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Time-based Media Model
37
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Time-based Media Behaviors Core behavior – getCoverpage – getThumbnail Time-based media – Play – playSection()
38
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Behaviors by Type ImageMapA/VBookNewsEText Core Image Hi-Res Layered Geo Time Text
39
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Next Steps Implement more object types – Event, video, tabular data Transactions – Ad-hoc groupings of repository objects – Asset management, Annotation – Access control for user editing Interoperability – Search protocols and repository interactions Consider application models Specialized clients
40
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Specialized Clients
41
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Viewer Object
42
Bill Parod Academic Technologies CNI April 15, 2004 Northwestern University Summary Code reuse through object abstraction Flexible implementation binding Comprehensible APIs for applications Stable APIs for Content reuse
43
Thank You Bill Parod Academic Technologies Northwestern University bill-parod@northwestern.edu
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.