Producción de Sistemas de Información Agosto-Diciembre 2007 Sesión # 8
Basic Concepts Data Database Collection, Library, Archive Digital Collection Digital Repository Digital Library
Digital Collection: Examples Digital Archives UM DRUM Sterling Family Papers The Jim Henson Works Etc… Image collections
Image Collections Fine Arts Museum of San Francisco Art Museum Image Consortium SILS Art Image Browser
Image Collections: Issues Image users Image collection Image indexing Image retrieval
Image Collections: Issues Artists as information users Fine Artists: producing art Art Historians: studying art
Fine Artists and Art Historians Information needs: A strong need to access information in several formats and several languages. Sometimes they need to find objects and images similar to one they have already found.
Fine Artists and Art Historians Information seeking: Tend to search information by themselves. Tend to browse and explore image collections without any specific idea of their information needs. They tend to make extensive use of images in digital format.
Fine Artists and Art Historians Information seeking: When they know the specific work of art they are looking for, the main criteria used to search for images are text-based attributes, such as: Title Author Period Country of origin Type of art object
Fine Artists and Art Historians Information use: The relevance criteria they generally use to assess the relevance of visual materials are: Topicality Image quality Image accessibility Image novelty Image suggestiveness
Image Collections: Issues Design of new system features: Browsing several images at a time Exploring diverse collections at a time Adaptive search strategies: Concept-based search (ofness & aboutness) Content-based search (bolbworlds: texture, shape & color)
Image Collections: Issues Design of new standards and guidelines for image classification/indexing for digital collections. Examples: CDWA: Categories for the Descriptions of Works of Art VRA: Core Categories for Visual Art Best Practice Guidelines for Digital Collections at UMD (See PDF file)
Image Collections: Issues Design of new standard vocabularies, authorities, and term lists used for classifying works of art. Examples: Library of Congress Subject Headings (LCSH) Art and Architecture Thesaurus (AAT) ICONCLASS Union List of Artist Names (ULAN) Thesaurus of Geographic Names (TGN) LC Thesaurus for Graphic Materials I (TGM I) Others
Digital Repository: Example Dspace: An open-source digital archiving system designed by MIT Libraries and Hewlett Packard to capture, manage, and share research in digital formats. Over 200 organizations have adopted Dspace as the basic architecture to develop their institutional repositories.
Basic Concepts Digital Library: A collection of information, sources, and services in digital format, organized for use, and maintained by a public body, an institution, or a private individual.
Digital Libraries: Examples ICDL MALACH PDLib
ICDL Demo ICDL: International Children Digital Library HCILab at the UMD KidsTeam: a design methodology in which children of diverse backgrounds participate as design partners URL:
ICDL Research Areas How do children from diverse cultures search books: In print format? in digital format? How do children interact with digital content? How do children build communities of digital library users across cultures?
ICDL Research Areas How do children learn when searching and interacting with information in digital environments? What are the personal information seeking patterns of children, as opposed to those identified for imposed information seeking? How do children select and evaluate information?
MALACH Project MALACH (“angel”, in Hebrew) Private video collection 52,000 interviews 32 languages 3,000 interviews manually indexed 49,000 interviews waiting… (See PDF file)
MALACH Project 5 year project: design and develop an automated video indexing system several research groups several disciplines System architecture speech recognition algorithms cross-lingual retrieval algorithms system interface
MALACH Project Main features and information services: Automated translation and indexing of digital video interviews Access to interview content by: Browsing Using key words Query extension and refinement using a Thesaurus Online help
MALACH Project Research Funcionality: How to increase the performance of speech- recognition algorithms? How to integrate cross-lingual retrieval algorithms? Usability: How do users search and interact with digital video environments? What kind of user interface better supports information discovery and exploration?
PDLib Project PDLib: Portable Digital Library An open source DL architecture where users can store all kinds of documents, and then retrieve them from desktop or mobile devices.
PDLib: Client/Server Architecture
PDLib Project Basic features: Store your own documents Share documents with other users Feed your library through Internet searches Search documents: with metadata attributes using full text within multiple collections Retrieve your documents anytime, anywhere, from your cell phone or PDA
PDLib Research Areas Funcionality: How to increase the performance of full- text search algorithms? How to integrate cross-lingual retrieval algorithms? Usability: How do PDLib users search and interact with information in digital environments? Is it possible to develop an audio-based interface for mobile devices?