Rethinking Assumptions with the Our Americas Archive Partnership (OAAP) Geneva Henry Rice University 6 April 2009 CNI Spring 2009 Task Force Meeting, Minneapolis,

Slides:



Advertisements
Similar presentations
International Children’s Digital Library (ICDL)
Advertisements

Business Development Suit Presented by Thomas Mathews.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
EXtensible Catalog David Lindahl University of Rochester.
Data Curation for the Humanities Geneva Henry, George Washington University CNI Fall Meeting - December 8, 2014.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
ENCompass at Cornell Prepared by Karen Calhoun for the ARL Meeting on Portal Implementations ALA, Atlanta GA June 14, 2002 ~Cornell University Library.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Open Discovery: Collaborative Approaches to Metadata 26 August 2011 Kira B. Homo Electronic Records Archivist.
Welcome to a guided tour of Oxford African American Studies Center. Please click the forward arrows to advance to the next section or click on a topic.
NOBLE Digital Library. How does it work? The NOBLE Digital Library uses the DSpace platform. Image files and metadata are imported into DSpace using.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Shared October 13, 2010 Shelf Michael Roy, Dean of Library and Information Services, Middlebury College A Networked Image Platform Jeremy Stynes, Head.
Variations On Video project update DLF Fall Forum 2010 Jon Dunn, Indiana University Claire Stewart, Northwestern University November 2, 2010.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
Digital Library Architecture and Technology
1 Open Library Environment Designing technology for the way libraries really work December 8, 2008 ~ CNI, Washington DC Lynne O’Brien Director, Academic.
The attic & the parlor CHM collections & exhibitions overview May 5, 2006 Kirsten Tashev VP Collections & Exhibitions.
HUBZERO AT INDIANA UNIVERSITY: THE INDIANA CTSI HUB Bill Barnett EDUCAUSE October 14, 2010.
Making the SHiFt: Using Sufia with Hydra/Fedora for collection management and access James Halliday Programmer/Analyst, Library Technologies Juliet L.
SobekCM’s Community Ecosystems & Socio-Technical Practices Presented by Mark V. Sullivan June 10 th, 2014 Sobek image created by Jeff Dahl and is shared.
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
Creating Access to Europe’s Television Heritage Prof. Dr. Sonja de Leeuw (project-coordinator, Utrecht University) Johan Oomen MA (technical director,
IMLS NLG Collection Registry & Item-Level Metadata Repository at the University of Illinois Timothy W. Cole Mathematics Librarian &
Describing Collections So Visitors Can Find Them: A sampling of ways to get materials on-line Amanda Focke, Rice University
Serenate1 Non-standard users: The Library Raf Dekeyser K.U.Leuven.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Collaborative Approach to Open Access: Experience from Bioline International Leslie Chan Associate Director Bioline International University of Toronto.
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
Australian Partnership for Sustainable Repositories University of Sydney practices and test-bed projects, sustainability in a distributed.
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
The Western Waters Digital Library: Building a Resource Through Multi- State Collaboration and Technology Dawn Paschal Assistant Dean, Digital Library.
DSpace. TM 2 Agenda  Introduction to DSpace  DSpace community  Institutional Repository  Easy to add/find content in DSpace  Building Online Communities.
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Creating and Operating a Digital Library for Information and Learning– the GROW Project Muniram Budhu Department of Civil Engineering & Engineering Mechanics.
The Portal to Texas History: Harnessing Technology to Enable Collaboration with Small Museums and Libraries CNI, December 6, 2005 Cathy Nelson Hartman.
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
© 2005 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice The China Digital Museum Project.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Imaging Pittsburgh: Creating a Shared Gateway to Digital Image Collections of the Pittsburgh Region IMLS 2002 National Leadership Grant Library & Museum.
Introduction to metadata
May 2, 2013 An introduction to DSpace. Module 1 – An Introduction By the end of this module, you will … Understand what DSpace is, and what it can be.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
WorldWideScience.org: An International Knowledge-Sharing Model Brian A. Hitson Office of Scientific & Technical Information U.S. Department of Energy.
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Warwick Cathro Assistant Director-General Resource Sharing and Innovation National Library of Australia Trove – a service built on collaboration OCLC Asia.
DSpace - Digital Library Software
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
The Global Library Lisa Spiro Digital Media Center, Fondren Library November 2008 Lisa Spiro Digital Media Center, Fondren Library November 2008.
DSpace An Open Source Dynamic Digital Repository Xizi (Cecilia) Cai IS565 Spring 2013 DL Topic Presentation.
Primo at the British Library Mandy Stewart. 2 About the British Library The British Library is the National Library of the UK It is a world-class.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
PA Photos & Documents Exploring Pennsylvania’s digitized documents and photographs This project is made possible by an IMLS grant as administered by the.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
The world’s libraries. Connected. CONTENTdm ® Digital Collection Management Solutions Learn what to consider when outsourcing your library’s digitization.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
Omeka Web-Publishing Platform
GeoNetwork OpenSource: Geographic data sharing for everyone
Markup of Educational Content
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
VI-SEEM Data Repository
AUC’s Role In Facilitating Access To Knowledge In The Arab World
Presentation transcript:

Rethinking Assumptions with the Our Americas Archive Partnership (OAAP) Geneva Henry Rice University 6 April 2009 CNI Spring 2009 Task Force Meeting, Minneapolis, MN

Presentation overview  About the Our Americas Archive Partnership project  Vision and goals  Approach we’re taking with the development  Building the collections  Assumption regarding growth  Assumptions regarding metadata  Challenges  About the Our Americas Archive Partnership project  Vision and goals  Approach we’re taking with the development  Building the collections  Assumption regarding growth  Assumptions regarding metadata  Challenges

Background  Our Americas Archive Partnership (OAAP) awarded to Rice by the Institute of Museum and Library Services -- IMLS  National Leadership Grant for digitization -Digitize selected items in Woodson’s Americas collection -Add Web 2.0 technologies to enable use of Rice collection and University of Maryland’s complimentary Early Americas Digital Archive collection  Our Americas Archive Partnership (OAAP) awarded to Rice by the Institute of Museum and Library Services -- IMLS  National Leadership Grant for digitization -Digitize selected items in Woodson’s Americas collection -Add Web 2.0 technologies to enable use of Rice collection and University of Maryland’s complimentary Early Americas Digital Archive collection

The Partnership  Rice University  Fondren Library  Humanities Research Center (HRC)  Digitization, transcriptions, translations, metadata, markup, research modules, scholarly introductions  University of Maryland  Maryland Institute for Technology in the Humanities (MITH)  Integration of collections, development of web 2.0 features including social tagging and a geospatial interface  Addition of Instituto Mora, Mexico City  rich collection of materials relating to the socioeconomic and historical conditions of Mexico  Not part of IMLS grant  Collaborative relationship with Rice’s HRC  Rice University  Fondren Library  Humanities Research Center (HRC)  Digitization, transcriptions, translations, metadata, markup, research modules, scholarly introductions  University of Maryland  Maryland Institute for Technology in the Humanities (MITH)  Integration of collections, development of web 2.0 features including social tagging and a geospatial interface  Addition of Instituto Mora, Mexico City  rich collection of materials relating to the socioeconomic and historical conditions of Mexico  Not part of IMLS grant  Collaborative relationship with Rice’s HRC

Description of Collections  Early Americas Digital Archive (EADA )  a collection of electronic texts of transcribed literary-historical narratives written in or about the Americas  Rice Americas Digital Archive ( )  includes approximately 25,000 pages of original letters, broadsides, pamphlets, printed materials and books documenting the political and cultural relationships between the United States, Mexico, Central and South America, Cuba, Spain, and Portugal  Instituto Mora Collection  7000 pages of additional archival items scanned, digitized, marked up, and fully integrated into the search tools  Scanning started June 2008 and will continue through summer 2009  Early Americas Digital Archive (EADA )  a collection of electronic texts of transcribed literary-historical narratives written in or about the Americas  Rice Americas Digital Archive ( )  includes approximately 25,000 pages of original letters, broadsides, pamphlets, printed materials and books documenting the political and cultural relationships between the United States, Mexico, Central and South America, Cuba, Spain, and Portugal  Instituto Mora Collection  7000 pages of additional archival items scanned, digitized, marked up, and fully integrated into the search tools  Scanning started June 2008 and will continue through summer 2009

Our Vision  Focus on Americas from a hemispheric perspective rather than the nation state, driven by scholars’ needs  Span of OAAP captures cultural transformation that spans the five hundred year period that saw the making of modern and colonial cultures in the Americas  OAAP will impact the study of American literary and cultural history by more easily allowing scholars to understand cross-cultural influence  Focus on Americas from a hemispheric perspective rather than the nation state, driven by scholars’ needs  Span of OAAP captures cultural transformation that spans the five hundred year period that saw the making of modern and colonial cultures in the Americas  OAAP will impact the study of American literary and cultural history by more easily allowing scholars to understand cross-cultural influence

Goals  Create unique new research and teaching opportunities  Make unique archival collection digitally available  Build common interface between partners’ repositories, enabling additional digital archives to be added  Address issues associated with the complexity of multilingual documents  Create unique new research and teaching opportunities  Make unique archival collection digitally available  Build common interface between partners’ repositories, enabling additional digital archives to be added  Address issues associated with the complexity of multilingual documents

Ubiquitous discovery opens new horizons  OAAP supports new scholarly inquiry into understanding the development of the Americas  Unrestricted access to scholarly resources that were previously only in nation-specific collections at a variety of institutions  Collaboration that crosses institutions, crosses countries, and will grow as scholars need it to grow  Power to the scholars  OAAP supports new scholarly inquiry into understanding the development of the Americas  Unrestricted access to scholarly resources that were previously only in nation-specific collections at a variety of institutions  Collaboration that crosses institutions, crosses countries, and will grow as scholars need it to grow  Power to the scholars

Federation Model  Provide a common interface to multiple repositories with different content management approaches  search page allowing for multifaceted browsing  MySQL database built from harvested content  Federated digital environment allowing institutional partners to share holdings while retaining individual identity  Extensible to allow for folksonomic tagging  Provide a common interface to multiple repositories with different content management approaches  search page allowing for multifaceted browsing  MySQL database built from harvested content  Federated digital environment allowing institutional partners to share holdings while retaining individual identity  Extensible to allow for folksonomic tagging

Technical approach  Technically diverse digital collections  Digital assets stored in separate repositories  Technical Approach  Capture meta data as Dublin Core  Convert TEI-marked documents in EADA to Dublin Core and harvest repositories  Texts encoded in TEI-Light  Social tagging by scholars using their vocabulary  Technically diverse digital collections  Digital assets stored in separate repositories  Technical Approach  Capture meta data as Dublin Core  Convert TEI-marked documents in EADA to Dublin Core and harvest repositories  Texts encoded in TEI-Light  Social tagging by scholars using their vocabulary DSpace Custom repository Metadata harvesting Develop common descriptors Common text display

DSpace Platform  DSpace is one of the leading open source software platforms for an institutional repository  Rice’s Digital Scholarship Archive uses DSpace, with some significant customizations  Provides permanent digital archive for materials  Fine-grained access controls  Metadata separate from actual objects allows for scalability of digital assets  DSpace is one of the leading open source software platforms for an institutional repository  Rice’s Digital Scholarship Archive uses DSpace, with some significant customizations  Provides permanent digital archive for materials  Fine-grained access controls  Metadata separate from actual objects allows for scalability of digital assets

Overview of DSpace Architecture  Web-based user interface  Runs on Unix-based OS; Rice’s is running on Apple Xserves  Production server for final collections  Development and Test servers for preparation  Uses PostGres database for managing content  Includes Lucene search engine  Support for full text search  Supports Dublin Core metadata standard  Metadata harvested by OAI harvesters  Storage demands are VERY high  Using Isilon clustered storage solution to facilitate multimedia  Web-based user interface  Runs on Unix-based OS; Rice’s is running on Apple Xserves  Production server for final collections  Development and Test servers for preparation  Uses PostGres database for managing content  Includes Lucene search engine  Support for full text search  Supports Dublin Core metadata standard  Metadata harvested by OAI harvesters  Storage demands are VERY high  Using Isilon clustered storage solution to facilitate multimedia

Connexions  Provide scholarly analysis of the archival documents or demonstrate their pedagogical uses in an on-line environment  Connexions is a set of tools for developing and freely distributing educational material  Provide scholarly analysis of the archival documents or demonstrate their pedagogical uses in an on-line environment  Connexions is a set of tools for developing and freely distributing educational material

Using archival materials Scanned images immediately provide visual cues as to the type of document  a letter versus a governmental document Scanned images immediately provide visual cues as to the type of document  a letter versus a governmental document

Multilingual documents Translations expand access to intellectual content of texts  By providing the content in language of the reader And  In a format that facilitates visual scanning of content and full text searching Translations expand access to intellectual content of texts  By providing the content in language of the reader And  In a format that facilitates visual scanning of content and full text searching

Enhancing Multilingual documents Digital Image > Transcription > Translation

Example Item Record  TEI file  Digital Image  Metadata  TEI file  Digital Image  Metadata

Rice Americas Archive Interface

EADA Browse Interface

OAAP Beta site interface

Geospatial view of results

Outcomes  Allow scholarly examination of American literature from a hemispheric perspective,  develop a collection of texts, curricular models and teaching materials that embody a hemispheric approach to the study of the early Americas  generate professional and intellectual exchanges among scholars from various fields  Support Scholars from outside the US and their contributions  Create digitized version of primary sources not previously available to wide range and physically dispersed audience  Support addition of other digital archives with minimal barrier to entry  Allow scholarly examination of American literature from a hemispheric perspective,  develop a collection of texts, curricular models and teaching materials that embody a hemispheric approach to the study of the early Americas  generate professional and intellectual exchanges among scholars from various fields  Support Scholars from outside the US and their contributions  Create digitized version of primary sources not previously available to wide range and physically dispersed audience  Support addition of other digital archives with minimal barrier to entry

Growth Assumptions  Architectural approach assumed new partners would host their own digital collections  Assumed familiarity with digitization practices  Sustainability of collection assumed to be responsibility of each partner  Assumed at least some level of processing (minimal) to be a contributing partner  Architectural approach assumed new partners would host their own digital collections  Assumed familiarity with digitization practices  Sustainability of collection assumed to be responsibility of each partner  Assumed at least some level of processing (minimal) to be a contributing partner

Assumptions regarding metadata  Dublin core was assumed acceptable for partnering  Following metadata best practices viewed as a good thing when project started  Markup of text documents seen as valuable enhancement  Geospatial information to support geospatial visualization of resources thought to be valuable to scholars  Dublin core was assumed acceptable for partnering  Following metadata best practices viewed as a good thing when project started  Markup of text documents seen as valuable enhancement  Geospatial information to support geospatial visualization of resources thought to be valuable to scholars

Collection Challenges  Latin American institutions have rich collections but limited experience and resources with digitization  Hosting collections presents issues of sustainability  Should hosted collections follow practices of collection at hosting institution?  Latin American institutions have rich collections but limited experience and resources with digitization  Hosting collections presents issues of sustainability  Should hosted collections follow practices of collection at hosting institution?

Metadata challenges  New scholarly approach to understanding historic documents relies on new descriptions  Cataloging/metadata best practices impose a previous organizational bias  Deciding what geographic information is relevant is not so straight-forward  Scholars interested in shifting borders; geospatial presentation of little value to them  Should minimal metadata with full text search be the new model for supporting digital scholarship?  New scholarly approach to understanding historic documents relies on new descriptions  Cataloging/metadata best practices impose a previous organizational bias  Deciding what geographic information is relevant is not so straight-forward  Scholars interested in shifting borders; geospatial presentation of little value to them  Should minimal metadata with full text search be the new model for supporting digital scholarship?

Project Website  Website:  Updates on project developments  Share team presentations to communities  Share scripts and code for future participants  Rice and Mora Americas collection at  EADA at  Website:  Updates on project developments  Share team presentations to communities  Share scripts and code for future participants  Rice and Mora Americas collection at  EADA at

Thank You and come visit us on the web Contacts: Geneva Henry, PI (Rice) Caroline Levander, Co-PI (Rice) Neil Fraistat, Co-PI (MITH) Contacts: Geneva Henry, PI (Rice) Caroline Levander, Co-PI (Rice) Neil Fraistat, Co-PI (MITH)