OAIster: What’s with the Weird Name? Kat Hagedorn UM Library Information Technology November 28, 2005.

Slides:



Advertisements
Similar presentations
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Advertisements

Online sheet music Jenn Riley Metadata Librarian Indiana University.
Harvesting Metadata Using OAI-PMH Roy Tennant California Digital Library.
How (Not) to Use a Semi-automated Clustering Tool Kat Hagedorn University of Michigan April 11, 2006.
Furthering Collaboration Among OAI Data Providers and Service Providers Kat Hagedorn University of Michigan Libraries Digital Library Production Service.
OAIster != Google Kat Hagedorn University of Michigan Libraries October 26, 2007.
University of Michigan’s OAIster Service Provider Kat Hagedorn OAIster/Metadata Harvesting Librarian University of Michigan, DLPS November 5, 2002.
Virtual Collections VIRTUAL COLLECTIONS Focus on Metadata Cataloging Discussion Group, Friday, November 18.
And now for something completely different… informal collaboration Kat Hagedorn University of Michigan University of Michigan11/8/2005.
University of Michigan’s OAI Metadata Harvesting Project Kat Hagedorn OAIster Librarian, UM April 16, 2002.
University of Michigan’s OAI Metadata Harvesting Project Kat Hagedorn OAIster Librarian, UM May 12, 2002.
OAIster: A “No Dead Ends” Digital Object Service Kat Hagedorn OAIster Librarian University of Michigan Libraries October 3, 2003.
OAI and OAIster Kat Hagedorn University of Michigan Libraries October 30, 2006.
IMLS Grant: University of Michigan’s Role Kat Hagedorn
University of Michigan’s OAIster Lessons Learned Kat Hagedorn OAIster/Metadata Harvesting Librarian University of Michigan, DLPS October 7, 2002.
OAIster Kat Hagedorn University of Michigan Libraries September 12, 2007.
The Open Archives Initiative and OAIster: Past, Present and Future Kat Hagedorn University of Michigan Libraries April 6, 2006.
Aquifer Portal at U of Michigan Kat Hagedorn and Perry Willett University of Michigan DLF Spring Forum, Austin TX April 11, 2006.
Debbie Campbell Director Collaborative Services National Library of Australia Electronic Resources Australia Annual Forum Sydney 10 July 2012 Trove’s Application.
Norwegian Open Research Archives (NORA) How and why is the NORA project adding value to the institutional repositories established in Norway?
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Metadata Repositories for Interoperable/Shareable Metadata.
Sheet Music Consortium: Tools for Data Providers Jenn Riley Head, Carolina Digital Library and Archives The University of North Carolina at Chapel Hill.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Getting Started with CONTENTdm Corey Harper, University of Oregon Terry Reese, Oregon State University OLA - April 8, 2005.
The DNER - a national digital library Andy Powell ZIG Meeting, York October 2001 UKOLN, University of Bath UKOLN is funded by Resource:
Creating an Open Archives Metadata Harvesting Protocol Compliant Repository for the American Memory Online Collections OAI Open Meeting, Washington, DC.
Implementing PTFS ArchivalWare at York St John University: a project under the JISC Repositories Start-up and Enhancement (SUE) strand Helen Westmancoat.
OAIster: Metadata Pointing to Digital Objects Kat Hagedorn Metadata Harvesting/DLXS Librarian University of Michigan Libraries February 18, 2004.
How do I find works in the Repository?. University of Texas Libraries UT DR Digital Repository Search in the Repository Keyword search from the Repository.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
The NCAR Community Data Portal (CDP) Experiences with OAI metadata record federation presented by Michael Burek (NCAR/SCD/VETS) Acknowledgments:
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
The future of the catalogue Warwick Cathro Assistant Director- General, Innovation.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI User Services Kat Hagedorn, UM University of Michigan 11/10/2005.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Bitter Harvest Metadata Harvesting Issues, Problems, and Possible Solutions Roy Tennant California Digital Library.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
Metadata and OAI DLESE OAI Workshop April 29-30, 2002 Katy Ginger Presentation available at:
National Library of the Czech Republic Integration of digital materials into EDL Adolf Knoll National Library of the Czech Republic Helsinki CENL Workshop.
Best Practices for OAI: A Status Report Kat Hagedorn Sarah Shreeves DLF Spring Forum San Diego, CA April
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
OAIster: A One-Stop-Shop Service for Digital Objects Kat Hagedorn OAIster Librarian University of Michigan Libraries September 18, 2003.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
DLF Fall Forum DLF/IMLS OAI Project Update A Tale of Three Registries Plus a few other things By Tom Habing
Surveying the landscape: collection-level description & resource discovery JISC/NSF DLI Projects meeting, Edinburgh, 24 June 2002 Pete Johnston UKOLN,
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
A Training Program for Shareable Metadata Metadata for You & Me is a collaboration between the University of Illinois Library and Indiana University. This.
VIVA Special Collections Committee GRANT MEETING January 26, 2007 METADATA: The Who, What, Why, Where, and When Bob Vay George Mason University.
DLF Fall Forum The Distributed Library: OAI for Digital Library Aggregation UIUC’s Role: Registry of OAI Data Providers
Institutional Repositories and Licensing of Research Output advanced information management laboratory university of cape town department of computer science.
Open Access Tools for Scholars Scholarly Communication Retreat Wednesday December 12, 2007 Presented by Marcia Salmon.
Sharing Your Finding Aids in CONTENTdm Encoded Archival Description (EAD) Files in Mountain West Digital Library June 3, 2009 Sandra McIntyre, Mountain.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
The Open Archives Initiative: Perspectives on Metadata Harvesting OAI Provider & Harvesting Services at the University of Illinois Timothy W. Cole Mathematics.
1 XML and XML in DLESE Katy Ginger November 2003.
University of Michigan’s OAIster Progress Report
Lifecycle …of OAI …of DPs and SPs
Mining Digital Archives through OAI, Web Services & Google Indexing
The New Face of Information Retrieval: The Ankara University Open Access Platform Prof. Dr. Sekine Karakaş Prof. Dr. Doğan.
OAI 11/20/07.
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Institutional Repositories
IMLS Grant: University of Michigan’s Role
Presentation transcript:

OAIster: What’s with the Weird Name? Kat Hagedorn UM Library Information Technology November 28, 2005

What is OAIster?  Is/was a means for UM to test the OAI protocol… (hence the name)  A method for sharing metadata among institutions and groups of people  A means of developing a search service for end-users worldwide

Basics of OAI

What does OAIster collect?  Harvests all metadata from all OAI data providers (within reason)  Only keeps metadata that points to digital objects, e.g., articles, photographs, datasets, etc. in digitized form  All available via search service…

Searching OAIster  Time to show off OAIster… 

A little history  Service is now 3.5 years old  Started with 66 data providers and a little over 200K records  Now have 572 data providers and “a little” over 6 million records  37% US, 63% international

Visibility of OAI  Surprising who hasn’t made their metadata shareable through OAI  Harvard, Yale, Stanford…the big ones  Initially perplexing, but now clearer:  always done at the end  only recently thought of at initiation of projects  truthfully, many institutions not collaborative…

Examples of data providers  Many data providers are huge, e.g.,  arXiv: physics preprint and postprint articles  pubmed: medical articles, although restricted  pictureaustralia: images from govt and academic institutions in Australia  lcoa: Library of Congress digital archives  usc: U South California census data

Examples of data providers  Most are small, though  Many around 100 records  Value of making their records available  increased visibility  inclusion in bigger search service than theirs  incorporation in Yahoo! Search

Yahoo! Search  Two years ago, collaborated with team at Yahoo! Search to send our metadata to them for indexing  e.g., “gardens at albury” in Yahoo! Search  know it’s not static html roboting  IspartOf Victorian Railways collection.  IspartOf Victorian Railways collection.  Many, many more hits  Also send metadata to Google

System design UM harvester Record storage XSLT transformation tool BibClass indexes OAI-enabled DC records Non-OAI- enabled DC records XSL stylesheets (per source type) Search interface (XPAT)

Transformation of metadata  Most metadata needs to be brushed off  adding an to the front of URLs  Or raked  removing instances of <![CDATA[  Or wrung out  instead of “Where’s Waldo,” it’s “Where’s the incorrect UTF-8 character?”  And should be normalized…

Why normalize?  Sample date values <date> </date><date> </date><date> </date><date>1822</date> between 1827 and 1833 between 1827 and 1833 <date>18--?</date> November 13, 1947 November 13, 1947 SEP 1958 SEP bce 235 bce Summer, 1948 Summer, 1948

Why use a CV?  Sample subject values <subject>30,51,52</subject> 1852, Apr. 22. E[veritt] Judson, letter to Philuta [Judson]. 1852, Apr. 22. E[veritt] Judson, letter to Philuta [Judson]. Slavery--United States--Controversial literature Slavery--United States--Controversial literature view of interior with John Henry sculpture view of interior with John Henry sculpture Particles (Nuclear physics) -- Research. Particles (Nuclear physics) -- Research.

Best practices  Fixing more than half of the data providers is cumbersome  Individuals at OAI-enabled institutions started a “Best Practices” group to inform data providers what they ought to do  bin/wiki.pl?TableOfContents

2nd phase OAI  “Best Practices” group sponsored by the Digital Library Federation, which also…  Sponsors our latest grant  Better and more easily calculated statistics  Search interface improvements  Clustering / classification techniques  Using richer metadata

Clustering / classification  Using automated means to take a selection of metadata and determine “what it’s about”  Working with Emory University (one of our grant partners) to test their tool  Results will be integrated into search so can search in smaller group of OAIster records

Using richer metadata  Data providers must use simple Dublin Core  Very sparse schema for describing objects  dc:title must contain main title, sorted title and alternative titles  dc:subject doesn’t distinguish between geographical, hierarchical, temporal…

Using richer metadata  Encouraging use of richer metadata, especially MODS (Metadata Object Description Schema) from LOC  Developed testbed for grant deliverables  currently only shows MODS work… 

Other stuff  Well, make it smaller somehow…  Clean up Boolean interface  squinch fields together  include more normalization  Make it available through federated search  Proselytize sharing metadata  Test, test, test

Contact me  Kat Hagedorn  UM Library Information Technology  