Proxy Searching of Non-Searchable & Poorly Searchable Open Access Archives of Digital Scholarly Journals Dr. Péter Jacsó, Professor Department of Information and Computer Sciences University of Hawaii at Manoa Honolulu, Hawaii International Conference on Asian Digital Libraries December 8-12, 2003 Kuala Lumpur, Malaysia
High quality content in Web-born journals in all disciplines Multiple year archives – browsing by TOC, volumes and issues No search engine – no money for development/licensing Modest web-site search engines, like Jacsó No exact phrase, proximity, positional operators for FT archives (information industry = industry information) primitive relevance ranking
Non searchable archives Jacsó “made searchable in late 2003”
Poorly searchable Jacsó
283 items are reported with 90% or higher “relevance” Jacsó
Call in the best web-wide search engines as proxy searchers Jacsó
Searching archive at the sub-domain level - AllTheWeb Jacsó