Copyright © 1998 – 2003 MuseGlobal, Inc. All Rights Reserved MetaSearch - Searching What it involves How to survive in world without standards Dr Peter Noerr, MuseGlobal, Inc.
Copyright © 1998 – 2003 MuseGlobal, Inc. All Rights Reserved The (non) Metasearch Search 1 Search 2 Search 3 Search 1 - Find Search Engine - Logon - Compose search - Run search - Study results - Refine results - Find document - Get document Search 2 - Find search engine - Logon - Compose search - Run search - Study results - Refine results - Find document - GetdDocument Search
Copyright © 1998 – 2003 MuseGlobal, Inc. All Rights Reserved The Metasearch Search 1 - Find MetaSearch Engine - Logon - Compose search - Select Sources - Run search (Metasearch engine runs Searches 1a,b,c) - Study results - Refine results - Find document - Get document Search 1aSearch 1b Search 1c
Copyright © 1998 – 2003 MuseGlobal, Inc. All Rights Reserved The MetaSearch - Benefits About half the time One logon ( authentication ) One search syntax Simple Source selection Consistent results display Consistent refinement tools ( sort, dedupe … ) One click to get full text ( or doc delivery )
Copyright © 1998 – 2003 MuseGlobal, Inc. All Rights Reserved The MetaSearch - Problems Connection protocols –Multiple standards Poor implementation Patchy implementation Local variations –Proprietary protocols Changes with time ( http/html mostly ) Semantics Record formats Inconsistent Source functionality Authentication Source selection
Copyright © 1998 – 2003 MuseGlobal, Inc. All Rights Reserved The MetaSearch – Work Flow Find MetaSearch engine Authenticate Search Results Document(s) Peter Tamar Janifer
Copyright © 1998 – 2003 MuseGlobal, Inc. All Rights Reserved Searching – Components for ONE Source Authentication Connection handling Secondary processing Results re-formatting Profile Result Set handling Session management Profile Stateful, stateless; priority; permissions; personalisation (Optional), field mapping, character encoding, record enrichment Protocol, single/dialogue, search syntax, semantic mapping Type(IP, ID/pwd. URL…) by Provider, by institution, Proxy use, values (Optional), Combine/not results, Canonical format, key generation, (Optional), In-Search processing, Secondary search,
Copyright © 1998 – 2003 MuseGlobal, Inc. All Rights Reserved Future Challenges Protocols Standards – Z39.50/SRW/SRU, Xquery, CQL XML protocols, MetaSearch access Semantics exposed ontologies (OWL) Functionality Search engine evolution Data Metadata standards (inc semantics of data) Source Description Standards – Explain, RDF, UDDI, WSDL Functionality, access, syntax, semantics, formats These items are being worked on by the NISO ( Metasearch Initiative committees. Participation is welcomed from all, from anywhere.
Copyright © 1998 – 2003 MuseGlobal, Inc. All Rights Reserved Conclusion This is not a simple task: the technology is sophisticated, the numbers are daunting; Content Providers (organisations) 4,000 worldwide (est.) Sources (databases, search engines, etc.) 10,000 worldwide (est.) Muse’s Global Source Library 2,500 worldwide Contact the author: Dr Peter Noerr, CTO, Museglobal