Federated & Meta Search

Slides:



Advertisements
Similar presentations
1 Use of Electronic Resources in Research Prof. Dr. Khalid Mahmood Department of Library & Information Science University of the Punjab.
Advertisements

Google and Beyond… Hatch Library Bay Path College / Spring 2010.
Aggregation Services & Library Consortia N V Sathyanarayana Informatics (India) Limited, Bangalore, India.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
1 Pertemuan 20 Searching Mechanisms Matakuliah: M0284/Teknologi & Infrastruktur E-Business Tahun: 2005 Versi: >
Search engines. The number of Internet hosts exceeded in in in in in
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
WHAT HAVE WE DONE SO FAR?  Weeks 1 – 8 : various components of an information retrieval system  Now – look at various examples of information retrieval.
What’s The Difference??  Subject Directory  Search Engine  Deep Web Search.
Academic Research to Support Arguments.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
The Invisible Web Cynthia Rooley Computer Research.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Roy Tennant California Digital Library Is Metasearch Dead?
HELPING YOUR LIBRARY BE THE BEST PARTNER FOR RESEARCH.
Bio-Medical Information Retrieval from Net By Sukhdev Singh.
Web Scale Discovery Service Vs Federated Search NIKESH NARAYANAN
OpenURL Link Resolvers 101
7. Approaches to Models of Metadata Creation, Storage and Retrieval Metadata Standards and Applications.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Evaluating IR (Web) Systems Study of Information Seeking & IR Pragmatics of IR experimentation The dynamic Web Cataloging & understanding Web docs Web.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The Internet 8th Edition Tutorial 4 Searching the Web.
CSM06 Information Retrieval Lecture 6: Visualising the Results Set Dr Andrew Salway
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
Productive Strategies for Internet Searches The content and images here are used for teaching purposes only and used under the “fair use” doctrine of U.S.
WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.
Searching for NZ Information in the Virtual Library Alastair G Smith School of Information Management Victoria University of Wellington.
1 SEARCHING FOR TRUTH Locating Information on the WWW chapter 5.
Web Search Architecture & The Deep Web
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Taking the Library Back from Google Abe Lederman, President and CTO October 18-20, 2007.
Chapter 20 Asking Questions, Finding Sources. Characteristics of a Good Research Paper Poses an interesting question and significant problem Responds.
Learning how to search on the web “If all you ever do is all you’ve ever done, then all you’ll ever get is all you’ve ever got.” (author unknown)
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Information Retrieval in Practice
Education 499-R01 Search Basics.
Information Sources for Academic Work: Beyond Google and Wikipedia
Search Engine Architecture
Introduction to Library Research: CO 1003
CIW Lesson 6 Web Search Engines.
Wikis in Action: A Wiki as a Research Guide
Understand Internet Search Tools
Taxonomies, Lexicons and Organizing Knowledge
Search Engines & Subject Directories
Spicing Up Your Knowledge Management Strategy
Eric Sieverts University Library Utrecht Institute for Media &
WIRED Week 2 Syllabus Update Readings Overview.
Information Retrieval
Part Three SOURCES AND COLLECTION OF DATA
DIGITAL LIBRARY.
1.01- Understand Internet search tools and methods.
أدوات البحث عبر الانترنت
ثانيا :أدوات البحث عبر الانترنت
Introduction into Knowledge and information
1.01- Understand Internet search tools and methods.
Searching the Internet
Introduction to Information Retrieval
Search Engines & Subject Directories
Search Engines & Subject Directories
Introduction to metadata for IDAH fellows
CS/INFO 430 Information Retrieval
Christopher C. Brown Reference Librarian
Lesson 2: Gathering and Organizing Information Using ICT KEY QUESTION: HOW DO YOU GATHER AND ORGANIZE INFORMATION USING THE COMPUTER AND INTERNET?
Presentation transcript:

Federated & Meta Search What are they? Environment Library (institutional), Everywhere (Web) Content Web, Databases, Catalogs (books), (numerical) data Users Researchers, Students, Academics, Anyone How are they used? Comparing results Widest possible information set for retrieval

Do you use Metasearch? For research? When shopping? Research papers General information seeking When shopping? Trips Books Technical support (help)? What else?

What are Digital Libraries? What’s not a digital library? The Web, Lexis-Nexis, UTNetCAT, ACM DigLib, YouTube, Amazon.com, your laptop’s hard drive? Users think they’re content Librarians think they’re institutions & services Are they digital content only? An easier, digital way to find physical content or help? “Content, collections & communities” How do all of these fit together for Info Retrieval? Organizing everything for effective retrieval seems to be the key challenge Making everything (possible) searchable is the key feature for users. Metasearch is the key to Digital Libraries

Digital Library = Virtual Library? Freely available Web content is a pretty good digital library Your own content is a good library (for re-finding content) Databases & Indexes are traditional library content. Now more digital Should it matter where the content is? Costs? Findability? Scalability?

Federated Search Everything is accessible Legal issues & pricing is coordinated Clustering & redundant information is processed accordingly (cheapest first?) Query syntax is universal & transformed for each dataset Databases, catalogs & text Relevancy is weighted & precise Multiple vendors & open access sources A balance? How “deep” in the deep Web?

Web Dynamics & Metasearch Different documents have many different characteristics Web documents vs. other types of content Links, Metadata, Genre, Dynamically changing How well is the Web indexed? In terms of completeness? 60%? Metasearch is an index of the indices Parallel queries are not always the same Special purpose search engines a better idea? Google Scholar vs. Google Is Personalized (meta) search the answer? Special purpose is your purpose Relevance, ranking & importance Pricing, availability, locality

Categorizing Web search results The interface on metasearch may be more important to users than the content Understanding results over finding (all) content Show results in context - use categories Understanding searches Building a taxonomy for results Customized for each result set? Show when there aren’t any results When results don’t rank high enough Do we need more overviews for results? Visualization for clustering

Category Building for Search How deep, shallow, lean or rich should categories be? Should the content be the main criteria for categories? Host, links, user perspective, genre? What features of content should be used to cluster results? For a metasearch?

Fast-feature categorization Online lean techniques DNS, time visited, format, language, size, index date Online rich techniques Fit to existing categories such as ODP, Yahoo!, Music, Gov, Inventory Offline techniques Directory hierarchy Query probing Results, pages, words, (category) nodes, depth & type of hierarchy Understanding the content is critical

Yahoo! Cataloging the Web A non-automated, technique How do information professionals build an “index” of the Web? Cataloging applies to the Web Indexing with synonyms Browsing indexes vs searching them Comprehensive index not the goal Quality Information Density Yahoo’s own ontology – points to site for full info Subject Trees with aliases (@) to other locations “More like this” comparisons as checksums

Yahoo uses tools for indexing

More metasearch tools Scroogle Thumshots.org Ranking Jux2 Search Engine Relationship Chart