Meta Search Engines Taly Sharon. T.Sharon Search Engine Seminar2 Contents Search Engines (SEs) generations Meta Search Engine (MSE) Why use several SEs.

Slides:



Advertisements
Similar presentations
Database VS. Search Engine
Advertisements

Dogpile.com Metasearch. What is metasearch technology Webopedia (2012): A search engine that queries other search engines and then combines the results.
Crawling, Ranking and Indexing. Organizing the Web The Web is big. Really big. –Over 3 billion pages, just in the indexable Web The Web is dynamic Problems:
Project Proposal.
Search Engine – Metasearch Engine Comparison By Ali Can Akdemir.
Computer Information Technology – Section 3-2. The Internet Objectives: The Student will: 1. Understand Search Engines and how they work 2. Understand.
 How many pages does it search?  How does it access all those pages?  How does it give us an answer so quickly?  How does it give us such accurate.
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
Web Information Retrieval and Extraction Chia-Hui Chang, Associate Professor National Central University, Taiwan
T.Sharon-A.Frank 1 Internet Resources Discovery (IRD) Additional Aspects.
Internet Resources Discovery (IRD) Search Engines Quality.
(c) Maria Indrawan Distributed Information Retrieval.
Search Engine Usability Taly Sharon
Web Information Retrieval and Extraction Chia-Hui Chang, Associate Professor National Central University, Taiwan Sep. 16, 2005.
J. Chen, O. R. Zaiane and R. Goebel An Unsupervised Approach to Cluster Web Search Results based on Word Sense Communities.
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
The Players The Majors Dead Search Engines International Search Engines Metasearch Engines.
Internet Resources Discovery (IRD) Meta-Search Engines (MSEs)
T.Sharon-A.Frank 1 Internet Resources Discovery (IRD) Intelligent IRD.
Unit 3 Web Search Engines. Can You Find the Answers? n Connect to Google Google n Search for items on Iran Records ________ n Combine Iran with nuclear.
Search Engine Usability Taly Sharon
 Search engines are programs that search documents for specified keywords and returns a list of the documents where the keywords were found.  A search.
Internet Searches The Basics for Effective Research Using the World Wide Web.
Mamma.com.
Web Document Clustering By Sang-Cheol Seok. 1.Introduction: Web document clustering? Why ? Two results for the same query ‘amazon’ Google : currently.
 Search Tools:  There are many type of search tools that you can use to locate information on the World Wide Web.  Various search tools are developed.
Promotion & Cataloguing AGCJ 407 Web Authoring in Agricultural Communications.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
SEARCH ENGINES Jaime Ma, Vancy Truong & Victoria Fry.
Fourth Edition Discovering the Internet Discovering the Internet Complete Concepts and Techniques, Second Edition Chapter 3 Searching the Web.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
Making the most of the World Wide Wonderland 22nd February 2005 An introduction to effective internet searching Lorraine Sperring Learning & Teaching Resources.
WISER Humanities: Quality Information on the Internet Johanneke Sytsema Linguistics Subject Consultant
Yr 12 OCR Nationals – LEVEL 3 Unit 2 – Getting Started.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Personalized Search Xiao Liu
Effective Search Strings Continued. Truncated Searches A special symbol (*) which allows you to search simultaneously for several words with the same.
Search engines are used to for looking for documents. They compile their databases by employing "spiders" or "robots" to crawl through web space from.
Choosing a Search Engine Taly Sharon Thanks to Ariel Frank, Bar-Ilan University
Where do I find it? Created by Connie CampbellConnie Campbell.
Search Engines June 20, 2005 LIBS100 Linda Galloway.
Web Index D irectory WEB Which kind to use? All Which kind to use? All S earch E ngine General SpecialtyGeneralSpecialty Meta-S earch.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
Search Engines.
Stop Searching and Start FINDING: Strategies for Effective Web Research.
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
Sharon M. Jordan Assistant Director for Program Integration U.S. DOE Office of Scientific & Technical Information Vantage Point: Government R&D Results.
Searching the World Wide Web: Meta Crawlers vs. Single Search Engines By: Voris Tejada.
Internet Research – Illustrated, Fourth Edition Unit B.
LIR 10: Week 10 Advanced WWW Topics. Class Announcements New features on Section 2904 Schedule Missing Homework Online Quiz due 11/16 Another WWW directory.
Unit B Constructing Complex Searches Internet Research Third Edition.
WISER Humanities: Quality Information on the Internet Johanneke Sytsema Linguistics Subject Consultant Judy Reading Reader.
Internet Power Searching Finding Pearls in a Zillion Grains of Sand.
Bringing Order to the Web : Automatically Categorizing Search Results Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Hao Chen Susan Dumais.
W orkshops in I nformation S kills and E lectronic R esources Oxford University Library Services – Information Skills Training Finding quality information.
Effective Internet Search Strategies: Search Engines & Directories Wendy E. Moore, M.S. in L.S. Acquisitions/Serials Librarian University of Georgia School.
Learning how to search on the web “If all you ever do is all you’ve ever done, then all you’ll ever get is all you’ve ever got.” (author unknown)
Third Edition Discovering the Internet Discovering the Internet Complete Concepts and Techniques, Second Edition Chapter 3 Searching the Web.
Lecture 4 Access Tools/Searching Tools. Learning Objectives To define access tools To identify various access tools To be able to formulate a search strategy.
Information Architecture
Search Engines and Search techniques
Federated & Meta Search
ITE 130 Web Searching.
Information Integration for Digital Libraries
IST 497E Information Retrieval and Organization
Top Search Engines.
Presentation transcript:

Meta Search Engines Taly Sharon

T.Sharon Search Engine Seminar2 Contents Search Engines (SEs) generations Meta Search Engine (MSE) Why use several SEs (Motivation)? Highlighted MSEs (Mamma, Dogpile, Vivisimo, Ixquick, KartOO) Hebrew MSEs MSE comparison When to use MSE – pros and cons How to choose MSE?

T.Sharon Search Engine Seminar3 Search Engines Generations 1st Generation - Basic SEs: 2nd Generation - Meta SEs: 3rd Generation - Popularity SEs:

T.Sharon Search Engine Seminar4 2nd Generation SEs - MetaSEs Using several SEs in parallel. The results are filtered, ranked and presented to the user as a uniformed list. The ranking is a combination of the number of sources each page appeared in, and the ranking in each source.

T.Sharon Search Engine Seminar5 Meta SE is a Meta-Service It doesn ’ t use an Index/database of its own. It uses other external search services that provide the information necessary to fulfill user queries.

T.Sharon Search Engine Seminar6 Meta Search Engine MetaCrawler YahooWeb CrawlerOpen TextLycosInfoSeekInktomiGalaxyExcite Google · Yahoo · Jeeves Ask About · LookSmart · Overture FindWhat

T.Sharon Search Engine Seminar7 Premises of a Meta SE No single search is sufficient. Problem in expressing the query. Low quality references can be detected.

T.Sharon Search Engine Seminar8 Why use Several SEs? Search Engines differ more than we think!

T.Sharon Search Engine Seminar9 Overlap between Google and Yahoo Source: Jux2 analysis of 500 top search terms, April

T.Sharon Search Engine Seminar10 Who Overlaps Whom?

T.Sharon Search Engine Seminar11 Try it jux2

T.Sharon Search Engine Seminar12 MSE - Motivation 1.The number and variety of SEs. 2.Each SE provides an incomplete snapshot of Web. 3.Users are forced to try and retry their queries across different SEs. 4.Each SE has its own interface. 5.Irrelevant, outdated or unavailable responses. 6.Each query is independent. 7.No individual customization. 8.The result is not homogenized.

T.Sharon Search Engine Seminar13 Problems of MSEs No advanced search options. Using the lowest common denominator. Sponsored results from the SEs are not highlighted.

T.Sharon Search Engine Seminar14 Highlighted MSEs

T.Sharon Search Engine Seminar15 Mamma

T.Sharon Search Engine Seminar16 rSort: Mamma ’ s Ranking Algorithm Each duplicate search result is considered a 'vote' for that result. Pages with the highest number of votes go at the top of our result set the method of voting we use is a simplified version of the "Condorcet Method", named after the mathematician Marquis de Condorcet who invented this voting procedure in the 18th century. One of the big advantages of this ranking method is the elimination of search engine spam. Spammers often have difficulty spamming more than one engine at the same time, as different spamming methods must be used for each search engine. Spam results will tend to receive fewer votes from multiple sources. A spammer may have top ranking on one search engine, but they won't achieve it on Mamma unless they're able to spam ALL of our sources, an insurmountable task for even the best spammer.

T.Sharon Search Engine Seminar17 Dogpile

T.Sharon Search Engine Seminar18 Dogpile Advanced

T.Sharon Search Engine Seminar19 Dogpile Advanced

T.Sharon Search Engine Seminar20 Dogpile Advanced

T.Sharon Search Engine Seminar21 Dogpile Advanced

T.Sharon Search Engine Seminar22 Dogpile Preferences

T.Sharon Search Engine Seminar23 Dogpile Preferences

T.Sharon Search Engine Seminar24 Vivisimo/Clusty Viv í simo supports the most advanced features of the major search engines using one Viv í simo syntax, which follows the most standard conventions. Viv í simo translates your query into the corresponding syntax of each underlying search engine. Also, Viv í simo only queries the search engines that support your chosen syntax.

T.Sharon Search Engine Seminar25 Clusty

T.Sharon Search Engine Seminar26 Vivisimo Advanced

T.Sharon Search Engine Seminar27 Vivisimo Advanced

T.Sharon Search Engine Seminar28 Ixquick

T.Sharon Search Engine Seminar29 Ixquick

T.Sharon Search Engine Seminar30 Ixquick

T.Sharon Search Engine Seminar31 KartOO – Visual MSE

T.Sharon Search Engine Seminar32 MetaSEs in Hebrew: Clusty Start

T.Sharon Search Engine Seminar33 Clusty

T.Sharon Search Engine Seminar34 Clusty

T.Sharon Search Engine Seminar35 When to use a MSE? When single Basic-SE fails to provide good results. One-stop shopping - prefer to search multiple SEs/sites at once to get blended ranked results (so as to save effort/time). Searching for multi-faceted topics. Want to get clustered results to focus search on the relevant keywords. Looking for current events/news.

T.Sharon Search Engine Seminar36 For quick and dirty searches. If you want an answer fast, you may have better luck querying multiple engines simultaneously. For broad and shallow searches. Meta searching is an excellent approach if the purpose of your search is to get an overview of a topic. To assess potential keywords for an unfamiliar subject. What better way to discover search terms than to see how they appear in a cross section of documents across the web? To see how different engines handle the same query. This is an excellent way to get to know the "personalities" of different search engines -- their strengths, weaknesses, and types of queries they handle best.

T.Sharon Search Engine Seminar37 MSE pros Useful when you want to retrieve a relatively small number of relevant results an excellent choice for obscure topics a good option when you are not having luck finding what you want when you search appropriate when you want to get an overall picture of what is available on the Web on your topic

T.Sharon Search Engine Seminar38 MSE cons use is limited primarily to simple queries little or no field searching is available most services return a limited number of results that do not represent the totality of results from any source engine Sponsored results may are not highlighted (even though probably not first)

T.Sharon Search Engine Seminar39 How to Choose your MetaSE Search engines used Operators supported Special features Speed Presentation

T.Sharon Search Engine Seminar40 Meta-SEs Features Chart Red – not working

T.Sharon Search Engine Seminar41 Vivisimo: link: Not supported?

T.Sharon Search Engine Seminar42

T.Sharon Search Engine Seminar43 Practical Recommendations  Use Ixquick for fast results and maximal syntax flexibility  Use Vivisimo/Clusty (start) for Clustering and/or Hebrew  Use Dogpile to include Google+Yahoo!, date range, or spelling corrections.  Use none for non-MSE tasks (see MSE cons) …

T.Sharon Search Engine Seminar44 Exercises Find a presentation by Mary Ellen Bates Learn about the litrature of Pablo Neruda from a research/educational point of view Hint , query: +domain:edu +literature +"pablo neruda “ Explore the different meanings of Jaguar

T.Sharon Search Engine Seminar45 Bibliography earch.html earch.html engines.htm engines.htm er.pdf er.pdf metacrawler.pdf metacrawler.pdf