Refining – Finding Words/expanding Taly Sharon
Contents Expanding/Learning terms Categorization/Clustering engines Google Suggest SurfWax FocusWords When you don’t know where to start
Make Longer Queries Yahoo Harvest Digital Average Search Terms per Query Overall Experienced
Adding Words Holocaust23,200,000Holocaust holocaust memorial 836,000holocaust memorial holocaust memorial budapest 42,300holocaust memorial budapest holocaust memorial budapest danube 4,910holocaust memorial budapest danube holocaust memorial budapest danube promenade 692holocaust memorial budapest danube promenade
Classification/Categorization Classification: the process of deciding the appropriate category for a given document. Examples: –deciding what newsgroup an article belongs to. –what folder an message should be directed to. –what is the general topic of an essay.
Clustering The process of automatically grouping documents.
Search Categorization/Clustering The result documents are ordered according to categories. The searcher can select the relevant category to display the related documents. Examples: –Vivisimo/Clusty –Excite –Teoma –Exalead
Clusty
Excite
Ask
Exalead
Google Suggest As you type – you get query suggestions and number of results per query.
Google Suggest (2)
Google Suggest
Yahoo Search Assist
SurfWax FocusWords SurfWax has an option “Focus” This option invokes the FocusWords mechanism You get suggestions to make your query: –Broader –Similar –Narrower
SurfWax FocusWords
When you don’t know how to start Reverse Dictionary Glossaries and Dictionaries Taxonomy/Folksonomy Pearl Culturing Analyzing pages Finding similar pages –Google’s related: –Alexa
Reverse Dictionary OneLook reverse dictionary: dictionary.shtml dictionary.shtml Example: “bird of prey” => raptor Example: economical measure of a nation’s wealth => Gross Domestic Product
Glossaries and Dictionaries Google search: – –glossary OR thesaurus OR dictionary OR taxonomy Example 1: agriculture glossary – – – Example 2: agriculture thesaurus – – (multilingual) – industry/agriculture.asphttp:// industry/agriculture.asp
Taxonomy/Folksonomy Taxonomies –Found via directory search (example DMOZ): bin/search?search=taxonomy bin/search?search=taxonomy – (paid) Folksonomy –Use tags in Technorati Delicious
del.icio.us
del.icio.us
Pearl Culturing What to do when you don’t have the category nor the right keywords? Find one good relevant website Look it up in directories You will find: –the category/main keywords –authoritative websites Useful search engine: ExaleadExalead
Analyze Pages Distilling: what is problematic in a bad page? –what is wrong? Is there an interfering keyword/term appearing. –Remove interfering terms (using “-”). Identifying clues and patterns in a good page. –Read the document, what are the clues? –Look for new keywords, word combinations and other things differentiating between it to non-authoritative documents. –Use frequency counter: s.htmlhttp:// s.html
Frequency Counter
Wordcounter
References s.htmlhttp:// s.html
Exercises 1.How are bad user interfaces called (hint: try Google suggest) 2.Reverse dictionary a.Find relevant keywords for chemistry b.What is the terminology for when menstruation stops? c.How was the separation between the west and the soviet union called? 3.What are the related terms to Competitive Intelligence? 4.Check suggestions from Google Suggest for a query starting with biofuel. 5.Using SurfWax, learn options to focus or broaden the query: biodiesel. 6.Identify the most relevant terms in the website: 7.Identify the most relevant terms in the Biofuel Wikipedia entry Search in Onelook reverse dictionary and in other glossaries terms: fuel, natural energy, geothermal, and other terms. Look at the results.