LIS618 lecture 9 Google Thomas Krichel 2011-11-21.

Slides:



Advertisements
Similar presentations
LIS651 lecture 3 taming PHP Thomas Krichel
Advertisements

LIS618 lecture 6 Thomas Krichel structure DIALOG –basic vs additional index –initial database file selection (files) Lexis/Nexis.
LIS651 lecture 3 functions & sessions Thomas Krichel
LIS650lecture 1 XHTML 1.0 strict Thomas Krichel
BY: Jamaican Hopscotch
Internet Search Methods 1.01 Understand Internet search tools and methods.
Standard 1.02 Investigate uses of the Internet and World Wide Web.
Search Techniques. It is imperative students use proper techniques when searching information on a computer system. It is imperative students use proper.
Google Chrome & Search C Chapter 18. Objectives 1.Use Google Chrome to navigate the Word Wide Web. 2.Manage bookmarks for web pages. 3.Perform basic keyword.
Advanced Google Becoming a Power Googler. (c) Thomas T. Kaun 2005 How Google Works PageRank: The number of pages link to any given page. “Importance”
Google for Genealogists. Google's mission statement “Organize the world's information and make it universally accessible and useful."
LIS618 lecture 9 Thomas Krichel Structure Google “theory”, see essay by Brin and Page fullpapers/1921/com1921.htm.
LIS618 lecture 9 Web retrieval Thomas Krichel
Google Search Using internet search engine as a tool to find information related to creativity & innovation.
Web Searching and PPT. Searching for Information on the Web  Goals: –Decrease number of search results –Increase number of relevant results  Method:
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
Learn how to search for information the smart way Choose your own adventure!
Searching The Web Search Engines are computer programs (variously called robots, crawlers, spiders, worms) that automatically visit Web sites and, starting.
Chapter 5 Searching for Truth: Locating Information on the WWW.
Searching and Researching the World Wide: Emphasis on Christian Websites Developed from the book: Searching and Researching on the Internet and World Wide.
1 The World Wide Web. 2  Web Fundamentals  Pages are defined by the Hypertext Markup Language (HTML) and contain text, graphics, audio, video and software.
Cutting Through the Clutter Searching the Web. There is a wealth of information waiting for you on the internet, if you know the right tools to use and.
Net Search Engines The Which, Why and How Tim Landeck Handouts/PowerPoint available at:
Lesson 12 — The Internet and Research
Chapter 5 Searching for Truth: Locating Information on the WWW.
Internet Research Skills Workshop Presented By: Paul Chisholm Program Resource Teacher Gateway Education Centre March 6, 2007.
Searching Google Ms. Mary Condon Librarian Lowell Catholic High School.
Slide No. 1 Searching the Web H Search engines and directories H Locating these resources H Using these resources H Interpreting results H Locating specific.
LIS618 lecture 5 Thomas Krichel structure Google “theory”, mainly page rank Google query language Google special services and features –Images.
Introduction to HTML. What is a HTML File?  HTML stands for Hyper Text Markup Language  An HTML file is a text file containing small markup tags  The.
Beyond the Basics Steven Butzel, Nashua Public Library , Yahoo IM: nashuaref.
LIS618 lecture 10 Thomas Krichel Structure some repeats from last week other special syntaxes usenet news in google open directory project.
Google Analytics for Small Business Presented by: Keidra Chaney.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
Search - on the Web and Locally Related directly to Web Search Engines: Part 1 and Part 2. IEEE Computer. June & August 2006.
LIS618 lecture 8 Credo and Gale Thomas Krichel
Chapter 8 Browsing and Searching the Web. 2Practical PC 5 th Edition Chapter 8 Getting Started In this Chapter, you will learn: − What is a Web page −
LIS618 lecture 5 Thomas Krichel Structure Theory on query languages Web information retrieval –Google “theory”, see essay by Brin and Page.
LIS618 lecture 3 Thomas Krichel Structure of talk Document Preprocessing Basic ingredients of query languages Retrieval performance evaluation.
Intro to PHP IST2101. Review: HTML & Tags 2IST210.
Link: link: restricts the results to those web pages that have links to the specified URL. There can be no space between link: and the URL. Source:
CPT 499 Internet Skills for Educators Session Three Class Notes.
陈贵梧 Chen Gui-wu Search. Outline l Google Overview l Basics of Google Search l Advanced Search Made Easy l Search Results Page l Google Tools l Questions.
Unit 1—Computer Basics Lesson 3 The Internet and Research.
Created by Branden Maglio and Flynn Castellanos Team BFMMA.
Lawrence Snyder University of Washington, Seattle © Lawrence Snyder 2004.
LIS618 lecture 8 Thomas Krichel Lexis/Nexis Lexis is a specialized legal research service Nexis is primarily a news services adds an important.
Selected Internet Search Engines Search Engine Database Advanced/ Boolean Other search options Miscellaneous Google Google google.co m Advanced Search.
Our MP3 Search Engine Crawler –Searching for Artist Name –Searching for Song Title Website Difficulties Looking Back.
A presentation by Patrick Douglas Crispen NetSquirrel.com Modified 2013 by Michael Wood.
©2003 Paula Matuszek GOOGLE API l Search requests: submit a query string and a set of parameters to the Google Web APIs service and receive in return a.
How to build a better Google? Adam Bak IST 497E November 21, 2002.
LIS654 lecture 4 more on omeka Thomas Krichel
IST 210: PHP Basics IST 210: Organization of Data IST2101.
1 UNIT 13 The World Wide Web. Introduction 2 Agenda The World Wide Web Search Engines Video Streaming 3.
 Every word matters. Generally, all the words you put in the query will be used.  Search is always case insensitive. A search for [ new york times ]
My Favorite Top 5 Free Keyword Research Tools –
To query or not to query! Review of search techniques, methods and …tricks Part of this presentation is adapted from:
Chapter 10: Web Basics.
Chapter 8 Browsing and Searching the Web
Understanding Search Engines
Browsing and Searching the Web
All About Google’s Search Features
1.01- Understand Internet search tools and methods.
ITE 130 Web Searching.
ZANZIBAR UNIVERSITY LIBRARY SERVICES Introduction
ZANZIBAR UNIVERSITY LIBRARY SERVICES Introduction
ZANZIBAR UNIVERSITY LIBRARY SERVICES Introduction
Searching effective on Google
Presentation transcript:

LIS618 lecture 9 Google Thomas Krichel

structure Google query language Google special services and features – official – officious

literature Calishain and Dornfest's “Google hacks”, O'Reilley 2003 Schneider & alii “How to do everything with Google”, McGraw Hill Osborne, 2004 Google web site ators.html

interfaces The simple interface has command driven features that make it more advanced than the advanced interface The advanced interface is a form interface to query language available on the simple interface. The Google toolbars for different browsers may be quite useful.

customizing Google These are available in the "" link on the Google home page. Preferences only stored as a cookie in the browser.

language preference You can set – preferences for finding pages in a certain language (set language to German and search for “Krichel”) – preferences for the language of the interface – both are impact search results. In fact the language preference is detected automatically by a http header that the browser usually sends. Thus you can set it in the browser too.

SafeSearch This is a tool for the automatic exclusion of explicit erotic material. This was a big controversy topic in the 90s, early 00s in libraries. There is a way to look the save search on a browser by logging in to a Google account. But it is still browser dependent and (o/s level) user account dependent, and requires the authorization of third party cookies.

Google Instant and results Google Instant is the feature that produces results while you type them. It differs by language and Google domain. – In google.com I am proposed “krichel” as a completion to “kric” – In google.ru, I am not ;-( If you change the number of results per page from the default, the showing of instant results is disabled.

location preference The location preference is normally extracted from the IP address of the browser’s computer. It may be set in the preference to some un- controlled string. But that will only take account of location inside the country domain. So Google ignores “moscow” as your location when searching google.es.

composing the query You should type your question as responded on the web site. E.g. to find the age of N.N. “N.N. born”. Sometimes Google will attempt a guess, e.g. “Drupal written in”. Try descriptive words, e.g. avoid “documentation” For queries that are obvious, like “thomas krichel” use the “I’m feeling lucky” button.

Boolean operators Default Boolean AND between terms Case insensitive Terms can be ORed with “ |”, or “OR” in all caps (!) e.g. “krichel | chen”. “or” is treated is a normal term Adjacent terms have to be put in double quotes. Boolean NOT can be expressed with the exclusion operator – e.g. “krichel –thomas”

wildcard and limit * is a wildcard for any word There is a limit of 10 words, but a * does not count towards the limit.

phrase searching Double quotes around more than one term will prefer the terms to be appearing next to each other. Double quotes around one term will require the term to be appears as such, without spelling variations or grammatical variation. Example “"ralf"” will not search for “ralph”.

similarity operator ~ is the similarity operator. It searches for synonyms. Synonyms are gleaned from the web, not from a thesaurus. This seem to include common spelling as well. examples – “~auto repair novosibirsk” vs “auto repair novosibirsk” – ~phillipp ~meyr gesis

query treatment Google prefers pages that have the search terms – in close proximity – in the same order as in the query Repeating a query term once adds weight to it repeating it twice has no further effect

spell checking Google makes spelling suggestions. They based on usage of query terms, not on a dictionary. – example: “untied stats” suggests “united states” – example: “beurocratic” But note that these suggestions depend on your interface language.

maps Most things that look like a location expression can be entered in the normal search interface and return a map.” Example “ 11 rue Boinod, Paris” Flights also have such a feature, if you give your flight number

map queries [n/a] you can compose map query as – point of interest near location – point of interest in location where point of interest is a destination of interest and location is a location. In the US, the name, state combination is sufficient. Examples “strip clubs near jackson heights” “shoe repair in Paris, France”

math I you can enter math in plain English words – example: “two times two” – example: “half of eleven” – example: “five megabytes in bytes” – example: “ten gallons in liters” also knows the standard operators “+”, “-”, “*”, “/”, ”^“ or “**”, “% of”

advanced math “!” factorial “choose”combination without replacement “sqrt”square root “log”logarithm (base 10) “ln”logarithm (base e) “lg”logarithm (base 2) “exp”e to the power of “mod”modulo (remainder) “sin”, “cos”, “tan” “csc”, “sec”, “ctn” “arcsin”, “arccos”, “arctan” “arccsc”, “arcsec”, “arcctn” “sinh”, “cosh”, “tanh”

site: This is an officially documented special syntax to limit searches to a domain. It breaks down if a path is included. It can not be used on its own, only with other query expressions. This can be used to build site-specific engines using Google. Such engines would use the local query and append the site limiter.

site: Example “"shit" krichel site:liu.edu -baseball - nyu” You could put longer site names “krichel site: You can use -site to exclude a site eg. “krichel - site:openlib.org -site:liu.edu”

filetype: This appears to find file with a certain type. But it only looks for the extension. Example – “copyright filetype:pptx”

related: related can be used with a site URL (without to find related pages. It can be combined with search terms. This is the same as the related link next to searches. The searches for – related:openlib.org/home/krichel – related: are very successful.

link: link: returns pages that link to a specific page. It is mentioned on the Google site. It does not seem to work properly. Example: link:wotan.liu.edu/home/krichel. Generally wotan is excluded in Google as a duplicate. Example: “link:openlib.org -site:openlib.org”, seems to lead to nonsense.

link: limitation You cannot combine a link: search with a regular keyword search. As soon as you do the "link" is interpreted as a normal search term Google does probably not return all the pages that match. Example: “link:openlib.org -site:openlib.org”

inofficial syntax There are special syntaxes that are not documented, but they appear to be working.

intitle: intitle: find in html only Example “intitle:lis618” find old powerpoints slides. Recent slides can be found with “intitle:powerpoint krichel lis618” because that is the title of the slides at this time. ;-(. It can be combined with other terms. Example: – “intitle:"Thomas Krichel"” – intitle:lis650 -krichel -thomas -site:wotan.liu.edu

intext: intext: find in text only. This will exclude occurrences of the search term in anchor or title data. example: “intext:"miserable failure"“ will not bring up George W. Bush's official biography, as “miserable failure” does, or did at one time.

allintext: This requires all the term that follow to be in the text. The order seems to depend on the position of the words. But the words are, supposedly, requiered to be there “allintext:krichel love I thomas” “allintext:I love thomas krichel” “allintext:krichel novosibirsk 1209”

inanchor: inanchor: This option requests pages, for which there is another page that links to them with the anchor text in the query. Example: – “inanchor:"courses information" krichel” finds it now. – “inanchor:"nsk 10 hourly"” – “inanchor:lis650” finds html

inanchor inanchor: can be used to restrict an otherwise popular term to the instances of something you want. Example – “inanchor:listserv library” – “inanchor:"list of my courses"” finds my courses page because it has a link with that text from an old version of my homepage.

allinanchor: This requires all the following terms to be in the anchor. Example: – allinanchor:Томас Крихель

cache: cache: pages that are in the google cache, useful if query result has nothing to do with the query terms Example: “cache:openlib.org/home/krichel” will show the cached version of the page. If you add further terms, they will be highlighted.

cache: This can be used to check if a page is indexed. Example: – “cache:3d.openlib.org” – “cache:openlib.org”

inurl: It finds in URL only. It can use star as a wildcard Examples – “inurl:list” – “inurl:krichel -thomas” – “inurl:*.openlib.org” – “inurl:wotan.liu.edu/omeka/ grandmother”

allinurl: Requires all the terms that follow it to be in the URL address of the page found. Example: – “allinurl:visitor krichel house_rules” – “allinurl:visitor krichel house_rules site:wotan.liu.edu” – “allinurl:visitor krichel house_rules site:openlib.org”

daterange: This limits the search to pages indexed between a range of dates. Changed pages are reindexed, unchanged pages are not reindexed when the crawler visits a page. Dates are expressed in the Julian period, i.e. number of days after :00 UTC of the Julian calendar. This date is used by astronomers. Find a converter with the Google search “julian converter site:nasa.gov” example: “daterange: krichel”

info: info: shows information about a page. The argument to info must be a real existing page that is in the Google index. Example: “info:openlib.org/home/krichel”

mixing special syntax expressions The link: syntax does not mix with others. Other bad ideas: – "site:openlib.org –inurl:openlib" – "site:edu site:com" Things that work well – intitle:search – Intitle:biology inurl:help

examples George Bush site:nytimes.com "Copyright * The New York Times" "George Bush" Intitle:"directory * * trees" Botany intitle:"directory of" site:edu "powered by blogger" or site:blogspot.com "classical music" (inurl:mailman | inurl:listserv) google special syntax –site:google.com

stocks on google stocks: ticker will look up a ticker symbol ticker at you can find ticker symbols there ticker symbols are useful to find financial information about publicly traded companies. example: “stocks:msft”

google images special syntax intitle: searches for images with a given string in the file name – example: “intitle:novosibirsk” inurl: searches for images in pages that have a certain url – example: “inurl:liu.edu” site: restricts the search to a certain site. It should be combined with a search term like – example “site:liu.edu koenig”

Please shutdown the computers when you are done. Thank you for your attention!