Download presentation
Presentation is loading. Please wait.
Published byRafe Hines Modified over 9 years ago
1
1 © 2000 Searching the Hidden Internet When Search Engines Aren’t Enough A Webcast Workshop
2
2 © 2000 Sponsored by — Today’s Webcast Workshop Produced by — Produced by — Broadcast by — Broadcast by —
3
3 © 2000 Karen Hartman and Ernest Ackermann Mary Washington College Today’s Presenters
4
4 © 2000 Internet/Web Resources coauthored by Ernest Ackermann and Karen Hartman
5
5 © 2000 Today’s Agenda What is the Hidden Internet? Why can’t popular search engines reach all the content on the Web? What types of information are “hidden”? How do we find this information?
6
6 © 2000 What percentage of the publicly available Web is currently indexed by popular search engines? A.About 2% B.About 30% C.Between 40-50% D.More than 75% Polling Interaction
7
7 © 2000 What percentage of the publicly available Web is currently indexed by popular search engines? A.About 2% B.About 30% C.Between 40-50% D.More than 75% Polling Interaction Answer
8
8 © 2000 What is the Hidden Internet? The Hidden, or Invisible Web, is that part of the Internet not indexed in the major search engines (e.g., AltaVista, Northern Light, Google, and Lycos) This hidden content is usually contained in specialized databases that are linked to the Web
9
9 © 2000 Specialized Databases Provide Faster, More Reliable Results Because these databases are smaller and contain selective information, the researcher can focus and limit a search by using particular criteria; such as date, language, location, etc. The precision of searching targeted content results in more relevant information Databases can provide very current information (often updated daily) They tend to be efficient, reliable, and comprehensive sources for research
10
10 © 2000 Why popular search engines can’t reach all the content on the Web Content in databases that display pages dynamically in response to queries isn’t usually indexed by search engines (e.g., the Medline database is not searched by using HotBot) Search engines tend to avoid indexing URLs that have dynamic components in them (for example, the ? or “cgi”) Depth of indexing: search engine spiders are sometimes unable to index all pages of a site Indexing frequency: sometimes it can take months for a page to be indexed by a search engine, whereas a special database may update its pages daily
11
11 © 2000 What’s something you have tried to search for on the Internet, but couldn’t ever find? Elvis or the Holy Grail don’t count! Message Interaction
12
12 © 2000 What Types of Information May be Hidden from Search Engines? Scholarly journal citations and abstracts Library catalogs Company financial reports Multimedia files Dynamically generated Web pages and interactive tools Content of Adobe PDF and other formatted files Digital collections Content in sites that require a login and registration process (including proprietary databases)
13
13 © 2000 What are some databases and catalogs that you or your students use for research? Message Interaction
14
14 © 2000 Databases and Library Catalogs Medline (contains abstracts of the world’s premier medical literature) ERIC (Educational Resources Information Center) Hoover’s Online (company information) Library of Congress Online Catalog
15
15 © 2000 Dynamically Generated Web Content Ticketmaster.com search for events, buying concert tickets Anywho.com search for a person’s telephone number or Email address http://www.bartleby.com/99/ Bartlett’s Familiar Quotations Bankrate.com calculate mortgage rates, CD rates, auto loans, etc.
16
16 © 2000 Project Gutenberg is a project whereby volunteers choose and type entire books, creating them into e-texts. No books can be published that are still under copyright restriction. This means that you won’t find books published after what date? A.1890 B.1800 C.1923 D.1905 Polling Interaction
17
17 © 2000 From Project Gutenberg: “We cannot publish any texts still in copyright. This generally means that our texts are taken from books published pre- 1923. It’s more complicated than that, as our Copyright Page explains, but 1923 is a good first rule-of-thumb for the U.S.A.” A.1890 B.1800 C.1923 D.1905 Polling Interaction Answer
18
18 © 2000 Digital Collections American Memory Collection UC Berkeley’s Sunsite The University of Virginia’s E-Text Collection Project Gutenberg
19
19 © 2000 Sites Used to Search for Images, Audio, PDF Documents, and Newsgroup Archives Images AltaVista Image Search Audio FindSounds.com PDF Documents Search Adobe PDF Online Newsgroup Archives Deja.com
20
20 © 2000 Login and Register at these Sites New York Times Thomas register BioMedNet Journal Collection Medical Matrix FastWeb
21
21 © 2000 If you lived in Richmond, Virginia and needed a lawyer who spoke Spanish, which two tools listed below would be most helpful in your search? A.Librarian’s Index to the Internet B.HotBot C.Google D.The InvisibleWeb Polling Interaction
22
22 © 2000 If you lived in Richmond, Virginia and needed a lawyer who spoke Spanish, which two tools listed below would be most helpful in your search? A.Librarian’s Index to the Internet B.HotBot C.Google D.The InvisibleWeb Polling Interaction Answer
23
23 © 2000 How to Find Tools that Search the Hidden Internet CiteLine Professional Direct Search Intelliseek’s InvisibleWeb The Internet Public Library Librarian’s Index to the Internet Library Spot The Scout Report Webdata.com
24
24 © 2000 Here’s an example You need a lawyer in Richmond, Virginia who specializes in bankruptcy and who also speaks Spanish Keywords to use: lawyer, Richmond, Virginia, bankruptcy, spanish
25
25 © 2000 Try a popular search engine - AltaVista - Type in … lawyer richmond virginia bankruptcy spanish
26
26 © 2000 Searching AltaVista without search features = 3.3 million results
27
27 © 2000 Try narrowing your search -use specific search features Place quotes around phrases Place + before necessary terms Type search expression like this: +lawyer +”richmond virginia” +bankruptcy +spanish
28
28 © 2000 Searching AltaVista with search features = 32 results (none appear to be relevant)
29
29 © 2000 Try using a virtual library Use the Librarian’s Index to the Internet to increase your chance of finding relevant sources.
30
30 © 2000 Librarian’s Index to the Internet
31
31 © 2000 Martindale-Hubbell Lawyer Locator: www.martindale.com/locator
32
32 © 2000 Martindale-Hubbell’s Search Form
33
33 © 2000 Results = 2
34
34 © 2000 Try another example You need to find reliable statistics on how the states rank in the amount of people that don’t have health insurance.
35
35 © 2000 Here’s what we tried Start with a virtual library (we used Library Spot) Look for a statistics category Go to a comprehensive statistics site We chose the University of Michigan Find health insurance category Search the databases listed We chose the PBS program “The Uninsured in America”
36
36 © 2000 Where we found the statistics Click on Mapping It Out: State Statistics
37
37 © 2000 State Health Insurance Data (you must place the cursor over the state you are interested in)
38
38 © 2000 Today’s Agenda What is the Hidden Internet? Why can’t popular search engines reach all the content on the Web? What types of information are “hidden”? How do we find this information?
39
39 © 2000 Final Thoughts
40
40 © 2000 Thank you for your participation! Textbook resources: www.webliminal.com/ernie Franklin, Beedle & Associates: www.fbeedle.com A Webcast Workshop
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.