When Google Isn’t Enough! Finding Information on the Invisible Web Yaacov Taube

Slides:



Advertisements
Similar presentations
Slide 1 of 10 Taming the Internet. Slide 2 of 10 Overview Specific products include Directories, Intellectual Capital Collections, and annotated reports.
Advertisements

TexShare Databases Electronic Access to Magazines, Newspapers, and Books Presented by: Jennifer Bekker Technical Services Supervisor Lewisville Public.
Insurance and Risk Management Internet Searches Overall Objective: –On completion of the three course modules, you should be able to obtain and evaluate.
Lake Land College Library Tim Schreiber Information Services Librarian.
Create a website with Google Sites
Sometimes Google Isn’t Enough Finding Information on the Invisible Web Shirley McDonald Hilda Donaldson
Databases vs the Internet Coconino Community College Revised August 2010.
How to Use the Indian Law Portal Sandra Day O’Conner College of Law Ross Blakley Law Library.
The Invisible Web Definition Searching. The Invisible Web Also called: deep content hidden internet dark matter.
Exploring the Deep Web Brunvand, Amy, Kate Holvoet, Peter Kraus, and David Morrison. "Exploring the Deep Web." PPT--Download University of Utah.
“The Computer as an Educational Tool: Productivity and Problem Solving” ©Richard C. Forcier and Don E. Descy.
Tara Guthrie, 2012 Types of Resources: Electronic.
LEMA, February 2011 Deep Web Video. Image from express.howstuffworks.com, 14 Feb 11 Surface Web: accessible via general-purpose search engines such as.
Exploring the Academic Invisible Web Das wissenschaftliche Invisible Web erkunden Dr. Dirk Lewandowski Heinrich-Heine-Universität Düsseldorf, Information.
Web Evaluation Websites and the Invisible Web HIST 221/INFO 221 February 25, 2004 Presented By: Teresa Ferguson
Information on the Internet. http hypertext transfer protocol Web clients (browsers) make request to the web server. Looks for web page written in HTML.
Google & Beyond Expert Internet Searching Tools & Strategies.
Searching and Researching the World Wide: Emphasis on Christian Websites Developed from the book: Searching and Researching on the Internet and World Wide.
T.Sharon-A.Frank 1 Internet Resources Discovery (IRD) Definition of Digital Libraries.
What’s The Difference??  Subject Directory  Search Engine  Deep Web Search.
Internet Research Online Databases: Lexis-Nexis. Database A database is a collection of information put together in a certain way. The phone book is a.
PTA 221 Finding & Using PT Information Martin J. Crabtree MCCC Library.
ITIS 1210 Introduction to Web-Based Information Systems Internet Research Four Finding Specialty Information.
Conquering the Presented by Bonnie Shucha University of Wisconsin Law Library August 10, 2005 Invisible Web.
Tips and Tricks to Find Internet Information Quick and Effectively This material was developed for the exclusive use of USD 233 staff. Copies can be made.
1 © 2000 Searching the Hidden Internet When Search Engines Aren’t Enough A Webcast Workshop.
Uncovering the Web. Can your favorite search engine find all there is to find on the Web?
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
Databases and Library Catalogs Global Index Medicus/Global Health Library PubMed Source Bibliographic Database: International Health and Disability.
Operating Systems Concepts 1/e Ruth Watson Chapter 12 Chapter 12 Introduction to the Internet Ruth Watson.
The Invisible Web Cynthia Rooley Computer Research.
Basic Web Applications 2. Search Engine Why we need search ensigns? Why we need search ensigns? –because there are hundreds of millions of pages available.
Sian Aynsley Information Skills Trainer South London Healthcare NHS Trust Getting the Most Out of Google.
Concepts and phrases From ODLIS (Online Dictionary of Library and Information Science)
ENG 102 Finding Information Martin J. Crabtree MCCC Library.
Week 9 Search Engines and the Invisible Web. Resource Pages Collections of Links Compiled by “experts” Sometimes annotated Targeted Information for a.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
Search Engine Comparisons By: Thomie Ventura. Search Engines Today, much, but not all, of the work we do revolves around the web Today, much, but not.
The Internet 8th Edition Tutorial 4 Searching the Web.
2 pt 3 pt 4 pt 5pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2pt 3 pt 4pt 5 pt 1pt 2pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4pt 5 pt 1pt Library Catalog Google Website Evaluation.
Using The Right Tools Information Searching by using the right tools. by Dolores Jordan August 1,2006.
Internet Research Tips Daniel Fack. Internet Research Tips The internet is a self publishing medium. It must be be analyzed for appropriateness of research.
Understanding Search Engines What Is The Web? Web Search Lesson Plan Module A1.
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
ENG 024 Finding Information Martin J. Crabtree MCCC Library.
Student Edition: Gale Info Trac Database Lesson Grades 9-12 High School Student Edition: Gale Info Trac Database Lesson Grades 9-12 High School Anita Cellucci.
From: Valenza, Joyce K. (1998). Power Tools recharged: 125+ Essential Forms and Presentations for your school Library Information Program. Rev. ed. Chicago,
Uncovering the Invisible Web. Back in the day… Students used to research using resources hand-picked by librarians and teachers. These materials were.
Unit 1—Computer Basics Lesson 3 The Internet and Research.
LIR 10: Week 10 Advanced WWW Topics. Class Announcements New features on Section 2904 Schedule Missing Homework Online Quiz due 11/16 Another WWW directory.
 Using Online Databases. What are Scholarly Databases?  Professionals in various fields conduct scientific research and publish their research to share.
The World Wide Web: Information Resource. How a Search Engine works… How Search Works - YouTube
Web Search Architecture & The Deep Web
Databases vs the Internet. QUESTION: What is the main difference between using library databases and search engines? ANSWER: Databases are NOT the Internet.
The Deep Web March 2, What is the Deep Web Aka the Invisible Web – Contents from thousands of specialized, searchable databases – Contents from.
By: Kem Forbs Advanced Google Search. Tips and Tricks Keywords: adding additional terms or keywords can redefine your search and make the most relevant.
Beyond Googling : Searching the Web and Databases Effectively N. Mellendorf, Librarian Maine South High School Library Resource Center Park Ridge, Illinois.
The Web Web Design. 3.2 The Web Focus on Reading Main Ideas A URL is an address that identifies a specific Web page. Web browsers have varying capabilities.
Databases vs the Internet
Understanding Search Engines
Understanding Search Engines
Internet Searching: Finding Quality Information
Federated & Meta Search
using the internet for research
Technology I Mrs. Huddleston
ثانيا :أدوات البحث عبر الانترنت
Website A website is a collection of web pages (documents that are accessed through the Internet) When someone gives you their web address, it generally.
What’s the big deal? Can’t I just find everything on Google?
HEALTH RESEARCH PROJECT.
Presentation transcript:

When Google Isn’t Enough! Finding Information on the Invisible Web Yaacov Taube

What is the Visible (Surface) Web? “It’s made up of HTML Web pages that the search engines have chosen to include in their indices. It’s no more complicated than that.” Sherman and Price.

What is the Visible (Surface) Web? A collection of webpages Searchable with “search engines” What you and I think of as the “Internet” is actually only a small portion of the Internet

What is the Visible (Surface) Web? High volume Mass appeal High value Small percentage of web content –Exception: Google books and Google Scholar

What is the Invisible Web? What search engines do not search Searchable Databases –Tens of Thousands –Accessible and searchable via the Internet –Results often dynamically generated in specific response to your request (eBay, MapQuest, etc.)

What is the Invisible Web? Excluded Pages –Excluded per search engine –Excluded per webpage by the owner of the site Typically databases –Businesses –Governments –Schools –Libraries –Associations

What is the Invisible Web? Academic Never been indexed or linked Uniquely generated pages Proprietary Confidential Protected by username & password Constitutes the majority of the webpages on the Internet

The Invisible Web is about 550 times larger than the visible web and is growing much faster The deep Web consists of about 91,000 terabytes.terabytes The surface Web is only about 167 terabytes1 The Library of Congress contains about 11 terabytes.Library of Congress Quality content is 1,000 to 2,000 times greater than surface web 95% of the Deep Web is accessible to public (no fees or subscription required) based on extrapolations from a study done at University of California, BerkeleyextrapolationsUniversity of California, Berkeley Visible vs. Invisible Web

Opaque Web Private Web Proprietary Web Pay per click What is on the Invisible Web

Requires payment Requires registration Dynamically generated Very new Website specifically stops spiders Why can’t Google find it?

Fixed, or Could be indexed, but is not Deemed not important enough Too new and therefore not linked Never makes max results cutoff No one ever linked or submitted URL Opaque Web

Private Web Deliberately excluded –Password –Special coding in website stops spiders Only for select individuals –Employees –Students –Researchers

Proprietary Web Protected –Password –Registration (N.Y. Times, eBay, banks, etc.) –Terms of Use Anyone can access if you –Pay –Register –Agree to terms

Pay per click Search Engine Marketing tools Ex: overture.com, FindWhat.comoverture.comFindWhat.com

When do I use …. Portal or Directory? Search Engine? Invisible Web?

Portal or Directory You have a general topic You know little about the subject You do not know keywords You want someone or something to have sorted out the junk You need an exploratory overview

Search Engine You are looking for something specific You have keywords You are pretty sure the information is –advertised or –otherwise generally disseminated

Tips for search engines Use a toolbar Determine the key words/phrases most likely to be in your document and nowhere else Learn and use Boolean Operators Scan results Question the results

Invisible Web You are pretty sure the information is in a specific database Need something authoritative Speed The information is dynamically generated You are familiar with the database –Search techniques –Protocols –Access requirements

Searching the Invisible Web Directories – subject guide compiled by human editors Specialized Search Engines – Special Databases ( Library of Congress, Library of Congress LookSmart’s Find Articles (over 900 publications National Science Digital Library Singing Fish – audio and video

Special Databases Library of Congress – LookSmart’s Find Articles (over 900 publications) – National Science Digital Library – Singing Fish – audio and video –

Types of Databases Information stored in tables (Access, Oracle, SQL Server, DB2) and accessible only by query. Examples: Phone books, People finders, Patents, laws Items for sale in a Web store or Web-based auctions Digital exhibits Multimedia and graphical files Stock and bond prices

Types of Hidden Info Pages in searchable databases: medical (WebMD.com), patent, scientific, legal (Lexis and Westlaw), reference Pages requiring login or registration: Social Sites, New York Times, web based applications, calendars, Google Docs, etc. Government publications or databases: ERIC, usa.gov Online databases: Gale Research PDF files, audio, video, any new format

More hidden stuff Dictionaries and thesauri Sites that require forms to be filled out (ex: travel direction, job hunting) Product catalogs and library catalogs Newspaper and magazine archives Dynamic web pages (ex: airline flight checkers, mapquest) Interactive tools (ex: calculators & measurement converters)

Access to invisible web is improving … Google Books Google Scholar

Maybe Consider … Specialized Databases such as Dialog, Nexis Lexis, Factiva, etc. (not cheap) Use an Information Professional

To Conclude … Focus and continue doing what you do best and what you have been trained for and let an Information Professional find the info you need. He is trained to do it faster, more effectively and efficiently than you or one of your employees. (