Web Information retrieval

Slides:



Advertisements
Similar presentations
Imagining the Future. WORLD WIDE WEB Tim Berners-Lee invented the World Wide Web.World Wide Web A graduate of Oxford University, England, in 1989, Tim.
Advertisements

The Internet and the Web
4.01 How Web Pages Work.
The Client-Server Model for the Web 1. A Web Client (usually in the form of a web browser) makes an HTTP request to a specific web server. 2. The Web Server.
 To publish information for global distribution, one needs a universally understood language, a kind of publishing mother tongue that all computers may.
Internet – Part II. What is the World Wide Web? The World Wide Web is a collection of host machines, which deliver documents, graphics and multi-media.
UNIFORM RESOURCE LOCATOR (URL)
1 Internet History Internet made up of thousands of networks worldwide No one in charge of Internet - No governing body Internet backbone owned by private.
Web Programming Language Dr. Ken Cosh Week 1 (Introduction)
HINARI/Basic Internet Concepts (module 1.1). Instructions - This part of the:  course is a PowerPoint demonstration intended to introduce you to Basic.
Copyright © cs-tutorial.com. Introduction to Web Development In 1990 and 1991,Tim Berners-Lee created the World Wide Web at the European Laboratory for.
Website Development. Internet Internet (Net) – Global network connecting millions of computers. Network – Group of computers and associated devices that.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
 The internet is the hardware that creates the massive worldwide network. Computers, cables, telephone wires, high-speed communication lines. The internet.
Networks QUME 185 Introduction to Computer Applications.
 Internet network of connected computer › No company owns the internet › Purpose is to share information  World Wide Web (Web) one of the ways information.
Lecture 7 World Wide Web CSCS100 – Fall 2009 – Forman Christian College Asher Imtiaz *Several of these slides have been adapted and modified from VU CS101.
HTML ~ Web Design.
Objective Understand concepts used to web-based digital media. Course Weight : 5%
MySQL and PHP Internet and WWW. Computer Basics A Single Computer.
Introduction to the Internet and World Wide Web. The Internet n A network of networks n Began in 1969 as ARPAnet (Advanced Research Projects Agency) n.
Introduction to the Internet and HTML. Objectives Students develop an understanding of the origins of the internet Students will be able to identify the.
Overview of HTML and XML. Contents n History n Usage n Examples n Advantages n Disadvantages.
1 MSCS 237 Overview of web technologies (A specific type of distributed systems)
Introduction to HTML. Today’s Discussion What is HTML ? What is HTML ? What is Web Page ? What is Web Page ? Web Server Web Server Web Browser Web Browser.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
OWL Representing Information Using the Web Ontology Language.
Internet Network of networks Mother of all networks
 A website, also written Web site, web site, or simply site, is a group of Web pages and related text, databases, graphics, audio, and video files that.
Web Design. What is the Internet? A worldwide collection of computer networks that links millions of computers by – Businesses (.com.net) – the government.
Internet Applications (Cont’d) Basic Internet Applications – World Wide Web (WWW) Browser Architecture Static Documents Dynamic Documents Active Documents.
The Internet is a Big Collection of Computers and Cables. -"interconnection of computer networks". Millions of personal, business, and governmental.
JavaScript and Ajax (Internet Background) Week 1 Web site:
Introduction to the World Wide Web & Internet CIS 101.
The Internet What is the Internet? The Internet is a lot of computers over the whole world connected together so that they can share information. It.
CSE541: Web Applications Special Thanks to M. Abdur Rahman.
 The web is referred to as a “massive collection of web pages stored on millions of computers across the world that are linked by the Internet” (Chowdhury,
World Wide Web. The World Wide Web is a system of interlinked hypertext documents accessed via the Internet The World Wide Web is a system of interlinked.
HTML PROJECT #1 Project 1 Introduction to HTML. HTML Project 1: Introduction to HTML 2 Project Objectives 1.Describe the Internet and its associated key.
E-Business Infrastructure PRESENTED BY IKA NOVITA DEWI, MCS.
4.01 How Web Pages Work.
The World Wide Web.
4.01 How Web Pages Work.
Web Programming Language
Introduction to the Internet and World Wide Web
Introduction to the Web
JavaScript and Ajax (Internet Background)
Chapter 1 Introduction to HTML.
CNIT 131 Internet Basics & Beginning HTML
Introduction to Web Pages
E-commerce | WWW World Wide Web - Concepts
Introduction to Computers
Project 1 Introduction to HTML.
E-commerce | WWW World Wide Web - Concepts
CASE STUDY -HTML,URLs,HTTP
Some Common Terms The Internet is a network of computers spanning the globe. It is also called the World Wide Web. World Wide Web It is a collection of.
A Brief History of the Internet
A Brief Introduction to the Internet
Who is Tim Berners-Lee? Born 8 June 1955 in London (age 59)
HTML History CS 101.
1 Introduction to the Internet.
The Internet and Electronic Mail
Introduction to the Internet and World Wide Web
All About the Internet.
Introduction to Web Application Design
Introduction to World Wide Web
An Introduction to HTML Pages
Unit-3.
4.01 How Web Pages Work.
4.01 How Web Pages Work.
Presentation transcript:

Web Information retrieval

Introduction The web is referred to as a “massive collection of web pages stored on millions of computers across the world that are linked by the Internet” (Chowdhury, 2010, p. 381) It was created in 1989 by Tim Berners-Lee and his team of scientists at the European Laboratory for Particle Physics in Geneva. The Hyper Text Transfer Protocol (HTTP) was created to standardize communication between clients and servers used by the web Mosaic was the first web browser created for the web in 1993 at the US National Center for Supercomputing Applications. This was followed by the Netscape Navigator and the Internet Explorer. Today there are several browsers , Firefox, Chrome, Safari etc. The web has grown exponentially from over 9 million websites in 2002 to over 1 billion in 2014. Today the no. Of indexed pages is 4.71 billion.

Differences between traditional and web retrieval Distributed nature of web- Web resources are distributed on millions of computers throughout the world with different architecture, software, and standards. Text retrieval systems deals with a set of documents, and specified set of standards such as hardware, software, and processing standards, (eg. MARC formats and OPAC). Size and growth of the web – The rapid growth of the web makes indexing and retrieval complex and difficult. Traditional text retrieval systems are amenable to research and testing for eventual handling of large volumes of data

Differences between traditional and web retrieval Deep versus the surface web – surface web is accessible by all, deep web is larger, inaccessible, and password protected, requires authorization or use of a specified program Type and format of documents - The web has a variety of data and documents, eg. Text and multimedia resources. Text retrieval deals with text only

Differences between traditional and web retrieval Quality of information- Quality of web information is uncertain since anyone can publish on the web. Text retrieval system comprise published information resources with definite quality control. Frequency of changes- Web information changes frequently. Contents of text retrieval systems are static and thus easy to track and retrieved by a retrieval system. Ownership – ownership of web resources varies, some are free, others require permission or access rights, posing a challenge to retrieval

Differences between traditional and web retrieval Distributed users – Unlike users of the web, text retrieval systems know the nature, characteristics, information needs and seeking behaviours of their users posing a challenge to the designer of a web information retrieval system Multiple languages- language of both information resources and users are diverse posing a challenge. An ideal IRS must be able to retrieve required information irrespective of language of the query or the source of information. Resource requirements- The astronomical size of the web makes it difficult for it to run effectively and efficiently, and also be funded by a single body although the world desires a good IRS to access the web information resources.

Assignment Make notes on issues and challenges of web information retrieval (Chowdhury, 2010, pp. 385-386.