Download presentation
Presentation is loading. Please wait.
Published byNoel Warner Modified over 9 years ago
5
A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of information in order to find what it is that you are looking for. “Search engine” is the popular term for an Information Retrieval (IR) system.
6
Search engines match queries against an index that they create. The index consists of the words in each document, plus pointers to their locations within the documents. This is called an inverted file.
7
A search engine or IR system comprises four essential modules: A document processor A query processor A search and matching function A ranking capability
9
Archie -> First search tool for the Internet Gopher -> Indexed plain text documents Jughead -> Searched the files stored in Gopher index Wandex -> First Web search engine systems
10
Alan Emtage created the first ever search engine called Archie in 1990. Archie collects information by crawling the internet. It matches the files it found with search queries and returned results from its database. Archie
11
It was introduced in 1990 It is the protocol system It allows text files to be hierarchically organized It is used to enable the simple sharing of documents with which could be located in any part of the world Gopher
12
Jughead It was developed by Rhett Jones in 1993 Jug head is the search engine system for the gopher protocol It searches the single server at a time It was officially released in 2006
13
Wandex It was the first search engine introduced in 1993 by Matthew Gray It was the first search engine that is usable for world wide web Wandex itself was based on the world wide web wanderer which merely listed websites
15
A document processor A query processor A search and matching function A ranking capability
16
The document processor prepares, processes, and inputs the documents, pages, or sites that users search against. The document processor performs some or all of the following steps: Document Processor
17
Normalizes the document stream to a predefined format. Breaks the document stream into desired retrievable units. Isolates and metatags subdocument pieces. Identifies potential index able elements in documents.
18
Deletes stop words. Stems terms. Extracts index entries. Computes weights. Creates and updates the main inverted file against which the search engine searches in order to match queries to documents.
19
Query Processor Query processing has seven possible steps. System can cut these steps short and proceed to match the query to the inverted file. Document processing shares many steps with query processing.
20
More steps and more documents make the process more expensive for processing Search system designers must choose what is most important to their users, time or quality. Publicly available search engines usually choose time over very high quality, having too many documents to search against.
21
Step 1: Tokenizing Step 2: Parsing Step 3: STOPLIST Step 4: Stemming Step 5: Creating the query Step 6: Query expansion Step 7: Query term weighting
22
Search and matching function Searching the inverted file for documents meeting the query requirements It is the simple binary search It follows the simple document representation, query representation and algorithm After computing the similarity of each document in the subset of documents, the system presents an ordered list to the user
24
Crawler - Based Search Engines Human - Powered Directories Hybrid Search Engines " Or Mixed Results
26
Spid ers Robo ts
32
It is easy and quick It is a quick access to information Save of time It help to organize internet and individual websites
34
The search system development or selection should be based primarily on the local needs Consider using freeware search engines, if your requirements are met by these. Consider if the web server you are using supports indexing and search, and if this is adequate for you
35
The IT Professionals should make an effort to keep themselves abreast of the current web technologies The features available within a tool should be made use of properly to get maximum benefits
37
Search engine plays important role in accessing the content over the internet, it fetches the pages requested by the user. It made the internet and accessing the information just a click away.
38
The need for better search engines only increases The search engine sites are among the most popular websites.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.