1 Enhancements in Query Evaluation and Page Summarization of The Thinking Algorithm M. Shoaib Jameel Amar Akshat Chingtham Tejbanta Singh Department of.

Slides:



Advertisements
Similar presentations
A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.
Advertisements

Search for personal information using Yahoo BOSS by Evgeny Dosychev Dmitry Kichin Supervisor: Eddie Bortnikov.
By Andrei Broder, IBM Research 1 A Taxonomy of Web Search Presented By o Onur Özbek o Mirun Akyüz.
Web Search – Summer Term 2006 III. Web Search - Introduction (Cont.) - Jeff Dean, Google's Systems Lab:
Cláudio Baptista, UFCG A Model for Geographic Knowledge Extraction on Web Documents Cláudio E. C. Campelo and Cláudio de Souza.
Efficient Search in Large Textual Collections with Redundancy Jiangong Zhang and Torsten Suel Review by Newton Alex
Search for personal information using Yahoo BOSS by Evgeny Dosychev Dmitry Kichin Supervisor: Eddie Bortnikov.
Adaptive Hypermedia Meets Provenance Evgeny Knutov Paul De Bra Mykola Pechenizkiy GAF project: Generic Adaptation Framework (project is supported byNWO.
 Speed  Cost  Compatibility with existing H/W and other S/W  Ability to import other files  Quality of documentation  Ease of learning and ease of.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Query Expansion Presented By: Usha M.Tech(IT) MIT-876-2k11.
WEB SCIENCE: SEARCHING THE WEB. Basic Terms Search engine Software that finds information on the Internet or World Wide Web Web crawler An automated program.
PubMed/How to Search, Display, Download & (module 4.1)
How Search Engines Work. Any ideas? Building an index Dan taylor Flickr Creative Commons.
Section 2.1 Compare the Internet and the Web Identify Web browser components Compare Web sites and Web pages Describe types of Web sites Section 2.2 Identify.
Deploying Tata Steel’s R&D Algorithms at Corus (M. SHOAIB JAMEEL ) Under the Guidance of Prof. (Dr.) M. K. Ghose Mr. Fredi B. Zarolia Head of.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 5 1 Downloading and Storing Data Using FTP and Other Services to Transfer and.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
CHAPTER 9 Using the World Wide Web. OBJECTIVES 1.Describe the Internet and the World Wide Web 2.Define related Internet terms 3.Explain the components.
PERSONALIZED SEARCH Ram Nithin Baalay. Personalized Search? Search Engine: A Vital Need Next level of Intelligent Information Retrieval. Retrieval of.
WHAT IS A SEARCH ENGINE. Widescreen Presentation Proteus, Keeper of Knowledge. Proteus is synonymous with change and success.
A Survey of Patent Search Engine Software Jennifer Lewis April 24, 2007 CSE 8337.
Universiti Utara Malaysia Chapter 3 Introduction to ASP.NET 3.5.
Here you are at your computer, but you don’t have internet connections. Your ISP becomes your link to the internet. In order to get access you need to.
Presented by: Introduction to iTunes U BCC on iTunes U.
Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.
Lesson 7 – World Wide Web. What is the World Wide Web?  The content of the worldwide web is held on individual web pages gathered together to form websites.
Web Searching. How does a search engine work? It does NOT search the Web (when you make a query) It contains a database with info on numerous Web sites.
The Pedagogical ICT Licence ICT in initial teacher training Professional development of teachers in ICT Denmark.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Searching Tutorial By: Lola L. Introduction:  When you are using a topic, you might want to use “keyword topics.” Using this might help you find better.
By: Channa Boucher. What is ? Gigablast is a search engine that was created in 2000 that retrieves information from partner sites. It was created to index.
N-Gram-based Dynamic Web Page Defacement Validation Woonyon Kim Aug. 23, 2004 NSRI, Korea.
Discovering Computers Fundamentals, Third Edition CGS 1000 Introduction to Computers and Technology Spring 2007.
XP New Perspectives on The Internet, Fifth Edition— Comprehensive, 2005 Update Tutorial 7 1 Mass Communication on the Internet Using Newsgroups Tutorial.
1 Internet Research Third Edition Unit A Searching the Internet Effectively.
OWL Representing Information Using the Web Ontology Language.
Design a full-text search engine for a website based on Lucene
1 Language Specific Crawler for Myanmar Web Pages Pann Yu Mon Management and Information System Engineering Department Nagaoka University of Technology,
Search engine note. Search Signals “Heuristics” which allow for the sorting of search results – Word based: frequency, position, … – HTML based: emphasis,
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
WELCOME to Internet 102. Overview of Internet 102 Review of basic internet navigation Review of basic internet navigation Searching for and finding information.
The Internet. Internet O Internet is a worldwide system of CPU networks where network connecting millions of computers.
1 CS 430: Information Discovery Lecture 18 Web Search Engines: Google.
A s s i g n m e n t W e e k 7 : T h e I n t e r n e t B Y : P a t r i c k O b i s p o.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
Internet Searching the World Wide Web. The Internet and the World Wide Web The Internet is a worldwide collection of networks that allows people to communicate.
AMERICAN PHYSICAL SOCIETY
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
The Anatomy of a Large-Scale Hypertextual Web Search Engine S. Brin and L. Page, Computer Networks and ISDN Systems, Vol. 30, No. 1-7, pages , April.
General Architecture of Retrieval Systems 1Adrienn Skrop.
June 30, 2005 Public Web Site Search Project Update: 6/30/2005 Linda Busdiecker & Andy Nguyen Department of Information Technology.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Searching the Web for academic information Ruth Stubbings.
The Web Web Design. 3.2 The Web Focus on Reading Main Ideas A URL is an address that identifies a specific Web page. Web browsers have varying capabilities.
Glencoe Introduction to Multimedia Chapter 2 Multimedia Online 1 Internet A huge network that connects computers all over the world. Show Definition.
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
Mrs. Rosiline Mary Mr. S.Ketheeswaren Mr. N. Vishvanath MSc(LIS). Students Bharathidasan University Federated Searches over Indian Repositories: A Study.
ASH INFO 103 W EEK 2 Q UIZ Check this A+ tutorial guideline at For more classes visit
Editing Your Website on SharePoint 2013
Prepared by Rao Umar Anwar For Detail information Visit my blog:
The Anatomy of a Large-Scale Hypertextual Web Search Engine
Internet Research Third Edition
Thanks to Bill Arms, Marti Hearst
Innovative Design Contest 2018
How Search Engines Work?
Internet Vocabulary Terms
Internet Vocabulary Beth Felton McKelvey.
The Internet and Electronic mail
Presentation transcript:

1 Enhancements in Query Evaluation and Page Summarization of The Thinking Algorithm M. Shoaib Jameel Amar Akshat Chingtham Tejbanta Singh Department of Computer Science and Engineering Sikkim Manipal Institute of Technology INDIA ITSim’2008 Kuala Lumpur Malaysia

2 Discussion Flow Introduction. Query Parsing. Page Summarization. URL Sorting. Results Evaluation Mechanism. Conclusion. ITSim’2008 Kuala Lumpur Malaysia

3 Introduction – The Thinking Algorithm Web search engine algorithm. Tries to solve the following question with every query “Sitting in a particular region, WHY have you entered such a query?” ITSim’2008 Kuala Lumpur Malaysia

4 Query Parsing Determine Query context. Ambiguity Removal. Determining User’s Competence from query eg. [hacking] and [how do I do hacking] Compounded Uniqueness Level – Geo- Location searching. ITSim’2008 Kuala Lumpur Malaysia

5 Page Summarization PageTags and Query Expansion {demo, evaluation} and {Tutorials, courses} Understanding Page Format – Amount of textual information, number of images How rich a web page is in a particular context. ITSim’2008 Kuala Lumpur Malaysia

6 URL Page Sorting Important Feature: Considers user’s Internet connection speed. Queries targeted: [news], [download doom] ITSim’2008 Kuala Lumpur Malaysia

7 Query Evaluation Mechanism – Query [download gcc] ITSim’2008 Kuala Lumpur Malaysia Step 1: Parsing of the query. Step 2: ‘download’ found. Conveys that the user wants to download something. Step 3: Convert gcc to gcc.exe, gcc.tar.gz, gcc.zip etc. Step 4: Search in the indexes of software download sites like download.com, tucows.com etc. Step 5: Apply C.U.L. Step 6: Sort pages according to user’s internet connection speed. Step 7: Results

8 Conclusion A unique algorithm – considers human factors – especially competency part. Solves some of the major issues in search like Geo-location searches, WHY factor. Future of search technology. ITSim’2008 Kuala Lumpur Malaysia

9 Thank You ITSim’2008 Kuala Lumpur Malaysia