CPS 49S Google: The Computer Science Within and its Impact on Society Shivnath Babu Spring 2007.

Slides:



Advertisements
Similar presentations
The Inside Story Christine Reilly CSCI 6175 September 27, 2011.
Advertisements

Matrices, Digraphs, Markov Chains & Their Use by Google Leslie Hogben Iowa State University and American Institute of Mathematics Leslie Hogben Iowa State.
Information Retrieval Lecture 8 Introduction to Information Retrieval (Manning et al. 2007) Chapter 19 For the MSc Computer Science Programme Dell Zhang.
“ The Anatomy of a Large-Scale Hypertextual Web Search Engine ” Presented by Ahmed Khaled Al-Shantout ICS
 How many pages does it search?  How does it access all those pages?  How does it give us an answer so quickly?  How does it give us such accurate.
1 CS 430 / INFO 430: Information Retrieval Lecture 16 Web Search 2.
Architecture of the 1st Google Search Engine SEARCHER URL SERVER CRAWLERS STORE SERVER REPOSITORY INDEXER D UMP L EXICON SORTERS ANCHORS URL RESOLVER (CF.
Presentation of Anatomy of a Large-Scale Hypertextual Web Search Engine by Sergey Brin and Lawrence Page (1997) Presenter: Scott White.
The PageRank Citation Ranking “Bringing Order to the Web”
Anatomy of a Large-Scale Hypertextual Web Search Engine (e.g. Google)
Web Search – Summer Term 2006 III. Web Search - Introduction (Cont.) - Jeff Dean, Google's Systems Lab:
The Anatomy of a Large-Scale Hypertextual Web Search Engine Sergey Brin and Lawrence Page Distributed Systems - Presentation 6/3/2002 Nancy Alexopoulou.
Search engines fdm 20c introduction to digital media lecture warren sack / film & digital media department / university of california, santa.
Web Intelligence Search and Ranking. Today The anatomy of search engines (read it yourself) The key design goal(s) for search engines Why google is good:
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα
How Search Engines Work. Any ideas? Building an index Dan taylor Flickr Creative Commons.
CC P ROCESAMIENTO M ASIVO DE D ATOS O TOÑO 2015 Lecture 8: Information Retrieval II Aidan Hogan
The Search Engine Landscape: 2010 How Users Interact with Engines & How the Search Engines Crawl, Index & Rank Pages Rand Fishkin CEO & Co-Founder: SEOmoz.
Homework 4 Final homework Deadline: Sunday April 20, PM In this homework you have to write a short essay on how Google can handle new types of data.
Information Retrieval in Folksonomies Nikos Sarkas Social Information Systems Seminar DCS, University of Toronto, Winter 2007.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
The Business Model and Strategy of MBAA 609 R. Nakatsu.
1 University of Qom Information Retrieval Course Web Search (Link Analysis) Based on:
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Gregor Gisler-Merz How to hit in google The anatomy of a modern web search engine.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The College of Saint Rose CSC 460 / CIS 560 – Search and Information Retrieval David Goldschmidt, Ph.D. from Search Engines: Information Retrieval in Practice,
McLean HIGHER COMPUTER NETWORKING Lesson 7 Search engines Description of search engine methods.
Search Engine Optimization: A Survey of Current Best Practices Author - Niko Solihin Resource -Grand Valley State University April, 2013 Professor - Soe-Tsyr.
The Anatomy of a Large-Scale Hypertextual Web Search Engine Sergey Brin & Lawrence Page Presented by: Siddharth Sriram & Joseph Xavier Department of Electrical.
CPS 49S Google: The Computer Science Within and its Impact on Society Shivnath Babu Spring 2007.
The Anatomy of a Large-Scale Hypertextual Web Search Engine Kevin Mauricio Apaza Huaranca San Pablo Catholic University.
Web Search Algorithms By Matt Richard and Kyle Krueger.
The Business Model of Google MBAA 609 R. Nakatsu.
Search Engine Marketing SEM = Search Engine Marketing SEO = Search Engine Optimization optimizing (altering/changing) your page in order to get a higher.
Search Engines1 Searching the Web Web is vast. Information is scattered around and changing fast. Anyone can publish on the web. Two issues web users have.
Searching Tutorial By: Lola L. Introduction:  When you are using a topic, you might want to use “keyword topics.” Using this might help you find better.
Ranking CSCI 572: Information Retrieval and Search Engines Summer 2010.
Link Analysis Rong Jin. Web Structure  Web is a graph Each web site correspond to a node A link from one site to another site forms a directed edge 
Scribing Your responsibility to scribe at least one class (5 points of final grade!)
Ranking Link-based Ranking (2° generation) Reading 21.
Search Engine and SEO Presented by Yanni Li. Various Components of Search Engine.
Lawrence Snyder University of Washington, Seattle © Lawrence Snyder 2004.
CPS 49S Google: The Computer Science Within and its Impact on Society Shivnath Babu Spring 2007.
Information Retrieval and Web Search Link analysis Instructor: Rada Mihalcea (Note: This slide set was adapted from an IR course taught by Prof. Chris.
The World Wide Web: Information Resource. How a Search Engine works… How Search Works - YouTube
CPS 49S Google: The Computer Science Within and its Impact on Society Shivnath Babu Spring 2008.
CPS 49S Google: The Computer Science Within and its Impact on Society Shivnath Babu Spring 2008.
CPS 49S Google: The Computer Science Within and its Impact on Society Shivnath Babu Spring 2008.
The anatomy of a Large-Scale Hypertextual Web Search Engine.
WIRED Week 6 Syllabus Review Readings Overview Search Engine Optimization Assignment Overview & Scheduling Projects and/or Papers Discussion.
Search Engines Session 5 INST 301 Introduction to Information Science.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
CPS 49S Google: The Computer Science Within and its Impact on Society Shivnath Babu Spring 2008.
CS 440 Database Management Systems Web Data Management 1.
The Anatomy of a Large-Scale Hypertextual Web Search Engine (The creation of Google)
Traffic Source Tell a Friend Send SMS Social Network Group chat Banners Advertisement.
Presented By: Carlton Northern and Jeffrey Shipman The Anatomy of a Large-Scale Hyper-Textural Web Search Engine By Lawrence Page and Sergey Brin (1998)
1 CS 430 / INFO 430: Information Retrieval Lecture 20 Web Search 2.
DATA MINING Introductory and Advanced Topics Part III – Web Mining
Aidan Hogan CC Procesamiento Masivo de Datos Otoño 2017 Lecture 7: Information Retrieval II Aidan Hogan
Prepared by Rao Umar Anwar For Detail information Visit my blog:
Aidan Hogan CC Procesamiento Masivo de Datos Otoño 2018 Lecture 7 Information Retrieval: Ranking Aidan Hogan
The Anatomy of a Large-Scale Hypertextual Web Search Engine
Information Retrieval
CS 440 Database Management Systems
CPS 49S Google: The Computer Science Within and its Impact on Society
Information Retrieval and Web Design
Presentation transcript:

CPS 49S Google: The Computer Science Within and its Impact on Society Shivnath Babu Spring 2007

Discussion Format Talk for minutes –Give an overview Give an outline the discussion points that you have come up with Need a scribe –Volunteer?

Note Make it a habit to check the course web page daily for: –Updated notes (presentation, discussion report, and scribe notes) –Current and future schedule –Announcements

Introduction Let us look at some numbers –From the paper –From searchenginewatch.com

Introduction (contd.) Terms –HTML (look at the HTML for the class web page), Hypertext, link/hyperlink, inlink, outlink, anchor text, link graph –Search engine, meta search engine –Information retrieval, crawl, index Terms that we will discuss later –PageRank, proximity, barrel, …

Discussion Points Motivation for Google –Human-maintained lists –Keyword matching only –Advertising --- conflict of interest

Discussion Points Design Goal #1: High-quality search results –Hypertext –Proximity –PageRank Design Goal #2: Good performance Design Goal #3: Support for research activities

Next Problem: User types in a keyword-based search query. We have to (i) find result pages to answer this query, and (ii) rank these result pages –Proximity of terms –Anchor text –PageRank

Proximity Of terms on a web page E.g., phrases E.g., “anatomy”, “search”, “anatomy search” E.g., “google freshman seminar duke” Other examples?

Anchor text Text around the link Often accurate and concise description of page May have terms that the page does not contain –“search engine” –Other examples? Can return pages that have not been crawled

PageRank First cut: count inlinks Basic idea --- “recursive” counting Interpretation based on probability Demo

Assigned Readings For Tue (1/23) –Continuation of the anatomy paper –Paper on “Taxonomy of Web Search”