Structure of The World Wide Web From “Networks, Crowds and Markets” Chapter 13 Eyal Feder Nov, 14.

Slides:



Advertisements
Similar presentations
Texts and Digital Objects What seems to have changed.
Advertisements

Imagining the Future. WORLD WIDE WEB Tim Berners-Lee invented the World Wide Web.World Wide Web A graduate of Oxford University, England, in 1989, Tim.
Internet Basics Instructors : Connie Hutchison & Christopher McCoy.
The internet. Background Created in 1969, connected computers at UCLA, Stanford Research Institute, U. of Utah, and UC at Santa Barbara With an estimated.
Computer Technology Timpview High School. A collection of local, regional, national, and international computer networks that are linked together to exchange.
Scale Free Networks.
Internet Technology Introduction Review the history of the Internet, Introducing Web Technology Web development Environment : Describe different HTML standards.
Databases vs the Internet Coconino Community College Revised August 2010.
Web as Network: A Case Study Networked Life CIS 112 Spring 2010 Prof. Michael Kearns.
 To publish information for global distribution, one needs a universally understood language, a kind of publishing mother tongue that all computers may.
HYPERMEDIA Chang-Yang Lin Eastern Kentucky University
Skills: none Concepts: history of the Web, Internet culture, the contributions of Vannevar Bush, JCR Licklider, Doug Engelbart, Tim Berners-Lee, evolution.
Social Networks 101 P ROF. J ASON H ARTLINE AND P ROF. N ICOLE I MMORLICA.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
T.Sharon-A.Frank 1 Multimedia Hypertext and Hypermedia.
Hypertext Computer Science 01i Introduction to the Internet Neal Sample 6 February 2001.
Ch. 13 Structure of the Web Padmini Srinivasan Computer Science Department Department of Management Sciences
Internet Basics.
1 Internet History Internet made up of thousands of networks worldwide No one in charge of Internet - No governing body Internet backbone owned by private.
Peer-to-Peer and Social Networks Random Graphs. Random graphs E RDÖS -R ENYI MODEL One of several models … Presents a theory of how social webs are formed.
1 Networks and the Internet A network is a structure linking computers together for the purpose of sharing resources such as printers and files Users typically.
Lecturer: Ghadah Aldehim
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Tutorial 1: Getting Started with Adobe Dreamweaver CS4.
 I believe that it is essential to go to the very beginning of how networking started to fully understand how we reached to today’s position.  “The.
Chapter 6 The World Wide Web. Web Pages Each page is an interactive multimedia publication It can include: text, graphics, music and videos Pages are.
Programming the Web Web = Computer Network + Hypertext.
The Internet and the World Wide Web. The Internet A Network is a collection of computers and devices that are connected together. The Internet is a worldwide.
How did the internet develop?. What is Internet? The internet is a network of computers linking many different types of computers all over the world.
The Internet & The Manager. Introduction Most companies realize that the Internet is here to stay. Business leaders realize that in order to maintain.
Microsoft Office XP Illustrated Introductory, Enhanced Started with Internet Explorer Getting.
The World Wide Web (abbreviated as WWW or W3 and commonly known as the Web) is a system of interlinked hypertext documents accessed via the Internet.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
Introduction to HTML Tutorial 1 eXtensible Markup Language (XML)
HTML ~ Web Design.
The Internet and World Wide Web
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
Hypermedia Cooper and Davis. What Is Hypermedia?  The combination of text, video, graphic images, sound, hyperlinks, and other elements in the form typical.
Introducing the Internet and The Web Computer Concepts Unit A What Is Internet.
CS315-Web Search & Data Mining. A Semester in 50 minutes or less The Web History Key technologies and developments Its future Information Retrieval (IR)
1 UNIT 13 The World Wide Web Lecturer: Kholood Baselm.
Meet the web: First impressions How big is the web and how do you measure it? How many people use the web? How many use search engines? What is the shape.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
 History (WWW & Internet)  Search tools  Search Engines vs. Subject Directory  Meta search Engines  Steps for Searching  Effective Strategies.
The Structure of the Web. Getting to knowing the Web How big is the web and how do you measure it? How many people use the web? How many use search engines?
 A website, also written Web site, web site, or simply site, is a group of Web pages and related text, databases, graphics, audio, and video files that.
Introduction to Web Session 01 Subject: L0182 / Web & Animation Design Year: 2009.
and Internet Explorer.  The transmission of messages and files via a computer network  Messages can consist of simple text or can contain attachments,
PYP002 Intro.to Computer Science Brwosing the Web1 Browsing the Web Chapter 19.
The Internet. The Internet and Systems that Use It Internet –A group of computer networks that encircle the entire globe –Began in 1969 Protocol –Language.
1 UNIT 13 The World Wide Web. Introduction 2 The World Wide Web: ▫ Commonly referred to as WWW or the Web. ▫ Is a service on the Internet. It consists.
Tutorial 1 Getting Started with Adobe Dreamweaver CS5.
The World Wide Web.
Objectives Overview Identify the four categories of application software Describe characteristics of a user interface Identify the key features of widely.
Lecture 1: Introduction and Multimedia Data Representations
Evolution of Internet.
Browsing and Searching the Web
E-commerce | WWW World Wide Web - Concepts
E-commerce | WWW World Wide Web - Concepts
Microsoft Office Illustrated Introductory, Premium Edition
Tim Berners Lee By Jack Neus.
A Brief Introduction to the Internet
Navigating The World Wide Web
The Internet An Overview.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
A worldwide system of interconnected computer networks.
CS246: Web Characteristics
Browsing the Web Chapter 19 PYP002 Intro.to Computer Science
Presentation transcript:

Structure of The World Wide Web From “Networks, Crowds and Markets” Chapter 13 Eyal Feder Nov, 14

What Is the Web?

Not really The Web != Internet None of the are made of cats The World Wide Web is an application of the Internet

Information networks Vs. social networks The basic units connected (nodes) are pieces of information The edges symbolize some kind of connection between them Share a lot of the ideas mentioned in earlier sessions

Back to the web Created by Tim Burners-Lee A research project in at CERN An application of the internet Two basic features: Make documents on your computer publically accessible Easily access these documents using a browser

The first browser

Some are still there

The web as a network The nodes are documents (pages) The edges are links (figure 13.2) How do links work? Hypertext

Hypertext (The coolest thing about the web)

Different ways to manage information Alphabetically Hierarchy (like folders) Classification systems All of these have one thing in common Linearrrr

Earlier non linear connections Academic references (also in legal decisions and patents) Relevant to the web?

Earlier non linear connections Cross-reference encyclopedia (figure 13.4)

Memex Vannevar Bush, 1945 Article: “As We May Think” Our memory is not linear. Hypothetical model – the Memex Inspired the idea of hypertext

Introducing: Hypertext The ultimate reason text is blue. The ultimate reason text is blue Invented by Burners-Lee The way web pages are connected An associative way to organize information

Changes in the web over time

Static pages >> Query pages In the early days – static pages of contact Today? More and more transactional actions, which create query pages

Importance of static pages “The Backbone of the Internet” Reliable over time Include most links Navigational vs. transactional Our focus when thinking about structure

Time for math! (just a little bit, sorry…)

The web as a directed graph The best mathematical approximation – a graph Why directed?

What is a path in a directed graph? “A Path from node A to a node B in a directed graph is a sequence of nodes, beginig with A and ending with B, with the property that each consecutive pair of nodes in the sequence is connected by an edge pointing in the forward direction”

What is Strong Connectivity in a directed graph? “A directed graph is Strongly connected if there is a path from every node to every other node”

The Concept of Reachability Since connectivity does not describe all of the connections in a graph, we need another concept – Reachability Reachability describes the nodes that are reacheable from a certain node or vice versa How do we check this?

Strongly connected components Parts of a graph that have strong connectivity In other words – a group of nodes in which each node is reachable from all other nodes. Formal: We say that a strongly connected component (SCC) in a directed graph is a subset of the nodes such that: (i) every node in the subset has a path to every other; and (ii) the subset is not part of some larger set with the property that every node can reach every other.

How does all that help us understand the web? We can map reachability Using the super-graph

The Bow Tie Structure

History Short reminder – the Web is not the Internet! Created in 1999 by Andrei Broder and his colleagues Used data from biggest search engine back then – AltaVista. Afterwards – reevaluated many times

The bow tie structure

Why a giant component? Counter-intuative, ha? Let’s think probability

Different kinds of nodes In the SCC In the “inbound” part In the “outbound” part Tendrils Disconnected nodes

Limitations The bow-tie structure is a “mile high” view Not understanding the role of specific nodes (sites)

Web 2.0

What is web 2.0? A concept made popular by Tim O’railey in 2004 Basically – the web’s move towards a “Prosumer” crowd Three main charachteristics: (i) the growth of Web authoring styles that enabled many people to collectively create and maintain shared content; (ii) the movement of people’s personal on-line data (including , calendars, photos, and videos) from their own computers to services offered and hosted by large companies; (iii) the growth of linking styles that emphasize on-line connections between people, not just between documents.

Different implications of web 2.0 “Software that gets better as more people use it” “The wisdom of the crowds” “The Long Tail”

A little bit more a bout the structure of the web From: Albert R., Jeong H, & Barabasi A. - Diameter of the World Wide Web (2000)

About the research Trying to map reachability on the web Their main finding – the probability of a node to have k links (inbound and out) follow a power law Meaning – the web is a Small World Graph, typically found in biological and social networks This was proven more by the short path research