Presentation is loading. Please wait.

Presentation is loading. Please wait.

Authoritative Sources in a Hyperlinked Environment Jon M. Kleinberg Presented By: Talin Kevorkian Summer 2010 18 June 20151.

Similar presentations


Presentation on theme: "Authoritative Sources in a Hyperlinked Environment Jon M. Kleinberg Presented By: Talin Kevorkian Summer 2010 18 June 20151."— Presentation transcript:

1 Authoritative Sources in a Hyperlinked Environment Jon M. Kleinberg Presented By: Talin Kevorkian Summer 2010 18 June 20151

2 Overview Why Do We Care? Introduction Information Objective Approaches and Observed Results Related Work Generalization Conclusion Evaluation of Pros and Cons 18 June 20152 Authoritative Sources in a Hyperlinked Environment

3 Why Do We care? Complexity of WWW as a Hypertext Corpus Nature of the Hyperlinked Environment Structure Efficiency (Longer Response Time) and Storage Problems Because of Huge Amount of Results Return to the User 18 June 20153 Authoritative Sources in a Hyperlinked Environment

4 Introduction Information Query Types  Specific E.g. ”Does Windows 7 Support Oracle 10g?” Scarcity Problem  Broad-Topic E.g. “Sql Programming Language ” Abundance Problem Authority Notion  Similar-Page  E.g. “Similar Pages to Oracle.com” 18 June 20154 Authoritative Sources in a Hyperlinked Environment

5 Introduction Information Link-Based Model  Encoding latent human judgment Conferred Authority  Creating Balance Between Popularity and Relevance  Relation Between Authority and Hubs 18 June 20155 Authoritative Sources in a Hyperlinked Environment

6 Objective Presenting the Link-Based Model for the Conferral Authority Exploring Authoritative WWW Sources in the Global Range 18 June 20156 Authoritative Sources in a Hyperlinked Environment

7 Approaches and Observed Results Focused Subgraph Algorithm for WWW Authorities and Hubs Computation Approach for Similar-Page Queries Sample Observed Results 18 June 20157 Authoritative Sources in a Hyperlinked Environment

8 Focused Subgraph Algorithm for WWW Inputs:  Query String σ  Text-based Search Engine Outputs:  Set of Hyperlinked Pages as a Directed Graph G(V,E)  Root Set Rσ  Sub Set Sσ Almost Small in size Containing Most of Relevant Pages Covering Most of the Strongest Authorities Links Type in G[Sσ]  Transverse  Intrinsic 18 June 2015 8 Authoritative Sources in a Hyperlinked Environment

9 Authorities and Hubs Computation Solution to the approach of Ordering Pages by Their In-degree  Confusion Between Strong “Authorities” and “Universally Popular“ Pages Containing Mutually Reinforcing Relationship Concept 18 June 2015 9 Authoritative Sources in a Hyperlinked Environment

10 Authorities and Hubs Computation Iterate Algorithm  Input: Set of n linked pages G σ  Outputs: Updated Authority Weight (thru operation I) Updated Hub Weight (thru Operation O) Filter Algorithm  Input: Set of n linked pages G σ  Outputs: Reporting Pages with Top c Authorities Reporting Pages with Top c Hubs 18 June 201510 Authoritative Sources in a Hyperlinked Environment

11 Approach for Similar-Page Queries First Step: What Do Users of the WWW Decide to be Related to a Page When They Create any Pages and Hyperlinks Second Step: Applying Link Structure to the Concept of “Similarity” Third Step: Using concept of Authorities and Hubs 18 June 201511 Authoritative Sources in a Hyperlinked Environment

12 Sample Observed Results ( For Broad-Specific Queries) Query StringAuthoritiesDescription “Search Engine”.346 http://www.yahoo.com/.291 http://www.excite.com/.239 http://www.mckinley.com/.231 http://www.lycos.com/.231 http://www.altavista.digital.com/ Yahoo! Excite Welcome to Magellan! Lycos Home Page AltaVista: Main Page “Gates”.643 http://www.roadahead.com/ Bill.458 http://www.microsoft.com/.440 http://www.microsoft.com/corpinfo/bill -g.htm Gates: The Road Ahead Welcome to Microsoft 18 June 2015 12 Authoritative Sources in a Hyperlinked Environment

13 Sample Observed Results (For Similar-Pages Queries) Query StringAuthoritiesDescription “www.honda.com”.202 http://www.toyota.com/.199 http://www.honda.com/.192 http://www.ford.com/.173 http://www.bmwusa.com/.162 http://www.volvocars.com/.158 http://www.saturncars.com/.155 http://www.nissanmotors.com/.145 http://www.audi.com/.139 http://www.4adodge.com/.136 http://www.chryslercars.com/ Welcome to @Toyota Honda Ford Motor Company BMW of North America, Inc. VOLVO Welcome to the Saturn Web Site NISSAN - ENJOY THE RIDE Audi Homepage 1997 Dodge Site Welcome to Chrysler 18 June 201513 Authoritative Sources in a Hyperlinked Environment

14 Related Work Link Structure is Related to: Definition of Standing, Impact and Influence Concepts WWW Ranking Techniques Data Clustering 18 June 201514 Authoritative Sources in a Hyperlinked Environment

15 Standing, Impact and Influence Concepts Social Network  Proposed Standing Measure Katz Theory: Based on Path-Counting Hubbell Theory : Based on Nodes Weight-Propagation Scientific Citations  Proposed Impact/Influence Measure Garfield’s Impact Theory Pinski-Narin Influence Theory 18 June 2015 15 Authoritative Sources in a Hyperlinked Environment

16 WWW Ranking TechniquesWWW Ranking Techniques Ranking Measure Proposal:  Botafogo-Rivlin-Shniderman Theory  Carriere-Kanzman Theory  Brin-Page Theory and Contrast with This Paper Approach 18 June 201516 Authoritative Sources in a Hyperlinked Environment

17 Data Clustering Clustering needs :  Similarity Functions Bibliographic Coupling Co-Citation  Cluster Producer Functions Small-Griffith Approach Dimension-Reduction Spectral Graph partitioning Centroid Scaling 18 June 201517 Authoritative Sources in a Hyperlinked Environment

18 Generalization Specific Queries  Diffusion Concept Set of Hubs and Authorities can be Separated from each other Because:  Query String has different Meaning like “Jaguar”  Query String is a Highly Polarized Subject Like “Abortion”  Query String can be Applied in Multiple Communities like “Randomized Algorithms” 18 June 201518 Authoritative Sources in a Hyperlinked Environment

19 Generaliztion Sample Results Query StringAuthoritiesDescription “Jaguar” 2nd non-principal vector, positive end.255 http://www.jaguarsnfl.com/.137 http://www.nando.net/SportServer/fo otball/nfl/jax.html 3rd non-principal vector, positive end.227 http://www.jaguarvehicles.com/.227 http://www.collection.co.uk/ Official Jacksonville Jaguars NFL Website Jacksonville Jaguars Home Page Jaguar Cars Global Home Page The Jaguar Collection 18 June 2015 19 Authoritative Sources in a Hyperlinked Environment

20 Conclusion Basic Elements of Paper Approach  Applying Notation of Authoritative Sources  Selecting High Quality of Results  Dealing with Scale Problem  Exploring Structure of Hubs and Authorities 18 June 201520 Authoritative Sources in a Hyperlinked Environment

21 Evaluation of Pros and Cons Pros:  Clearly Describe the Algorithms and Applied Approaches  Provide Tangible Examples and Results  Enough Connection to Related Works Cons:  Ignoring the Textual Contents of pages  Complexity in the Nature of Quality Judgment  Concentrating mostly on Broad-Topic Queries 18 June 201521 Authoritative Sources in a Hyperlinked Environment

22 Q & A 18 June 201522 Authoritative Sources in a Hyperlinked Environment


Download ppt "Authoritative Sources in a Hyperlinked Environment Jon M. Kleinberg Presented By: Talin Kevorkian Summer 2010 18 June 20151."

Similar presentations


Ads by Google