Download presentation
Presentation is loading. Please wait.
Published byCora Lee Modified over 9 years ago
1
Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Longzhuang Li, Yi Shang, Wei Zhang 2002.ACM. Improvement of HITS-based Algorithms on Web Documents
2
Intelligent Database Systems Lab Outlines Motivation Objectives Methodology Experiments Conclusions Comments
3
Intelligent Database Systems Lab Motivation Content analysis usually takes a long time, and it is almost impossible to get users' feedback or visiting times for most Web documents.
4
Intelligent Database Systems Lab Objectives Present two ways to improve the precision of HITS-based algorithms on Web documents.
5
Intelligent Database Systems Lab Methodology – HITS algorithm limit authority hub New weighted HITS-BASED algorithm
6
Intelligent Database Systems Lab Methodology – HITS algorithm limit
7
Intelligent Database Systems Lab Methodology – Vector Space Model(VSM) Inner Product Weight a query q document Xi Vector
8
Intelligent Database Systems Lab Methodology – Vector Space Model(VSM) coverage of Google
9
Intelligent Database Systems Lab Methodology – Okapi Similarity Measurement(Okapi)
10
Intelligent Database Systems Lab Methodology – Cover Density Ranking (CDR ) In CDR, the results of phrase queries are ranked in two steps: The score of the cover set
11
Intelligent Database Systems Lab Methodology – Three-Level Scoring Method (TLS) Compute the relevance of a Web page to a query two steps: (1) (2)
12
Intelligent Database Systems Lab Experiments
13
Intelligent Database Systems Lab Experiments
14
Intelligent Database Systems Lab Experiments
15
Intelligent Database Systems Lab Conclusions The weighted HITS-based method performs better than Bharat's improved HITS algorithm.
16
Intelligent Database Systems Lab Comments Advantages - Effective. Applications - Information retrieval 、 Rank web pages.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.