J Gevrey 2001: Hubs and Authorities
Home
We assess a family of ranking mechanisms for search engines based on
linkage analysis using a carefully engineered subset of the World Wide
Web, WT10g, and a set of relevance judgements for 50 different queries
from Trec-9 to evaluate the performance of several link-based ranking
techniques.
Among these link-based algorithms, Kleinberg's HITS and Larry Page and
Sergey Brin's PageRank are assessed. Link analysis seems to yield poor
results in Trec's Web Ad Hoc Task. We suggest some alternative algorithms
which reuse both text-based search similarity measures and linkage
analysis. Although these algorithms yield better results, improving
text-only search recall-precision curves in the Web Ad Hoc Task remains
elusive; only a certain category of queries seems to benefit from linkage
analysis. Among these queries, homepage searches may be good candidates.