To read the full version of this content please select one of the options below:

New versions of PageRank employing alternative Web document models

Mike Thelwall (Reader in Computer Science, School of Computing and Information Technology, University of Wolverhampton, UK. E‐mail: m.thelwall@wlv.ac.uk)
Liwen Vaughan (Professor, Faculty of Information and Media Studies, University of Western Ontario, London, Canada. E‐mail: lvaughan@uwo.ca)

Aslib Proceedings

ISSN: 0001-253X

Article publication date: 1 February 2004

Abstract

Introduces several new versions of PageRank (the link based Web page ranking algorithm), based on an information science perspective on the concept of the Web document. Although the Web page is the typical indivisible unit of information in search engine results and most Web information retrieval algorithms, other research has suggested that aggregating pages based on directories and domains gives promising alternatives, particularly when Web links are the object of study. The new algorithms introduced based on these alternatives were used to rank four sets of Web pages. The ranking results were compared with human subjects’ rankings. The results of the tests were somewhat inconclusive: the new approach worked well for the set that includes pages from different Web sites; however, it does not work well in ranking pages that are from the same site. It seems that the new algorithms may be effective for some tasks but not for others, especially when only low numbers of links are involved or the pages to be ranked are from the same site or directory.

Keywords

Citation

Thelwall, M. and Vaughan, L. (2004), "New versions of PageRank employing alternative Web document models", Aslib Proceedings, Vol. 56 No. 1, pp. 24-33. https://doi.org/10.1108/00012530410516840

Publisher

:

Emerald Group Publishing Limited

Copyright © 2004, Emerald Group Publishing Limited