To read this content please select one of the options below:

Analyzing the results of automatic new topic identification

Seda Ozmutlu (Industrial Engineering Department, Uludag University, Bursa, Turkey)
Gencer C. Cosar (Industrial Engineering Department, Uludag University, Bursa, Turkey)

Library Hi Tech

ISSN: 0737-8831

Article publication date: 5 September 2008

364

Abstract

Purpose

Identification of topic changes within a user search session is a key issue in content analysis of search engine user queries. Recently, various studies have focused on new topic identification/session identification of search engine transaction logs, and several problems regarding the estimation of topic shifts and continuations were observed in these studies. This study aims to analyze the reasons for the problems that were encountered as a result of applying automatic new topic identification.

Design/methodology/approach

Measures, such as cleaning the data of common words and analyzing the errors of automatic new topic identification, are applied to eliminate the problems in estimating topic shifts and continuations.

Findings

The findings show that the resulting errors of automatic new topic identification have a pattern, and further research is required to improve the performance of automatic new topic identification.

Originality/value

Improving the performance of automatic new topic identification would be valuable to search engine designers, so that they can develop new clustering and query recommendation algorithms, as well as custom‐tailored graphical user interfaces for search engine users.

Keywords

Citation

Ozmutlu, S. and Cosar, G.C. (2008), "Analyzing the results of automatic new topic identification", Library Hi Tech, Vol. 26 No. 3, pp. 466-487. https://doi.org/10.1108/07378830810903373

Publisher

:

Emerald Group Publishing Limited

Copyright © 2008, Emerald Group Publishing Limited

Related articles