To read this content please select one of the options below:

An interactive query-based approach for summarizing scientific documents

Farnoush Bayatmakou (Department of Industrial Engineering and Management Systems, Amirkabir University of Technology, Tehran, Iran)
Azadeh Mohebi (Department of Information Technology, Iranian Research Institute for Information Science and Technology (IranDoc), Tehran, Iran)
Abbas Ahmadi (Department of Industrial Engineering and Management Systems, Amirkabir University of Technology, Tehran, Iran)

Information Discovery and Delivery

ISSN: 2398-6247

Article publication date: 14 June 2021

Issue publication date: 18 April 2022

142

Abstract

Purpose

Query-based summarization approaches might not be able to provide summaries compatible with the user’s information need, as they mostly rely on a limited source of information, usually represented as a single query by the user. This issue becomes even more challenging when dealing with scientific documents, as they contain more specific subject-related terms, while the user may not be able to express his/her specific information need in a query with limited terms. This study aims to propose an interactive multi-document text summarization approach that generates an eligible summary that is more compatible with the user’s information need. This approach allows the user to interactively specify the composition of a multi-document summary.

Design/methodology/approach

This approach exploits the user’s opinion in two stages. The initial query is refined by user-selected keywords/keyphrases and complete sentences extracted from the set of retrieved documents. It is followed by a novel method for sentence expansion using the genetic algorithm, and ranking the final set of sentences using the maximal marginal relevance method. Basically, for implementation, the Web of Science data set in the artificial intelligence (AI) category is considered.

Findings

The proposed approach receives feedback from the user in terms of favorable keywords and sentences. The feedback eventually improves the summary as the end. To assess the performance of the proposed system, this paper has asked 45 users who were graduate students in the field of AI to fill out a questionnaire. The quality of the final summary has been also evaluated from the user’s perspective and information redundancy. It has been investigated that the proposed approach leads to higher degrees of user satisfaction compared to the ones with no or only one step of the interaction.

Originality/value

The interactive summarization approach goes beyond the initial user’s query, while it includes the user’s preferred keywords/keyphrases and sentences through a systematic interaction. With respect to these interactions, the system gives the user a more clear idea of the information he/she is looking for and consequently adjusting the final result to the ultimate information need. Such interaction allows the summarization system to achieve a comprehensive understanding of the user’s information needs while expanding context-based knowledge and guiding the user toward his/her information journey.

Keywords

Acknowledgements

This research was supported by Robodoc Human and Machine Interaction Lab at the Iranian Research Institute for Information Science and Technology (IranDoc). We thank our colleagues from this Lab who provided insight and expertize that greatly assisted this research. Also, we would like to thank a group of Computer Engineering-AI graduate students at the Amirkabir University of Technology who assisted us in result evaluation process.

Citation

Bayatmakou, F., Mohebi, A. and Ahmadi, A. (2022), "An interactive query-based approach for summarizing scientific documents", Information Discovery and Delivery, Vol. 50 No. 2, pp. 176-191. https://doi.org/10.1108/IDD-10-2020-0124

Publisher

:

Emerald Publishing Limited

Copyright © 2021, Emerald Publishing Limited

Related articles