Language resources used in multi‐lingual question‐answering systems
Abstract
Purpose
In the field of information retrieval, some multi‐lingual tools are being created to help the users to overcome the language barriers. Nevertheless, these tools are not developed completely and it is necessary to investigate more for their improvement and application. One of their main problems is the choice of the linguistic resources to offer better coverage and to solve the translation problems in the context of the multi‐lingual information retrieval. This paper aims to address this issue.
Design/methodology/approach
This research is focused on the analysis of resources used by the multi‐lingual question‐answering systems, which respond to users' queries with short answers, rather than just offering a list of documents related to the search. An analysis of the main publications about the multi‐lingual QA systems was carried out, with the aim of identifying the typology, the advantages and disadvantages, and the real use and trend of each of the linguistic resources and tools used in this new kind of system.
Findings
Five of the resources most used in the cross‐languages QA systems were identified and studied: databases, dictionaries, corpora, ontologies and thesauri. The three most popular traditional resources (automatic translators, dictionaries, and corpora) are gradually leaving a widening gap for others – such as ontologies and the free encyclopaedia Wikipedia.
Originality/value
The perspective offered by the translation discipline can improve the effectiveness of QA systems.
Keywords
Citation
Olvera‐Lobo, M. and Gutierrez‐Artacho, J. (2011), "Language resources used in multi‐lingual question‐answering systems", Online Information Review, Vol. 35 No. 4, pp. 543-557. https://doi.org/10.1108/14684521111161927
Publisher
:Emerald Group Publishing Limited
Copyright © 2011, Emerald Group Publishing Limited