The purpose of this study is to examine the predictors of high‐quality answers in a community‐driven question answering service (Yahoo! Answers).
The identified predictors were organised into two categories: social and content features. Social features refer to the community aspects of the users and are extracted from explicit user interaction and feedback. Content features refer to the intrinsic and extrinsic content quality of answers that could be used to select the high‐quality answers. In total the framework built in this study comprises 17 features from two categories. Based on a randomly selected dataset of 1,600 question‐answer pairs from Yahoo! Answers, high‐quality answer predictors were identified.
The results of the analysis showed the importance of content appraisal features over social and textual content features. The features identified as strongly associated with high‐quality answers include positive votes, completeness, presentation, reliability and accuracy. Features weakly associated with high‐quality answers were high frequency words, answer length, and best answers answered. Features related to the asker's user history were found not to be associated with high‐quality answers.
This work could help in the reuse of answers for new questions. The study identified features that most influence the selection of high‐quality answers. Hence they could be used to select high‐quality answers for answering similar questions posed by users in the future. When a new question is posed, similar questions are first identified, and the answers for these questions are extracted and routed to the proposed quality framework for identifying high‐quality answers. Based on the overall quality index computed, the high‐quality answer could be returned to the asker.
Previous studies in identifying high‐quality answers were conducted using either of two approaches. First using social and textual content features found in community‐driven question answering services and second using content appraisal features by thorough assessment of answer quality provided by experts. However no study had integrated both approaches. Hence this study addresses this gap by developing an integrated generalisable framework to identify features that influence high‐quality answers.
CitationDownload as .RIS
Emerald Group Publishing Limited
Copyright © 2012, Emerald Group Publishing Limited