To read this content please select one of the options below:

An improved approach based on dynamic mixed sampling and transfer learning for topic recognition: a case study on online patient reviews

Yaotan Xie (Health Science Center, Chongqing University, Chongqing, China)
Fei Xiang (School of Medical and Health Management, Huazhong University of Science and Technology, Wuhan, China)

Online Information Review

ISSN: 1468-4527

Article publication date: 1 February 2022

Issue publication date: 26 September 2022

183

Abstract

Purpose

This study aimed to adapt existing text-mining techniques and propose a novel topic recognition approach for textual patient reviews.

Design/methodology/approach

The authors first transformed multilabel samples for adapting model training forms. Then, an improved method was proposed based on dynamic mixed sampling and transfer learning to improve the learning problem caused by imbalanced samples. Specifically, the training of our model was based on the framework of a convolutional neural network and self-trained Word2Vector on large-scale corpora.

Findings

Compared with the SVM and other CNN-based models, the CNN+ DMS + TL model proposed in this study has made significant improvement in F1 score.

Originality/value

The improved methods based on dynamic mixed sampling and transfer learning can adequately manage the learning problem caused by the skewed distribution of samples and achieve the effective and automatic topic recognition of textual patient reviews.

Peer review

The peer-review history for this article is available at: https://publons.com/publon/10.1108/OIR-01-2021-0059.

Keywords

Acknowledgements

Funding: This study received funding from the Fundamental Research Funds for the Central Universities (Award Number: HUST: 2021WKYXZX007).

Citation

Xie, Y. and Xiang, F. (2022), "An improved approach based on dynamic mixed sampling and transfer learning for topic recognition: a case study on online patient reviews", Online Information Review, Vol. 46 No. 6, pp. 1017-1033. https://doi.org/10.1108/OIR-01-2021-0059

Publisher

:

Emerald Publishing Limited

Copyright © 2022, Emerald Publishing Limited

Related articles