Sentiment time series clustering of Danmu videos based on BERT fine-tuning and SBD-K-shape
ISSN: 0264-0473
Article publication date: 22 April 2024
Issue publication date: 26 July 2024
Abstract
Purpose
This study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering.
Design/methodology/approach
This study consisted of two main parts: danmu comment sentiment series generation and clustering. In the first part, the authors proposed a sentiment classification model based on BERT fine-tuning to quantify danmu comment sentiment polarity. To smooth the sentiment series, they used methods, such as comprehensive weights. In the second part, the shaped-based distance (SBD)-K-shape method was used to cluster the actual collected data.
Findings
The filtered sentiment series or curves of the microfilms on the Bilibili website could be divided into four major categories. There is an apparently stable time interval for the first three types of sentiment curves, while the fourth type of sentiment curve shows a clear trend of fluctuation in general. In addition, it was found that “disputed points” or “highlights” are likely to appear at the beginning and the climax of films, resulting in significant changes in the sentiment curves. The clustering results show a significant difference in user participation, with the second type prevailing over others.
Originality/value
Their sentiment classification model based on BERT fine-tuning outperformed the traditional sentiment lexicon method, which provides a reference for using deep learning as well as transfer learning for danmu comment sentiment analysis. The BERT fine-tuning–SBD-K-shape algorithm can weaken the effect of non-regular noise and temporal phase shift of danmu text.
Keywords
Citation
Zhang, R. and Ren, C. (2024), "Sentiment time series clustering of Danmu videos based on BERT fine-tuning and SBD-K-shape", The Electronic Library, Vol. 42 No. 4, pp. 553-575. https://doi.org/10.1108/EL-10-2023-0243
Publisher
:Emerald Publishing Limited
Copyright © 2024, Emerald Publishing Limited