To read this content please select one of the options below:

Music sentiment classification based on an optimized CNN-RF-QPSO model

Rui Tian (Southeast University, Nanjing, China)
Ruheng Yin (Southeast University, Nanjing, China)
Feng Gan (Southeast University, Nanjing, China)

Data Technologies and Applications

ISSN: 2514-9288

Article publication date: 17 March 2023

Issue publication date: 15 November 2023

227

Abstract

Purpose

Music sentiment analysis helps to promote the diversification of music information retrieval methods. Traditional music emotion classification tasks suffer from high manual workload and low classification accuracy caused by difficulty in feature extraction and inaccurate manual determination of hyperparameter. In this paper, the authors propose an optimized convolution neural network-random forest (CNN-RF) model for music sentiment classification which is capable of optimizing the manually selected hyperparameters to improve the accuracy of music sentiment classification and reduce labor costs and human classification errors.

Design/methodology/approach

A CNN-RF music sentiment classification model is designed based on quantum particle swarm optimization (QPSO). First, the audio data are transformed into a Mel spectrogram, and feature extraction is conducted by a CNN. Second, the music features extracted are processed by RF algorithm to complete a preliminary emotion classification. Finally, to select the suitable hyperparameters for a CNN, the QPSO algorithm is adopted to extract the best hyperparameters and obtain the final classification results.

Findings

The model has gone through experimental validations and achieved a classification accuracy of 97 per cent for different sentiment categories with shortened training time. The proposed method with QPSO achieved 1.2 and 1.6 per cent higher accuracy than that with particle swarm optimization and genetic algorithm, respectively. The proposed model had great potential for music sentiment classification.

Originality/value

The dual contribution of this work comprises the proposed model which integrated two deep learning models and the introduction of a QPSO into model optimization. With these two innovations, the efficiency and accuracy of music emotion recognition and classification have been significantly improved.

Keywords

Acknowledgements

Funding: This research was supported by the National Social Science Foundation (21ZD11) and Jiangsu Provincial University Philosophy and Social Science Research Fund (2020SJZDA078).

Citation

Tian, R., Yin, R. and Gan, F. (2023), "Music sentiment classification based on an optimized CNN-RF-QPSO model", Data Technologies and Applications, Vol. 57 No. 5, pp. 719-733. https://doi.org/10.1108/DTA-07-2022-0267

Publisher

:

Emerald Publishing Limited

Copyright © 2023, Emerald Publishing Limited

Related articles