To read this content please select one of the options below:

Topic features for machine learning-based sentiment analysis in Indonesian tweets

Hendri Murfi (Department of Mathematics, Universitas Indonesia, Depok, Indonesia)
Furida Lusi Siagian (Department of Mathematics, Universitas Indonesia, Depok, Indonesia)
Yudi Satria (Department of Mathematics, Universitas Indonesia, Depok, Indonesia)

International Journal of Intelligent Computing and Cybernetics

ISSN: 1756-378X

Article publication date: 9 January 2019

Issue publication date: 22 February 2019

363

Abstract

Purpose

The purpose of this paper is to analyze topics as alternative features for sentiment analysis in Indonesian tweets.

Design/methodology/approach

Given Indonesian tweets, the processes of sentiment analysis start by extracting features from the tweets. The features are words or topics. The authors use non-negative matrix factorization to extract the topics and apply a support vector machine to classify the tweets into its sentiment class.

Findings

The authors analyze the accuracy using the two-class and three-class sentiment analysis data sets. Both data sets are about sentiments of candidates for Indonesian presidential election. The experiments show that the standard word features give better accuracies than the topics features for the two-class sentiment analysis. Moreover, the topic features can slightly improve the accuracy of the standard word features. The topic features can also improve the accuracy of the standard word features for the three-class sentiment analysis.

Originality/value

The standard textual data representation for sentiment analysis using machine learning is bag of word and its extensions mainly created by natural language processing. This paper applies topics as novel features for the machine learning-based sentiment analysis in Indonesian tweets.

Keywords

Citation

Murfi, H., Siagian, F.L. and Satria, Y. (2019), "Topic features for machine learning-based sentiment analysis in Indonesian tweets", International Journal of Intelligent Computing and Cybernetics, Vol. 12 No. 1, pp. 70-81. https://doi.org/10.1108/IJICC-04-2018-0057

Publisher

:

Emerald Publishing Limited

Copyright © 2019, Emerald Publishing Limited

Related articles