To read this content please select one of the options below:

Multi-class Twitter sentiment classification with emojis

Mengdi Li (NVIDIA Joint-Lab on Mixed Reality (Visualisation and AI), University of Nottingham Ningbo China, Ningbo, China) (International Doctoral Innovation Centre, University of Nottingham Ningbo China, Ningbo, China)
Eugene Ch’ng (NVIDIA Joint-Lab on Mixed Reality (Visualisation and AI), University of Nottingham Ningbo China, Ningbo, China)
Alain Yee Loong Chong (Nottingham University Business School China, University of Nottingham Ningbo China, Ningbo, China)
Simon See (NVIDIA Technology Centre APJ, Singapore)

Industrial Management & Data Systems

ISSN: 0263-5577

Article publication date: 5 September 2018

Issue publication date: 28 September 2018

1433

Abstract

Purpose

Recently, various Twitter Sentiment Analysis (TSA) techniques have been developed, but little has paid attention to the microblogging feature – emojis, and few works have been conducted on the multi-class sentiment analysis of tweets. The purpose of this paper is to consider the popularity of emojis on Twitter and investigate the feasibility of an emoji training heuristic for multi-class sentiment classification of tweets. Tweets from the “2016 Orlando nightclub shooting” were used as a source of study. Besides, this study also aims to demonstrate how mapping can contribute to interpreting sentiments.

Design/methodology/approach

The authors presented a methodological framework to collect, pre-process, analyse and map public Twitter postings related to the shooting. The authors designed and implemented an emoji training heuristic, which automatically prepares the training data set, a feature needed in Big Data research. The authors improved upon the previous framework by advancing the pre-processing techniques, enhancing feature engineering and optimising the classification models. The authors constructed the sentiment model with a logistic regression classifier and selected features. Finally, the authors presented how to visualise citizen sentiments on maps dynamically using Mapbox.

Findings

The sentiment model constructed with the automatically annotated training sets using an emoji approach and selected features performs well in classifying tweets into five different sentiment classes, with a macro-averaged F-measure of 0.635, a macro-averaged accuracy of 0.689 and the MAEM of 0.530. Compared to those experimental results in related works, the results are satisfactory, indicating the model is effective and the proposed emoji training heuristic is useful and feasible in multi-class TSA. The maps authors created, provide a much easier-to-understand visual representation of the data, and make it more efficient to monitor citizen sentiments and distributions.

Originality/value

This work appears to be the first to conduct multi-class sentiment classification on Twitter with automatic annotation of training sets using emojis. Little attention has been paid to applying TSA to monitor the public’s attitudes towards terror attacks and country’s gun policies, the authors consider this work to be a pioneering work. Besides, the authors have introduced a new data set of 2016 Orlando Shooting tweets, which will be made available for other researchers to mine the public’s political opinions about gun policies.

Keywords

Acknowledgements

The authors acknowledge the financial support from the International Doctoral Innovation Centre, Ningbo Education Bureau, Ningbo Science and Technology Bureau, and the University of Nottingham. This work was also supported by the UK Engineering and Physical Sciences Research Council (Grant No. EP/L015463/1).

Citation

Li, M., Ch’ng, E., Chong, A.Y.L. and See, S. (2018), "Multi-class Twitter sentiment classification with emojis", Industrial Management & Data Systems, Vol. 118 No. 9, pp. 1804-1820. https://doi.org/10.1108/IMDS-12-2017-0582

Publisher

:

Emerald Publishing Limited

Copyright © 2018, Emerald Publishing Limited

Related articles