Search results

1 – 10 of 472

View access options

Article

Publication date: 13 August 2024

X-News dataset for online news categorization

Samia Nawaz Yousafzai, Hooria Shahbaz, Armughan Ali, Amreen Qamar, Inzamam Mashood Nasir, Sara Tehsin and Robertas Damaševičius

The objective is to develop a more effective model that simplifies and accelerates the news classification process using advanced text mining and deep learning (DL) techniques. A…

HTML

PDF (727 KB)

Downloads

Abstract

Purpose

The objective is to develop a more effective model that simplifies and accelerates the news classification process using advanced text mining and deep learning (DL) techniques. A distributed framework utilizing Bidirectional Encoder Representations from Transformers (BERT) was developed to classify news headlines. This approach leverages various text mining and DL techniques on a distributed infrastructure, aiming to offer an alternative to traditional news classification methods.

Design/methodology/approach

This study focuses on the classification of distinct types of news by analyzing tweets from various news channels. It addresses the limitations of using benchmark datasets for news classification, which often result in models that are impractical for real-world applications.

Findings

The framework’s effectiveness was evaluated on a newly proposed dataset and two additional benchmark datasets from the Kaggle repository, assessing the performance of each text mining and classification method across these datasets. The results of this study demonstrate that the proposed strategy significantly outperforms other approaches in terms of accuracy and execution time. This indicates that the distributed framework, coupled with the use of BERT for text analysis, provides a robust solution for analyzing large volumes of data efficiently. The findings also highlight the value of the newly released corpus for further research in news classification and emotion classification, suggesting its potential to facilitate advancements in these areas.

Originality/value

This research introduces an innovative distributed framework for news classification that addresses the shortcomings of models trained on benchmark datasets. By utilizing cutting-edge techniques and a novel dataset, the study offers significant improvements in accuracy and processing speed. The release of the corpus represents a valuable contribution to the field, enabling further exploration into news and emotion classification. This work sets a new standard for the analysis of news data, offering practical implications for the development of more effective and efficient news classification systems.

Details

International Journal of Intelligent Computing and Cybernetics, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 1756-378X

Keywords

View access options

Article

Publication date: 19 July 2024

The research landscape on generative artificial intelligence: a bibliometric analysis of transformer-based models

Giulio Marchena Sekli

The aim of this study is to offer valuable insights to businesses and facilitate better understanding on transformer-based models (TBMs), which are among the widely employed…

HTML

PDF (3.2 MB)

Downloads

Abstract

Purpose

The aim of this study is to offer valuable insights to businesses and facilitate better understanding on transformer-based models (TBMs), which are among the widely employed generative artificial intelligence (GAI) models, garnering substantial attention due to their ability to process and generate complex data.

Design/methodology/approach

Existing studies on TBMs tend to be limited in scope, either focusing on specific fields or being highly technical. To bridge this gap, this study conducts robust bibliometric analysis to explore the trends across journals, authors, affiliations, countries and research trajectories using science mapping techniques – co-citation, co-words and strategic diagram analysis.

Findings

Identified research gaps encompass the evolution of new closed and open-source TBMs; limited exploration across industries like education and disciplines like marketing; a lack of in-depth exploration on TBMs' adoption in the health sector; scarcity of research on TBMs' ethical considerations and potential TBMs' performance research in diverse applications, like image processing.

Originality/value

The study offers an updated TBMs landscape and proposes a theoretical framework for TBMs' adoption in organizations. Implications for managers and researchers along with suggested research questions to guide future investigations are provided.

Details

Kybernetes, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 0368-492X

Keywords

Open Access

Article

Publication date: 2 April 2024

Automated Dewey Decimal Classification of Swedish library metadata using Annif software

Koraljka Golub, Osma Suominen, Ahmed Taiye Mohammed, Harriet Aagaard and Olof Osterman

In order to estimate the value of semi-automated subject indexing in operative library catalogues, the study aimed to investigate five different automated implementations of an…

HTML

PDF (187 KB)

Downloads

961

Abstract

Purpose

In order to estimate the value of semi-automated subject indexing in operative library catalogues, the study aimed to investigate five different automated implementations of an open source software package on a large set of Swedish union catalogue metadata records, with Dewey Decimal Classification (DDC) as the target classification system. It also aimed to contribute to the body of research on aboutness and related challenges in automated subject indexing and evaluation.

Design/methodology/approach

On a sample of over 230,000 records with close to 12,000 distinct DDC classes, an open source tool Annif, developed by the National Library of Finland, was applied in the following implementations: lexical algorithm, support vector classifier, fastText, Omikuji Bonsai and an ensemble approach combing the former four. A qualitative study involving two senior catalogue librarians and three students of library and information studies was also conducted to investigate the value and inter-rater agreement of automatically assigned classes, on a sample of 60 records.

Findings

The best results were achieved using the ensemble approach that achieved 66.82% accuracy on the three-digit DDC classification task. The qualitative study confirmed earlier studies reporting low inter-rater agreement but also pointed to the potential value of automatically assigned classes as additional access points in information retrieval.

Originality/value

The paper presents an extensive study of automated classification in an operative library catalogue, accompanied by a qualitative study of automated classes. It demonstrates the value of applying semi-automated indexing in operative information retrieval systems.

Details

Journal of Documentation, vol. 80 no. 5

Type: Research Article

DOI:

ISSN: 0022-0418

Keywords

View access options

Article

Publication date: 30 July 2024

Data-driven decision-making method for determining the handling department for online appeals

Sheng-Qun Chen, Ting You and Jing-Lin Zhang

This study aims to enhance the classification and processing of online appeals by employing a deep-learning-based method. This method is designed to meet the requirements for…

HTML

PDF (1.7 MB)

Downloads

Abstract

Purpose

This study aims to enhance the classification and processing of online appeals by employing a deep-learning-based method. This method is designed to meet the requirements for precise information categorization and decision support across various management departments.

Design/methodology/approach

This study leverages the ALBERT–TextCNN algorithm to determine the appropriate department for managing online appeals. ALBERT is selected for its advanced dynamic word representation capabilities, rooted in a multi-layer bidirectional transformer architecture and enriched text vector representation. TextCNN is integrated to facilitate the development of multi-label classification models.

Findings

Comparative experiments demonstrate the effectiveness of the proposed approach and its significant superiority over traditional classification methods in terms of accuracy.

Originality/value

The original contribution of this study lies in its utilization of the ALBERT–TextCNN algorithm for the classification of online appeals, resulting in a substantial improvement in accuracy. This research offers valuable insights for management departments, enabling enhanced understanding of public appeals and fostering more scientifically grounded and effective decision-making processes.

Details

Kybernetes, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 0368-492X

Keywords

View access options

Article

Publication date: 18 August 2023

Text classification using deep learning techniques: a bibliometric analysis and future research directions

Gaurav Sarin, Pradeep Kumar and M. Mukund

Text classification is a widely accepted and adopted technique in organizations to mine and analyze unstructured and semi-structured data. With advancement of technological…

HTML

PDF (3.6 MB)

Downloads

219

Abstract

Purpose

Text classification is a widely accepted and adopted technique in organizations to mine and analyze unstructured and semi-structured data. With advancement of technological computing, deep learning has become more popular among academicians and professionals to perform mining and analytical operations. In this work, the authors study the research carried out in field of text classification using deep learning techniques to identify gaps and opportunities for doing research.

Design/methodology/approach

The authors adopted bibliometric-based approach in conjunction with visualization techniques to uncover new insights and findings. The authors collected data of two decades from Scopus global database to perform this study. The authors discuss business applications of deep learning techniques for text classification.

Findings

The study provides overview of various publication sources in field of text classification and deep learning together. The study also presents list of prominent authors and their countries working in this field. The authors also presented list of most cited articles based on citations and country of research. Various visualization techniques such as word cloud, network diagram and thematic map were used to identify collaboration network.

Originality/value

The study performed in this paper helped to understand research gaps that is original contribution to body of literature. To best of the authors' knowledge, in-depth study in the field of text classification and deep learning has not been performed in detail. The study provides high value to scholars and professionals by providing them opportunities of research in this area.

Details

Benchmarking: An International Journal, vol. 31 no. 8

Type: Research Article

DOI:

ISSN: 1463-5771

Keywords

View access options

Article

Publication date: 12 July 2024

Early identification of high attention content for online mental health community users based on multi-level fusion model

Song Wang, Ying Luo and Xinmin Liu

The overload of user-generated content in online mental health community makes the focus and resonance tendencies of the participating groups less clear. Thus, the purpose of this…

HTML

PDF (739 KB)

Downloads

Abstract

Purpose

The overload of user-generated content in online mental health community makes the focus and resonance tendencies of the participating groups less clear. Thus, the purpose of this paper is to build an early identification mechanism for users' high attention content to promote early intervention and effective dissemination of professional medical guidance.

Design/methodology/approach

We decouple the identification mechanism from two processes: early feature combing and algorithmic model construction. Firstly, based on the differentiated needs and concerns of the participant groups, the multiple features of “information content + source users” are refined. Secondly, a multi-level fusion model is constructed for features processing. Specifically, Bidirectional Encoder Representation from Transformers (BERT)-Bi-directional Long-Short Term Memory (BiLSTM)-Linear are used to refine the semantic features, while Graph Attention Networks (GAT) is used to capture the entity attributes and relation features. Finally, the Convolutional Neural Network (CNN) is used to optimize the multi-level fusion features.

Findings

The results show that the ACC of the multi-level fusion model is 84.42%, F1 is 79.43% and R is 76.71%. Compared with other baseline models and single feature elements, the ACC and F1 values are improved to different degrees.

Originality/value

The originality of this paper lies in analyzing multiple features based on early stages and constructing a new multi-level fusion model for processing. Further, the study is valuable for the orientation of psychological patients' needs and early guidance of professional medical care.

Details

Data Technologies and Applications, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 2514-9288

Keywords

View access options

Article

Publication date: 11 September 2024

Speaking with a “forked tongue” – misalignment between user ratings and textual emotions in LLMs

Yixing Yang and Jianxiong Huang

The study aims to provide concrete service remediation and enhancement for LLM developers such as getting user forgiveness and breaking through perceived bottlenecks. It also aims…

HTML

PDF (40.4 MB)

Downloads

Abstract

Purpose

The study aims to provide concrete service remediation and enhancement for LLM developers such as getting user forgiveness and breaking through perceived bottlenecks. It also aims to improve the efficiency of app users' usage decisions.

Design/methodology/approach

This paper takes the user reviews of the app stores in 21 countries and 10 languages as the research data, extracts the potential factors by LDA model, exploratively takes the misalignment between user ratings and textual emotions as user forgiveness and perceived bottleneck and uses the Word2vec-SVM model to analyze the sentiment. Finally, attributions are made based on empathy.

Findings

The results show that AI-based LLMs are more likely to cause bias in user ratings and textual content than regular APPs. Functional and economic remedies are effective in awakening empathy and forgiveness, while empathic remedies are effective in reducing perceived bottlenecks. Interestingly, empathetic users are “pickier”. Further social network analysis reveals that problem solving timeliness, software flexibility, model updating and special data (voice and image) analysis capabilities are beneficial in breaking perceived bottlenecks. Besides, heterogeneity analysis show that eastern users are more sensitive to the price factor and are more likely to generate forgiveness through economic remedy, and there is a dual interaction between basic attributes and extra boosts in the East and West.

Originality/value

The “gap” between negative (positive) user reviews and ratings, that is consumer forgiveness and perceived bottlenecks, is identified in unstructured text; the study finds that empathy helps to awaken user forgiveness and understanding, while it is limited to bottleneck breakthroughs; the dataset includes a wide range of countries and regions, findings are tested in a cross-language and cross-cultural perspective, which makes the study more robust, and the heterogeneity of users' cultural backgrounds is also analyzed.

Details

Kybernetes, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 0368-492X

Keywords

View access options

Article

Publication date: 17 September 2024

Evolutionary analysis of value co-destruction in urban village renovation using SEM-FCM model

Wei Chen, Mingyu Yu, Yong Wang, Xuteng Lu and Jia Hou

Urban villages are prevalent informal settlements within Chinese cities, arising from urban expansion. These areas frequently face systematic demolition during urban renewal due…

HTML

PDF (16.9 MB)

Downloads

Abstract

Purpose

Urban villages are prevalent informal settlements within Chinese cities, arising from urban expansion. These areas frequently face systematic demolition during urban renewal due to their disorderly layout and outdated appearance. Urban village renovation (UVR) entails balancing diverse interests and navigating complex conflicts, particularly within China’s dual property rights system encompassing urban and rural land. The purpose of this study is to avoid the fierce interest conflict of UVR.

Design/methodology/approach

This study utilized the theoretical framework of value co-destruction. Initially, text mining and literature analysis were employed to identify concept nodes and interaction relationships. Subsequently, the structural equation model (SEM) was used to verify the causal model. Finally, the fuzzy cognitive map (FCM) was developed to dynamically simulate value co-destruction scenarios within UVR across various hypothetical situations.

Findings

The concept nodes influencing value co-destruction in UVR form a complex system with multiple levels. This includes three cause nodes and one result node. Among these, actor-to-actor emerges as a primary and underlying cause influencing value co-destruction in these projects. Furthermore, strategies for UVR should prioritize integrated interventions that enhance actor-to-actor relationships.

Originality/value

This study introduced a novel mixed methodology aimed at systematically simulating the dynamic process of value co-destruction during UVR. It also provided a fresh perspective on reverse assessment to mitigate the prevalent interest conflicts in UVR, thereby contributing to theoretical advancements and practical strategies for UVR.

Details

Engineering, Construction and Architectural Management, vol. ahead-of-print no. ahead-of-print

Type: Research Article

DOI:

ISSN: 0969-9988

Keywords

View access options

Article

Publication date: 1 August 2024

An application of machine learning to classify food waste interventions from a food supply chain perspective

Qiongfang Zou, Carel Nicolaas Bezuidenhout and Imran Ishrat

The purpose of this paper is to demonstrate the efficacy of machine learning (ML) in managing natural language processing tasks, specifically by developing two ML models to…

HTML

PDF (2.1 MB)

Downloads

Abstract

Purpose

The purpose of this paper is to demonstrate the efficacy of machine learning (ML) in managing natural language processing tasks, specifically by developing two ML models to systematically classify a substantial number of food waste interventions.

Design/methodology/approach

A literature review was undertaken to gather global food waste interventions. Subsequently, two ML models were designed and trained to classify these interventions into predefined supply chain-related groups and intervention types. To demonstrate the use of the models, a meta-analysis was performed to uncover patterns amongst the interventions.

Findings

The performance of the two classification models underscores the capabilities of ML in natural language processing, significantly enhancing the efficiency of text classification. This facilitated the rapid and effective classification of a large dataset consisting of 2,469 food waste interventions into six distinct types and assigning them to seven involved supply chain stakeholder groups. The meta-analysis reveals the most dominant intervention types and the strategies most widely adopted: 672 interventions are related to “Process and Operations Optimisation”, 457 to “Awareness and Behaviour Interventions” and 403 to “Technological and Engineering Solutions”. Prominent stakeholder groups, including “Processing and Manufacturing”, “Retail” “Government and Local Authorities” and “NGOs, Charitable Organisations and Research and Advocacy Groups”, are actively involved in over a thousand interventions each.

Originality/value

This study bridges a notable gap in food waste intervention research, a domain previously characterised by fragmentation and incomprehensive classification of the full range of interventions along the whole food supply chain. To the best of the authors’ knowledge, this is the first study to systematically classify a broad spectrum of food waste interventions while demonstrating ML capabilities. The study provides a clear, systematic framework for interventions to reduce food waste, offering valuable insight for practitioners in the food system, policymakers and consumers. Additionally, it lays the foundation for future in-depth research in the food waste reduction domain.

Details

British Food Journal, vol. 126 no. 9

Type: Research Article

DOI:

ISSN: 0007-070X

Keywords

View access options

Article

Publication date: 17 July 2023

Machine learning-based model for customer emotion detection in hotel booking services

Nghia Nguyen, Thuy-Hien Nguyen, Yen-Nhi Nguyen, Dung Doan, Minh Nguyen and Van-Ho Nguyen

The purpose of this paper is to expand and analyze deeply customer emotions, concretize the levels of positive or negative emotions with the aim of using machine learning methods…

HTML

PDF (4.2 MB)

Downloads

252

Abstract

Purpose

The purpose of this paper is to expand and analyze deeply customer emotions, concretize the levels of positive or negative emotions with the aim of using machine learning methods, and build a model to identify customer emotions.

Design/methodology/approach

The study proposed a customer emotion detection model and data mining method based on the collected dataset, including 80,593 online reviews on agoda.com and booking.com from 2009 to 2022.

Findings

By discerning specific emotions expressed in customers' comments, emotion detection, which refers to the process of identifying users' emotional states, assumes a crucial role in evaluating the brand value of a product. The research capitalizes on the vast and diverse data sources available on hotel booking websites, which, despite their richness, remain largely unexplored and unanalyzed. The outcomes of the model, pertaining to the detection and classification of customer emotions based on ratings and reviews into four distinct emotional states, offer a means to address the challenge of determining customer satisfaction regarding their actual service experiences. These findings hold substantial value for businesses operating in this domain, as the findings facilitate the evaluation and formulation of improvement strategies within their business models. The experimental study reveals that the proposed model attains an exact match ratio, precision, and recall rates of up to 81%, 90% and 90%, respectively.

Research limitations/implications

The study has yet to mine real-time data. Prediction results may be influenced because the amount of data collected from the web is insufficient and preprocessing is not completely suppressed. Furthermore, the model in the study was not tested using all algorithms and multi-label classifiers. Future research should build databases to mine data in real-time and collect more data and enhance the current model.

Practical implications

The study's results suggest that the emotion detection models can be applied to the real world to quickly analyze customer feedback. The proposed models enable the identification of customers' emotions, the discovery of customer demand, the enhancement of service, and the general customer experience. The established models can be used by many service sectors to learn more about customer satisfaction with the offered goods and services from customer reviews.

Social implications

The research paper helps businesses in the hospitality area analyze customer emotions in each specific aspect to ensure customer satisfaction. In addition, managers can come up with appropriate strategies to bring better products and services to society and people. Subsequently, fostering the growth of the hotel tourism sector within the nation, thereby facilitating sustainable economic development on a national scale.

Originality/value

This study developed a customer emotions detection model for detecting and classifying customer ratings and reviews as 4 specific emotions: happy, angry, depressed and hopeful based on online booking hotel websites agoda.com and booking.com that contains 80,593 reviews in Vietnamese. The research results help businesses check and evaluate the quality of their services, thereby offering appropriate improvement strategies to increase customers' satisfaction and demand more effectively.

Details

Journal of Hospitality and Tourism Insights, vol. 7 no. 3

Type: Research Article

DOI:

ISSN: 2514-9792

Keywords

Access

Year

Content type

1 – 10 of 472

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Details

Keywords

Abstract

Purpose

Design/methodology/approach

Findings

Research limitations/implications

Practical implications

Social implications

Originality/value

Details

Keywords

Access

Year

Content type

All feedback is valuable

Report an issue or find answers to frequently asked questions