Search results

1 – 10 of over 1000
Article
Publication date: 19 June 2023

Sunil Kumar Jauhar, B. Ripon Chakma, Sachin S. Kamble and Amine Belhadi

As e-commerce has expanded rapidly, online shopping platforms have become widespread in India and throughout the world. Product return, which has a negative effect on the…

Abstract

Purpose

As e-commerce has expanded rapidly, online shopping platforms have become widespread in India and throughout the world. Product return, which has a negative effect on the E-Commerce Industry's economic and ecological sustainability, is one of the E-Commerce Industry's greatest challenges in light of the substantial increase in online transactions. The authors have analyzed the purchasing patterns of the customers to better comprehend their product purchase and return patterns.

Design/methodology/approach

The authors utilized digital transformation techniques-based recency, frequency and monetary models to better understand and segment potential customers in order to address personalized strategies to increase sales, and the authors performed seller clustering using k-means and hierarchical clustering to determine why some sellers have the most sales and what products they offer that entice customers to purchase.

Findings

The authors discovered, through the application of digital transformation models to customer segmentation, that over 61.15% of consumers are likely to purchase, loyal customers and utilize firm service, whereas approximately 35% of customers have either stopped purchasing or have relatively low spending. To retain these consumer segments, special consideration and an enticing offer are required. As the authors dug deeper into the seller clustering, we discovered that the maximum number of clusters is six, while certain clusters indicate that prompt delivery of the goods plays a crucial role in customer feedback and high sales volume.

Originality/value

This is one of the rare study that develops a seller segmentation strategy by utilizing digital transformation-based methods in order to achieve seller group division.

Details

Journal of Enterprise Information Management, vol. 37 no. 2
Type: Research Article
ISSN: 1741-0398

Keywords

Article
Publication date: 3 November 2023

Nihan Yildirim, Derya Gultekin, Cansu Hürses and Abdullah Mert Akman

This paper aims to use text mining methods to explore the similarities and differences between countries’ national digital transformation (DT) and Industry 4.0 (I4.0) policies…

Abstract

Purpose

This paper aims to use text mining methods to explore the similarities and differences between countries’ national digital transformation (DT) and Industry 4.0 (I4.0) policies. The study examines the applicability of text mining as an alternative for comprehensive clustering of national I4.0 and DT strategies, encouraging policy researchers toward data science that can offer rapid policy analysis and benchmarking.

Design/methodology/approach

With an exploratory research approach, topic modeling, principal component analysis and unsupervised machine learning algorithms (k-means and hierarchical clustering) are used for clustering national I4.0 and DT strategies. This paper uses a corpus of policy documents and related scientific publications from several countries and integrate their science and technology performance. The paper also presents the positioning of Türkiye’s I4.0 and DT national policy as a case from a developing country context.

Findings

Text mining provides meaningful clustering results on similarities and differences between countries regarding their national I4.0 and DT policies, aligned with their geographic, economic and political circumstances. Findings also shed light on the DT strategic landscape and the key themes spanning various policy dimensions. Drawing from the Turkish case, political options are discussed in the context of developing (follower) countries’ I4.0 and DT.

Practical implications

The paper reveals meaningful clustering results on similarities and differences between countries regarding their national I4.0 and DT policies, reflecting political proximities aligned with their geographic, economic and political circumstances. This can help policymakers to comparatively understand national DT and I4.0 policies and use this knowledge to reflect collaborative and competitive measures to their policies.

Originality/value

This paper provides a unique combined methodology for text mining-based policy analysis in the DT context, which has not been adopted. In an era where computational social science and machine learning have gained importance and adaptability to political and social science fields, and in the technology and innovation management discipline, clustering applications showed similar and different policy patterns in a timely and unbiased manner.

Details

Journal of Science and Technology Policy Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2053-4620

Keywords

Content available
Article
Publication date: 30 May 2023

Benjamin Leiby and Darryl Ahner

This paper aims to examine how the regional variable in country conflict modeling affects forecast accuracy and identifies a methodology to further improve the predictions.

Abstract

Purpose

This paper aims to examine how the regional variable in country conflict modeling affects forecast accuracy and identifies a methodology to further improve the predictions.

Design/methodology/approach

This paper uses statistical learning methods to both evaluate the quantity of data for clustering countries along with quantifying accuracy according to the number of clusters used.

Findings

This study demonstrates that increasing the number of clusters for modeling improves the ability to predict conflict as long as the models are robust.

Originality/value

This study investigates the quantity of clusters used in conflict modeling, while previous research assumes a specific quantity before modeling.

Details

Journal of Defense Analytics and Logistics, vol. 7 no. 1
Type: Research Article
ISSN: 2399-6439

Keywords

Article
Publication date: 17 April 2024

Charitha Sasika Hettiarachchi, Nanfei Sun, Trang Minh Quynh Le and Naveed Saleem

The COVID-19 pandemic has posed many challenges in almost all sectors around the globe. Because of the pandemic, government entities responsible for managing health-care resources…

Abstract

Purpose

The COVID-19 pandemic has posed many challenges in almost all sectors around the globe. Because of the pandemic, government entities responsible for managing health-care resources face challenges in managing and distributing their limited and valuable health resources. In addition, severe outbreaks may occur in a small or large geographical area. Therefore, county-level preparation is crucial for officials and organizations who manage such disease outbreaks. However, most COVID-19-related research projects have focused on either state- or country-level. Only a few studies have considered county-level preparations, such as identifying high-risk counties of a particular state to fight against the COVID-19 pandemic. Therefore, the purpose of this research is to prioritize counties in a state based on their COVID-19-related risks to manage the COVID outbreak effectively.

Design/methodology/approach

In this research, the authors use a systematic hybrid approach that uses a clustering technique to group counties that share similar COVID conditions and use a multi-criteria decision-making approach – the analytic hierarchy process – to rank clusters with respect to the severity of the pandemic. The clustering was performed using two methods, k-means and fuzzy c-means, but only one of them was used at a time during the experiment.

Findings

The results of this study indicate that the proposed approach can effectively identify and rank the most vulnerable counties in a particular state. Hence, state health resources managing entities can identify counties in desperate need of more attention before they allocate their resources and better prepare those counties before another surge.

Originality/value

To the best of the authors’ knowledge, this study is the first to use both an unsupervised learning approach and the analytic hierarchy process to identify and rank state counties in accordance with the severity of COVID-19.

Details

Journal of Systems and Information Technology, vol. 26 no. 2
Type: Research Article
ISSN: 1328-7265

Keywords

Article
Publication date: 22 April 2024

Ruoxi Zhang and Chenhan Ren

This study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering.

Abstract

Purpose

This study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering.

Design/methodology/approach

This study consisted of two main parts: danmu comment sentiment series generation and clustering. In the first part, the authors proposed a sentiment classification model based on BERT fine-tuning to quantify danmu comment sentiment polarity. To smooth the sentiment series, they used methods, such as comprehensive weights. In the second part, the shaped-based distance (SBD)-K-shape method was used to cluster the actual collected data.

Findings

The filtered sentiment series or curves of the microfilms on the Bilibili website could be divided into four major categories. There is an apparently stable time interval for the first three types of sentiment curves, while the fourth type of sentiment curve shows a clear trend of fluctuation in general. In addition, it was found that “disputed points” or “highlights” are likely to appear at the beginning and the climax of films, resulting in significant changes in the sentiment curves. The clustering results show a significant difference in user participation, with the second type prevailing over others.

Originality/value

Their sentiment classification model based on BERT fine-tuning outperformed the traditional sentiment lexicon method, which provides a reference for using deep learning as well as transfer learning for danmu comment sentiment analysis. The BERT fine-tuning–SBD-K-shape algorithm can weaken the effect of non-regular noise and temporal phase shift of danmu text.

Details

The Electronic Library , vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 30 April 2021

Faruk Bulut, Melike Bektaş and Abdullah Yavuz

In this study, supervision and control of the possible problems among people over a large area with a limited number of drone cameras and security staff is established.

Abstract

Purpose

In this study, supervision and control of the possible problems among people over a large area with a limited number of drone cameras and security staff is established.

Design/methodology/approach

These drones, namely unmanned aerial vehicles (UAVs) will be adaptively and automatically distributed over the crowds to control and track the communities by the proposed system. Since crowds are mobile, the design of the drone clusters will be simultaneously re-organized according to densities and distributions of people. An adaptive and dynamic distribution and routing mechanism of UAV fleets for crowds is implemented to control a specific given region. The nine popular clustering algorithms have been used and tested in the presented mechanism to gain better performance.

Findings

The nine popular clustering algorithms have been used and tested in the presented mechanism to gain better performance. An outperformed clustering performance from the aggregated model has been received when compared with a singular clustering method over five different test cases about crowds of human distributions. This study has three basic components. The first one is to divide the human crowds into clusters. The second one is to determine an optimum route of UAVs over clusters. The last one is to direct the most appropriate security personnel to the events that occurred.

Originality/value

This study has three basic components. The first one is to divide the human crowds into clusters. The second one is to determine an optimum route of UAVs over clusters. The last one is to direct the most appropriate security personnel to the events that occurred.

Details

International Journal of Intelligent Unmanned Systems, vol. 12 no. 1
Type: Research Article
ISSN: 2049-6427

Keywords

Article
Publication date: 20 July 2023

Elaheh Hosseini, Kimiya Taghizadeh Milani and Mohammad Shaker Sabetnasab

This research aimed to visualize and analyze the co-word network and thematic clusters of the intellectual structure in the field of linked data during 1900–2021.

Abstract

Purpose

This research aimed to visualize and analyze the co-word network and thematic clusters of the intellectual structure in the field of linked data during 1900–2021.

Design/methodology/approach

This applied research employed a descriptive and analytical method, scientometric indicators, co-word techniques, and social network analysis. VOSviewer, SPSS, Python programming, and UCINet software were used for data analysis and network structure visualization.

Findings

The top ranks of the Web of Science (WOS) subject categorization belonged to various fields of computer science. Besides, the USA was the most prolific country. The keyword ontology had the highest frequency of co-occurrence. Ontology and semantic were the most frequent co-word pairs. In terms of the network structure, nine major topic clusters were identified based on co-occurrence, and 29 thematic clusters were identified based on hierarchical clustering. Comparisons between the two clustering techniques indicated that three clusters, namely semantic bioinformatics, knowledge representation, and semantic tools were in common. The most mature and mainstream thematic clusters were natural language processing techniques to boost modeling and visualization, context-aware knowledge discovery, probabilistic latent semantic analysis (PLSA), semantic tools, latent semantic indexing, web ontology language (OWL) syntax, and ontology-based deep learning.

Originality/value

This study adopted various techniques such as co-word analysis, social network analysis network structure visualization, and hierarchical clustering to represent a suitable, visual, methodical, and comprehensive perspective into linked data.

Details

Library Hi Tech, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0737-8831

Keywords

Article
Publication date: 15 June 2023

Abena Owusu and Aparna Gupta

Although risk culture is a key determinant for an effective risk management, identifying the risk culture of a firm can be challenging due to the abstract concept of culture. This…

Abstract

Purpose

Although risk culture is a key determinant for an effective risk management, identifying the risk culture of a firm can be challenging due to the abstract concept of culture. This paper proposes a novel approach that uses unsupervised machine learning techniques to identify significant features needed to assess and differentiate between different forms of risk culture.

Design/methodology/approach

To convert the unstructured text in our sample of banks' 10K reports into structured data, a two-dimensional dictionary for text mining is built to capture risk culture characteristics and the bank's attitude towards the risk culture characteristics. A principal component analysis (PCA) reduction technique is applied to extract the significant features that define risk culture, before using a K-means unsupervised learning to cluster the reports into distinct risk culture groups.

Findings

The PCA identifies uncertainty, litigious and constraining sentiments among risk culture features to be significant in defining the risk culture of banks. Cluster analysis on the PCA factors proposes three distinct risk culture clusters: good, fair and poor. Consistent with regulatory expectations, a good or fair risk culture in banks is characterized by high profitability ratios, bank stability, lower default risk and good governance.

Originality/value

The relationship between culture and risk management can be difficult to study given that it is hard to measure culture from traditional data sources that are messy and diverse. This study offers a better understanding of risk culture using an unsupervised machine learning approach.

Details

International Journal of Managerial Finance, vol. 20 no. 2
Type: Research Article
ISSN: 1743-9132

Keywords

Article
Publication date: 31 May 2022

Jianfang Qi, Yue Li, Haibin Jin, Jianying Feng and Weisong Mu

The purpose of this study is to propose a new consumer value segmentation method for low-dimensional dense market datasets to quickly detect and cluster the most profitable…

Abstract

Purpose

The purpose of this study is to propose a new consumer value segmentation method for low-dimensional dense market datasets to quickly detect and cluster the most profitable customers for the enterprises.

Design/methodology/approach

In this study, the comprehensive segmentation bases (CSB) with richer meanings were obtained by introducing the weighted recency-frequency-monetary (RFM) model into the common segmentation bases (SB). Further, a new market segmentation method, the CSB-MBK algorithm was proposed by integrating the CSB model and the mini-batch k-means (MBK) clustering algorithm.

Findings

The results show that our proposed CSB model can reflect consumers' contributions to a market, as well as improve the clustering performance. Moreover, the proposed CSB-MBK algorithm is demonstrably superior to the SB-MBK, CSB-KMA and CSB-Chameleon algorithms with respect to the Silhouette Coefficient (SC), the Calinski-Harabasz (CH) Index , the average running time and superior to the SB-MBK, RFM-MBK and WRFM-MBK algorithms in terms of the inter-market value and characteristic differentiation.

Practical implications

This paper provides a tool for decision-makers and marketers to segment a market quickly, which can help them grasp consumers' activity, loyalty, purchasing power and other characteristics in a target market timely and achieve the precision marketing.

Originality/value

This study is the first to introduce the CSB-MBK algorithm for identifying valuable customers through the comprehensive consideration of the clustering quality, consumer value and segmentation speed. Moreover, the CSB-MBK algorithm can be considered for applications in other markets.

Details

Kybernetes, vol. 52 no. 10
Type: Research Article
ISSN: 0368-492X

Keywords

Article
Publication date: 28 February 2023

Meltem Aksoy, Seda Yanık and Mehmet Fatih Amasyali

When a large number of project proposals are evaluated to allocate available funds, grouping them based on their similarities is beneficial. Current approaches to group proposals…

Abstract

Purpose

When a large number of project proposals are evaluated to allocate available funds, grouping them based on their similarities is beneficial. Current approaches to group proposals are primarily based on manual matching of similar topics, discipline areas and keywords declared by project applicants. When the number of proposals increases, this task becomes complex and requires excessive time. This paper aims to demonstrate how to effectively use the rich information in the titles and abstracts of Turkish project proposals to group them automatically.

Design/methodology/approach

This study proposes a model that effectively groups Turkish project proposals by combining word embedding, clustering and classification techniques. The proposed model uses FastText, BERT and term frequency/inverse document frequency (TF/IDF) word-embedding techniques to extract terms from the titles and abstracts of project proposals in Turkish. The extracted terms were grouped using both the clustering and classification techniques. Natural groups contained within the corpus were discovered using k-means, k-means++, k-medoids and agglomerative clustering algorithms. Additionally, this study employs classification approaches to predict the target class for each document in the corpus. To classify project proposals, various classifiers, including k-nearest neighbors (KNN), support vector machines (SVM), artificial neural networks (ANN), classification and regression trees (CART) and random forest (RF), are used. Empirical experiments were conducted to validate the effectiveness of the proposed method by using real data from the Istanbul Development Agency.

Findings

The results show that the generated word embeddings can effectively represent proposal texts as vectors, and can be used as inputs for clustering or classification algorithms. Using clustering algorithms, the document corpus is divided into five groups. In addition, the results demonstrate that the proposals can easily be categorized into predefined categories using classification algorithms. SVM-Linear achieved the highest prediction accuracy (89.2%) with the FastText word embedding method. A comparison of manual grouping with automatic classification and clustering results revealed that both classification and clustering techniques have a high success rate.

Research limitations/implications

The proposed model automatically benefits from the rich information in project proposals and significantly reduces numerous time-consuming tasks that managers must perform manually. Thus, it eliminates the drawbacks of the current manual methods and yields significantly more accurate results. In the future, additional experiments should be conducted to validate the proposed method using data from other funding organizations.

Originality/value

This study presents the application of word embedding methods to effectively use the rich information in the titles and abstracts of Turkish project proposals. Existing research studies focus on the automatic grouping of proposals; traditional frequency-based word embedding methods are used for feature extraction methods to represent project proposals. Unlike previous research, this study employs two outperforming neural network-based textual feature extraction techniques to obtain terms representing the proposals: BERT as a contextual word embedding method and FastText as a static word embedding method. Moreover, to the best of our knowledge, there has been no research conducted on the grouping of project proposals in Turkish.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 16 no. 3
Type: Research Article
ISSN: 1756-378X

Keywords

1 – 10 of over 1000