Search results

1 – 10 of over 18000
Article
Publication date: 28 October 2014

Minchen Zhu, Weizhi Wang and Jingshan Huang

It is well known that the selection of initial cluster centers can significantly affect K-means clustering results. The purpose of this paper is to propose an improved, efficient…

Abstract

Purpose

It is well known that the selection of initial cluster centers can significantly affect K-means clustering results. The purpose of this paper is to propose an improved, efficient methodology to handle such a challenge.

Design/methodology/approach

According to the fact that the inner-class distance among samples within the same cluster is supposed to be smaller than the inter-class distance among clusters, the algorithm will dynamically adjust initial cluster centers that are randomly selected. Consequently, such adjusted initial cluster centers will be highly representative in the sense that they are distributed among as many samples as possible. As a result, local optima that are common in K-means clustering can then be effectively reduced. In addition, the algorithm is able to obtain all initial cluster centers simultaneously (instead of one center at a time) during the dynamic adjustment.

Findings

Experimental results demonstrate that the proposed algorithm greatly improves the accuracy of traditional K-means clustering results and, in a more efficient manner.

Originality/value

The authors presented in this paper an efficient algorithm, which is able to dynamically adjust initial cluster centers that are randomly selected. The adjusted centers are highly representative, i.e. they are distributed among as many samples as possible. As a result, local optima that are common in K-means clustering can be effectively reduced so that the authors can achieve an improved clustering accuracy. In addition, the algorithm is a cost-efficient one and the enhanced clustering accuracy can be obtained in a more efficient manner compared with traditional K-means algorithm.

Details

Engineering Computations, vol. 31 no. 8
Type: Research Article
ISSN: 0264-4401

Keywords

Article
Publication date: 1 March 1993

P. Gu

The grouping of parts and machines for design of cellularmanufacturing systems is carried out by clustering analysis. Two majordrawbacks of some clustering algorithms have been…

Abstract

The grouping of parts and machines for design of cellular manufacturing systems is carried out by clustering analysis. Two major drawbacks of some clustering algorithms have been identified in handling bottleneck machines for forming machine cells. These drawbacks include solution inconsistency and possible misclustering which result in unnecessary bottleneck machines required. Presents a more robust clustering algorithm to overcome these drawbacks. The algorithm consists of four stages: selection of initial cluster centres; cluster‐seeking analysis; eliminating unnecessary bottleneck machines; and new parts assignments. The decision functions based on the formed machine cells are defined to assign new parts to the machine cells. The algorithm is capable of selecting an ideal set of initial cluster centres, and minimizing the number of bottleneck machines required for forming the desired number of machine cells. It can also provide alternative design of machine cells to accommodate the existing production environment.

Details

Integrated Manufacturing Systems, vol. 4 no. 3
Type: Research Article
ISSN: 0957-6061

Keywords

Article
Publication date: 5 September 2016

Runhai Jiao, Shaolong Liu, Wu Wen and Biying Lin

The large volume of big data makes it impractical for traditional clustering algorithms which are usually designed for entire data set. The purpose of this paper is to focus on…

Abstract

Purpose

The large volume of big data makes it impractical for traditional clustering algorithms which are usually designed for entire data set. The purpose of this paper is to focus on incremental clustering which divides data into series of data chunks and only a small amount of data need to be clustered at each time. Few researches on incremental clustering algorithm address the problem of optimizing cluster center initialization for each data chunk and selecting multiple passing points for each cluster.

Design/methodology/approach

Through optimizing initial cluster centers, quality of clustering results is improved for each data chunk and then quality of final clustering results is enhanced. Moreover, through selecting multiple passing points, more accurate information is passed down to improve the final clustering results. The method has been proposed to solve those two problems and is applied in the proposed algorithm based on streaming kernel fuzzy c-means (stKFCM) algorithm.

Findings

Experimental results show that the proposed algorithm demonstrates more accuracy and better performance than streaming kernel stKFCM algorithm.

Originality/value

This paper addresses the problem of improving the performance of increment clustering through optimizing cluster center initialization and selecting multiple passing points. The paper analyzed the performance of the proposed scheme and proved its effectiveness.

Details

Kybernetes, vol. 45 no. 8
Type: Research Article
ISSN: 0368-492X

Keywords

Open Access
Article
Publication date: 24 June 2021

Bo Wang, Guanwei Wang, Youwei Wang, Zhengzheng Lou, Shizhe Hu and Yangdong Ye

Vehicle fault diagnosis is a key factor in ensuring the safe and efficient operation of the railway system. Due to the numerous vehicle categories and different fault mechanisms…

Abstract

Purpose

Vehicle fault diagnosis is a key factor in ensuring the safe and efficient operation of the railway system. Due to the numerous vehicle categories and different fault mechanisms, there is an unbalanced fault category problem. Most of the current methods to solve this problem have complex algorithm structures, low efficiency and require prior knowledge. This study aims to propose a new method which has a simple structure and does not require any prior knowledge to achieve a fast diagnosis of unbalanced vehicle faults.

Design/methodology/approach

This study proposes a novel K-means with feature learning based on the feature learning K-means-improved cluster-centers selection (FKM-ICS) method, which includes the ICS and the FKM. Specifically, this study defines cluster centers approximation to select the initialized cluster centers in the ICS. This study uses improved term frequency-inverse document frequency to measure and adjust the feature word weights in each cluster, retaining the top τ feature words with the highest weight in each cluster and perform the clustering process again in the FKM. With the FKM-ICS method, clustering performance for unbalanced vehicle fault diagnosis can be significantly enhanced.

Findings

This study finds that the FKM-ICS can achieve a fast diagnosis of vehicle faults on the vehicle fault text (VFT) data set from a railway station in the 2017 (VFT) data set. The experimental results on VFT indicate the proposed method in this paper, outperforms several state-of-the-art methods.

Originality/value

This is the first effort to address the vehicle fault diagnostic problem and the proposed method performs effectively and efficiently. The ICS enables the FKM-ICS method to exclude the effect of outliers, solves the disadvantages of the fault text data contained a certain amount of noisy data, which effectively enhanced the method stability. The FKM enhances the distribution of feature words that discriminate between different fault categories and reduces the number of feature words to make the FKM-ICS method faster and better cluster for unbalanced vehicle fault diagnostic.

Details

Smart and Resilient Transportation, vol. 3 no. 2
Type: Research Article
ISSN: 2632-0487

Keywords

Article
Publication date: 22 February 2024

Yumeng Feng, Weisong Mu, Yue Li, Tianqi Liu and Jianying Feng

For a better understanding of the preferences and differences of young consumers in emerging wine markets, this study aims to propose a clustering method to segment the super-new…

Abstract

Purpose

For a better understanding of the preferences and differences of young consumers in emerging wine markets, this study aims to propose a clustering method to segment the super-new generation wine consumers based on their sensitivity to wine brand, origin and price and then conduct user profiles for segmented consumer groups from the perspectives of demographic attributes, eating habits and wine sensory attribute preferences.

Design/methodology/approach

We first proposed a consumer clustering perspective based on their sensitivity to wine brand, origin and price and then conducted an adaptive density peak and label propagation layer-by-layer (ADPLP) clustering algorithm to segment consumers, which improved the issues of wrong centers' selection and inaccurate classification of remaining sample points for traditional DPC (DPeak clustering algorithm). Then, we built a consumer profile system from the perspectives of demographic attributes, eating habits and wine sensory attribute preferences for segmented consumer groups.

Findings

In this study, 10 typical public datasets and 6 basic test algorithms are used to evaluate the proposed method, and the results showed that the ADPLP algorithm was optimal or suboptimal on 10 datasets with accuracy above 0.78. The average improvement in accuracy over the base DPC algorithm is 0.184. As an outcome of the wine consumer profiles, sensitive consumers prefer wines with medium prices of 100–400 CNY and more personalized brands and origins, while casual consumers are fond of popular brands, popular origins and low prices within 50 CNY. The wine sensory attributes preferred by super-new generation consumers are red, semi-dry, semi-sweet, still, fresh tasting, fruity, floral and low acid.

Practical implications

Young Chinese consumers are the main driver of wine consumption in the future. This paper provides a tool for decision-makers and marketers to identify the preferences of young consumers quickly which is meaningful and helpful for wine marketing.

Originality/value

In this study, the ADPLP algorithm was introduced for the first time. Subsequently, the user profile label system was constructed for segmented consumers to highlight their characteristics and demand partiality from three aspects: demographic characteristics, consumers' eating habits and consumers' preferences for wine attributes. Moreover, the ADPLP algorithm can be considered for user profiles on other alcoholic products.

Details

Kybernetes, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0368-492X

Keywords

Article
Publication date: 23 August 2022

Kamlesh Kumar Pandey and Diwakar Shukla

The K-means (KM) clustering algorithm is extremely responsive to the selection of initial centroids since the initial centroid of clusters determines computational effectiveness…

Abstract

Purpose

The K-means (KM) clustering algorithm is extremely responsive to the selection of initial centroids since the initial centroid of clusters determines computational effectiveness, efficiency and local optima issues. Numerous initialization strategies are to overcome these problems through the random and deterministic selection of initial centroids. The random initialization strategy suffers from local optimization issues with the worst clustering performance, while the deterministic initialization strategy achieves high computational cost. Big data clustering aims to reduce computation costs and improve cluster efficiency. The objective of this study is to achieve a better initial centroid for big data clustering on business management data without using random and deterministic initialization that avoids local optima and improves clustering efficiency with effectiveness in terms of cluster quality, computation cost, data comparisons and iterations on a single machine.

Design/methodology/approach

This study presents the Normal Distribution Probability Density (NDPD) algorithm for big data clustering on a single machine to solve business management-related clustering issues. The NDPDKM algorithm resolves the KM clustering problem by probability density of each data point. The NDPDKM algorithm first identifies the most probable density data points by using the mean and standard deviation of the datasets through normal probability density. Thereafter, the NDPDKM determines K initial centroid by using sorting and linear systematic sampling heuristics.

Findings

The performance of the proposed algorithm is compared with KM, KM++, Var-Part, Murat-KM, Mean-KM and Sort-KM algorithms through Davies Bouldin score, Silhouette coefficient, SD Validity, S_Dbw Validity, Number of Iterations and CPU time validation indices on eight real business datasets. The experimental evaluation demonstrates that the NDPDKM algorithm reduces iterations, local optima, computing costs, and improves cluster performance, effectiveness, efficiency with stable convergence as compared to other algorithms. The NDPDKM algorithm minimizes the average computing time up to 34.83%, 90.28%, 71.83%, 92.67%, 69.53% and 76.03%, and reduces the average iterations up to 40.32%, 44.06%, 32.02%, 62.78%, 19.07% and 36.74% with reference to KM, KM++, Var-Part, Murat-KM, Mean-KM and Sort-KM algorithms.

Originality/value

The KM algorithm is the most widely used partitional clustering approach in data mining techniques that extract hidden knowledge, patterns and trends for decision-making strategies in business data. Business analytics is one of the applications of big data clustering where KM clustering is useful for the various subcategories of business analytics such as customer segmentation analysis, employee salary and performance analysis, document searching, delivery optimization, discount and offer analysis, chaplain management, manufacturing analysis, productivity analysis, specialized employee and investor searching and other decision-making strategies in business.

Article
Publication date: 28 July 2020

Ming K. Lim, Jianxin Wang, Chao Wang and Ming-Lang Tseng

Increasing academic communities and practitioners begin to explore a novel method to reduce environmental pollution and realize green logistics delivery. Additionally, China's…

Abstract

Purpose

Increasing academic communities and practitioners begin to explore a novel method to reduce environmental pollution and realize green logistics delivery. Additionally, China's Statistical Yearbook shows that the number of private cars has reached 165 million in China. Under this background, this study proposes a green delivery method by the combination of sharing vehicle (private cars) and IoT (Internet of things) from the perspective of vehicle energy efficiency and aims to improve the energy efficiency of social vehicles and provides more convenient delivery services.

Design/methodology/approach

This study builds an IoT architecture consisting of customer data layer, information collection layer, cloud optimization layer and delivery task execution layer. Especially in the IoT architecture, a clustering analysis method is used to determine the critical value of customers' classification and shared delivery, a routing optimization method is used to solve the initial solution in could layer and shared technology is used in the implementation of shared delivery.

Findings

The results show that the delivery method considering shared vehicles has a positive effect on improving the energy utilization of vehicles. But if all of delivery tasks are performed by the shared vehicle, the application effect may be counterproductive, such as delivery cost increases and energy efficiency decreases. This study provides a good reference for the implementation of green intelligent delivery business, which has a positive effect on the improvement of logistics operation efficiency.

Originality/value

This study designs a novel method to solve the green and shared delivery issues under the IoT environment, which integrates the IoT architecture. The proposed methodology is applied in a real case in China.

Details

Industrial Management & Data Systems, vol. 120 no. 9
Type: Research Article
ISSN: 0263-5577

Keywords

Article
Publication date: 27 February 2024

Jianhua Zhang, Liangchen Li, Fredrick Ahenkora Boamah, Dandan Wen, Jiake Li and Dandan Guo

Traditional case-adaptation methods have poor accuracy, low efficiency and limited applicability, which cannot meet the needs of knowledge users. To address the shortcomings of…

Abstract

Purpose

Traditional case-adaptation methods have poor accuracy, low efficiency and limited applicability, which cannot meet the needs of knowledge users. To address the shortcomings of the existing research in the industry, this paper proposes a case-adaptation optimization algorithm to support the effective application of tacit knowledge resources.

Design/methodology/approach

The attribute simplification algorithm based on the forward search strategy in the neighborhood decision information system is implemented to realize the vertical dimensionality reduction of the case base, and the fuzzy C-mean (FCM) clustering algorithm based on the simulated annealing genetic algorithm (SAGA) is implemented to compress the case base horizontally with multiple decision classes. Then, the subspace K-nearest neighbors (KNN) algorithm is used to induce the decision rules for the set of adapted cases to complete the optimization of the adaptation model.

Findings

The findings suggest the rapid enrichment of data, information and tacit knowledge in the field of practice has led to low efficiency and low utilization of knowledge dissemination, and this algorithm can effectively alleviate the problems of users falling into “knowledge disorientation” in the era of the knowledge economy.

Practical implications

This study provides a model with case knowledge that meets users’ needs, thereby effectively improving the application of the tacit knowledge in the explicit case base and the problem-solving efficiency of knowledge users.

Social implications

The adaptation model can serve as a stable and efficient prediction model to make predictions for the effects of the many logistics and e-commerce enterprises' plans.

Originality/value

This study designs a multi-decision class case-adaptation optimization study based on forward attribute selection strategy-neighborhood rough sets (FASS-NRS) and simulated annealing genetic algorithm-fuzzy C-means (SAGA-FCM) for tacit knowledgeable exogenous cases. By effectively organizing and adjusting tacit knowledge resources, knowledge service organizations can maintain their competitive advantages. The algorithm models established in this study develop theoretical directions for a multi-decision class case-adaptation optimization study of tacit knowledge.

Details

Journal of Advances in Management Research, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0972-7981

Keywords

Article
Publication date: 21 August 2017

Xiangyu Liu, Ping Zhang, Guanglong Du, Ziping He and Guohao Chen

The purpose of this paper is to provide a novel training-responding controlling approach for human–robot interaction. The approach is inspired by the processes of muscle memory…

Abstract

Purpose

The purpose of this paper is to provide a novel training-responding controlling approach for human–robot interaction. The approach is inspired by the processes of muscle memory and conditioned reflex. The approach is significant for dealing with the problems of robot’s redundant movements and operator’s fatigue in human–robot interaction system.

Design/methodology/approach

This paper presented a directional double clustering algorithm (DDCA) to achieve the training process. The DDCA ensured that the initial clustering centers uniformly distributed in every desired cluster. A minimal resource allocation network was used to construct a memory responding algorithm (MRA). When the human–robot interaction system needed to carry out a task for more than one time, the desired movements of the robot were given by the MRA without repeated training. Experimentally demonstrated results showed the proposed training-responding controlling approach could successfully accomplish human–robot interaction tasks.

Findings

The training-responding controlling approach improved the robustness and reliability of the human–robot interaction system, which presented a novel controlling method for the operator.

Practical implications

This approach has significant commercial applications, as a means of controlling for human–robot interaction could serve to point to the desired target and arrive at the appointed positions in industrial and household environment.

Originality/value

This work presented a novel training-responding human-robot controlling method. The human-robot controlling method dealt with the problems of robot’s redundant movements and operator’s fatigue. To the authors’ knowledge, the working processes of muscle memory and conditioned reflex have not been reported to apply to human-robot controlling.

Details

Industrial Robot: An International Journal, vol. 44 no. 5
Type: Research Article
ISSN: 0143-991X

Keywords

Open Access
Article
Publication date: 3 June 2021

Lulu Ge, Zheming Yang and Wen Ji

The evolution of crowd intelligence is a mainly concerns issue in the field of crowd science. It is a kind of group behavior that is superior to the individual’s ability to…

Abstract

Purpose

The evolution of crowd intelligence is a mainly concerns issue in the field of crowd science. It is a kind of group behavior that is superior to the individual’s ability to complete tasks through the cooperation of many agents. In this study, the evolution of crowd intelligence is studied through the clustering method and the particle swarm optimization (PSO) algorithm.

Design/methodology/approach

This study proposes a crowd evolution method based on intelligence level clustering. Based on clustering, this method uses the agents’ intelligence level as the metric to cluster agents. Then, the agents evolve within the cluster on the basis of the PSO algorithm.

Findings

Two main simulation experiments are designed for the proposed method. First, agents are classified based on their intelligence level. Then, when evolving the agents, two different evolution centers are set. Besides, this paper uses different numbers of clusters to conduct experiments.

Practical implications

The experimental results show that the proposed method can effectively improve the crowd intelligence level and the cooperation ability between agents.

Originality/value

This paper proposes a crowd evolution method based on intelligence level clustering, which is based on the clustering method and the PSO algorithm to analyze the evolution.

Details

International Journal of Crowd Science, vol. 5 no. 2
Type: Research Article
ISSN: 2398-7294

Keywords

1 – 10 of over 18000