Search results

1 – 10 of 278
Article
Publication date: 13 July 2018

Nadjia Khatir and Safia Nait-bahloul

This study aims to evaluate a new fusion technique of visual and textual clusters of objects from a real multimedia data-driven collection to improve the performance of multimedia…

Abstract

Purpose

This study aims to evaluate a new fusion technique of visual and textual clusters of objects from a real multimedia data-driven collection to improve the performance of multimedia applications.

Design/methodology/approach

The authors focused on using multi-criteria for clustering texts and images. The algorithm consists of these steps: first is text representation using the statistical method of weighting, second is image representation using a bag of words feature descriptors methods and finally application of multi-criteria clustering.

Findings

As an application for event detection based on social multimedia data, in particular, Flickr platform. Several experiments were conducted to choose the appropriate parameters for a better scheme of clustering. The new approach achieves better performance when aggregate text clustering is done with image clustering for event detection.

Research limitations/implications

Further researches would be investigated on other social media platforms such as Facebook and Twitter for a generalization of the technique.

Originality/value

This study contributes to multimedia data mining through the new fusion technique of clustering. The technique has its root in such strong field as the field of multi-criteria clustering and decision-making support.

Details

Kybernetes, vol. 47 no. 10
Type: Research Article
ISSN: 0368-492X

Keywords

Article
Publication date: 8 April 2022

Botond Benedek, Cristina Ciumas and Bálint Zsolt Nagy

The purpose of this paper is to survey the automobile insurance fraud detection literature in the past 31 years (1990–2021) and present a research agenda that addresses the…

1645

Abstract

Purpose

The purpose of this paper is to survey the automobile insurance fraud detection literature in the past 31 years (1990–2021) and present a research agenda that addresses the challenges and opportunities artificial intelligence and machine learning bring to car insurance fraud detection.

Design/methodology/approach

Content analysis methodology is used to analyze 46 peer-reviewed academic papers from 31 journals plus eight conference proceedings to identify their research themes and detect trends and changes in the automobile insurance fraud detection literature according to content characteristics.

Findings

This study found that automobile insurance fraud detection is going through a transformation, where traditional statistics-based detection methods are replaced by data mining- and artificial intelligence-based approaches. In this study, it was also noticed that cost-sensitive and hybrid approaches are the up-and-coming avenues for further research.

Practical implications

This paper’s findings not only highlight the rise and benefits of data mining- and artificial intelligence-based automobile insurance fraud detection but also highlight the deficiencies observable in this field such as the lack of cost-sensitive approaches or the absence of reliable data sets.

Originality/value

This paper offers greater insight into how artificial intelligence and data mining challenges traditional automobile insurance fraud detection models and addresses the need to develop new cost-sensitive fraud detection methods that identify new real-world data sets.

Details

Journal of Financial Regulation and Compliance, vol. 30 no. 4
Type: Research Article
ISSN: 1358-1988

Keywords

Article
Publication date: 7 March 2016

Seyed Hossein Razavi Hajiagha, Shide Sadat Hashemi and Hannan Amoozad Mahdiraji

Data envelopment analysis (DEA) is a non-parametric model that is developed for evaluating the relative efficiency of a set of homogeneous decision-making units that each unit…

Abstract

Purpose

Data envelopment analysis (DEA) is a non-parametric model that is developed for evaluating the relative efficiency of a set of homogeneous decision-making units that each unit transforms multiple inputs into multiple outputs. However, usually the decision-making units are not completely similar. The purpose of this paper is to propose an algorithm for DEA applications when considered DMUs are non-homogeneous.

Design/methodology/approach

To reach this aim, an algorithm is designed to mitigate the impact of heterogeneity on efficiency evaluation. Using fuzzy C-means algorithm, a fuzzy clustering is obtained for DMUs based on their inputs and outputs. Then, the fuzzy C-means based DEA approach is used for finding the efficiency of DMUs in different clusters. Finally, the different efficiencies of each DMU are aggregated based on the membership values of DMUs in clusters.

Findings

Heterogeneity causes some positive impact on some DMUs while it has negative impact on other ones. The proposed method mitigates this undesirable impact and a different distribution of efficiency score is obtained that neglects this unintended impacts.

Research limitations/implications

The proposed method can be applied in DEA applications with a large number of DMUs in different situations, where some of them enjoyed the good environmental conditions, while others suffered from bad conditions. Therefore, a better assessment of real performance can be obtained.

Originality/value

The paper proposed a hybrid algorithm combination of fuzzy C-means clustering method with classic DEA models for the first time.

Details

Kybernetes, vol. 45 no. 3
Type: Research Article
ISSN: 0368-492X

Keywords

Article
Publication date: 28 January 2014

Swarnalatha Purushotham and Balakrishna Tripathy

The purpose of this paper is to provide a way to analyze satellite images using various clustering algorithms and refined bitplane methods with other supporting techniques to…

Abstract

Purpose

The purpose of this paper is to provide a way to analyze satellite images using various clustering algorithms and refined bitplane methods with other supporting techniques to prove the superiority of RIFCM.

Design/methodology/approach

A comparative study has been carried out using RIFCM with other related algorithms from their suitability in analysis of satellite images with other supporting techniques which segments the images for further process for the benefit of societal problems. Four images were selected dealing with hills, freshwater, freshwatervally and drought satellite images.

Findings

The superiority of the proposed algorithm, RIFCM with refined bitplane towards other clustering techniques with other supporting methods clustering, has been found and as such the comparison, has been made by applying four metrics (Otsu (Max-Min), PSNR and RMSE (40%-60%-Min-Max), histogram analysis (Max-Max), DB index and D index (Max-Min)) and proved that the RIFCM algorithm with refined bitplane yielded robust results with efficient performance, reduction in the metrics and time complexity of depth computation of satellite images for further process of an image.

Practical implications

For better clustering of satellite images like lands, hills, freshwater, freshwatervalley, drought, etc. of satellite images is an achievement.

Originality/value

The existing system extends the novel framework to provide a more explicit way to analyze an image by removing distortions with refined bitplane slicing using the proposed algorithm of rough intuitionistic fuzzy c-means to show the superiority of RIFCM.

Article
Publication date: 17 April 2024

Charitha Sasika Hettiarachchi, Nanfei Sun, Trang Minh Quynh Le and Naveed Saleem

The COVID-19 pandemic has posed many challenges in almost all sectors around the globe. Because of the pandemic, government entities responsible for managing health-care resources…

Abstract

Purpose

The COVID-19 pandemic has posed many challenges in almost all sectors around the globe. Because of the pandemic, government entities responsible for managing health-care resources face challenges in managing and distributing their limited and valuable health resources. In addition, severe outbreaks may occur in a small or large geographical area. Therefore, county-level preparation is crucial for officials and organizations who manage such disease outbreaks. However, most COVID-19-related research projects have focused on either state- or country-level. Only a few studies have considered county-level preparations, such as identifying high-risk counties of a particular state to fight against the COVID-19 pandemic. Therefore, the purpose of this research is to prioritize counties in a state based on their COVID-19-related risks to manage the COVID outbreak effectively.

Design/methodology/approach

In this research, the authors use a systematic hybrid approach that uses a clustering technique to group counties that share similar COVID conditions and use a multi-criteria decision-making approach – the analytic hierarchy process – to rank clusters with respect to the severity of the pandemic. The clustering was performed using two methods, k-means and fuzzy c-means, but only one of them was used at a time during the experiment.

Findings

The results of this study indicate that the proposed approach can effectively identify and rank the most vulnerable counties in a particular state. Hence, state health resources managing entities can identify counties in desperate need of more attention before they allocate their resources and better prepare those counties before another surge.

Originality/value

To the best of the authors’ knowledge, this study is the first to use both an unsupervised learning approach and the analytic hierarchy process to identify and rank state counties in accordance with the severity of COVID-19.

Details

Journal of Systems and Information Technology, vol. 26 no. 2
Type: Research Article
ISSN: 1328-7265

Keywords

Article
Publication date: 26 September 2018

Tarik Kucukdeniz and Sakir Esnaf

The purpose of this paper is to propose hybrid revised weighted fuzzy c-means (RWFCM) clustering and Nelder–Mead (NM) simplex algorithm, called as RWFCM-NM, for generalized…

Abstract

Purpose

The purpose of this paper is to propose hybrid revised weighted fuzzy c-means (RWFCM) clustering and Nelder–Mead (NM) simplex algorithm, called as RWFCM-NM, for generalized multisource Weber problem (MWP).

Design/methodology/approach

Although the RWFCM claims that there is no obligation to sequentially use different methods together, NM’s local search advantage is investigated and performance of the proposed hybrid algorithm for generalized MWP is tested on well-known research data sets.

Findings

Test results state the outstanding performance of new hybrid RWFCM and NM simplex algorithm in terms of cost minimization and CPU times.

Originality/value

Proposed approach achieves better results in continuous facility location problems.

Details

Journal of Enterprise Information Management, vol. 31 no. 6
Type: Research Article
ISSN: 1741-0398

Keywords

Article
Publication date: 2 January 2024

Xiumei Cai, Xi Yang and Chengmao Wu

Multi-view fuzzy clustering algorithms are not widely used in image segmentation, and many of these algorithms are lacking in robustness. The purpose of this paper is to…

Abstract

Purpose

Multi-view fuzzy clustering algorithms are not widely used in image segmentation, and many of these algorithms are lacking in robustness. The purpose of this paper is to investigate a new algorithm that can segment the image better and retain as much detailed information about the image as possible when segmenting noisy images.

Design/methodology/approach

The authors present a novel multi-view fuzzy c-means (FCM) clustering algorithm that includes an automatic view-weight learning mechanism. Firstly, this algorithm introduces a view-weight factor that can automatically adjust the weight of different views, thereby allowing each view to obtain the best possible weight. Secondly, the algorithm incorporates a weighted fuzzy factor, which serves to obtain local spatial information and local grayscale information to preserve image details as much as possible. Finally, in order to weaken the effects of noise and outliers in image segmentation, this algorithm employs the kernel distance measure instead of the Euclidean distance.

Findings

The authors added different kinds of noise to images and conducted a large number of experimental tests. The results show that the proposed algorithm performs better and is more accurate than previous multi-view fuzzy clustering algorithms in solving the problem of noisy image segmentation.

Originality/value

Most of the existing multi-view clustering algorithms are for multi-view datasets, and the multi-view fuzzy clustering algorithms are unable to eliminate noise points and outliers when dealing with noisy images. The algorithm proposed in this paper has stronger noise immunity and can better preserve the details of the original image.

Details

Engineering Computations, vol. 41 no. 1
Type: Research Article
ISSN: 0264-4401

Keywords

Article
Publication date: 5 September 2016

Rjiba Sadika, Moez Soltani and Saloua Benammou

The purpose of this paper is to apply the Takagi-Sugeno (T-S) fuzzy model techniques in order to treat and classify textual data sets with and without noise. A comparative study…

Abstract

Purpose

The purpose of this paper is to apply the Takagi-Sugeno (T-S) fuzzy model techniques in order to treat and classify textual data sets with and without noise. A comparative study is done in order to select the most accurate T-S algorithm in the textual data sets.

Design/methodology/approach

From a survey about what has been termed the “Tunisian Revolution,” the authors collect a textual data set from a questionnaire targeted at students. Five clustering algorithms are mainly applied: the Gath-Geva (G-G) algorithm, the modified G-G algorithm, the fuzzy c-means algorithm and the kernel fuzzy c-means algorithm. The authors examine the performances of the four clustering algorithms and select the most reliable one to cluster textual data.

Findings

The proposed methodology was to cluster textual data based on the T-S fuzzy model. On one hand, the results obtained using the T-S models are in the form of numerical relationships between selected keywords and the rest of words constituting a text. Consequently, it allows the authors to interpret these results not only qualitatively but also quantitatively. On the other hand, the proposed method is applied for clustering text taking into account the noise.

Originality/value

The originality comes from the fact that the authors validate some economical results based on textual data, even if they have not been written by experts in the linguistic fields. In addition, the results obtained in this study are easy and simple to interpret by the analysts.

Article
Publication date: 23 March 2021

Hendri Murfi

The aim of this research is to develop an eigenspace-based fuzzy c-means method for scalable topic detection.

Abstract

Purpose

The aim of this research is to develop an eigenspace-based fuzzy c-means method for scalable topic detection.

Design/methodology/approach

The eigenspace-based fuzzy c-means (EFCM) combines representation learning and clustering. The textual data are transformed into a lower-dimensional eigenspace using truncated singular value decomposition. Fuzzy c-means is performed on the eigenspace to identify the centroids of each cluster. The topics are provided by transforming back the centroids into the nonnegative subspace of the original space. In this paper, we extend the EFCM method for scalability by using the two approaches, i.e. single-pass and online. We call the developed topic detection methods as oEFCM and spEFCM.

Findings

Our simulation shows that both oEFCM and spEFCM methods provide faster running times than EFCM for data sets that do not fit in memory. However, there is a decrease in the average coherence score. For both data sets that fit and do not fit into memory, the oEFCM method provides a tradeoff between running time and coherence score, which is better than spEFCM.

Originality/value

This research produces a scalable topic detection method. Besides this scalability capability, the developed method also provides a faster running time for the data set that fits in memory.

Details

Data Technologies and Applications, vol. 55 no. 4
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 7 June 2022

Indranil Ghosh, Rabin K. Jana and Paritosh Pramanik

It is essential to validate whether a nation's economic strength always transpires into new business capacity. The present research strives to identify the key indicators to the…

Abstract

Purpose

It is essential to validate whether a nation's economic strength always transpires into new business capacity. The present research strives to identify the key indicators to the proxy new business ecosystem of countries and critically evaluate the similarity through the lens of advanced Fuzzy Clustering Frameworks over the years.

Design/methodology/approach

The authors use Fuzzy C Means, Type 2 Fuzzy C Means, Fuzzy Possibilistic C Means and Fuzzy Possibilistic Product Partition C Means Clustering algorithm to discover the inherent groupings of the considered countries in terms of intricate patterns of geospatial new business capacity during 2015–2018. Additionally, the authors propose a Particle Swarm Optimization driven Gradient Boosting Regression methodology to measure the influence of the underlying indicators for the overall surge in new business.

Findings

The Fuzzy Clustering frameworks suggest the existence of two clusters of nations across the years. Several developing countries have emerged to cater praiseworthy state of the new business ecosystem. The ease of running a business has appeared to be the most influential feature that governs the overall New Business Density.

Practical implications

It is of paramount practical importance to conduct a periodic review of nations' overall new business ecosystem to draw action plans to emphasize and augment the key enablers linked to new business growth. Countries found to lack new business capacity despite enjoying adequate economic strength can focus effectively on weaker dimensions.

Originality/value

The research proposes a robust systematic framework for new business capacity across different economies, indicating that economic strength does not necessarily transpire to equivalent new business capacity.

Details

Benchmarking: An International Journal, vol. 30 no. 4
Type: Research Article
ISSN: 1463-5771

Keywords

1 – 10 of 278