Search results

1 – 10 of over 5000
Article
Publication date: 14 May 2020

Minghua Wei

In order to solve the problem that the performance of the existing local feature descriptors in uncontrolled environment is greatly affected by illumination, background, occlusion…

135

Abstract

Purpose

In order to solve the problem that the performance of the existing local feature descriptors in uncontrolled environment is greatly affected by illumination, background, occlusion and other factors, we propose a novel face recognition algorithm in uncontrolled environment which combines the block central symmetry local binary pattern (CS-LBP) and deep residual network (DRN) model.

Design/methodology/approach

The algorithm first extracts the block CSP-LBP features of the face image, then incorporates the extracted features into the DRN model, and gives the face recognition results by using a well-trained DRN model. The features obtained by the proposed algorithm have the characteristics of both local texture features and deep features that robust to illumination.

Findings

Compared with the direct usage of the original image, the usage of local texture features of the image as the input of DRN model significantly improves the computation efficiency. Experimental results on the face datasets of FERET, YALE-B and CMU-PIE have shown that the recognition rate of the proposed algorithm is significantly higher than that of other compared algorithms.

Originality/value

The proposed algorithm fundamentally solves the problem of face identity recognition in uncontrolled environment, and it is particularly robust to the change of illumination, which proves its superiority.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 13 no. 2
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 1 April 2022

Jingtong Gao, Shaopeng Dong, Jin Cui, Mei Yuan and Juanru Zhao

The purpose of this paper is to propose a new deep learning-based model to carry out better maintenance for naval propulsion system.

Abstract

Purpose

The purpose of this paper is to propose a new deep learning-based model to carry out better maintenance for naval propulsion system.

Design/methodology/approach

This model is constructed by integrating different deep learning algorithms. The basic idea is to change the connection structure of the deep neural network by introducing a residual module, to limit the prediction output to a reasonable range. Then, connect the Deep Residual Network (DRN) with a Generative Adversarial Network (GAN), which helps achieve data expansion during the training process to improve the accuracy of the assessment model.

Findings

Study results show that the proposed model achieves a better prediction effect on the dataset. The average performance and accuracy of the proposed model outperform the traditional models and the basic deep learning models tested in the paper.

Originality/value

The proposed model proved to be better performed naval propulsion system maintenance than the traditional models and the basic deep learning models. Therefore, our model may provide better maintenance advice for the naval propulsion system and will lead to a more reliable environment for offshore operations.

Details

Engineering Computations, vol. 39 no. 6
Type: Research Article
ISSN: 0264-4401

Keywords

Article
Publication date: 23 December 2022

Jinchao Huang

Recently, the convolutional neural network (ConvNet) has a wide application in the classification of motor imagery EEG signals. However, the low signal-to-noise…

86

Abstract

Purpose

Recently, the convolutional neural network (ConvNet) has a wide application in the classification of motor imagery EEG signals. However, the low signal-to-noise electroencephalogram (EEG) signals are collected under the interference of noises. However, the conventional ConvNet model cannot directly solve this problem. This study aims to discuss the aforementioned issues.

Design/methodology/approach

To solve this problem, this paper adopted a novel residual shrinkage block (RSB) to construct the ConvNet model (RSBConvNet). During the feature extraction from EEG signals, the proposed RSBConvNet prevented the noise component in EEG signals, and improved the classification accuracy of motor imagery. In the construction of RSBConvNet, the author applied the soft thresholding strategy to prevent the non-related motor imagery features in EEG signals. The soft thresholding was inserted into the residual block (RB), and the suitable threshold for the current EEG signals distribution can be learned by minimizing the loss function. Therefore, during the feature extraction of motor imagery, the proposed RSBConvNet de-noised the EEG signals and improved the discriminative of classification features.

Findings

Comparative experiments and ablation studies were done on two public benchmark datasets. Compared with conventional ConvNet models, the proposed RSBConvNet model has obvious improvements in motor imagery classification accuracy and Kappa coefficient. Ablation studies have also shown the de-noised abilities of the RSBConvNet model. Moreover, different parameters and computational methods of the RSBConvNet model have been tested on the classification of motor imagery.

Originality/value

Based on the experimental results, the RSBConvNet constructed in this paper has an excellent recognition accuracy of MI-BCI, which can be used for further applications for the online MI-BCI.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 16 no. 3
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 6 November 2020

Wenjuan Shen and Xiaoling Li

recent years, facial expression recognition has been widely used in human machine interaction, clinical medicine and safe driving. However, there is a limitation that conventional…

Abstract

Purpose

recent years, facial expression recognition has been widely used in human machine interaction, clinical medicine and safe driving. However, there is a limitation that conventional recurrent neural networks can only learn the time-series characteristics of expressions based on one-way propagation information.

Design/methodology/approach

To solve such limitation, this paper proposes a novel model based on bidirectional gated recurrent unit networks (Bi-GRUs) with two-way propagations, and the theory of identity mapping residuals is adopted to effectively prevent the problem of gradient disappearance caused by the depth of the introduced network. Since the Inception-V3 network model for spatial feature extraction has too many parameters, it is prone to overfitting during training. This paper proposes a novel facial expression recognition model to add two reduction modules to reduce parameters, so as to obtain an Inception-W network with better generalization.

Findings

Finally, the proposed model is pretrained to determine the best settings and selections. Then, the pretrained model is experimented on two facial expression data sets of CK+ and Oulu- CASIA, and the recognition performance and efficiency are compared with the existing methods. The highest recognition rate is 99.6%, which shows that the method has good recognition accuracy in a certain range.

Originality/value

By using the proposed model for the applications of facial expression, the high recognition accuracy and robust recognition results with lower time consumption will help to build more sophisticated applications in real world.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 13 no. 4
Type: Research Article
ISSN: 1756-378X

Keywords

Open Access
Article
Publication date: 9 December 2022

Rui Wang, Shunjie Zhang, Shengqiang Liu, Weidong Liu and Ao Ding

The purpose is using generative adversarial network (GAN) to solve the problem of sample augmentation in the case of imbalanced bearing fault data sets and improving residual

Abstract

Purpose

The purpose is using generative adversarial network (GAN) to solve the problem of sample augmentation in the case of imbalanced bearing fault data sets and improving residual network is used to improve the diagnostic accuracy of the bearing fault intelligent diagnosis model in the environment of high signal noise.

Design/methodology/approach

A bearing vibration data generation model based on conditional GAN (CGAN) framework is proposed. The method generates data based on the adversarial mechanism of GANs and uses a small number of real samples to generate data, thereby effectively expanding imbalanced data sets. Combined with the data augmentation method based on CGAN, a fault diagnosis model of rolling bearing under the condition of data imbalance based on CGAN and improved residual network with attention mechanism is proposed.

Findings

The method proposed in this paper is verified by the western reserve data set and the truck bearing test bench data set, proving that the CGAN-based data generation method can form a high-quality augmented data set, while the CGAN-based and improved residual with attention mechanism. The diagnostic model of the network has better diagnostic accuracy under low signal-to-noise ratio samples.

Originality/value

A bearing vibration data generation model based on CGAN framework is proposed. The method generates data based on the adversarial mechanism of GAN and uses a small number of real samples to generate data, thereby effectively expanding imbalanced data sets. Combined with the data augmentation method based on CGAN, a fault diagnosis model of rolling bearing under the condition of data imbalance based on CGAN and improved residual network with attention mechanism is proposed.

Details

Smart and Resilient Transportation, vol. 5 no. 1
Type: Research Article
ISSN: 2632-0487

Keywords

Article
Publication date: 6 June 2019

Shuang-Shuang Liu

The conventional pedestrian detection algorithms lack in scale sensitivity. The purpose of this paper is to propose a novel algorithm of self-adaptive scale pedestrian detection…

Abstract

Purpose

The conventional pedestrian detection algorithms lack in scale sensitivity. The purpose of this paper is to propose a novel algorithm of self-adaptive scale pedestrian detection, based on deep residual network (DRN), to address such lacks.

Design/methodology/approach

First, the “Edge boxes” algorithm is introduced to extract region of interests from pedestrian images. Then, the extracted bounding boxes are incorporated to different DRNs, one is a large-scale DRN and the other one is the small-scale DRN. The height of the bounding boxes is used to classify the results of pedestrians and to regress the bounding boxes to the entity of the pedestrian. At last, a weighted self-adaptive scale function, which combines the large-scale results and small-scale results, is designed for the final pedestrian detection.

Findings

To validate the effectiveness and feasibility of the proposed algorithm, some comparison experiments have been done on the common pedestrian detection data sets: Caltech, INRIA, ETH and KITTI. Experimental results show that the proposed algorithm is adapted for the various scales of the pedestrians. For the hard detected small-scale pedestrians, the proposed algorithm has improved the accuracy and robustness of detections.

Originality/value

By applying different models to deal with different scales of pedestrians, the proposed algorithm with the weighted calculation function has improved the accuracy and robustness for different scales of pedestrians.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 12 no. 3
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 9 April 2024

Lu Wang, Jiahao Zheng, Jianrong Yao and Yuangao Chen

With the rapid growth of the domestic lending industry, assessing whether the borrower of each loan is at risk of default is a pressing issue for financial institutions. Although…

Abstract

Purpose

With the rapid growth of the domestic lending industry, assessing whether the borrower of each loan is at risk of default is a pressing issue for financial institutions. Although there are some models that can handle such problems well, there are still some shortcomings in some aspects. The purpose of this paper is to improve the accuracy of credit assessment models.

Design/methodology/approach

In this paper, three different stages are used to improve the classification performance of LSTM, so that financial institutions can more accurately identify borrowers at risk of default. The first approach is to use the K-Means-SMOTE algorithm to eliminate the imbalance within the class. In the second step, ResNet is used for feature extraction, and then two-layer LSTM is used for learning to strengthen the ability of neural networks to mine and utilize deep information. Finally, the model performance is improved by using the IDWPSO algorithm for optimization when debugging the neural network.

Findings

On two unbalanced datasets (category ratios of 700:1 and 3:1 respectively), the multi-stage improved model was compared with ten other models using accuracy, precision, specificity, recall, G-measure, F-measure and the nonparametric Wilcoxon test. It was demonstrated that the multi-stage improved model showed a more significant advantage in evaluating the imbalanced credit dataset.

Originality/value

In this paper, the parameters of the ResNet-LSTM hybrid neural network, which can fully mine and utilize the deep information, are tuned by an innovative intelligent optimization algorithm to strengthen the classification performance of the model.

Details

Kybernetes, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0368-492X

Keywords

Article
Publication date: 16 August 2019

Shuangshuang Liu and Xiaoling Li

Conventional image super-resolution reconstruction by the conventional deep learning architectures suffers from the problems of hard training and gradient disappearing. In order…

Abstract

Purpose

Conventional image super-resolution reconstruction by the conventional deep learning architectures suffers from the problems of hard training and gradient disappearing. In order to solve such problems, the purpose of this paper is to propose a novel image super-resolution algorithm based on improved generative adversarial networks (GANs) with Wasserstein distance and gradient penalty.

Design/methodology/approach

The proposed algorithm first introduces the conventional GANs architecture, the Wasserstein distance and the gradient penalty for the task of image super-resolution reconstruction (SRWGANs-GP). In addition, a novel perceptual loss function is designed for the SRWGANs-GP to meet the task of image super-resolution reconstruction. The content loss is extracted from the deep model’s feature maps, and such features are introduced to calculate mean square error (MSE) for the loss calculation of generators.

Findings

To validate the effectiveness and feasibility of the proposed algorithm, a lot of compared experiments are applied on three common data sets, i.e. Set5, Set14 and BSD100. Experimental results have shown that the proposed SRWGANs-GP architecture has a stable error gradient and iteratively convergence. Compared with the baseline deep models, the proposed GANs models have a significant improvement on performance and efficiency for image super-resolution reconstruction. The MSE calculated by the deep model’s feature maps gives more advantages for constructing contour and texture.

Originality/value

Compared with the state-of-the-art algorithms, the proposed algorithm obtains a better performance on image super-resolution and better reconstruction results on contour and texture.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 12 no. 3
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 9 April 2024

Shola Usharani, R. Gayathri, Uday Surya Deveswar Reddy Kovvuri, Maddukuri Nivas, Abdul Quadir Md, Kong Fah Tee and Arun Kumar Sivaraman

Automation of detecting cracked surfaces on buildings or in any industrially manufactured products is emerging nowadays. Detection of the cracked surface is a challenging task for…

Abstract

Purpose

Automation of detecting cracked surfaces on buildings or in any industrially manufactured products is emerging nowadays. Detection of the cracked surface is a challenging task for inspectors. Image-based automatic inspection of cracks can be very effective when compared to human eye inspection. With the advancement in deep learning techniques, by utilizing these methods the authors can create automation of work in a particular sector of various industries.

Design/methodology/approach

In this study, an upgraded convolutional neural network-based crack detection method has been proposed. The dataset consists of 3,886 images which include cracked and non-cracked images. Further, these data have been split into training and validation data. To inspect the cracks more accurately, data augmentation was performed on the dataset, and regularization techniques have been utilized to reduce the overfitting problems. In this work, VGG19, Xception and Inception V3, along with Resnet50 V2 CNN architectures to train the data.

Findings

A comparison between the trained models has been performed and from the obtained results, Xception performs better than other algorithms with 99.54% test accuracy. The results show detecting cracked regions and firm non-cracked regions is very efficient by the Xception algorithm.

Originality/value

The proposed method can be way better back to an automatic inspection of cracks in buildings with different design patterns such as decorated historical monuments.

Details

International Journal of Structural Integrity, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1757-9864

Keywords

Article
Publication date: 31 August 2023

Hongwei Zhang, Shihao Wang, Hongmin Mi, Shuai Lu, Le Yao and Zhiqiang Ge

The defect detection problem of color-patterned fabric is still a huge challenge due to the lack of manual defect labeling samples. Recently, many fabric defect detection…

118

Abstract

Purpose

The defect detection problem of color-patterned fabric is still a huge challenge due to the lack of manual defect labeling samples. Recently, many fabric defect detection algorithms based on feature engineering and deep learning have been proposed, but these methods have overdetection or miss-detection problems because they cannot adapt to the complex patterns of color-patterned fabrics. The purpose of this paper is to propose a defect detection framework based on unsupervised adversarial learning for image reconstruction to solve the above problems.

Design/methodology/approach

The proposed framework consists of three parts: a generator, a discriminator and an image postprocessing module. The generator is able to extract the features of the image and then reconstruct the image. The discriminator can supervise the generator to repair defects in the samples to improve the quality of image reconstruction. The multidifference image postprocessing module is used to obtain the final detection results of color-patterned fabric defects.

Findings

The proposed framework is compared with state-of-the-art methods on the public dataset YDFID-1(Yarn-Dyed Fabric Image Dataset-version1). The proposed framework is also validated on several classes in the MvTec AD dataset. The experimental results of various patterns/classes on YDFID-1 and MvTecAD demonstrate the effectiveness and superiority of this method in fabric defect detection.

Originality/value

It provides an automatic defect detection solution that is convenient for engineering applications for the inspection process of the color-patterned fabric manufacturing industry. A public dataset is provided for academia.

Details

International Journal of Clothing Science and Technology, vol. 35 no. 6
Type: Research Article
ISSN: 0955-6222

Keywords

1 – 10 of over 5000