Search results

1 – 10 of 225
Article
Publication date: 13 July 2023

Luya Yang, Xinbo Huang, Yucheng Ren, Qi Han and Yanchen Huang

In the process of continuous casting and rolling of steel plate, due to the influence of rolling equipment and process, there are scratches, inclusions, patches, scabs and pitted…

Abstract

Purpose

In the process of continuous casting and rolling of steel plate, due to the influence of rolling equipment and process, there are scratches, inclusions, patches, scabs and pitted surfaces on the surface of steel plate, which will not only affect the corrosion resistance, wear resistance and fatigue strength of steel plate but also may cause production accidents. Therefore, the detection of steel plate surface defect must be strengthened to ensure the production quality of steel plate and the smooth development of industrial construction.

Design/methodology/approach

(1) A steel plate surface defect detection technology based on small datasets is proposed, which can detect multiple surface defects and fill in the blank of scab defect detection. (2) A detection system based on intelligent recognition technology is built. The steel plate images are collected by the front-end monitoring device, then transmitted to the back-end monitoring center and processed by the embedded intelligent algorithms. (3) In order to reduce the impact of external light on the image, an improved Multi-Scale Retinex (MSR) enhancement algorithm based on adaptive weight calculation is proposed, which lays the foundation for subsequent object segmentation and feature extraction. (4) According to the different factors such as the cause and shape, the texture and shape features are combined to classify different defects on the steel plate surface. The defect classification model is constructed and the classification results are recorded and stored, which has certain application value in the field of steel plate surface defect detection. (5) The practicability and effectiveness of the proposed method are verified by comparison with other methods, and the field running tests are conducted based on the equipment commissioning field of China Heavy Machinery Institute.

Findings

When applied to small dataset, the precision of the proposed method is 94.5% and the time is 23.7 ms. In order to compare with deep learning technology, after expanding the image dataset, the precision and detection time of this paper are 0.948 and 24.2 ms, respectively. The proposed method is superior to other traditional image processing and deep learning methods. And the field recognition precision is 91.7%.

Originality/value

In brief, the steel plate surface defect detection technology based on computer vision is effective, but the previous attempts and methods are not comprehensive and the accuracy and detection speed need to be improved. Therefore, a more practical and comprehensive technology is developed in this paper. The main contributions are as follows: (1) A steel plate surface defect detection technology based on small datasets is proposed, which can detect multiple surface defects and fill in the blank of scab defect detection. (2) A detection system based on intelligent recognition technology is built. The steel plate images are collected by the front-end monitoring device, then transmitted to the back-end monitoring center and processed by the embedded intelligent algorithms. (3) In order to reduce the impact of external light on the image, an improved MSR enhancement algorithm based on adaptive weight calculation is proposed, which lays the foundation for subsequent object segmentation and feature extraction. (4) According to the different factors such as the cause and shape, the texture and shape features are combined to classify different defects on the steel plate surface. The defect classification model is constructed and the classification results are recorded and stored, which has certain application value in the field of steel plate surface defect detection. (5) The practicability and effectiveness of the proposed method are verified by comparison with other methods, and the field running tests are conducted based on the equipment commissioning field of China Heavy Machinery Institute.

Details

Engineering Computations, vol. 40 no. 6
Type: Research Article
ISSN: 0264-4401

Keywords

Article
Publication date: 29 September 2021

Swetha Parvatha Reddy Chandrasekhara, Mohan G. Kabadi and Srivinay

This study has mainly aimed to compare and contrast two completely different image processing algorithms that are very adaptive for detecting prostate cancer using wearable…

Abstract

Purpose

This study has mainly aimed to compare and contrast two completely different image processing algorithms that are very adaptive for detecting prostate cancer using wearable Internet of Things (IoT) devices. Cancer in these modern times is still considered as one of the most dreaded disease, which is continuously pestering the mankind over a past few decades. According to Indian Council of Medical Research, India alone registers about 11.5 lakh cancer related cases every year and closely up to 8 lakh people die with cancer related issues each year. Earlier the incidence of prostate cancer was commonly seen in men aged above 60 years, but a recent study has revealed that this type of cancer has been on rise even in men between the age groups of 35 and 60 years as well. These findings make it even more necessary to prioritize the research on diagnosing the prostate cancer at an early stage, so that the patients can be cured and can lead a normal life.

Design/methodology/approach

The research focuses on two types of feature extraction algorithms, namely, scale invariant feature transform (SIFT) and gray level co-occurrence matrix (GLCM) that are commonly used in medical image processing, in an attempt to discover and improve the gap present in the potential detection of prostate cancer in medical IoT. Later the results obtained by these two strategies are classified separately using a machine learning based classification model called multi-class support vector machine (SVM). Owing to the advantage of better tissue discrimination and contrast resolution, magnetic resonance imaging images have been considered for this study. The classification results obtained for both the SIFT as well as GLCM methods are then compared to check, which feature extraction strategy provides the most accurate results for diagnosing the prostate cancer.

Findings

The potential of both the models has been evaluated in terms of three aspects, namely, accuracy, sensitivity and specificity. Each model’s result was checked against diversified ranges of training and test data set. It was found that the SIFT-multiclass SVM model achieved a highest performance rate of 99.9451% accuracy, 100% sensitivity and 99% specificity at 40:60 ratio of the training and testing data set.

Originality/value

The SIFT-multi SVM versus GLCM-multi SVM based comparison has been introduced for the first time to perceive the best model to be used for the accurate diagnosis of prostate cancer. The performance of the classification for each of the feature extraction strategies is enumerated in terms of accuracy, sensitivity and specificity.

Details

International Journal of Pervasive Computing and Communications, vol. 20 no. 1
Type: Research Article
ISSN: 1742-7371

Keywords

Article
Publication date: 27 February 2023

Fatima-Zahrae Nakach, Hasnae Zerouaoui and Ali Idri

Histopathology biopsy imaging is currently the gold standard for the diagnosis of breast cancer in clinical practice. Pathologists examine the images at various magnifications to…

Abstract

Purpose

Histopathology biopsy imaging is currently the gold standard for the diagnosis of breast cancer in clinical practice. Pathologists examine the images at various magnifications to identify the type of tumor because if only one magnification is taken into account, the decision may not be accurate. This study explores the performance of transfer learning and late fusion to construct multi-scale ensembles that fuse different magnification-specific deep learning models for the binary classification of breast tumor slides.

Design/methodology/approach

Three pretrained deep learning techniques (DenseNet 201, MobileNet v2 and Inception v3) were used to classify breast tumor images over the four magnification factors of the Breast Cancer Histopathological Image Classification dataset (40×, 100×, 200× and 400×). To fuse the predictions of the models trained on different magnification factors, different aggregators were used, including weighted voting and seven meta-classifiers trained on slide predictions using class labels and the probabilities assigned to each class. The best cluster of the outperforming models was chosen using the Scott–Knott statistical test, and the top models were ranked using the Borda count voting system.

Findings

This study recommends the use of transfer learning and late fusion for histopathological breast cancer image classification by constructing multi-magnification ensembles because they perform better than models trained on each magnification separately.

Originality/value

The best multi-scale ensembles outperformed state-of-the-art integrated models and achieved an accuracy mean value of 98.82 per cent, precision of 98.46 per cent, recall of 100 per cent and F1-score of 99.20 per cent.

Details

Data Technologies and Applications, vol. 57 no. 5
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 22 December 2021

C. Ganeshkumar, Sanjay Kumar Jena, A. Sivakumar and T. Nambirajan

This paper is a literature review on use of artificial intelligence (AI) among agricultural value chain (AVC) actors, and it brings out gaps in research in this area and provides…

1250

Abstract

Purpose

This paper is a literature review on use of artificial intelligence (AI) among agricultural value chain (AVC) actors, and it brings out gaps in research in this area and provides directions for future research.

Design/methodology/approach

The authors systematically collected literature from several databases covering 25 years (1994–2020). They classified literature based on AVC actors present in different stages of AVC. The literature was analysed using Nvivo 12 (qualitative software) for descriptive and content analysis.

Findings

Fifty percent of the reviewed studies were empirical, and 35% were conceptual. The review showed that AI adoption in AVC could increase agriculture income, enhance competitiveness and reduce cost. Among the AVC stages, AI research related to agricultural processing and consumer sector was very low compared to input, production and quality testing. Most AVC actors widely used deep learning algorithm of artificial neural networks in various aspects such as water resource management, yield prediction, price/demand forecasting, energy efficiency, optimalization of fertilizer/pesticide usage, crop planning, personalized advisement and predicting consumer behaviour.

Research limitations/implications

The authors have considered only AI in the AVC, AI use in any other sector and not related to value chain actors were not included in the study.

Originality/value

Earlier studies focussed on AI use in specific areas and actors in the AVC such as inputs, farming, processing, distribution and so on. There were no studies focussed on the entire AVC and the use of AI. This review has filled that literature gap.

Details

Journal of Agribusiness in Developing and Emerging Economies, vol. 13 no. 3
Type: Research Article
ISSN: 2044-0839

Keywords

Article
Publication date: 17 June 2021

Ambica Ghai, Pradeep Kumar and Samrat Gupta

Web users rely heavily on online content make decisions without assessing the veracity of the content. The online content comprising text, image, video or audio may be tampered…

1153

Abstract

Purpose

Web users rely heavily on online content make decisions without assessing the veracity of the content. The online content comprising text, image, video or audio may be tampered with to influence public opinion. Since the consumers of online information (misinformation) tend to trust the content when the image(s) supplement the text, image manipulation software is increasingly being used to forge the images. To address the crucial problem of image manipulation, this study focusses on developing a deep-learning-based image forgery detection framework.

Design/methodology/approach

The proposed deep-learning-based framework aims to detect images forged using copy-move and splicing techniques. The image transformation technique aids the identification of relevant features for the network to train effectively. After that, the pre-trained customized convolutional neural network is used to train on the public benchmark datasets, and the performance is evaluated on the test dataset using various parameters.

Findings

The comparative analysis of image transformation techniques and experiments conducted on benchmark datasets from a variety of socio-cultural domains establishes the effectiveness and viability of the proposed framework. These findings affirm the potential applicability of proposed framework in real-time image forgery detection.

Research limitations/implications

This study bears implications for several important aspects of research on image forgery detection. First this research adds to recent discussion on feature extraction and learning for image forgery detection. While prior research on image forgery detection, hand-crafted the features, the proposed solution contributes to stream of literature that automatically learns the features and classify the images. Second, this research contributes to ongoing effort in curtailing the spread of misinformation using images. The extant literature on spread of misinformation has prominently focussed on textual data shared over social media platforms. The study addresses the call for greater emphasis on the development of robust image transformation techniques.

Practical implications

This study carries important practical implications for various domains such as forensic sciences, media and journalism where image data is increasingly being used to make inferences. The integration of image forgery detection tools can be helpful in determining the credibility of the article or post before it is shared over the Internet. The content shared over the Internet by the users has become an important component of news reporting. The framework proposed in this paper can be further extended and trained on more annotated real-world data so as to function as a tool for fact-checkers.

Social implications

In the current scenario wherein most of the image forgery detection studies attempt to assess whether the image is real or forged in an offline mode, it is crucial to identify any trending or potential forged image as early as possible. By learning from historical data, the proposed framework can aid in early prediction of forged images to detect the newly emerging forged images even before they occur. In summary, the proposed framework has a potential to mitigate physical spreading and psychological impact of forged images on social media.

Originality/value

This study focusses on copy-move and splicing techniques while integrating transfer learning concepts to classify forged images with high accuracy. The synergistic use of hitherto little explored image transformation techniques and customized convolutional neural network helps design a robust image forgery detection framework. Experiments and findings establish that the proposed framework accurately classifies forged images, thus mitigating the negative socio-cultural spread of misinformation.

Details

Information Technology & People, vol. 37 no. 2
Type: Research Article
ISSN: 0959-3845

Keywords

Article
Publication date: 8 August 2023

Changro Lee

Unstructured data such as images have defied usage in property valuation for a long time. Instead, structured data in tabular format are commonly employed to estimate property…

Abstract

Purpose

Unstructured data such as images have defied usage in property valuation for a long time. Instead, structured data in tabular format are commonly employed to estimate property prices. This study attempts to quantify the shape of land lots and uses the resultant output as an input variable for subsequent land valuation models.

Design/methodology/approach

Imagery data containing land lot shapes are fed into a convolutional neural network, and the shape of land lots is classified into two categories, regular and irregular-shaped. Then, the intermediate output (regularity score) is utilized in four downstream models to estimate land prices: random forest, gradient boosting, support vector machine and regression models.

Findings

Quantification of the land lot shapes and their exploitation in valuation led to an improvement in the predictive accuracy for all subsequent models.

Originality/value

The study findings are expected to promote the adoption of elusive price determinants such as the shape of a land lot, appearance of a house and the landscape of a neighborhood in property appraisal practices.

Details

Data Technologies and Applications, vol. 58 no. 2
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 8 July 2022

Uzair Khan, Hikmat Ullah Khan, Saqib Iqbal and Hamza Munir

Image Processing is an emerging field that is used to extract information from images. In recent years, this field has received immense attention from researchers, especially in…

Abstract

Purpose

Image Processing is an emerging field that is used to extract information from images. In recent years, this field has received immense attention from researchers, especially in the research domains of object detection, Biomedical Imaging and Semantic segmentation. In this study, a bibliometric analysis of publications related to image processing in the Science Expanded Index Extended (SCI-Expanded) has been performed. Several parameters have been analyzed such as annual scientific production, citations per article, most cited documents, top 20 articles, most relevant authors, authors evaluation using y-index, top and most relevant sources (journals) and hot topics.

Design/methodology/approach

The Bibliographic data has been extracted from the Web of Science which is well known and the world's top database of bibliographic citations of multidisciplinary areas that covers the various journals of computer science, engineering, medical and social sciences.

Findings

The research work in image processing is meager in the past decade, however, from 2014 to 2019, it increases dramatically. Recently, the IEEE Access journal is the most relevant source with an average of 115 publications per year. The USA is most productive and its publications are highly cited while China comes in second place. Image Segmentation, Feature Extraction and Medical Image Processing are hot topics in recent years. The National Natural Science Foundation of China provides 8% of all funds for Image Processing. As Image Processing is now becoming one of the most critical fields, the research productivity has enhanced during the past five years and more work is done while the era of 2005–2013 was the area with the least amount of work in this area.

Originality/value

This research is novel in this regard that no previous research focuses on Bibliometric Analysis in the Image Processing domain, which is one of the hot research areas in computer science and engineering.

Article
Publication date: 24 January 2024

Stuart John Barnes

Color psychology theory reveals that complex images with very varied palettes and many different colors are likely to be considered unattractive by individuals. Notwithstanding…

Abstract

Purpose

Color psychology theory reveals that complex images with very varied palettes and many different colors are likely to be considered unattractive by individuals. Notwithstanding, web content containing social signals may be more attractive via the initiation of a social connection. This research investigates a predictive model blending variables from these theoretical perspectives to determine crowdfunding success.

Design/methodology/approach

The research is based on data from 176,614 Kickstarter projects. A number of machine learning and artificial intelligence techniques were employed to analyze the listing images for color complexity and the presence of people, while specific language features, including socialness, were measured in the listing text. Logistic regression was applied, controlling for several additional variables and predictive model was developed.

Findings

The findings supported the color complexity and socialness effects on crowdfunding success. The model achieves notable predictive value explaining 56.4% of variance. Listing images containing fewer colors and that have more similar colors are more likely to be crowdfunded successfully. Listings that convey greater socialness have a greater likelihood of being funded.

Originality/value

This investigation contributes a unique understanding of the effect of features of both socialness and color complexity on the success of crowdfunding ventures. A second contribution comes from the process and methods employed in the study, which provides a clear blueprint for the processing of large-scale analysis of soft information (images and text) in order to use them as variables in the scientific testing of theory.

Details

Industrial Management & Data Systems, vol. 124 no. 3
Type: Research Article
ISSN: 0263-5577

Keywords

Article
Publication date: 19 December 2023

Jinchao Huang

Single-shot multi-category clothing recognition and retrieval play a crucial role in online searching and offline settlement scenarios. Existing clothing recognition methods based…

Abstract

Purpose

Single-shot multi-category clothing recognition and retrieval play a crucial role in online searching and offline settlement scenarios. Existing clothing recognition methods based on RGBD clothing images often suffer from high-dimensional feature representations, leading to compromised performance and efficiency.

Design/methodology/approach

To address this issue, this paper proposes a novel method called Manifold Embedded Discriminative Feature Selection (MEDFS) to select global and local features, thereby reducing the dimensionality of the feature representation and improving performance. Specifically, by combining three global features and three local features, a low-dimensional embedding is constructed to capture the correlations between features and categories. The MEDFS method designs an optimization framework utilizing manifold mapping and sparse regularization to achieve feature selection. The optimization objective is solved using an alternating iterative strategy, ensuring convergence.

Findings

Empirical studies conducted on a publicly available RGBD clothing image dataset demonstrate that the proposed MEDFS method achieves highly competitive clothing classification performance while maintaining efficiency in clothing recognition and retrieval.

Originality/value

This paper introduces a novel approach for multi-category clothing recognition and retrieval, incorporating the selection of global and local features. The proposed method holds potential for practical applications in real-world clothing scenarios.

Details

International Journal of Intelligent Computing and Cybernetics, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 2 April 2024

R.S. Vignesh and M. Monica Subashini

An abundance of techniques has been presented so forth for waste classification but, they deliver inefficient results with low accuracy. Their achievement on various repositories…

Abstract

Purpose

An abundance of techniques has been presented so forth for waste classification but, they deliver inefficient results with low accuracy. Their achievement on various repositories is different and also, there is insufficiency of high-scale databases for training. The purpose of the study is to provide high security.

Design/methodology/approach

In this research, optimization-assisted federated learning (FL) is introduced for thermoplastic waste segregation and classification. The deep learning (DL) network trained by Archimedes Henry gas solubility optimization (AHGSO) is used for the classification of plastic and resin types. The deep quantum neural networks (DQNN) is used for first-level classification and the deep max-out network (DMN) is employed for second-level classification. This developed AHGSO is obtained by blending the features of Archimedes optimization algorithm (AOA) and Henry gas solubility optimization (HGSO). The entities included in this approach are nodes and servers. Local training is carried out depending on local data and updations to the server are performed. Then, the model is aggregated at the server. Thereafter, each node downloads the global model and the update training is executed depending on the downloaded global and the local model till it achieves the satisfied condition. Finally, local update and aggregation at the server is altered based on the average method. The Data tag suite (DATS_2022) dataset is used for multilevel thermoplastic waste segregation and classification.

Findings

By using the DQNN in first-level classification the designed optimization-assisted FL has gained an accuracy of 0.930, mean average precision (MAP) of 0.933, false positive rate (FPR) of 0.213, loss function of 0.211, mean square error (MSE) of 0.328 and root mean square error (RMSE) of 0.572. In the second level classification, by using DMN the accuracy, MAP, FPR, loss function, MSE and RMSE are 0.932, 0.935, 0.093, 0.068, 0.303 and 0.551.

Originality/value

The multilevel thermoplastic waste segregation and classification using the proposed model is accurate and improves the effectiveness of the classification.

1 – 10 of 225