Search results

1 – 10 of 496
Article
Publication date: 14 December 2018

Erion Çano and Maurizio Morisio

The fabulous results of convolution neural networks in image-related tasks attracted attention of text mining, sentiment analysis and other text analysis researchers. It is…

Abstract

Purpose

The fabulous results of convolution neural networks in image-related tasks attracted attention of text mining, sentiment analysis and other text analysis researchers. It is, however, difficult to find enough data for feeding such networks, optimize their parameters, and make the right design choices when constructing network architectures. The purpose of this paper is to present the creation steps of two big data sets of song emotions. The authors also explore usage of convolution and max-pooling neural layers on song lyrics, product and movie review text data sets. Three variants of a simple and flexible neural network architecture are also compared.

Design/methodology/approach

The intention was to spot any important patterns that can serve as guidelines for parameter optimization of similar models. The authors also wanted to identify architecture design choices which lead to high performing sentiment analysis models. To this end, the authors conducted a series of experiments with neural architectures of various configurations.

Findings

The results indicate that parallel convolutions of filter lengths up to 3 are usually enough for capturing relevant text features. Also, max-pooling region size should be adapted to the length of text documents for producing the best feature maps.

Originality/value

Top results the authors got are obtained with feature maps of lengths 6–18. An improvement on future neural network models for sentiment analysis could be generating sentiment polarity prediction of documents using aggregation of predictions on smaller excerpt of the entire text.

Details

Data Technologies and Applications, vol. 53 no. 1
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 11 January 2021

Rajit Nair, Santosh Vishwakarma, Mukesh Soni, Tejas Patel and Shubham Joshi

The latest 2019 coronavirus (COVID-2019), which first appeared in December 2019 in Wuhan's city in China, rapidly spread around the world and became a pandemic. It has had a…

Abstract

Purpose

The latest 2019 coronavirus (COVID-2019), which first appeared in December 2019 in Wuhan's city in China, rapidly spread around the world and became a pandemic. It has had a devastating impact on daily lives, the public's health and the global economy. The positive cases must be identified as soon as possible to avoid further dissemination of this disease and swift care of patients affected. The need for supportive diagnostic instruments increased, as no specific automated toolkits are available. The latest results from radiology imaging techniques indicate that these photos provide valuable details on the virus COVID-19. User advanced artificial intelligence (AI) technologies and radiological imagery can help diagnose this condition accurately and help resolve the lack of specialist doctors in isolated areas. In this research, a new paradigm for automatic detection of COVID-19 with bare chest X-ray images is displayed. Images are presented. The proposed model DarkCovidNet is designed to provide correct binary classification diagnostics (COVID vs no detection) and multi-class (COVID vs no results vs pneumonia) classification. The implemented model computed the average precision for the binary and multi-class classification of 98.46% and 91.352%, respectively, and an average accuracy of 98.97% and 87.868%. The DarkNet model was used in this research as a classifier for a real-time object detection method only once. A total of 17 convolutionary layers and different filters on each layer have been implemented. This platform can be used by the radiologists to verify their initial application screening and can also be used for screening patients through the cloud.

Design/methodology/approach

This study also uses the CNN-based model named Darknet-19 model, and this model will act as a platform for the real-time object detection system. The architecture of this system is designed in such a way that they can be able to detect real-time objects. This study has developed the DarkCovidNet model based on Darknet architecture with few layers and filters. So before discussing the DarkCovidNet model, look at the concept of Darknet architecture with their functionality. Typically, the DarkNet architecture consists of 5 pool layers though the max pool and 19 convolution layers. Assume as a convolution layer, and as a pooling layer.

Findings

The work discussed in this paper is used to diagnose the various radiology images and to develop a model that can accurately predict or classify the disease. The data set used in this work is the images bases on COVID-19 and non-COVID-19 taken from the various sources. The deep learning model named DarkCovidNet is applied to the data set, and these have shown signification performance in the case of binary classification and multi-class classification. During the multi-class classification, the model has shown an average accuracy 98.97% for the detection of COVID-19, whereas in a multi-class classification model has achieved an average accuracy of 87.868% during the classification of COVID-19, no detection and Pneumonia.

Research limitations/implications

One of the significant limitations of this work is that a limited number of chest X-ray images were used. It is observed that patients related to COVID-19 are increasing rapidly. In the future, the model on the larger data set which can be generated from the local hospitals will be implemented, and how the model is performing on the same will be checked.

Originality/value

Deep learning technology has made significant changes in the field of AI by generating good results, especially in pattern recognition. A conventional CNN structure includes a convolution layer that extracts characteristics from the input using the filters it applies, a pooling layer that reduces calculation efficiency and the neural network's completely connected layer. A CNN model is created by integrating one or more of these layers, and its internal parameters are modified to accomplish a specific mission, such as classification or object recognition. A typical CNN structure has a convolution layer that extracts features from the input with the filters it applies, a pooling layer to reduce the size for computational performance and a fully connected layer, which is a neural network. A CNN model is created by combining one or more such layers, and its internal parameters are adjusted to accomplish a particular task, such as classification or object recognition.

Details

World Journal of Engineering, vol. 19 no. 1
Type: Research Article
ISSN: 1708-5284

Keywords

Article
Publication date: 25 March 2020

Wang Zhao and Long Lu

Facial expression provides abundant information for social interaction, and the analysis and utilization of facial expression data are playing a huge driving role in all areas of…

Abstract

Purpose

Facial expression provides abundant information for social interaction, and the analysis and utilization of facial expression data are playing a huge driving role in all areas of society. Facial expression data can reflect people's mental state. In health care, the analysis and processing of facial expression data can promote the improvement of people's health. This paper introduces several important public facial expression databases and describes the process of facial expression recognition. The standard facial expression database FER2013 and CK+ were used as the main training samples. At the same time, the facial expression image data of 16 Chinese children were collected as supplementary samples. With the help of VGG19 and Resnet18 algorithm models of deep convolution neural network, this paper studies and develops an information system for the diagnosis of autism by facial expression data.

Design/methodology/approach

The facial expression data of the training samples are based on the standard expression database FER2013 and CK+. FER2013 and CK+ databases are a common facial expression data set, which is suitable for the research of facial expression recognition. On the basis of FER2013 and CK+ facial expression database, this paper uses the machine learning model support vector machine (SVM) and deep convolution neural network model CNN, VGG19 and Resnet18 to complete the facial expression recognition.

Findings

In this study, ten normal children and ten autistic patients were recruited to test the accuracy of the information system and the diagnostic effect of autism. After testing, the accuracy rate of facial expression recognition is 81.4 percent. This information system can easily identify autistic children. The feasibility of recognizing autism through facial expression is verified.

Research limitations/implications

The CK+ facial expression database contains some adult facial expression images. In order to improve the accuracy of facial expression recognition for children, more facial expression data of children will be collected as training samples. Therefore, the recognition rate of the information system will be further improved.

Originality/value

This research uses facial expression data and the latest artificial intelligence technology, which is advanced in technology. The diagnostic accuracy of autism is higher than that of traditional systems, so this study is innovative. Research topics come from the actual needs of doctors, and the contents and methods of research have been discussed with doctors many times. The system can diagnose autism as early as possible, promote the early treatment and rehabilitation of patients, and then reduce the economic and mental burden of patients. Therefore, this information system has good social benefits and application value.

Details

Library Hi Tech, vol. 38 no. 4
Type: Research Article
ISSN: 0737-8831

Keywords

Article
Publication date: 21 December 2021

Shadrack Fred Mahenge and Ala Alsanabani

In the purpose of the section, the cracks that are in the construction domain may be common and usually fixed with the human inspection which is at the visible range, but for the…

Abstract

Purpose

In the purpose of the section, the cracks that are in the construction domain may be common and usually fixed with the human inspection which is at the visible range, but for the cracks which may exist at the distant place for the human eye in the same building but can be captured with the camera. If the crack size is quite big can be visible but few cracks will be present due to the flaws in the construction of walls which needs authentic information and confirmation about it for the successful completion of the wall cracks, as these cracks in the wall will result in the structure collapse.

Design/methodology/approach

In the modern era of digital image processing, it has captured the importance in all the domain of engineering and all the fields irrespective of the division of the engineering, hence, in this research study an attempt is made to deal with the wall cracks which are found or searched during the building inspection process, in the present context in association with the unique U-net architecture is used with convolutional neural network method.

Findings

In the construction domain, the cracks may be common and usually fixed with the human inspection which is at the visible range, but for the cracks which may exist at the distant place for the human eye in the same building but can be captured with the camera. If the crack size is quite big can be visible but few cracks will be present due to the flaws in the construction of walls which needs authentic information and confirmation about it for the successful completion of the wall cracks, as these cracks in the wall will result in the structure collapse. Hence, for the modeling of the proposed system, it is considered with the image database from the Mendeley portal for the analysis. With the experimental analysis, it is noted and observed that the proposed system was able to detect the wall cracks, search the flat surface by the result of no cracks found and it is successful in dealing with the two phases of operation, namely, classification and segmentation with the deep learning technique. In contrast to other conventional methodologies, the proposed methodology produces excellent performance results.

Originality/value

The originality of the paper is to find the portion of the cracks on the walls using deep learning architecture.

Details

International Journal of Pervasive Computing and Communications, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1742-7371

Keywords

Article
Publication date: 15 January 2024

Faris Elghaish, Sandra Matarneh, Essam Abdellatef, Farzad Rahimian, M. Reza Hosseini and Ahmed Farouk Kineber

Cracks are prevalent signs of pavement distress found on highways globally. The use of artificial intelligence (AI) and deep learning (DL) for crack detection is increasingly…

Abstract

Purpose

Cracks are prevalent signs of pavement distress found on highways globally. The use of artificial intelligence (AI) and deep learning (DL) for crack detection is increasingly considered as an optimal solution. Consequently, this paper introduces a novel, fully connected, optimised convolutional neural network (CNN) model using feature selection algorithms for the purpose of detecting cracks in highway pavements.

Design/methodology/approach

To enhance the accuracy of the CNN model for crack detection, the authors employed a fully connected deep learning layers CNN model along with several optimisation techniques. Specifically, three optimisation algorithms, namely adaptive moment estimation (ADAM), stochastic gradient descent with momentum (SGDM), and RMSProp, were utilised to fine-tune the CNN model and enhance its overall performance. Subsequently, the authors implemented eight feature selection algorithms to further improve the accuracy of the optimised CNN model. These feature selection techniques were thoughtfully selected and systematically applied to identify the most relevant features contributing to crack detection in the given dataset. Finally, the authors subjected the proposed model to testing against seven pre-trained models.

Findings

The study's results show that the accuracy of the three optimisers (ADAM, SGDM, and RMSProp) with the five deep learning layers model is 97.4%, 98.2%, and 96.09%, respectively. Following this, eight feature selection algorithms were applied to the five deep learning layers to enhance accuracy, with particle swarm optimisation (PSO) achieving the highest F-score at 98.72. The model was then compared with other pre-trained models and exhibited the highest performance.

Practical implications

With an achieved precision of 98.19% and F-score of 98.72% using PSO, the developed model is highly accurate and effective in detecting and evaluating the condition of cracks in pavements. As a result, the model has the potential to significantly reduce the effort required for crack detection and evaluation.

Originality/value

The proposed method for enhancing CNN model accuracy in crack detection stands out for its unique combination of optimisation algorithms (ADAM, SGDM, and RMSProp) with systematic application of multiple feature selection techniques to identify relevant crack detection features and comparing results with existing pre-trained models.

Details

Smart and Sustainable Built Environment, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2046-6099

Keywords

Article
Publication date: 8 February 2021

Adireddy Rajasekhar Reddy and Appini Narayana Rao

In modern technology, the wireless sensor networks (WSNs) are generally most promising solutions for better reliability, object tracking, remote monitoring and more, which is…

Abstract

Purpose

In modern technology, the wireless sensor networks (WSNs) are generally most promising solutions for better reliability, object tracking, remote monitoring and more, which is directly related to the sensor nodes. Received signal strength indication (RSSI) is main challenges in sensor networks, which is fully depends on distance measurement. The learning algorithm based traditional models are involved in error correction, distance measurement and improve the accuracy of effectiveness. But, most of the existing models are not able to protect the user’s data from the unknown or malicious data during the signal transmission. The simulation outcomes indicate that proposed methodology may reach more constant and accurate position states of the unknown nodes and the target node in WSNs domain than the existing methods.

Design/methodology/approach

This paper present a deep convolutional neural network (DCNN) from the adaptation of machine learning to identify the problems on deep ranging sensor networks and overthrow the problems of unknown sensor nodes localization in WSN networks by using instance parameters of elephant herding optimization (EHO) technique and which is used to optimize the localization problem.

Findings

In this proposed method, the signal propagation properties can be extracted automatically because of this image data and RSSI data values. Rest of this manuscript shows that the ECO can find the better performance analysis of distance estimation accuracy, localized nodes and its transmission range than those traditional algorithms. ECO has been proposed as one of the main tools to promote a transformation from unsustainable development to one of sustainable development. It will reduce the material intensity of goods and services.

Originality/value

The proposed technique is compared to existing systems to show the proposed method efficiency. The simulation results indicate that this proposed methodology can achieve more constant and accurate position states of the unknown nodes and the target node in WSNs domain than the existing methods.

Details

International Journal of Pervasive Computing and Communications, vol. 18 no. 2
Type: Research Article
ISSN: 1742-7371

Keywords

Article
Publication date: 22 March 2022

Shiva Sumanth Reddy and C. Nandini

The present research work is carried out for determining haemoprotozoan diseases in cattle and breast cancer diseases in humans at early stage. The combination of LeNet and…

Abstract

Purpose

The present research work is carried out for determining haemoprotozoan diseases in cattle and breast cancer diseases in humans at early stage. The combination of LeNet and bidirectional long short-term memory (Bi-LSTM) model is used for the classification of heamoprotazoan samples into three classes such as theileriosis, babesiosis and anaplasmosis. Also, BreaKHis dataset image samples are classified into two major classes as malignant and benign. The hyperparameter optimization is used for selecting the prominent features. The main objective of this approach is to overcome the manual identification and classification of samples into different haemoprotozoan diseases in cattle. The traditional laboratory approach of identification is time-consuming and requires human expertise. The proposed methodology will help to identify and classify the heamoprotozoan disease in early stage without much of human involvement.

Design/methodology/approach

LeNet-based Bi-LSTM model is used for the classification of pathology images into babesiosis, anaplasmosis, theileriosis and breast images classified into malignant or benign. An optimization-based super pixel clustering algorithm is used for segmentation once the normalization of histopathology images is conducted. The edge information in the normalized images is considered for identifying the irregular shape regions of images, which are structurally meaningful. Also, it is compared with another segmentation approach circular Hough Transform (CHT). The CHT is used to separate the nuclei from non-nuclei. The Canny edge detection and gaussian filter is used for extracting the edges before sending to CHT.

Findings

The existing methods such as artificial neural network (ANN), convolution neural network (CNN), recurrent neural network (RNN), LSTM and Bi-LSTM model have been compared with the proposed hyperparameter optimization approach with LeNET and Bi-LSTM. The results obtained by the proposed hyperparameter optimization-Bi-LSTM model showed the accuracy of 98.99% when compared to existing models like Ensemble of Deep Learning Models of 95.29% and Modified ReliefF Algorithm of 95.94%.

Originality/value

In contrast to earlier research done using Modified ReliefF, the suggested LeNet with Bi-LSTM model, there is an improvement in accuracy, precision and F-score significantly. The real time data set is used for the heamoprotozoan disease samples. Also, for anaplasmosis and babesiosis, the second set of datasets were used which are coloured datasets obtained by adding a chemical acetone and stain.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 16 no. 4
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 14 May 2020

Minghua Wei

In order to solve the problem that the performance of the existing local feature descriptors in uncontrolled environment is greatly affected by illumination, background, occlusion…

135

Abstract

Purpose

In order to solve the problem that the performance of the existing local feature descriptors in uncontrolled environment is greatly affected by illumination, background, occlusion and other factors, we propose a novel face recognition algorithm in uncontrolled environment which combines the block central symmetry local binary pattern (CS-LBP) and deep residual network (DRN) model.

Design/methodology/approach

The algorithm first extracts the block CSP-LBP features of the face image, then incorporates the extracted features into the DRN model, and gives the face recognition results by using a well-trained DRN model. The features obtained by the proposed algorithm have the characteristics of both local texture features and deep features that robust to illumination.

Findings

Compared with the direct usage of the original image, the usage of local texture features of the image as the input of DRN model significantly improves the computation efficiency. Experimental results on the face datasets of FERET, YALE-B and CMU-PIE have shown that the recognition rate of the proposed algorithm is significantly higher than that of other compared algorithms.

Originality/value

The proposed algorithm fundamentally solves the problem of face identity recognition in uncontrolled environment, and it is particularly robust to the change of illumination, which proves its superiority.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 13 no. 2
Type: Research Article
ISSN: 1756-378X

Keywords

Book part
Publication date: 25 October 2023

Md Sakib Ullah Sourav, Huidong Wang, Mohammad Raziuddin Chowdhury and Rejwan Bin Sulaiman

One of the most neglected sources of energy loss is streetlights that generate too much light in areas where it is not required. Energy waste has enormous economic and…

Abstract

One of the most neglected sources of energy loss is streetlights that generate too much light in areas where it is not required. Energy waste has enormous economic and environmental effects. In addition, due to the conventional manual nature of operation, streetlights are frequently seen being turned ‘ON’ during the day and ‘OFF’ in the evening, which is regrettable even in the twenty-first century. These issues require automated streetlight control in order to be resolved. This study aims to develop a novel streetlight controlling method by combining a smart transport monitoring system powered by computer vision technology with a closed circuit television (CCTV) camera that allows the light-emitting diode (LED) streetlight to automatically light up with the appropriate brightness by detecting the presence of pedestrians or vehicles and dimming the streetlight in their absence using semantic image segmentation from the CCTV video streaming. Consequently, our model distinguishes daylight and nighttime, which made it feasible to automate the process of turning the streetlight ‘ON’ and ‘OFF’ to save energy consumption costs. According to the aforementioned approach, geo-location sensor data could be utilised to make more informed streetlight management decisions. To complete the tasks, we consider training the U-net model with ResNet-34 as its backbone. Validity of the models is guaranteed with the use of assessment matrices. The suggested concept is straightforward, economical, energy-efficient, long-lasting and more resilient than conventional alternatives.

Details

Technology and Talent Strategies for Sustainable Smart Cities
Type: Book
ISBN: 978-1-83753-023-6

Keywords

Article
Publication date: 6 June 2022

Guoyang Wan, Fudong Li, Bingyou Liu, Shoujun Bai, Guofeng Wang and Kaisheng Xing

This paper aims to study six degrees-of-freedom (6DOF) pose measurement of reflective metal casts by machine vision, analyze the problems existing in the positioning of metal…

Abstract

Purpose

This paper aims to study six degrees-of-freedom (6DOF) pose measurement of reflective metal casts by machine vision, analyze the problems existing in the positioning of metal casts by stereo vision sensor in unstructured environment and put forward the visual positioning and grasping strategy that can be used in industrial robot cell.

Design/methodology/approach

A multikeypoints detection network Binocular Attention Hourglass Net is constructed, which can complete the two-dimensional positioning of the left and right cameras of the stereo vision system at the same time and provide reconstruction information for three-dimensional pose measurement. Generate adversarial networks is introduced to enhance the image of local feature area of object surface, and the three-dimensional pose measurement of object is completed by combining RANSAC ellipse fitting algorithm and triangulation method.

Findings

The proposed method realizes the high-precision 6DOF positioning and grasping of reflective metal casts by industrial robots; it has been applied in many fields and solves the problem of difficult visual measurement of reflective casts. The experimental results show that the system exhibits superior recognition performance, which meets the requirements of the grasping task.

Research limitations/implications

Because of the chosen research approach, the research results may lack generalizability. The proposed method is more suitable for objects with plane positioning features.

Originality/value

This paper realizes the 6DOF pose measurement of reflective casts by vision system, and solves the problem of positioning and grasping such objects by industrial robot.

Details

Assembly Automation, vol. 42 no. 4
Type: Research Article
ISSN: 0144-5154

Keywords

1 – 10 of 496