Search results

1 – 10 of over 4000
Article
Publication date: 26 September 2022

Hong Wang, Yong Xie, Shasha Tian, Lu Zheng, Xiaojie Dong and Yu Zhu

The purpose of the study is to address the problems of low accuracy and missed detection of occluded pedestrians and small target pedestrians when using the YOLOX general object…

Abstract

Purpose

The purpose of the study is to address the problems of low accuracy and missed detection of occluded pedestrians and small target pedestrians when using the YOLOX general object detection algorithm for pedestrian detection. This study proposes a multi-level fine-grained YOLOX pedestrian detection algorithm.

Design/methodology/approach

First, to address the problem of the original YOLOX algorithm in obtaining a single perceptual field for the feature map before feature fusion, this study improves the PAFPN structure by adding the ResCoT module to increase the diversity of the perceptual field of the feature map and divides the pedestrian multi-scale features into finer granularity. Second, for the CSPLayer of the PAFPN, a weight gain-based normalization-based attention module (NAM) is proposed to make the model pay more attention to the context information when extracting pedestrian features and highlight the salient features of pedestrians. Finally, the authors experimentally determined the optimal values for the confidence loss function.

Findings

The experimental results show that, compared with the original YOLOX algorithm, the AP of the improved algorithm increased by 2.90%, the Recall increased by 3.57%, and F1 increased by 2% on the pedestrian dataset.

Research limitations/implications

The multi-level fine-grained YOLOX pedestrian detection algorithm can effectively improve the detection of occluded pedestrians and small target pedestrians.

Originality/value

The authors introduce a multi-level fine-grained ResCoT module and a weight gain-based NAM attention module.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 16 no. 2
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 22 April 2024

Ruoxi Zhang and Chenhan Ren

This study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering.

Abstract

Purpose

This study aims to construct a sentiment series generation method for danmu comments based on deep learning, and explore the features of sentiment series after clustering.

Design/methodology/approach

This study consisted of two main parts: danmu comment sentiment series generation and clustering. In the first part, the authors proposed a sentiment classification model based on BERT fine-tuning to quantify danmu comment sentiment polarity. To smooth the sentiment series, they used methods, such as comprehensive weights. In the second part, the shaped-based distance (SBD)-K-shape method was used to cluster the actual collected data.

Findings

The filtered sentiment series or curves of the microfilms on the Bilibili website could be divided into four major categories. There is an apparently stable time interval for the first three types of sentiment curves, while the fourth type of sentiment curve shows a clear trend of fluctuation in general. In addition, it was found that “disputed points” or “highlights” are likely to appear at the beginning and the climax of films, resulting in significant changes in the sentiment curves. The clustering results show a significant difference in user participation, with the second type prevailing over others.

Originality/value

Their sentiment classification model based on BERT fine-tuning outperformed the traditional sentiment lexicon method, which provides a reference for using deep learning as well as transfer learning for danmu comment sentiment analysis. The BERT fine-tuning–SBD-K-shape algorithm can weaken the effect of non-regular noise and temporal phase shift of danmu text.

Details

The Electronic Library , vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 5 May 2023

Nguyen Thi Dinh, Nguyen Thi Uyen Nhi, Thanh Manh Le and Thanh The Van

The problem of image retrieval and image description exists in various fields. In this paper, a model of content-based image retrieval and image content extraction based on the…

Abstract

Purpose

The problem of image retrieval and image description exists in various fields. In this paper, a model of content-based image retrieval and image content extraction based on the KD-Tree structure was proposed.

Design/methodology/approach

A Random Forest structure was built to classify the objects on each image on the basis of the balanced multibranch KD-Tree structure. From that purpose, a KD-Tree structure was generated by the Random Forest to retrieve a set of similar images for an input image. A KD-Tree structure is applied to determine a relationship word at leaves to extract the relationship between objects on an input image. An input image content is described based on class names and relationships between objects.

Findings

A model of image retrieval and image content extraction was proposed based on the proposed theoretical basis; simultaneously, the experiment was built on multi-object image datasets including Microsoft COCO and Flickr with an average image retrieval precision of 0.9028 and 0.9163, respectively. The experimental results were compared with those of other works on the same image dataset to demonstrate the effectiveness of the proposed method.

Originality/value

A balanced multibranch KD-Tree structure was built to apply to relationship classification on the basis of the original KD-Tree structure. Then, KD-Tree Random Forest was built to improve the classifier performance and retrieve a set of similar images for an input image. Concurrently, the image content was described in the process of combining class names and relationships between objects.

Details

Data Technologies and Applications, vol. 57 no. 4
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 24 July 2023

Haonan Fan, Qin Dong and Naixuan Guo

This paper aims to propose a classification method for steel strip surface defects based on a mixed attention mechanism to achieve fast and accurate classification performance…

Abstract

Purpose

This paper aims to propose a classification method for steel strip surface defects based on a mixed attention mechanism to achieve fast and accurate classification performance. The traditional method of classifying surface defects of hot-rolled steel strips has the problems of low recognition accuracy and low efficiency in the industrial complex production environment.

Design/methodology/approach

The authors selected min–max scaling comparison method to filter the training results of multiple network models on the steel strip surface defect data set. Then, the best comprehensive performance model EfficientNet-B0 was refined. Based on this, the authors proposed two mixed attention addition methods, which include squeeze-excitation spatial mixed module and multilayer mixed attention mechanism (MMAM) module, respectively.

Findings

With these two methods, the authors achieved 96.72% and 97.70% recognition accuracy on the steel strip data set after data augmentation for adapting to the complex production environment, respectively. Using the transfer learning method, the EfficientNet-B0 based on MMAM obtained 100% recognition accuracy.

Originality/value

This study not only focuses on improving the recognition accuracy of the network model itself but also considers other performance indicators of the network, which are rarely considered by many researchers. The authors further improve the intelligent production technique and address this issue. Both methods proposed in this paper can be applied to embedded equipment, which can effectively improve steel strip factory production efficiency and reduce material and time loss.

Details

Robotic Intelligence and Automation, vol. 43 no. 4
Type: Research Article
ISSN: 2754-6969

Keywords

Article
Publication date: 3 January 2023

Sangchul Park and Hyun-Woo Lee

Fitness service companies often promote the companies' personal training service by attributing trainers' competent characteristics, qualifications or/and service provision to…

Abstract

Purpose

Fitness service companies often promote the companies' personal training service by attributing trainers' competent characteristics, qualifications or/and service provision to their effort or talent. This promotion is called performance attribution promotion. Utilizing attribution theory and the theory's adjacent studies, this study investigated whether and why performance attribution promotion affects consumers' service purchase of personal fitness training.

Design/methodology/approach

The authors developed the experimental stimuli of performance attribution promotion and validated those through a pretest (N = 200). Using the validated stimuli, the authors conducted an experiment with employing a single factor between-subject design (performance attribution promotion: effort vs talent) based on random assignment (N = 200).

Findings

The analysis results revealed that attributing trainers' competent characteristics, qualifications or/and service provision to effort (vs talent) leads to a higher level of service registration intention. Moreover, this effect was mediated by the perceived teaching expertise but not by the perceived teaching trustworthiness.

Originality/value

These findings enrich the literature by illuminating a new mechanism and consequence of performance attribution promotion. The authors' study also extends the marketing studies related to expertise perception by presenting the attribution of visible features as one of the characteristics determining expertise perception. Finally, the authors' findings also have implications for fitness service companies and other stakeholders that seek to effectively leverage trainers' competent outcomes for consumer acquisition.

Details

International Journal of Sports Marketing and Sponsorship, vol. 24 no. 2
Type: Research Article
ISSN: 1464-6668

Keywords

Article
Publication date: 19 January 2024

Ping Huang, Haitao Ding, Hong Chen, Jianwei Zhang and Zhenjia Sun

The growing availability of naturalistic driving datasets (NDDs) presents a valuable opportunity to develop various models for autonomous driving. However, while current NDDs…

Abstract

Purpose

The growing availability of naturalistic driving datasets (NDDs) presents a valuable opportunity to develop various models for autonomous driving. However, while current NDDs include data on vehicles with and without intended driving behavior changes, they do not explicitly demonstrate a type of data on vehicles that intend to change their driving behavior but do not execute the behaviors because of safety, efficiency, or other factors. This missing data is essential for autonomous driving decisions. This study aims to extract the driving data with implicit intentions to support the development of decision-making models.

Design/methodology/approach

According to Bayesian inference, drivers who have the same intended changes likely share similar influencing factors and states. Building on this principle, this study proposes an approach to extract data on vehicles that intended to execute specific behaviors but failed to do so. This is achieved by computing driving similarities between the candidate vehicles and benchmark vehicles with incorporation of the standard similarity metrics, which takes into account information on the surrounding vehicles' location topology and individual vehicle motion states. By doing so, the method enables a more comprehensive analysis of driving behavior and intention.

Findings

The proposed method is verified on the Next Generation SIMulation dataset (NGSim), which confirms its ability to reveal similarities between vehicles executing similar behaviors during the decision-making process in nature. The approach is also validated using simulated data, achieving an accuracy of 96.3 per cent in recognizing vehicles with specific driving behavior intentions that are not executed.

Originality/value

This study provides an innovative approach to extract driving data with implicit intentions and offers strong support to develop data-driven decision-making models for autonomous driving. With the support of this approach, the development of autonomous vehicles can capture more real driving experience from human drivers moving towards a safer and more efficient future.

Details

Data Technologies and Applications, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 16 December 2022

Kinjal Bhargavkumar Mistree, Devendra Thakor and Brijesh Bhatt

According to the Indian Sign Language Research and Training Centre (ISLRTC), India has approximately 300 certified human interpreters to help people with hearing loss. This paper…

Abstract

Purpose

According to the Indian Sign Language Research and Training Centre (ISLRTC), India has approximately 300 certified human interpreters to help people with hearing loss. This paper aims to address the issue of Indian Sign Language (ISL) sentence recognition and translation into semantically equivalent English text in a signer-independent mode.

Design/methodology/approach

This study presents an approach that translates ISL sentences into English text using the MobileNetV2 model and Neural Machine Translation (NMT). The authors have created an ISL corpus from the Brown corpus using ISL grammar rules to perform machine translation. The authors’ approach converts ISL videos of the newly created dataset into ISL gloss sequences using the MobileNetV2 model and the recognized ISL gloss sequence is then fed to a machine translation module that generates an English sentence for each ISL sentence.

Findings

As per the experimental results, pretrained MobileNetV2 model was proven the best-suited model for the recognition of ISL sentences and NMT provided better results than Statistical Machine Translation (SMT) to convert ISL text into English text. The automatic and human evaluation of the proposed approach yielded accuracies of 83.3 and 86.1%, respectively.

Research limitations/implications

It can be seen that the neural machine translation systems produced translations with repetitions of other translated words, strange translations when the total number of words per sentence is increased and one or more unexpected terms that had no relation to the source text on occasion. The most common type of error is the mistranslation of places, numbers and dates. Although this has little effect on the overall structure of the translated sentence, it indicates that the embedding learned for these few words could be improved.

Originality/value

Sign language recognition and translation is a crucial step toward improving communication between the deaf and the rest of society. Because of the shortage of human interpreters, an alternative approach is desired to help people achieve smooth communication with the Deaf. To motivate research in this field, the authors generated an ISL corpus of 13,720 sentences and a video dataset of 47,880 ISL videos. As there is no public dataset available for ISl videos incorporating signs released by ISLRTC, the authors created a new video dataset and ISL corpus.

Details

International Journal of Intelligent Computing and Cybernetics, vol. 16 no. 3
Type: Research Article
ISSN: 1756-378X

Keywords

Article
Publication date: 25 September 2023

Emelia Ohene Afriyie, Yan Jin, Mariama Yakubu and Iddrisu Awudu

This study aims to examine the influence of training and development (TAD), including performance appraisal (PFA), on the performance of microfinance institutions in a developing…

Abstract

Purpose

This study aims to examine the influence of training and development (TAD), including performance appraisal (PFA), on the performance of microfinance institutions in a developing economy.

Design/methodology/approach

A random sampling drew 100 microfinance institutions in Ghana's Greater Accra Region. Then, a purposive sampling approach selected a cross-section of employees in these institutions. Finally, the data were collected from a sample of 367 respondents, such as managers, utilizing a survey questionnaire. Structural equation modeling (SEM) was used to test hypothesized relationships.

Findings

The study results indicate that PFA has a statistically significant positive relationship with organizational performance, and this relationship is partially mediated by job satisfaction (JBS). Interestingly, the TAD process does not have a statistically significant positive relationship with organizational performance when JBS is present in the model. In fact, JBS fully mediates the relationship between TAD and organizational performance.

Research limitations/implications

The study is limited to microfinance in tiers two and three in Greater Accra city of Ghana and did not include the entire country. Although the city of Accra provides a generalized representation of the research, which can be replicated, some variables and results may be impacted if other tiers of microfinance organizations are incorporated.

Practical implications

TAD, as well as PFA, enhance the performance of microfinance and can be utilized as tools for competitive advantage in small and medium-sized enterprises (SMEs) (e.g. microfinance institutions). The study accentuates the value of TAD, PFA and JBS in microfinance in a developing country like Ghana.

Originality/value

This is an original study investigating the effect of TAD and PFA practices on the performance of SMEs in a developing country like Ghana. Also, the study analyses JBS as a mediation variable to performance using SEM, which advances the research methodology in this research field.

Details

International Journal of Productivity and Performance Management, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 1741-0401

Keywords

Article
Publication date: 22 July 2022

Ying Tao Chai and Ting-Kwei Wang

Defects in concrete surfaces are inevitably recurring during construction, which needs to be checked and accepted during construction and completion. Traditional manual inspection…

Abstract

Purpose

Defects in concrete surfaces are inevitably recurring during construction, which needs to be checked and accepted during construction and completion. Traditional manual inspection of surface defects requires inspectors to judge, evaluate and make decisions, which requires sufficient experience and is time-consuming and labor-intensive, and the expertise cannot be effectively preserved and transferred. In addition, the evaluation standards of different inspectors are not identical, which may lead to cause discrepancies in inspection results. Although computer vision can achieve defect recognition, there is a gap between the low-level semantics acquired by computer vision and the high-level semantics that humans understand from images. Therefore, computer vision and ontology are combined to achieve intelligent evaluation and decision-making and to bridge the above gap.

Design/methodology/approach

Combining ontology and computer vision, this paper establishes an evaluation and decision-making framework for concrete surface quality. By establishing concrete surface quality ontology model and defect identification quantification model, ontology reasoning technology is used to realize concrete surface quality evaluation and decision-making.

Findings

Computer vision can identify and quantify defects, obtain low-level image semantics, and ontology can structurally express expert knowledge in the field of defects. This proposed framework can automatically identify and quantify defects, and infer the causes, responsibility, severity and repair methods of defects. Through case analysis of various scenarios, the proposed evaluation and decision-making framework is feasible.

Originality/value

This paper establishes an evaluation and decision-making framework for concrete surface quality, so as to improve the standardization and intelligence of surface defect inspection and potentially provide reusable knowledge for inspecting concrete surface quality. The research results in this paper can be used to detect the concrete surface quality, reduce the subjectivity of evaluation and improve the inspection efficiency. In addition, the proposed framework enriches the application scenarios of ontology and computer vision, and to a certain extent bridges the gap between the image features extracted by computer vision and the information that people obtain from images.

Details

Engineering, Construction and Architectural Management, vol. 30 no. 10
Type: Research Article
ISSN: 0969-9988

Keywords

Book part
Publication date: 24 April 2023

Yoonseok Lee and Donggyu Sul

The authors develop a novel forecast combination approach based on the order statistics of individual predictability from panel data forecasts. To this end, the authors define the…

Abstract

The authors develop a novel forecast combination approach based on the order statistics of individual predictability from panel data forecasts. To this end, the authors define the notion of forecast depth, which provides a ranking among different forecasts based on their normalized forecast errors during the training period. The forecast combination is in the form of a depth-weighted trimmed mean. The authors derive the limiting distribution of the depth-weighted forecast combination, based on which the authors can readily construct prediction intervals. Using this novel forecast combination, the authors predict the national level of new COVID-19 cases in the United States and compare it with other approaches including the ensemble forecast from the Centers for Disease Control and Prevention (CDC). The authors find that the depth-weighted forecast combination yields more accurate and robust predictions compared with other popular forecast combinations and reports much narrower prediction intervals.

Details

Essays in Honor of Joon Y. Park: Econometric Methodology in Empirical Applications
Type: Book
ISBN: 978-1-83753-212-4

Keywords

1 – 10 of over 4000