To read this content please select one of the options below:

Auto-attentional mechanism in multi-domain convolutional neural networks for improving object tracking

Jinchao Huang (College of Mathematics and Information Engineering, Longyan University, Longyan, China)

International Journal of Intelligent Computing and Cybernetics

ISSN: 1756-378X

Article publication date: 30 August 2021

Issue publication date: 2 February 2022

4011

Abstract

Purpose

Multi-domain convolutional neural network (MDCNN) model has been widely used in object recognition and tracking in the field of computer vision. However, if the objects to be tracked move rapid or the appearances of moving objects vary dramatically, the conventional MDCNN model will suffer from the model drift problem. To solve such problem in tracking rapid objects under limiting environment for MDCNN model, this paper proposed an auto-attentional mechanism-based MDCNN (AA-MDCNN) model for the rapid moving and changing objects tracking under limiting environment.

Design/methodology/approach

First, to distinguish the foreground object between background and other similar objects, the auto-attentional mechanism is used to selectively aggregate the weighted summation of all feature maps to make the similar features related to each other. Then, the bidirectional gated recurrent unit (Bi-GRU) architecture is used to integrate all the feature maps to selectively emphasize the importance of the correlated feature maps. Finally, the final feature map is obtained by fusion the above two feature maps for object tracking. In addition, a composite loss function is constructed to solve the similar but different attribute sequences tracking using conventional MDCNN model.

Findings

In order to validate the effectiveness and feasibility of the proposed AA-MDCNN model, this paper used ImageNet-Vid dataset to train the object tracking model, and the OTB-50 dataset is used to validate the AA-MDCNN tracking model. Experimental results have shown that the augmentation of auto-attentional mechanism will improve the accuracy rate 2.75% and success rate 2.41%, respectively. In addition, the authors also selected six complex tracking scenarios in OTB-50 dataset; over eleven attributes have been validated that the proposed AA-MDCNN model outperformed than the comparative models over nine attributes. In addition, except for the scenario of multi-objects moving with each other, the proposed AA-MDCNN model solved the majority rapid moving objects tracking scenarios and outperformed than the comparative models on such complex scenarios.

Originality/value

This paper introduced the auto-attentional mechanism into MDCNN model and adopted Bi-GRU architecture to extract key features. By using the proposed AA-MDCNN model, rapid object tracking under complex background, motion blur and occlusion objects has better effect, and such model is expected to be further applied to the rapid object tracking in the real world.

Keywords

Acknowledgements

Funding: This work was supported by the Education and Scientific Research Project for Young and Middle-aged Teachers in Fujian Province (No. JAT200581).

Citation

Huang, J. (2022), "Auto-attentional mechanism in multi-domain convolutional neural networks for improving object tracking", International Journal of Intelligent Computing and Cybernetics, Vol. 15 No. 1, pp. 41-60. https://doi.org/10.1108/IJICC-04-2021-0067

Publisher

:

Emerald Publishing Limited

Copyright © 2021, Emerald Publishing Limited

Related articles