Zhengzheng Tu
A novel domain activation mapping-guided network (DA-GNT) for visual tracking
Tu, Zhengzheng; Zhou, Ajian; Gan, Chuang; Jiang, Bo; Hussain, Amir; Luo, Bin
Abstract
Conventional convolution neural network (CNN)-based visual trackers are easily influenced by too much background information in candidate samples. Further, extreme imbalance of foreground and background samples has a negative impact on training the classifier, whereas features learned from limited data are insufficient to train the classifier. To address these problems, we propose a novel deep neural network for visual tracking, termed the domain activation mapping guided network (DA-GNT). First, we introduce the class activation mapping with weakly supervised localization in multi-domain to identify the most discriminative regions in the bounding box and suppress the background in the positive sample. Next, to further increase the discriminability of deep feature representation, we utilize an ensemble network to achieve a kind of multi-view feature representation and a channel attention mechanism for adaptive feature selection. Finally, we propose a simple but effective data augmentation method to further increase the positive samples for our network training. Extensive experiments on two widely used benchmark datasets demonstrate the effectiveness of the proposed tracking method against many state-of-the-art trackers. The novel DA-GNT is thus posited as a potential benchmark resource for the computer vision and machine learning research community.
Journal Article Type | Article |
---|---|
Acceptance Date | Mar 22, 2021 |
Online Publication Date | Mar 26, 2021 |
Publication Date | 2021-08 |
Deposit Date | Jun 17, 2021 |
Journal | Neurocomputing |
Print ISSN | 0925-2312 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 449 |
Pages | 443-454 |
DOI | https://doi.org/10.1016/j.neucom.2021.03.056 |
Keywords | Visual tracking, Deep neural networks, Weakly supervised localization, Attention mechanism, Data augmentation |
Public URL | http://researchrepository.napier.ac.uk/Output/2781116 |
You might also like
Applications of Deep Learning and Reinforcement Learning to Biological Data
(2018)
Journal Article
Guided Policy Search for Sequential Multitask Learning
(2018)
Journal Article
Learning Latent Features With Infinite Nonnegative Binary Matrix Trifactorization
(2018)
Journal Article
Cross-modality interactive attention network for multispectral pedestrian detection
(2018)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search