Aihua Zheng
Diverse features discovery transformer for pedestrian attribute recognition
Zheng, Aihua; Wang, Huimin; Wang, Jiaxiang; Huang, Huaibo; He, Ran; Hussain, Amir
Authors
Abstract
Recently, Swin Transformer has been widely explored as a general backbone for computer vision, which helps to improve the performance of vision tasks due to the ability to establish associations for long-range dependencies of different spatial locations. By implementing the pedestrian attribute recognition with Swin Transformer, we observe that Swin Transformer tends to focus on a relatively small number of local regions within which attributes may be correlated with other attributes, which leads Swin Transformer to predict attributes in those neglected regions based on such correlation. In fact, discriminative information may exist within these neglected regions, which is crucial for attribute identification. To address this problem, we propose a novel diverse features discovery transformer (DFDT) which can find more attribute relationship regions for robust pedestrian attribute recognition. First, Swin Transformer is used as a feature extraction network to acquire attribute features with the long-distance association, which predicts the corresponding attribute information. Second, we propose a diverse features suppression module (DFSM) to obtain semantic features directly associated with attributes by suppressing the peak locations of the most discriminative features and randomly selected feature regions to spread the feature regions that Swin Transformer is interested in. Third, we plug the diverse features suppression module into different stages of Swin Transformer to learn detailed texture features to help recognition. In addition, we have divided the attribute features into multiple vertical feature regions to improve the focus on local attribute features. Experiments on three benchmark datasets validate the effectiveness of the proposed algorithm.
Citation
Zheng, A., Wang, H., Wang, J., Huang, H., He, R., & Hussain, A. (2023). Diverse features discovery transformer for pedestrian attribute recognition. Engineering Applications of Artificial Intelligence, 119, Article 105708. https://doi.org/10.1016/j.engappai.2022.105708
Journal Article Type | Article |
---|---|
Acceptance Date | Dec 1, 2022 |
Online Publication Date | Dec 26, 2022 |
Publication Date | 2023-03 |
Deposit Date | Feb 15, 2023 |
Publicly Available Date | Dec 27, 2023 |
Journal | Engineering Applications of Artificial Intelligence |
Print ISSN | 0952-1976 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 119 |
Article Number | 105708 |
DOI | https://doi.org/10.1016/j.engappai.2022.105708 |
Keywords | Pedestrian attribute recognition, Vision transformer, Features suppression |
Public URL | http://researchrepository.napier.ac.uk/Output/3020292 |
Files
Diverse Features Discovery Transformer For Pedestrian Attribute Recognition (accepted version)
(719 Kb)
PDF
You might also like
MTFDN: An image copy‐move forgery detection method based on multi‐task learning
(2024)
Journal Article
Transition-aware human activity recognition using an ensemble deep learning framework
(2024)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search