Z. Tu
A New Spatio-Temporal Saliency-Based Video Object Segmentation
Tu, Z.; Abel, A.; Zhang, L.; Luo, B.; Hussain, A.
Abstract
Humans and animals are able to segment visual scenes by having the natural cognitive ability to quickly identify salient objects in both static and dynamic scenes. In this paper, we present a new spatio-temporal-based approach to video object segmentation that considers both motion- and image-based saliency to produce a weighted approach which can segment both static and dynamic objects. We perform fast optical flow and then calculate the motion saliency based on this temporal information, detecting the presence of global motion and adjusting the initial optical flow results accordingly. This is then fused with a region-based contrast image saliency method, with both techniques weighted. Finally, our joint weighted saliency map is used as part of a foreground–background labelling approach to produce the final segmented video files. Good results in a wide range of environments are presented, showing that our spatio-temporal system is more robust and consistent than a number of other state-of-the-art approaches.
Citation
Tu, Z., Abel, A., Zhang, L., Luo, B., & Hussain, A. (2016). A New Spatio-Temporal Saliency-Based Video Object Segmentation. Cognitive Computation, 8(4), 629-647. https://doi.org/10.1007/s12559-016-9387-7
Journal Article Type | Article |
---|---|
Acceptance Date | Feb 19, 2016 |
Online Publication Date | Mar 8, 2016 |
Publication Date | 2016-08 |
Deposit Date | Oct 7, 2019 |
Journal | Cognitive Computation |
Print ISSN | 1866-9956 |
Electronic ISSN | 1866-9964 |
Publisher | BMC |
Peer Reviewed | Peer Reviewed |
Volume | 8 |
Issue | 4 |
Pages | 629-647 |
DOI | https://doi.org/10.1007/s12559-016-9387-7 |
Keywords | Video object segmentation, Global motion, Spatio-temporal saliency, Foreground–background labelling |
Public URL | http://researchrepository.napier.ac.uk/Output/1792582 |
You might also like
Guided Policy Search for Sequential Multitask Learning
(2018)
Journal Article
Learning Latent Features With Infinite Nonnegative Binary Matrix Trifactorization
(2018)
Journal Article
Cross-modality interactive attention network for multispectral pedestrian detection
(2018)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search