Muhammad Usman Yaseen
Spatial frequency based video stream analysis for object classification and recognition in clouds
Yaseen, Muhammad Usman; Anjum, Ashiq; Antonopoulos, Nick
Authors
Ashiq Anjum
Prof Nick Antonopoulos N.Antonopoulos@napier.ac.uk
Deputy Vice Chancellor and Vice Principal of Research & Innovation
Abstract
The recent rise in multimedia technology has made it easier to perform a number of tasks. One of these tasks is monitoring where cheap cameras are producing large amount of video data. This video data is then processed for object classification to extract useful information. However, the video data obtained by these cheap cameras is often of low quality and results in blur video content. Moreover, various illumination effects caused by lightning conditions also degrade the video quality. These effects present severe challenges for object classification. We present a cloud-based blur and illumination invariant approach for object classification from images and video data. The bi-dimensional empirical mode decomposition (BEMD) has been adopted to decompose a video frame into intrinsic mode functions (IMFs). These IMFs further undergo to first order Reisz transform to generate monogenic video frames. The analysis of each IMF has been carried out by observing its local properties (amplitude, phase and orientation) generated from each monogenic video frame. We propose a stack based hierarchy of local pattern features generated from the amplitudes of each IMF which results in blur and illumination invariant object classification. The extensive experimentation on video streams as well as publically available image datasets reveals that our system achieves high accuracy from 0.97 to 0.91 for increasing Gaussian blur ranging from 0.5 to 5 and outperforms state of the art techniques under uncontrolled conditions. The system also proved to be scalable with high throughput when tested on a number of video streams using cloud infrastructure.
Citation
Yaseen, M. U., Anjum, A., & Antonopoulos, N. (2016). Spatial frequency based video stream analysis for object classification and recognition in clouds. In Proceedings of the 3rd IEEE/ACM conference on Big Data Computing, Applications and Technologieshttps://doi.org/10.1145/3006299.3006322
Conference Name | 3rd IEEE/ACM conference on Big Data Computing, Applications and Technologies |
---|---|
Conference Location | Shanghai, China |
Start Date | Dec 6, 2016 |
End Date | Dec 9, 2016 |
Acceptance Date | Dec 6, 2016 |
Publication Date | Dec 6, 2016 |
Deposit Date | Feb 12, 2019 |
Journal | Proceedings of the 3rd IEEE/ACM conference on Big Data Computing, Applications and Technologies |
Publisher | Association for Computing Machinery (ACM) |
Book Title | Proceedings of the 3rd IEEE/ACM conference on Big Data Computing, Applications and Technologies |
ISBN | 9781450346177 |
DOI | https://doi.org/10.1145/3006299.3006322 |
Keywords | Empirical Mode Decomposition; Local Ternary Patterns; Riesz Transform; Amplitude Spectrum, Cloud Computing, Big Data Analytics, Object Classification |
Public URL | http://researchrepository.napier.ac.uk/Output/1557175 |
You might also like
An Ensemble Neural Model for Classification of LADA Diabetes Case, Control and Variable Importance
(2022)
Conference Proceeding
A survey on system level energy optimisation for MPSoCs in IoT and consumer electronics
(2021)
Journal Article
Latency-Based Analytic Approach to Forecast Cloud Workload Trend for Sustainable Datacenters
(2019)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search