Arfan Ghani
An Optimised CNN Hardware Accelerator Applicable to IoT End Nodes for Disruptive Healthcare
Ghani, Arfan; Aina, Akinyemi; Hwang See, Chan
Abstract
In the evolving landscape of computer vision, the integration of machine learning algorithms with cutting-edge hardware platforms is increasingly pivotal, especially in the context of disruptive healthcare systems. This study introduces an optimized implementation of a Convolutional Neural Network (CNN) on the Basys3 FPGA, designed specifically for accelerating the classification of cytotoxicity in human kidney cells. Addressing the challenges posed by constrained dataset sizes, compute-intensive AI algorithms, and hardware limitations, the approach presented in this paper leverages efficient image augmentation and pre-processing techniques to enhance both prediction accuracy and the training efficiency. The CNN, quantized to 8-bit precision and tailored for the FPGA’s resource constraints, significantly accelerates training by a factor of three while consuming only 1.33% of the power compared to a traditional software-based CNN running on an NVIDIA K80 GPU. The network architecture, composed of seven layers with excessive hyperparameters, processes downscale grayscale images, achieving notable gains in speed and energy efficiency. A cornerstone of our methodology is the emphasis on parallel processing, data type optimization, and reduced logic space usage through 8-bit integer operations. We conducted extensive image pre-processing, including histogram equalization and artefact removal, to maximize feature extraction from the augmented dataset. Achieving an accuracy of approximately 91% on unseen images, this FPGA-implemented CNN demonstrates the potential for rapid, low-power medical diagnostics within a broader IoT ecosystem where data could be assessed online. This work underscores the feasibility of deploying resource-efficient AI models in environments where traditional high-performance computing resources are unavailable, typically in healthcare settings, paving the way for and contributing to advanced computer vision techniques in embedded systems.
Citation
Ghani, A., Aina, A., & Hwang See, C. (2024). An Optimised CNN Hardware Accelerator Applicable to IoT End Nodes for Disruptive Healthcare. IoT, 5(4), 901-921. https://doi.org/10.3390/iot5040041
Journal Article Type | Article |
---|---|
Acceptance Date | Dec 3, 2024 |
Online Publication Date | Dec 6, 2024 |
Publication Date | 2024 |
Deposit Date | Dec 5, 2024 |
Publicly Available Date | Dec 6, 2024 |
Journal | IoT |
Electronic ISSN | 2624-831X |
Publisher | MDPI |
Peer Reviewed | Peer Reviewed |
Volume | 5 |
Issue | 4 |
Pages | 901-921 |
DOI | https://doi.org/10.3390/iot5040041 |
Keywords | computer vision; applied artificial intelligence; IoT for healthcare; CNN; integer-based architecture |
Files
An Optimised CNN Hardware Accelerator Applicable to IoT End Nodes for Disruptive Healthcare
(12.6 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
Compact reconfigurable MIMO antenna for 5G and Wi-Fi applications
(2024)
Journal Article
Editorial on Antennas
(2023)
Journal Article
Enhancement of Precise Underwater Object Localization
(2023)
Journal Article
Dual-Band MIMO Antenna Design for 5G Smartphones Mobile Communications
(2023)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search