Arfan Ghani
An Optimised CNN Hardware Accelerator Applicable to IoT End Nodes for Disruptive Healthcare
Ghani, Arfan; Aina, Akinyemi; Hwang See, Chan
Abstract
In the evolving landscape of computer vision, the integration of machine learning algorithms with cutting-edge hardware platforms is increasingly pivotal, especially in the context of disruptive healthcare systems. This study introduces an optimized implementation of a Convolutional Neural Network (CNN) on the Basys3 FPGA, designed specifically for accelerating the classification of cytotoxicity in human kidney cells. Addressing the challenges posed by constrained dataset sizes, compute-intensive AI algorithms, and hardware limitations, the approach presented in this paper leverages efficient image augmentation and pre-processing techniques to enhance both prediction accuracy and the training efficiency. The CNN, quantized to 8-bit precision and tailored for the FPGA’s resource constraints, significantly accelerates training by a factor of three while consuming only 1.33% of the power compared to a traditional software-based CNN running on an NVIDIA K80 GPU. The network architecture, composed of seven layers with excessive hyperparameters, processes downscale grayscale images, achieving notable gains in speed and energy efficiency. A cornerstone of our methodology is the emphasis on parallel processing, data type optimization, and reduced logic space usage through 8-bit integer operations. We conducted extensive image pre-processing, including histogram equalization and artefact removal, to maximize feature extraction from the augmented dataset. Achieving an accuracy of approximately 91% on unseen images, this FPGA-implemented CNN demonstrates the potential for rapid, low-power medical diagnostics within a broader IoT ecosystem where data could be assessed online. This work underscores the feasibility of deploying resource-efficient AI models in environments where traditional high-performance computing resources are unavailable, typically in healthcare settings, paving the way for and contributing to advanced computer vision techniques in embedded systems.
Citation
Ghani, A., Aina, A., & Hwang See, C. (2024). An Optimised CNN Hardware Accelerator Applicable to IoT End Nodes for Disruptive Healthcare. IoT, 5(4), 901-921. https://doi.org/10.3390/iot5040041
Journal Article Type | Article |
---|---|
Acceptance Date | Dec 3, 2024 |
Online Publication Date | Dec 6, 2024 |
Publication Date | 2024 |
Deposit Date | Dec 5, 2024 |
Publicly Available Date | Dec 6, 2024 |
Journal | IoT |
Electronic ISSN | 2624-831X |
Publisher | MDPI |
Peer Reviewed | Peer Reviewed |
Volume | 5 |
Issue | 4 |
Pages | 901-921 |
DOI | https://doi.org/10.3390/iot5040041 |
Keywords | computer vision; applied artificial intelligence; IoT for healthcare; CNN; integer-based architecture |
Files
An Optimised CNN Hardware Accelerator Applicable to IoT End Nodes for Disruptive Healthcare
(12.6 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
An Overview of THz Antenna Design for 5G/6G Wireless Communications
(2025)
Book Chapter
Antenna Design and Optimization for 5G, 6G, and IoT
(2025)
Journal Article
A PIFA Array with Enhanced Bandwidth and Isolation for 5G Smartphone Applications
(2025)
Presentation / Conference Contribution