Jihene Tmamna
Bare‐Bones particle Swarm optimization‐based quantization for fast and energy efficient convolutional neural networks
Tmamna, Jihene; Ayed, Emna Ben; Fourati, Rahma; Hussain, Amir; Ayed, Mounir Ben
Authors
Abstract
Neural network quantization is a critical method for reducing memory usage and computational complexity in deep learning models, making them more suitable for deployment on resource-constrained devices. In this article, we propose a method called BBPSO-Quantizer, which utilizes an enhanced Bare-Bones Particle Swarm Optimization algorithm, to address the challenging problem of mixed precision quantization of convolutional neural networks (CNNs). Our proposed algorithm leverages a new population initialization, a robust screening process, and a local search strategy to improve the search performance and guide the population towards a feasible region. Additionally, Deb's constraint handling method is incorporated to ensure that the optimized solutions satisfy the functional constraints. The effectiveness of our BBPSO-Quantizer is evaluated on various state-of-the-art CNN architectures, including VGG, DenseNet, ResNet, and MobileNetV2, using CIFAR-10, CIFAR-100, and Tiny ImageNet datasets. Comparative results demonstrate that our method delivers an excellent tradeoff between accuracy and computational efficiency.
Journal Article Type | Article |
---|---|
Acceptance Date | Nov 27, 2023 |
Online Publication Date | Dec 17, 2023 |
Publication Date | 2024-04 |
Deposit Date | Jan 12, 2024 |
Publicly Available Date | Dec 18, 2024 |
Journal | Expert Systems |
Print ISSN | 0266-4720 |
Electronic ISSN | 1468-0394 |
Publisher | Wiley |
Peer Reviewed | Peer Reviewed |
Volume | 41 |
Issue | 4 |
Article Number | e13522 |
DOI | https://doi.org/10.1111/exsy.13522 |
Keywords | Barebone PSO, energy efficient model inference, mixed precision quantization, model compression |
Public URL | http://researchrepository.napier.ac.uk/Output/3436452 |
Files
This file is under embargo until Dec 18, 2024 due to copyright reasons.
Contact repository@napier.ac.uk to request a copy for personal use.
You might also like
Applications of Deep Learning and Reinforcement Learning to Biological Data
(2018)
Journal Article
Guided Policy Search for Sequential Multitask Learning
(2018)
Journal Article
Learning Latent Features With Infinite Nonnegative Binary Matrix Trifactorization
(2018)
Journal Article
Cross-modality interactive attention network for multispectral pedestrian detection
(2018)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search