Imane Guellil
A semi-supervised approach for sentiment analysis of arab (ic+ izi) messages: Application to the algerian dialect
Guellil, Imane; Adeel, Ahsan; Azouaou, Faical; Benali, Fodil; Hachani, Ala-Eddine; Dashtipour, Kia; Gogate, Mandar; Ieracitano, Cosimo; Kashani, Reza; Hussain, Amir
Authors
Ahsan Adeel
Faical Azouaou
Fodil Benali
Ala-Eddine Hachani
Dr Kia Dashtipour K.Dashtipour@napier.ac.uk
Lecturer
Dr. Mandar Gogate M.Gogate@napier.ac.uk
Principal Research Fellow
Cosimo Ieracitano
Reza Kashani
Prof Amir Hussain A.Hussain@napier.ac.uk
Professor
Abstract
In this paper, we propose a semi-supervised approach for sentiment analysis of Arabic and its dialects. This approach is based on a sentiment corpus, constructed automatically and reviewed manually by Algerian dialect native speakers. This approach consists of constructing and applying a set of deep learning algorithms to classify the sentiment of Arabic messages as positive or negative. It was applied on Facebook messages written in Modern Standard Arabic (MSA) as well as in Algerian dialect (DALG, which is a low resourced-dialect, spoken by more than 40 million people) with both scripts Arabic and Arabizi. To handle Arabizi, we consider both options: transliteration (largely used in the research literature for handling Arabizi) and translation (never used in the research literature for handling Arabizi). For highlighting the effectiveness of a semi-supervised approach, we carried out different experiments using both corpora for the training (i.e. the corpus constructed automatically and the one that was reviewed manually). The experiments were done on many test corpora dedicated to MSA/DALG, which were proposed and evaluated in the research literature. Both classifiers are used, shallow and deep learning classifiers such as Random Forest (RF), Logistic Regression(LR) Convolutional Neural Network (CNN) and Long short-term memory (LSTM). These classifiers are combined with word embedding models such as Word2vec and fastText that were used for sentiment classification. Experimental results (F1 score up to 95% for intrinsic experiments and up to 89% for extrinsic experiments) showed that the proposed system outperforms the existing state-of-the-art methodologies (the best improvement is up to 25%).
Citation
Guellil, I., Adeel, A., Azouaou, F., Benali, F., Hachani, A.-E., Dashtipour, K., Gogate, M., Ieracitano, C., Kashani, R., & Hussain, A. (2021). A semi-supervised approach for sentiment analysis of arab (ic+ izi) messages: Application to the algerian dialect. SN Computer Science, 2, Article 118. https://doi.org/10.1007/s42979-021-00510-1
Journal Article Type | Article |
---|---|
Acceptance Date | Nov 27, 2020 |
Online Publication Date | Feb 27, 2021 |
Publication Date | 2021 |
Deposit Date | Apr 27, 2022 |
Publicly Available Date | Apr 27, 2022 |
Journal | SN Computer Science |
Publisher | Springer |
Peer Reviewed | Peer Reviewed |
Volume | 2 |
Article Number | 118 |
DOI | https://doi.org/10.1007/s42979-021-00510-1 |
Keywords | Arabizi, Sentiment analysis, Arabic, Arabic dialect, Translation, Transliteration |
Public URL | http://researchrepository.napier.ac.uk/Output/2866999 |
Files
A Semi-supervised Approach For Sentiment Analysis Of Arab (ic+ Izi) Messages: Application To The Algerian Dialect
(1.5 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
Statistical Downscaling Modeling for Temperature Prediction
(2024)
Book Chapter
Federated Learning for Market Surveillance
(2024)
Book Chapter
Robust Real-time Audio-Visual Speech Enhancement based on DNN and GAN
(2024)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search