Wenjuan Li
Towards Designing an Email Classification System Using Multi-view Based Semi-supervised Learning
Li, Wenjuan; Meng, Weizhi; Tan, Zhiyuan; Xiang, Yang
Abstract
The goal of email classification is to classify user emails into spam and legitimate ones. Many supervised learning algorithms have been invented in this domain to accomplish the task, and these algorithms require a large number of labeled training data. However, data labeling is a labor intensive task and requires in-depth domain knowledge. Thus, only a very small proportion of the data can be labeled in practice. This bottleneck greatly degrades the effectiveness of supervised email classification systems. In order to address this problem, in this work, we first identify some critical issues regarding supervised machine learning-based email classification. Then we propose an effective classification model based on multi-view disagreement-based semi-supervised learning. The motivation behind the attempt of using multi-view and semi-supervised learning is that multi-view can provide richer information for classification, which is often ignored by literature, and semi-supervised learning supplies with the capability of coping with labeled and unlabeled data. In the evaluation, we demonstrate that the multi-view data can improve the email classification than using a single view data, and that the proposed model working with our algorithm can achieve better performance as compared to the existing similar algorithms.
Citation
Li, W., Meng, W., Tan, Z., & Xiang, Y. (2014, September). Towards Designing an Email Classification System Using Multi-view Based Semi-supervised Learning. Presented at 2014 IEEE 13th International Conference on Trust, Security and Privacy in Computing and Communications
Presentation Conference Type | Conference Paper (Published) |
---|---|
Conference Name | 2014 IEEE 13th International Conference on Trust, Security and Privacy in Computing and Communications |
Start Date | Sep 24, 2014 |
End Date | Sep 26, 2014 |
Publication Date | 2014-09 |
Deposit Date | Nov 17, 2016 |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 174-181 |
Book Title | 2014 IEEE 13th International Conference on Trust, Security and Privacy in Computing and Communications, |
ISBN | 9781479965137 |
DOI | https://doi.org/10.1109/trustcom.2014.26 |
Keywords | Electronic mail, Semisupervised learning, Training, Supervised learning, Data models, Support vector machines, Feature extraction |
Public URL | http://researchrepository.napier.ac.uk/Output/425928 |
You might also like
How Much do Robots Understand Rudeness? Challenges in Human-Robot Interaction
(2024)
Presentation / Conference Contribution
Overtaking Feasibility Prediction for Mixed Connected and Connectionless Vehicles
(2024)
Journal Article
Detection of Ransomware
(2024)
Patent
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search