Detecting Spam Email With Machine Learning Optimized With Bio-Inspired Metaheuristic Algorithms

Simran Gibson, Biju Issac, Li Zhang, Seibu Mary Jacob

Research output: Contribution to journalArticlepeer-review

Abstract

Electronic mail has eased communication methods for many organisations as well as individuals. This method is exploited for fraudulent gain by spammers through sending unsolicited emails. This article aims to present a method for detection of spam emails with machine learning algorithms that are optimized with bio-inspired methods. A literature review is carried to explore the efficient methods applied on different datasets to achieve good results. An extensive research was done to implement machine learning models using Naïve Bayes, Support Vector Machine, Random Forest, Decision Tree and Multi-Layer Perceptron on seven different email datasets, along with feature extraction and pre-processing. The bio-inspired algorithms like Particle Swarm Optimization and Genetic Algorithm were implemented to optimize the performance of classifiers. Multinomial Naïve Bayes with Genetic Algorithm performed the best overall. The comparison of our results with other machine learning and bio-inspired models to show the best suitable model is also discussed.
Original languageEnglish
Pages (from-to)187914-187932
Number of pages19
JournalIEEE Access
Volume8
DOIs
Publication statusPublished - 13 Oct 2020

Cite this