Issue |
ITM Web Conf.
Volume 70, 2025
2024 2nd International Conference on Data Science, Advanced Algorithm and Intelligent Computing (DAI 2024)
|
|
---|---|---|
Article Number | 04013 | |
Number of page(s) | 8 | |
Section | AI and Advanced Applications | |
DOI | https://doi.org/10.1051/itmconf/20257004013 | |
Published online | 23 January 2025 |
Enhancing Spam Filtering: A Comparative Study of Modern Advanced Machine Learning Techniques
Qingdao No.2 Middle School, 266600 Qingdao, China
Corresponding author: yuexin@ldy.edu.rs
Spam remains a persistent issue that not only consumes time and bandwidth but also poses significant cybersecurity threats. As a result, effective spam filtering has become essential. With an emphasis on Naïve Bayes (NB), Decision Trees (DT), and Support Vector Machines (SVM), this study offers a thorough analysis of the major machine learning techniques utilized in contemporary spam filtering. This paper investigates underlying principles of these methods, compares their performance through extensive experiments conducted on the Kaggle dataset, and discusses the cunent challenges and future directions for spam filtering technology. The study reveals that SVM is particularly effective for handling high-dimensional data. DT offers superior interpretability, and NB simplifies probabilistic classification. Experimental results demonstrate that while each method has its strengths and weaknesses, combining SVM with NB notably enhances classification accuracy. Despite these advances, spam filters still face challenges due to evolving spamming tactics. In order to address these persistent problems, the conclusion part highlights the need for more reliable and flexible spam filtering teclmologies and makes recommendations for future research directions.
© The Authors, published by EDP Sciences, 2025
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.