Spam filter based on naïve bayes and levy-firefly algorithm

Alhallaq, Ahmed J.H.

View/Open

File_10300346 (3.542Mb)

Date

2019

Author

Alhallaq, Ahmed J.H.

Metadata

Show full item record

Abstract

İnternet, teknoloji ve iletişimdeki hızlı yenilikler nedeniyle hızla gelişmektedir. Kullanıcılar ve kuruluşlar, verileri, uygulamaları ve hizmetleri depolayabilir ve etkili bir şekilde erişebilir. Bir yerden veri almak ya da bir yere veri göndermek için kullanılan en iyi ve en hızlı araçlardan biri de elektronik posta (e-posta) olmuştur. Ancak bu mecra zaman içinde e-posta dolandırıcılığı (e-kimlik hırsızlığı), sahtekârlık ve spam e-postalar gibi çeşitli kötü amaçlı saldırılar için açık hedef haline gelmiştir.Kullanıcıları spam e-postalardan korumak amacıyla hem akademide hem de endüstri camiasında çeşitli yaklaşım ve yöntemler geliştirilmiş olup bu konudaki çalışmalar devam etmektedir.Bu tez, hybrid meta-heuristic optimization algorithm (Levy flight + firefly algorithm) ve Naïve Bayes sınıflandırıcısı kullanılarak spam e-postalarını filtrelemek için yeni bir yaklaşım önermektedir. SPAM veri setinde yapılan çeşitli deneylerin ön sonuçları, önerilen karma yöntemin, yalnızca Naïve sınıflandırıcısını kullanırken yalnızca% 79'la karşılaştırıldığında % 95 doğrulukla istenmeyen spam e-postaları tespit ettiğini ortaya koymuştur. Başka bir deyişle % 16 iyileşme kaydedilmiştir. Ayrıca, uygulama süresi hızlıdır ve önerilen algoritma sonuçları oldukça kararlıdır; bu sonuç standart sapma ile kanıtlanmıştır.

Internet is rapidly developing due to quick innovation in technology and communication. Users and organization can store and effectively access data, applications, and services. Amongst the best and speediest approaches to connect and send/receive data from one place then onto the next is through electronic mail (e-mail). As a result, it became the target for several malicious attacks such as e-mail phishing, spoofing, and spam emails. Protecting various users from spam emails became the subject of several research, both in academia and industries, were several methods and approaches are developed and still being tested. Recently, the use of machine learning is proposed were several algorithms are trained to detect and separate spams from legitimate emails. This thesis proposes a new approach for filtering out spam e-mails through the use of effective hybrid meta-heuristic optimization algorithm (Levy flight + firefly algorithm) with Naïve Bayes classifier. Preliminary results of various experiments conducted on SPAM dataset revealed that the proposed hybrid method detects unwanted spam e-mails with ~95% accuracy compared to only 79% when using Naïve classifier alone, in other words, 16% improvement. Further, the execution time is fast and the proposed algorithm results are fairly stable; proven through standard deviation.

URI

https://acikbilim.yok.gov.tr/handle/20.500.12812/131011

Collections

TEZLER

Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess