Comparison of Automated Machine Learning Tools for SMS Spam Message Filtering

06/16/2021
by   Waddah Saeed, et al.
0

Short Message Service (SMS) is a very popular service used for communication by mobile users. However, this popular service can be abused by executing illegal activities and influencing security risks. Nowadays, many automatic machine learning (AutoML) tools exist which can help domain experts and lay users to build high-quality ML models with little or no machine learning knowledge. In this work, a classification performance comparison was conducted between three automatic ML tools for SMS spam message filtering. These tools are mljar-supervised AutoML, H2O AutoML, and Tree-based Pipeline Optimization Tool (TPOT) AutoML. Experimental results showed that ensemble models achieved the best classification performance. The Stacked Ensemble model, which was built using H2O AutoML, achieved the best performance in terms of Log Loss (0.8370), true positive (1088/1116), and true negative (281/287) metrics. There is a 19.05% improvement in Log Loss with respect to TPOT AutoML and 10.53% improvement with respect to mljar-supervised AutoML. The satisfactory filtering performance achieved with AutoML tools provides a potential application for AutoML tools to automatically determine the best ML model that can perform best for SMS spam message filtering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2020

Leveraging Automated Machine Learning for Text Classification: Evaluation of AutoML Tools and Comparison with Human Performance

Recently, Automated Machine Learning (AutoML) has registered increasing ...
research
01/14/2021

A Neophyte With AutoML: Evaluating the Promises of Automatic Machine Learning Tools

This paper discusses modern Auto Machine Learning (AutoML) tools from th...
research
01/17/2023

Vision Based Machine Learning Algorithms for Out-of-Distribution Generalisation

There are many computer vision applications including object segmentatio...
research
11/20/2022

Best-Effort Communication Improves Performance and Scales Robustly on Conventional Hardware

Here, we test the performance and scalability of fully-asynchronous, bes...
research
01/01/2022

AutoDES: AutoML Pipeline Generation of Classification with Dynamic Ensemble Strategy Selection

Automating machine learning has achieved remarkable technological develo...
research
05/11/2017

Content-based Approach for Vietnamese Spam SMS Filtering

Short Message Service (SMS) spam is a serious problem in Vietnam because...
research
05/25/2023

Theoretical Guarantees of Learning Ensembling Strategies with Applications to Time Series Forecasting

Ensembling is among the most popular tools in machine learning (ML) due ...

Please sign up or login with your details

Forgot password? Click here to reset