Semi-supervised classification for dynamic Android malware detection

04/19/2017
by   Li Chen, et al.
0

A growing number of threats to Android phones creates challenges for malware detection. Manually labeling the samples into benign or different malicious families requires tremendous human efforts, while it is comparably easy and cheap to obtain a large amount of unlabeled APKs from various sources. Moreover, the fast-paced evolution of Android malware continuously generates derivative malware families. These families often contain new signatures, which can escape detection when using static analysis. These practical challenges can also cause traditional supervised machine learning algorithms to degrade in performance. In this paper, we propose a framework that uses model-based semi-supervised (MBSS) classification scheme on the dynamic Android API call logs. The semi-supervised approach efficiently uses the labeled and unlabeled APKs to estimate a finite mixture model of Gaussian distributions via conditional expectation-maximization and efficiently detects malwares during out-of-sample testing. We compare MBSS with the popular malware detection classifiers such as support vector machine (SVM), k-nearest neighbor (kNN) and linear discriminant analysis (LDA). Under the ideal classification setting, MBSS has competitive performance with 98% accuracy and very low false positive rate for in-sample classification. For out-of-sample testing, the out-of-sample test data exhibit similar behavior of retrieving phone information and sending to the network, compared with in-sample training set. When this similarity is strong, MBSS and SVM with linear kernel maintain 90% detection rate while kNN and LDA suffer great performance degradation. When this similarity is slightly weaker, all classifiers degrade in performance, but MBSS still performs significantly better than other classifiers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/12/2022

Analysis, Detection, and Classification of Android Malware using System Calls

With the increasing popularity of Android in the last decade, Android is...
research
02/08/2023

Continuous Learning for Android Malware Detection

Machine learning methods can detect Android malware with very high accur...
research
01/20/2022

Android Malware Detection using Feature Ranking of Permissions

We investigate the use of Android permissions as the vehicle to allow fo...
research
07/27/2016

Android Malware Detection Using Parallel Machine Learning Classifiers

Mobile malware has continued to grow at an alarming rate despite on-goin...
research
11/10/2018

Metamorphic Malware Detection Using Linear Discriminant Analysis and Graph Similarity

The most common malware detection approaches which are based on signatur...
research
10/25/2022

Flexible Android Malware Detection Model based on Generative Adversarial Networks with Code Tensor

The behavior of malware threats is gradually increasing, heightened the ...

Please sign up or login with your details

Forgot password? Click here to reset