A Multi-modal Neural Embeddings Approach for Detecting Mobile Counterfeit Apps: A Case Study on Google Play Store

06/02/2020
by   Naveen Karunanayake, et al.
0

Counterfeit apps impersonate existing popular apps in attempts to misguide users to install them for various reasons such as collecting personal information or spreading malware. Many counterfeits can be identified once installed, however even a tech-savvy user may struggle to detect them before installation. To this end, this paper proposes to leverage the recent advances in deep learning methods to create image and text embeddings so that counterfeit apps can be efficiently identified when they are submitted for publication. We show that a novel approach of combining content embeddings and style embeddings outperforms the baseline methods for image similarity such as SIFT, SURF, and various image hashing methods. We first evaluate the performance of the proposed method on two well-known datasets for evaluating image similarity methods and show that content, style, and combined embeddings increase precision@k and recall@k by 10 retrieving five nearest neighbours. Second, specifically for the app counterfeit detection problem, combined content and style embeddings achieve 12 baseline methods. Third, we present an analysis of approximately 1.2 million apps from Google Play Store and identify a set of potential counterfeits for top-10,000 popular apps. Under a conservative assumption, we were able to find 2,040 potential counterfeits that contain malware in a set of 49,608 apps that showed high similarity to one of the top-10,000 popular apps in Google Play Store. We also find 1,565 potential counterfeits asking for at least five additional dangerous permissions than the original app and 1,407 potential counterfeits having at least five extra third party advertisement libraries.

READ FULL TEXT

page 1

page 3

page 9

page 10

page 15

research
04/26/2018

A Neural Embeddings Approach for Detecting Mobile Counterfeit Apps

Counterfeit apps impersonate existing popular apps in attempts to misgui...
research
06/20/2022

The Cost of the GDPR for Apps? Nearly Impossible to Study without Platform Data

A recently published pre-print titled 'GDPR and the Lost Generation of I...
research
02/09/2022

Erasing Labor with Labor: Dark Patterns and Lockstep Behaviors on the Google Play Store

Google Play Store's policy forbids the use of incentivized installs, rat...
research
11/19/2021

RacketStore: Measurements of ASO Deception in Google Play via Mobile and App Usage

Online app search optimization (ASO) platforms that provide bulk install...
research
04/01/2021

Studying Ad Library Integration Strategies of Top Free-to-Download Apps

In-app advertisements have become a major revenue source for app develop...
research
08/23/2020

Emerging App Issue Identification via Online Joint Sentiment-Topic Tracing

Millions of mobile apps are available in app stores, such as Apple's App...
research
03/01/2021

CHAMP: Characterizing Undesired App Behaviors from User Comments based on Market Policies

Millions of mobile apps have been available through various app markets....

Please sign up or login with your details

Forgot password? Click here to reset