Modeling User Behavior With Interaction Networks for Spam Detection

07/21/2022
by   Prabhat Agarwal, et al.
6

Spam is a serious problem plaguing web-scale digital platforms which facilitate user content creation and distribution. It compromises platform's integrity, performance of services like recommendation and search, and overall business. Spammers engage in a variety of abusive and evasive behavior which are distinct from non-spammers. Users' complex behavior can be well represented by a heterogeneous graph rich with node and edge attributes. Learning to identify spammers in such a graph for a web-scale platform is challenging because of its structural complexity and size. In this paper, we propose SEINE (Spam DEtection using Interaction NEtworks), a spam detection model over a novel graph framework. Our graph simultaneously captures rich users' details and behavior and enables learning on a billion-scale graph. Our model considers neighborhood along with edge types and attributes, allowing it to capture a wide range of spammers. SEINE, trained on a real dataset of tens of millions of nodes and billions of edges, achieves a high performance of 80 false positive rate. SEINE achieves comparable performance to the state-of-the-art techniques on a public dataset while being pragmatic to be used in a large-scale production system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/17/2019

Mining Unfollow Behavior in Large-Scale Online Social Networks via Spatial-Temporal Interaction

Online Social Networks (OSNs) evolve through two pervasive behaviors: fo...
research
09/05/2022

Modeling User Repeat Consumption Behavior for Online Novel Recommendation

Given a user's historical interaction sequence, online novel recommendat...
research
06/07/2021

DMBGN: Deep Multi-Behavior Graph Networks for Voucher Redemption Rate Prediction

In E-commerce, vouchers are important marketing tools to enhance users' ...
research
11/30/2017

Improving Latent User Models in Online Social Media

Modern social platforms are characterized by the presence of rich user-b...
research
11/04/2022

Fradulent User Detection Via Behavior Information Aggregation Network (BIAN) On Large-Scale Financial Social Network

Financial frauds cause billions of losses annually and yet it lacks effi...
research
04/22/2023

Detecting Spoilers in Movie Reviews with External Movie Knowledge and User Networks

Online movie review platforms are providing crowdsourced feedback for th...
research
04/18/2019

node2bits: Compact Time- and Attribute-aware Node Representations for User Stitching

Identity stitching, the task of identifying and matching various online ...

Please sign up or login with your details

Forgot password? Click here to reset