Overlapping Clustering Models, and One (class) SVM to Bind Them All

06/18/2018
by   Xueyu Mao, et al.
0

People belong to multiple communities, words belong to multiple topics, and books cover multiple genres; overlapping clusters are commonplace. Many existing overlapping clustering methods model each person (or word, or book) as a non-negative weighted combination of "exemplars" who belong solely to one community, with some small noise. Geometrically, each person is a point on a cone whose corners are these exemplars. This basic form encompasses the widely used Mixed Membership Stochastic Blockmodel of networks (Airoldi et al., 2008) and its degree-corrected variants (Karrer et al. 2011; Jin et al., 2017), as well as topic models such as LDA (Blei et al., 2003). We show that a simple one-class SVM yields provably consistent parameter inference for all such models, and scales to large datasets. Experimental results on several simulated and real datasets show our algorithm (called SVM-cone) is both accurate and scalable.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2017

Estimating Mixed Memberships with Sharp Eigenvector Deviations

We consider the problem of estimating overlapping community memberships....
research
07/01/2016

On Mixed Memberships and Symmetric Nonnegative Matrix Factorizations

The problem of finding overlapping communities in networks has gained mu...
research
07/26/2023

Optimal Estimation in Mixed-Membership Stochastic Block Models

Community detection is one of the most critical problems in modern netwo...
research
06/11/2019

A mixed-integer linear programming approach for soft graph clustering

This paper proposes a Mixed-Integer Linear Programming approach for the ...
research
09/20/2020

Overlapping community detection in networks via sparse spectral decomposition

We consider the problem of estimating overlapping community memberships ...
research
07/05/2017

ProtoDash: Fast Interpretable Prototype Selection

In this paper we propose an efficient algorithm ProtoDash for selecting ...
research
06/13/2013

Learning Using Privileged Information: SVM+ and Weighted SVM

Prior knowledge can be used to improve predictive performance of learnin...

Please sign up or login with your details

Forgot password? Click here to reset