EZLDA: Efficient and Scalable LDA on GPUs

07/17/2020
by   Shilong Wang, et al.
0

LDA is a statistical approach for topic modeling with a wide range of applications. However, there exist very few attempts to accelerate LDA on GPUs which come with exceptional computing and memory throughput capabilities. To this end, we introduce EZLDA which achieves efficient and scalable LDA training on GPUs with the following three contributions: First, EZLDA introduces three-branch sampling method which takes advantage of the convergence heterogeneity of various tokens to reduce the redundant sampling task. Second, to enable sparsity-aware format for both D and W on GPUs with fast sampling and updating, we introduce hybrid format for W along with corresponding token partition to T and inverted index designs. Third, we design a hierarchical workload balancing solution to address the extremely skewed workload imbalance problem on GPU and scaleEZLDA across multiple GPUs. Taken together, EZLDA achieves superior performance over the state-of-the-art attempts with lower memory consumption.

READ FULL TEXT
research
03/13/2018

CuLDA_CGS: Solving Large-scale LDA Problems on GPUs

Latent Dirichlet Allocation(LDA) is a popular topic model. Given the fac...
research
10/08/2016

SaberLDA: Sparsity-Aware Learning of Topic Models on GPUs

Latent Dirichlet Allocation (LDA) is a popular tool for analyzing discre...
research
09/16/2021

Dr. Top-k: Delegate-Centric Top-k on GPUs

Recent top-k computation efforts explore the possibility of revising var...
research
07/02/2020

GSoFa: Scalable Sparse LU Symbolic Factorization on GPUs

Decomposing a matrix A into a lower matrix L and an upper matrix U, whic...
research
10/29/2015

WarpLDA: a Cache Efficient O(1) Algorithm for Latent Dirichlet Allocation

Developing efficient and scalable algorithms for Latent Dirichlet Alloca...
research
09/18/2020

C-SAW: A Framework for Graph Sampling and Random Walk on GPUs

Many applications require to learn, mine, analyze and visualize large-sc...
research
08/01/2021

Experimental Findings on the Sources of Detected Unrecoverable Errors in GPUs

We investigate the sources of Detected Unrecoverable Errors (DUEs) in GP...

Please sign up or login with your details

Forgot password? Click here to reset