
-
Energy-based Out-of-distribution Detection
Determining whether inputs are out-of-distribution (OOD) is an essential...
read it
-
Fast Gunrock Subgraph Matching (GSM) on GPUs
In this paper, we propose a novel method, GSM (Gunrock Subgraph Matching...
read it
-
Unsupervised Object Segmentation with Explicit Localization Module
In this paper, we propose a novel architecture that iteratively discover...
read it
-
Fast BFS-Based Triangle Counting on GPUs
In this paper, we propose a novel method to compute triangle counting on...
read it
-
GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU
High-performance implementations of graph algorithms are challenging to ...
read it
-
VoroCrust: Voronoi Meshing Without Clipping
Polyhedral meshes are increasingly becoming an attractive option with pa...
read it
-
Object Localization and Motion Transfer learning with Capsules
Inspired by CapsNet's routing-by-agreement mechanism, with its ability t...
read it
-
A Comparative Study on Exact Triangle Counting Algorithms on the GPU
We implement exact triangle counting in graphs on the GPU using three di...
read it
-
Implementing Push-Pull Efficiently in GraphBLAS
We factor Beamer's push-pull, also known as direction-optimized breadth-...
read it
-
Design Principles for Sparse Matrix Multiplication on the GPU
We implement two novel algorithms for sparse-matrix dense-matrix multipl...
read it
-
Sampling Conditions for Conforming Voronoi Meshing by the VoroCrust Algorithm
We study the problem of decomposing a volume bounded by a smooth surface...
read it
-
Scalable Breadth-First Search on a GPU Cluster
On a GPU cluster, the ratio of high computing power to communication ban...
read it
-
Salable Breadth-First Search on a GPU Cluster
On a GPU cluster, the ratio of high computing power to communication ban...
read it
-
A Dynamic Hash Table for the GPU
We design and implement a fully concurrent dynamic hash table for GPUs w...
read it
-
Mathematical Foundations of the GraphBLAS
The GraphBLAS standard (GraphBlas.org) is being developed to bring the p...
read it
-
Piko: A Design Framework for Programmable Graphics Pipelines
We present Piko, a framework for designing, optimizing, and retargeting ...
read it
-
k-d Darts: Sampling by k-Dimensional Flat Searches
We formalize the notion of sampling a function using k-d darts. A k-d da...
read it
-
Finding Convex Hulls Using Quickhull on the GPU
We present a convex hull algorithm that is accelerated on commodity grap...
read it
-
Efficient Synchronization Primitives for GPUs
In this paper, we revisit the design of synchronization primitives---spe...
read it