Text Classification and Clustering with Annealing Soft Nearest Neighbor Loss

07/23/2021
by   Abien Fred Agarap, et al.
0

We define disentanglement as how far class-different data points from each other are, relative to the distances among class-similar data points. When maximizing disentanglement during representation learning, we obtain a transformed feature representation where the class memberships of the data points are preserved. If the class memberships of the data points are preserved, we would have a feature representation space in which a nearest neighbour classifier or a clustering algorithm would perform well. We take advantage of this method to learn better natural language representation, and employ it on text classification and text clustering tasks. Through disentanglement, we obtain text representations with better-defined clusters and improve text classification performance. Our approach had a test classification accuracy of as high as 90.11 88 other training tricks or regularization.

READ FULL TEXT
research
06/05/2020

Improving k-Means Clustering Performance with Disentangled Internal Representations

Deep clustering algorithms combine representation learning and clusterin...
research
11/18/2019

Basic Principles of Clustering Methods

Clustering methods group a set of data points into a few coherent groups...
research
02/05/2019

Analyzing and Improving Representations with the Soft Nearest Neighbor Loss

We explore and expand the Soft Nearest Neighbor Loss to measure the enta...
research
11/30/2022

Task-Specific Embeddings for Ante-Hoc Explainable Text Classification

Current state-of-the-art approaches to text classification typically lev...
research
12/16/2020

Predictive K-means with local models

Supervised classification can be effective for prediction but sometimes ...
research
12/14/2017

Adaptive kNN using Expected Accuracy for Classification of Geo-Spatial Data

The k-Nearest Neighbor (kNN) classification approach is conceptually sim...
research
12/30/2008

A New Clustering Algorithm Based Upon Flocking On Complex Network

We have proposed a model based upon flocking on a complex network, and t...

Please sign up or login with your details

Forgot password? Click here to reset