Mimicking Human Process: Text Representation via Latent Semantic Clustering for Classification

06/18/2019
by   Xiaoye Tan, et al.
0

Considering that words with different characteristic in the text have different importance for classification, grouping them together separately can strengthen the semantic expression of each part. Thus we propose a new text representation scheme by clustering words according to their latent semantics and composing them together to get a set of cluster vectors, which are then concatenated as the final text representation. Evaluation on five classification benchmarks proves the effectiveness of our method. We further conduct visualization analysis showing statistical clustering results and verifying the validity of our motivation.

READ FULL TEXT
research
12/08/2019

Attentive Representation Learning with Adversarial Training for Short Text Clustering

Short text clustering has far-reaching effects on semantic analysis, sho...
research
11/24/2020

Neural Text Classification by Jointly Learning to Cluster and Align

Distributional text clustering delivers semantically informative represe...
research
11/12/2019

Text Mining using Nonnegative Matrix Factorization and Latent Semantic Analysis

Text clustering is arguably one of the most important topics in modern d...
research
10/31/2022

Automated Code Extraction from Discussion Board Text Dataset

This study introduces and investigates the capabilities of three differe...
research
02/27/2018

Classifying Idiomatic and Literal Expressions Using Topic Models and Intensity of Emotions

We describe an algorithm for automatic classification of idiomatic and l...
research
03/02/2018

Hybrid Model For Word Prediction Using Naive Bayes and Latent Information

Historically, the Natural Language Processing area has been given too mu...
research
02/06/2020

Towards Semantic Noise Cleansing of Categorical Data based on Semantic Infusion

Semantic Noise affects text analytics activities for the domain-specific...

Please sign up or login with your details

Forgot password? Click here to reset