Improving Image Clustering using Sparse Text and the Wisdom of the Crowds

05/08/2014
by   Anna Ma, et al.
0

We propose a method to improve image clustering using sparse text and the wisdom of the crowds. In particular, we present a method to fuse two different kinds of document features, image and text features, and use a common dictionary or "wisdom of the crowds" as the connection between the two different kinds of documents. With the proposed fusion matrix, we use topic modeling via non-negative matrix factorization to cluster documents.

READ FULL TEXT
research
01/31/2022

Guided Semi-Supervised Non-negative Matrix Factorization on Legal Documents

Classification and topic modeling are popular techniques in machine lear...
research
08/24/2021

Hybrid Multisource Feature Fusion for the Text Clustering

The text clustering technique is an unsupervised text mining method whic...
research
05/06/2023

Two to Five Truths in Non-Negative Matrix Factorization

In this paper, we explore the role of matrix scaling on a matrix of coun...
research
07/08/2021

Assigning Topics to Documents by Successive Projections

Topic models provide a useful tool to organize and understand the struct...
research
07/18/2013

Video Text Localization using Wavelet and Shearlet Transforms

Text in video is useful and important in indexing and retrieving the vid...
research
02/23/2017

Stability of Topic Modeling via Matrix Factorization

Topic models can provide us with an insight into the underlying latent s...
research
04/21/2021

Clustering Introductory Computer Science Exercises Using Topic Modeling Methods

Manually determining concepts present in a group of questions is a chall...

Please sign up or login with your details

Forgot password? Click here to reset