Character-based Neural Embeddings for Tweet Clustering

03/15/2017
by   Svitlana Vakulenko, et al.
0

In this paper we show how the performance of tweet clustering can be improved by leveraging character-based neural networks. The proposed approach overcomes the limitations related to the vocabulary explosion in the word-based models and allows for the seamless processing of the multilingual content. Our evaluation results and code are available on-line at https://github.com/vendi12/tweet2vec_clustering

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset