VCWE: Visual Character-Enhanced Word Embeddings

02/23/2019
by   Chi Sun, et al.
0

Chinese is a logographic writing system, and the shape of Chinese characters contain rich syntactic and semantic information. In this paper, we propose a model to learn Chinese word embeddings via two-level composition: (1) a convolutional neural network to extract the intra-character compositionality from the visual shape of a character; (2) a recurrent neural network with self-attention to compose character representation into word embeddings. The word embeddings along with the network parameters are learned in the Skip-Gram framework. Evaluations demonstrate the superior performance of our model on four tasks: word similarity, sentiment analysis, named entity recognition and part-of-speech tagging.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset