Semantic Hilbert Space for Text Representation Learning

02/26/2019
by   Benyou Wang, et al.
0

Capturing the meaning of sentences has long been a challenging task. Current models tend to apply linear combinations of word features to conduct semantic composition for bigger-granularity units e.g. phrases, sentences, and documents. However, the semantic linearity does not always hold in human language. For instance, the meaning of the phrase `ivory tower' can not be deduced by linearly combining the meanings of `ivory' and `tower'. To address this issue, we propose a new framework that models different levels of semantic units (e.g. sememe, word, sentence, and semantic abstraction) on a single Semantic Hilbert Space, which naturally admits a non-linear semantic composition by means of a complex-valued vector word representation. An end-to-end neural network [https://github.com/wabyking/qnn] is proposed to implement the framework in the text classification task, and evaluation results on six benchmarking text classification datasets demonstrate the effectiveness, robustness and self-explanation power of the proposed model. Furthermore, intuitive case studies are conducted to help end users to understand how the framework works.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/15/2022

A semantic hierarchical graph neural network for text classification

The key to the text classification task is language representation and i...
research
05/29/2017

Character-Based Text Classification using Top Down Semantic Model for Sentence Representation

Despite the success of deep learning on many fronts especially image and...
research
07/02/2021

Transformer-F: A Transformer network with effective methods for learning universal sentence representation

The Transformer model is widely used in natural language processing for ...
research
03/29/2021

Extending Multi-Sense Word Embedding to Phrases and Sentences for Unsupervised Semantic Applications

Most unsupervised NLP models represent each word with a single point or ...
research
12/05/2018

Improving Medical Short Text Classification with Semantic Expansion Using Word-Cluster Embedding

Automatic text classification (TC) research can be used for real-world p...
research
09/19/2023

Semantic Text Compression for Classification

We study semantic compression for text where meanings contained in the t...
research
04/10/2019

CNM: An Interpretable Complex-valued Network for Matching

This paper seeks to model human language by the mathematical framework o...

Please sign up or login with your details

Forgot password? Click here to reset