Hyperbolic Deep Learning for Chinese Natural Language Understanding

12/11/2018
by   Marko Valentin Micic, et al.
0

Recently hyperbolic geometry has proven to be effective in building embeddings that encode hierarchical and entailment information. This makes it particularly suited to modelling the complex asymmetrical relationships between Chinese characters and words. In this paper we first train a large scale hyperboloid skip-gram model on a Chinese corpus, then apply the character embeddings to a downstream hyperbolic Transformer model derived from the principles of gyrovector space for Poincare disk model. In our experiments the character-based Transformer outperformed its word-based Euclidean equivalent. To the best of our knowledge, this is the first time in Chinese NLP that a character-based model outperformed its word-based counterpart, allowing the circumvention of the challenging and domain-dependent task of Chinese Word Segmentation (CWS).

READ FULL TEXT
research
08/30/2018

Skip-gram word embeddings in hyperbolic space

Embeddings of tree-like graphs in hyperbolic space were recently shown t...
research
03/19/2020

Temporal Embeddings and Transformer Models for Narrative Text Understanding

We present two deep learning approaches to narrative text understanding ...
research
02/23/2019

VCWE: Visual Character-Enhanced Word Embeddings

Chinese is a logographic writing system, and the shape of Chinese charac...
research
01/17/2019

Robust Chinese Word Segmentation with Contextualized Word Representations

In recent years, after the neural-network-based method was proposed, the...
research
06/03/2019

Chinese Embedding via Stroke and Glyph Information: A Dual-channel View

Recent studies have consistently given positive hints that morphology is...
research
03/22/2023

Evaluating Transformer Models and Human Behaviors on Chinese Character Naming

Neural network models have been proposed to explain the grapheme-phoneme...
research
12/27/2017

A Gap-Based Framework for Chinese Word Segmentation via Very Deep Convolutional Networks

Most previous approaches to Chinese word segmentation can be roughly cla...

Please sign up or login with your details

Forgot password? Click here to reset