LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching

02/25/2021
by   Boer Lyu, et al.
0

Chinese short text matching is a fundamental task in natural language processing. Existing approaches usually take Chinese characters or words as input tokens. They have two limitations: 1) Some Chinese words are polysemous, and semantic information is not fully utilized. 2) Some models suffer potential issues caused by word segmentation. Here we introduce HowNet as an external knowledge base and propose a Linguistic knowledge Enhanced graph Transformer (LET) to deal with word ambiguity. Additionally, we adopt the word lattice graph as input to maintain multi-granularity information. Our model is also complementary to pre-trained language models. Experimental results on two Chinese datasets show that our models outperform various typical text matching approaches. Ablation study also indicates that both semantic information and multi-granularity information are important for text matching modeling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2021

Lattice-BERT: Leveraging Multi-Granularity Representations in Chinese Pre-trained Language Models

Chinese pre-trained language models usually process text as a sequence o...
research
02/25/2019

Lattice CNNs for Matching Based Chinese Question Answering

Short text matching often faces the challenges that there are great word...
research
04/08/2023

The Short Text Matching Model Enhanced with Knowledge via Contrastive Learning

In recent years, short Text Matching tasks have been widely applied in t...
research
12/15/2022

RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis

With the advent of deep learning, a huge number of text-to-speech (TTS) ...
research
03/14/2023

Good Neighbors Are All You Need for Chinese Grapheme-to-Phoneme Conversion

Most Chinese Grapheme-to-Phoneme (G2P) systems employ a three-stage fram...
research
01/22/2022

Chinese Word Segmentation with Heterogeneous Graph Neural Network

In recent years, deep learning has achieved significant success in the C...
research
07/30/2023

Improving TTS for Shanghainese: Addressing Tone Sandhi via Word Segmentation

Tone is a crucial component of the prosody of Shanghainese, a Wu Chinese...

Please sign up or login with your details

Forgot password? Click here to reset