A Hybrid Word-Character Model for Abstractive Summarization

02/27/2018
by   Chieh-Teng Chang, et al.
0

Abstractive summarization is the popular research topic nowadays. Due to the difference in language property, Chinese summarization also gains lots of attention. Most of studies use character-based representation instead of word-based to keep out the error introduced by word segmentation and OOV problem. However, we believe that word-based representation can capture the semantics of the articles more accurately. We proposed a hybrid word-character model preserves the advantage of both word-based and character-based representations. Our method also enables us to use larger word vocabulary size than anyone else. We call this new method HWC (Hybrid Word-Character). We conduct the experiments on LCSTS Chinese summarization dataset, and out-perform the current state-of-the-art by at least 8 ROUGE points.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/17/2019

Robust Chinese Word Segmentation with Contextualized Word Representations

In recent years, after the neural-network-based method was proposed, the...
research
10/03/2019

Character Feature Engineering for Japanese Word Segmentation

On word segmentation problems, machine learning architecture engineering...
research
11/07/2019

Enhancing Pre-trained Chinese Character Representation with Word-aligned Attention

Most Chinese pre-trained encoders take a character as a basic unit and l...
research
05/01/2018

Nugget Proposal Networks for Chinese Event Detection

Neural network based models commonly regard event detection as a word-wi...
research
12/12/2017

Tracing a Loose Wordhood for Chinese Input Method Engine

Chinese input methods are used to convert pinyin sequence or other Latin...
research
10/08/2020

Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding

Intent detection and slot filling are two closely related tasks for buil...
research
11/25/2019

Chinese Spelling Error Detection Using a Fusion Lattice LSTM

Spelling error detection serves as a crucial preprocessing in many natur...

Please sign up or login with your details

Forgot password? Click here to reset