Character-based Joint Segmentation and POS Tagging for Chinese using Bidirectional RNN-CRF

04/05/2017
by   Yan Shao, et al.
0

We present a character-based model for joint segmentation and POS tagging for Chinese. The bidirectional RNN-CRF architecture for general sequence tagging is adapted and applied with novel vector representations of Chinese characters that capture rich contextual information and lower-than-character level features. The proposed model is extensively evaluated and compared with a state-of-the-art tagger respectively on CTB5, CTB9 and UD Chinese. The experimental results indicate that our model is accurate and robust across datasets in different sizes, genres and annotation schemes. We obtain state-of-the-art performance on CTB5, achieving 94.38 F1-score for joint segmentation and POS tagging.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/17/2021

Joint Chinese Word Segmentation and Part-of-speech Tagging via Two-stage Span Labeling

Chinese word segmentation and part-of-speech tagging are necessary tasks...
research
06/24/2018

Character-Level Feature Extraction with Densely Connected Networks

Generating character-level features is an important step for achieving g...
research
01/17/2019

Robust Chinese Word Segmentation with Contextualized Word Representations

In recent years, after the neural-network-based method was proposed, the...
research
10/05/2018

Sentence Segmentation for Classical Chinese Based on LSTM with Radical Embedding

In this paper, we develop a low than character feature embedding called ...
research
11/21/2018

Resource Mention Extraction for MOOC Discussion Forums

In discussions hosted on discussion forums for MOOCs, references to onli...
research
10/03/2019

Character Feature Engineering for Japanese Word Segmentation

On word segmentation problems, machine learning architecture engineering...
research
12/06/2022

A new eye segmentation method based on improved U2Net in TCM eye diagnosis

For the diagnosis of Chinese medicine, tongue segmentation has reached a...

Please sign up or login with your details

Forgot password? Click here to reset