Word Segmentation on Micro-blog Texts with External Lexicon and Heterogeneous Data

08/04/2016
by   Qingrong Xia, et al.
0

This paper describes our system designed for the NLPCC 2016 shared task on word segmentation on micro-blog texts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2020

Evaluating Sentence Segmentation and Word Tokenization Systems on Estonian Web Texts

Texts obtained from web are noisy and do not necessarily follow the orth...
research
04/28/2017

Neural Word Segmentation with Rich Pretraining

Neural word segmentation research has benefited from large-scale raw tex...
research
05/13/2020

Sanskrit Segmentation Revisited

Computationally analyzing Sanskrit texts requires proper segmentation in...
research
08/24/2021

Detection of Criminal Texts for the Polish State Border Guard

This paper describes research on the detection of Polish criminal texts ...
research
11/20/2015

Polysemy in Controlled Natural Language Texts

Computational semantics and logic-based controlled natural languages (CN...
research
10/22/2020

GAN based Unsupervised Segmentation: Should We Match the Exact Number of Objects

The unsupervised segmentation is an increasingly popular topic in biomed...

Please sign up or login with your details

Forgot password? Click here to reset