Ancient-Modern Chinese Translation with a Large Training Dataset

08/11/2018
by   Dayiheng Liu, et al.
0

Ancient Chinese brings the wisdom and spirit culture of the Chinese nation. Automatically translation from ancient Chinese to modern Chinese helps to inherit and carry forward the quintessence of the ancients. In this paper, we propose an Ancient-Modern Chinese clause alignment approach and apply it to create a large scale Ancient-Modern Chinese parallel corpus which contains about 1.24M bilingual pairs. To our best knowledge, this is the first large high-quality Ancient-Modern Chinese dataset. Furthermore, we train the SMT and various NMT based models on this dataset and provide a strong baseline for this task

READ FULL TEXT
research
10/11/2022

Applying FrameNet to Chinese(Poetry)

FrameNet( Fillmore and Baker [2009] ) is well-known for its wide use for...
research
06/03/2021

CCPM: A Chinese Classical Poetry Matching Dataset

Poetry is one of the most important art forms of human languages. Recent...
research
02/27/2019

CN-Probase: A Data-driven Approach for Large-scale Chinese Taxonomy Construction

Taxonomies play an important role in machine intelligence. However, most...
research
11/08/2015

A Chinese POS Decision Method Using Korean Translation Information

In this paper we propose a method that imitates a translation expert usi...
research
03/05/2018

Automatic Transferring between Ancient Chinese and Contemporary Chinese

During the long time of development, Chinese language has evolved a grea...
research
06/27/2017

Auto-Encoder Guided GAN for Chinese Calligraphy Synthesis

In this paper, we investigate the Chinese calligraphy synthesis problem:...
research
05/04/2020

Compose Like Humans: Jointly Improving the Coherence and Novelty for Modern Chinese Poetry Generation

Chinese poetry is an important part of worldwide culture, and classical ...

Please sign up or login with your details

Forgot password? Click here to reset