CN-Probase: A Data-driven Approach for Large-scale Chinese Taxonomy Construction

02/27/2019
by   Jindong Chen, et al.
0

Taxonomies play an important role in machine intelligence. However, most well-known taxonomies are in English, and non-English taxonomies, especially Chinese ones, are still very rare. In this paper, we focus on automatic Chinese taxonomy construction and propose an effective generation and verification framework to build a large-scale and high-quality Chinese taxonomy. In the generation module, we extract isA relations from multiple sources of Chinese encyclopedia, which ensures the coverage. To further improve the precision of taxonomy, we apply three heuristic approaches in verification module. As a result, we construct the largest Chinese taxonomy with high precision about 95 called CN-Probase. Our taxonomy has been deployed on Aliyun, with over 82 million API calls in six months.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/11/2018

Ancient-Modern Chinese Translation with a Large Training Dataset

Ancient Chinese brings the wisdom and spirit culture of the Chinese nati...
research
07/19/2023

RaTE: a Reproducible automatic Taxonomy Evaluation by Filling the Gap

Taxonomies are an essential knowledge representation, yet most studies o...
research
10/13/2020

CoRel: Seed-Guided Topical Taxonomy Construction by Concept Learning and Relation Transferring

Taxonomy is not only a fundamental form of knowledge representation, but...
research
12/17/2018

TechKG: A Large-Scale Chinese Technology-Oriented Knowledge Graph

Knowledge graph is a kind of valuable knowledge base which would benefit...
research
11/26/2022

SGCE-Font: Skeleton Guided Channel Expansion for Chinese Font Generation

The automatic generation of Chinese fonts is an important problem involv...
research
05/04/2021

Large-scale Taxonomy Induction Using Entity and Word Embeddings

Taxonomies are an important ingredient of knowledge organization, and se...
research
11/28/2017

Classification of entities via their descriptive sentences

Hypernym identification of open-domain entities is crucial for taxonomy ...

Please sign up or login with your details

Forgot password? Click here to reset