BCWS: Bilingual Contextual Word Similarity

10/21/2018
by   Ta-Chung Chi, et al.
0

This paper introduces the first dataset for evaluating English-Chinese Bilingual Contextual Word Similarity, namely BCWS (https://github.com/MiuLab/BCWS). The dataset consists of 2,091 English-Chinese word pairs with the corresponding sentential contexts and their similarity scores annotated by the human. Our annotated dataset has higher consistency compared to other similar datasets. We establish several baselines for the bilingual embedding task to benchmark the experiments. Modeling cross-lingual sense representations as provided in this dataset has the potential of moving artificial intelligence from monolingual understanding towards multilingual understanding.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2019

COS960: A Chinese Word Similarity Dataset of 960 Word Pairs

Word similarity computation is a widely recognized task in the field of ...
research
09/15/2018

CLUSE: Cross-Lingual Unsupervised Sense Embeddings

This paper proposes a modularized sense induction and representation lea...
research
06/07/2021

Investigating Transfer Learning in Multilingual Pre-trained Language Models through Chinese Natural Language Inference

Multilingual transformers (XLM, mT5) have been shown to have remarkable ...
research
05/17/2023

Smart Word Suggestions for Writing Assistance

Enhancing word usage is a desired feature for writing assistance. To fur...
research
03/11/2018

Generating Bilingual Pragmatic Color References

Contextual influences on language exhibit substantial language-independe...
research
08/01/2015

Separated by an Un-common Language: Towards Judgment Language Informed Vector Space Modeling

A common evaluation practice in the vector space models (VSMs) literatur...
research
06/02/2023

LyricSIM: A novel Dataset and Benchmark for Similarity Detection in Spanish Song LyricS

In this paper, we present a new dataset and benchmark tailored to the ta...

Please sign up or login with your details

Forgot password? Click here to reset