Cross-Lingual Induction and Transfer of Verb Classes Based on Word Vector Space Specialisation

07/21/2017
by   Ivan Vulić, et al.
0

Existing approaches to automatic VerbNet-style verb classification are heavily dependent on feature engineering and therefore limited to languages with mature NLP pipelines. In this work, we propose a novel cross-lingual transfer method for inducing VerbNets for multiple languages. To the best of our knowledge, this is the first study which demonstrates how the architectures for learning word embeddings can be applied to this challenging syntactic-semantic task. Our method uses cross-lingual translation pairs to tie each of the six target languages into a bilingual vector space with English, jointly specialising the representations to encode the relational information from English VerbNet. A standard clustering algorithm is then run on top of the VerbNet-specialised representations, using vector dimensions as features for learning verb classes. Our results show that the proposed cross-lingual transfer approach sets new state-of-the-art verb classification performance across all six target languages explored in this work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/15/2020

Cross-lingual Transfer of Twitter Sentiment Models Using a Common Vector Space

Word embeddings represent words in a numeric space in such a way that se...
research
09/26/2022

Cross-lingual Dysarthria Severity Classification for English, Korean, and Tamil

This paper proposes a cross-lingual classification method for English, K...
research
07/09/2018

Predicting Concreteness and Imageability of Words Within and Across Languages via Word Embeddings

The notions of concreteness and imageability, traditionally important in...
research
06/30/2020

Traceability Support for Multi-Lingual Software Projects

Software traceability establishes associations between diverse software ...
research
04/05/2019

Cross-Lingual Transfer of Semantic Roles: From Raw Text to Semantic Roles

We describe a transfer method based on annotation projection to develop ...
research
09/12/2019

Lost in Evaluation: Misleading Benchmarks for Bilingual Dictionary Induction

The task of bilingual dictionary induction (BDI) is commonly used for in...
research
08/10/2022

The Analysis about Building Cross-lingual Sememe Knowledge Base Based on Deep Clustering Network

A sememe is defined as the minimum semantic unit of human languages. Sem...

Please sign up or login with your details

Forgot password? Click here to reset