Semantic Vector Machines

05/14/2011
by   Etter Vincent, et al.
0

We first present our work in machine translation, during which we used aligned sentences to train a neural network to embed n-grams of different languages into an d-dimensional space, such that n-grams that are the translation of each other are close with respect to some metric. Good n-grams to n-grams translation results were achieved, but full sentences translation is still problematic. We realized that learning semantics of sentences and documents was the key for solving a lot of natural language processing problems, and thus moved to the second part of our work: sentence compression. We introduce a flexible neural network architecture for learning embeddings of words and sentences that extract their semantics, propose an efficient implementation in the Torch framework and present embedding results comparable to the ones obtained with classical neural language models, while being more powerful.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2017

Learning Joint Multilingual Sentence Representations with Neural Machine Translation

In this paper, we use the framework of neural machine translation to lea...
research
09/28/2017

A Deep Neural Network Approach To Parallel Sentence Extraction

Parallel sentence extraction is a task addressing the data sparsity prob...
research
06/13/2018

Extracting Parallel Sentences with Bidirectional Recurrent Neural Networks to Improve Machine Translation

Parallel sentence extraction is a task addressing the data sparsity prob...
research
08/16/2018

Paraphrase Thought: Sentence Embedding Module Imitating Human Language Recognition

Sentence embedding is an important research topic in natural language pr...
research
07/09/2021

Using Machine Translation to Localize Task Oriented NLG Output

One of the challenges in a task oriented natural language application li...
research
07/19/2019

Structure-Invariant Testing for Machine Translation

In recent years, machine translation software has increasingly been inte...
research
05/24/2022

Lack of Fluency is Hurting Your Translation Model

Many machine translation models are trained on bilingual corpus, which c...

Please sign up or login with your details

Forgot password? Click here to reset