Practice in Synonym Extraction at Large Scale

12/06/2014
by   Liangliang Cao, et al.
0

Synonym extraction is an important task in natural language processing and often used as a submodule in query expansion, question answering and other applications. Automatic synonym extractor is highly preferred for large scale applications. Previous studies in synonym extraction are most limited to small scale datasets. In this paper, we build a large dataset with 3.4 million synonym/non-synonym pairs to capture the challenges in real world scenarios. We proposed (1) a new cost function to accommodate the unbalanced learning problem, and (2) a feature learning based deep neural network to model the complicated relationships in synonym pairs. We compare several different approaches based on SVMs and neural networks, and find out a novel feature learning based neural network outperforms the methods with hand-assigned features. Specifically, the best performance of our model surpasses the SVM baseline with a significant 97% relative improvement.

READ FULL TEXT
research
04/26/2020

MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization

Recently, large-scale datasets have vastly facilitated the development i...
research
12/11/2015

Efficient Deep Feature Learning and Extraction via StochasticNets

Deep neural networks are a powerful tool for feature learning and extrac...
research
12/08/2020

Distilling Knowledge from Reader to Retriever for Question Answering

The task of information retrieval is an important component of many natu...
research
03/16/2017

Legal Question Answering using Ranking SVM and Deep Convolutional Neural Network

This paper presents a study of employing Ranking SVM and Convolutional N...
research
01/15/2021

APEX-Net: Automatic Plot Extractor Network

Automatic extraction of raw data from 2D line plot images is a problem o...
research
01/17/2023

The Recent Advances in Automatic Term Extraction: A survey

Automatic term extraction (ATE) is a Natural Language Processing (NLP) t...
research
06/07/2014

Application and Verification of Algorithm Learning Based Neural Network

This paper has been withdrawn by the author due to a crucial accuracy er...

Please sign up or login with your details

Forgot password? Click here to reset