Siamese Neural Networks with Random Forest for detecting duplicate question pairs

01/22/2018
by   Ameya Godbole, et al.
0

Determining whether two given questions are semantically similar is a fairly chal- lenging task given the different struc- tures and forms that the questions can take. In this paper, we use Gated Re- current Units(GRU) in combination with other highly used machine learning algo- rithms like Random Forest, Adaboost and SVM for the similarity prediction task on a dataset released by Quora, consisting of about 400k labeled question pairs. We got the best result by using the Siamese adaptation of a Bidirectional GRU with a Random Forest classifier, which landed us among the top 24 Quora Question Pairs hosted on Kaggle.

READ FULL TEXT
research
04/27/2017

A Siamese Deep Forest

A Siamese Deep Forest (SDF) is proposed in the paper. It is based on the...
research
09/27/2020

Machine Learning for Searching the Dark Energy Survey for Trans-Neptunian Objects

In this paper we investigate how implementing machine learning could imp...
research
09/03/2021

Contextualized Embeddings based Convolutional Neural Networks for Duplicate Question Identification

Question Paraphrase Identification (QPI) is a critical task for large-sc...
research
03/24/2022

Random Forest Regression for continuous affect using Facial Action Units

In this paper we describe our approach to the arousal and valence track ...
research
11/12/2019

Prediction of Missing Semantic Relations in Lexical-Semantic Network using Random Forest Classifier

This study focuses on the prediction of missing six semantic relations (...
research
07/29/2016

Authorship Verification - An Approach based on Random Forest

Authorship attribution, being an important problem in many areas in-clud...
research
04/22/2016

Detecting state of aggression in sentences using CNN

In this article we study verbal expression of aggression and its detecti...

Please sign up or login with your details

Forgot password? Click here to reset