Natural Language Understanding with the Quora Question Pairs Dataset

07/01/2019
by   Lakshay Sharma, et al.
0

This paper explores the task Natural Language Understanding (NLU) by looking at duplicate question detection in the Quora dataset. We conducted extensive exploration of the dataset and used various machine learning models, including linear and tree-based models. Our final finding was that a simple Continuous Bag of Words neural network model had the best performance, outdoing more complicated recurrent and attention based models. We also conducted error analysis and found some subjectivity in the labeling of the dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/13/2020

The Unstoppable Rise of Computational Linguistics in Deep Learning

In this paper, we trace the history of neural networks applied to natura...
research
11/24/2015

Natural Language Understanding with Distributed Representation

This is a lecture note for the course DS-GA 3001 <Natural Language Under...
research
10/31/2017

Whodunnit? Crime Drama as a Case for Natural Language Understanding

In this paper we argue that crime drama exemplified in television progra...
research
09/02/2021

Do Prompt-Based Models Really Understand the Meaning of their Prompts?

Recently, a boom of papers have shown extraordinary progress in few-shot...
research
08/12/2018

Interpreting Recurrent and Attention-Based Neural Models: a Case Study on Natural Language Inference

Deep learning models have achieved remarkable success in natural languag...
research
06/04/2020

Experiments on Paraphrase Identification Using Quora Question Pairs Dataset

We modeled the Quora question pairs dataset to identify a similar questi...
research
08/05/2019

A Weakly-Supervised Attention-based Visualization Tool for Assessing Political Affiliation

In this work, we seek to finetune a weakly-supervised expert-guided Deep...

Please sign up or login with your details

Forgot password? Click here to reset