Continuous Representation of Location for Geolocation and Lexical Dialectology using Mixture Density Networks

08/14/2017
by   Afshin Rahimi, et al.
0

We propose a method for embedding two-dimensional locations in a continuous vector space using a neural network-based model incorporating mixtures of Gaussian distributions, presenting two model variants for text-based geolocation and lexical dialectology. Evaluated over Twitter data, the proposed model outperforms conventional regression-based geolocation and provides a better estimate of uncertainty. We also show the effectiveness of the representation for predicting words from location in lexical dialectology, and evaluate it using the DARE dataset.

READ FULL TEXT

page 7

page 8

research
06/30/2023

Japanese Lexical Complexity for Non-Native Readers: A New Dataset

Lexical complexity prediction (LCP) is the task of predicting the comple...
research
10/17/2017

Specialising Word Vectors for Lexical Entailment

We present LEAR (Lexical Entailment Attract-Repel), a novel post-process...
research
12/20/2014

Word Representations via Gaussian Embedding

Current work in lexical distributed representations maps each word to a ...
research
05/08/2017

Density Estimation for Geolocation via Convolutional Mixture Density Network

Nowadays, geographic information related to Twitter is crucially importa...
research
06/03/2016

Learning Stylometric Representations for Authorship Analysis

Authorship analysis (AA) is the study of unveiling the hidden properties...
research
09/15/2023

Encoded Summarization: Summarizing Documents into Continuous Vector Space for Legal Case Retrieval

We present our method for tackling a legal case retrieval task by introd...
research
05/31/2020

A Unified Feature Representation for Lexical Connotations

Ideological attitudes and stance are often expressed through subtle mean...

Please sign up or login with your details

Forgot password? Click here to reset