Document Classification by Inversion of Distributed Language Representations

04/27/2015
by   Matt Taddy, et al.
0

There have been many recent advances in the structure and measurement of distributed language models: those that map from words to a vector-space that is rich in information about word choice and composition. This vector-space is the distributed language representation. The goal of this note is to point out that any distributed representation can be turned into a classifier through inversion via Bayes rule. The approach is simple and modular, in that it will work with any language representation whose training can be formulated as optimizing a probability model. In our application to 2 million sentences from Yelp reviews, we also find that it performs as well as or better than complex purpose-built algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/09/2017

Neural Vector Spaces for Unsupervised Information Retrieval

We propose the Neural Vector Space Model (NVSM), a method that learns re...
research
11/19/2015

Compressing Word Embeddings

Recent methods for learning vector space representations of words have s...
research
10/06/2020

Embedding Words in Non-Vector Space with Unsupervised Graph Learning

It has become a de-facto standard to represent words as elements of a ve...
research
06/28/2016

Hierarchical Neural Language Models for Joint Representation of Streaming Documents and their Content

We consider the problem of learning distributed representations for docu...
research
08/05/2020

6VecLM: Language Modeling in Vector Space for IPv6 Target Generation

Fast IPv6 scanning is challenging in the field of network measurement as...
research
05/12/2018

Weight Initialization in Neural Language Models

Semantic Similarity is an important application which finds its use in m...
research
07/08/2021

Vector Space Morphology with Linear Discriminative Learning

This paper presents three case studies of modeling aspects of lexical pr...

Please sign up or login with your details

Forgot password? Click here to reset