Language Recognition using Random Indexing

12/22/2014
by   Aditya Joshi, et al.
0

Random Indexing is a simple implementation of Random Projections with a wide range of applications. It can solve a variety of problems with good accuracy without introducing much complexity. Here we use it for identifying the language of text samples. We present a novel method of generating language representation vectors using letter blocks. Further, we show that the method is easily implemented and requires little computational power and space. Experiments on a number of model parameters illustrate certain properties about high dimensional sparse vector representations of data. Proof of statistically relevant language vectors are shown through the extremely high success of various language recognition tasks. On a difficult data set of 21,000 short sentences from 21 different languages, our model performs a language recognition task and achieves 97.8 methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2020

Temporal Random Indexing of Context Vectors Applied to Event Detection

In this paper we explore new representations for encoding language data....
research
02/18/2011

Searching in one billion vectors: re-rank with source coding

Recent indexing techniques inspired by source coding have been shown suc...
research
02/23/2018

High-Dimensional Vector Semantics

In this paper we explore the "vector semantics" problem from the perspec...
research
09/29/2017

Language-depedent I-Vectors for LRE15

A standard recipe for spoken language recognition is to apply a Gaussian...
research
04/01/2021

High-dimensional distributed semantic spaces for utterances

High-dimensional distributed semantic spaces have proven useful and effe...
research
04/09/2018

Set Similarity Search for Skewed Data

Set similarity join, as well as the corresponding indexing problem set s...
research
02/05/2015

Monitoring Term Drift Based on Semantic Consistency in an Evolving Vector Field

Based on the Aristotelian concept of potentiality vs. actuality allowing...

Please sign up or login with your details

Forgot password? Click here to reset