TheanoLM - An Extensible Toolkit for Neural Network Language Modeling

05/03/2016
by   Seppo Enarvi, et al.
0

We present a new tool for training neural network language models (NNLMs), scoring sentences, and generating text. The tool has been written using Python library Theano, which allows researcher to easily extend it and tune any aspect of the training process. Regardless of the flexibility, Theano is able to generate extremely fast native code that can utilize a GPU or multiple CPU cores in order to parallelize the heavy numerical computations. The tool has been evaluated in difficult Finnish and English conversational speech recognition tasks, and significant improvement was obtained over our best back-off n-gram models. The results that we obtained in the Finnish task were compared to those from existing RNNLM and RWTHLM toolkits, and found to be as good or better, while training times were an order of magnitude shorter.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/03/2017

Dual Language Models for Code Mixed Speech Recognition

In this work, we present a new approach to language modeling for bilingu...
research
11/05/2018

The Marchex 2018 English Conversational Telephone Speech Recognition System

In this paper, we describe recent improvements to the production Marchex...
research
08/27/2018

Large Margin Neural Language Model

We propose a large margin criterion for training neural language models....
research
05/04/2018

Pytrec_eval: An Extremely Fast Python Interface to trec_eval

We introduce pytrec_eval, a Python interface to the tree_eval informatio...
research
01/30/2018

Accelerating recurrent neural network language model based online speech recognition system

This paper presents methods to accelerate recurrent neural network based...
research
04/27/2016

The IBM 2016 English Conversational Telephone Speech Recognition System

We describe a collection of acoustic and language modeling techniques th...
research
02/14/2020

Integrating Discrete and Neural Features via Mixed-feature Trans-dimensional Random Field Language Models

There has been a long recognition that discrete features (n-gram feature...

Please sign up or login with your details

Forgot password? Click here to reset