Optimizing Neural Network Hyperparameters with Gaussian Processes for Dialog Act Classification

09/27/2016
by   Franck Dernoncourt, et al.
0

Systems based on artificial neural networks (ANNs) have achieved state-of-the-art results in many natural language processing tasks. Although ANNs do not require manually engineered features, ANNs have many hyperparameters to be optimized. The choice of hyperparameters significantly impacts models' performances. However, the ANN hyperparameters are typically chosen by manual, grid, or random search, which either requires expert experiences or is computationally expensive. Recent approaches based on Bayesian optimization using Gaussian processes (GPs) is a more systematic way to automatically pinpoint optimal or near-optimal machine learning hyperparameters. Using a previously published ANN model yielding state-of-the-art results for dialog act classification, we demonstrate that optimizing hyperparameters using GP further improves the results, and reduces the computational time by a factor of 4 compared to a random search. Therefore it is a useful technique for tuning ANN models to yield the best performances for natural language processing tasks.

READ FULL TEXT

page 5

page 6

research
04/25/2016

CMA-ES for Hyperparameter Optimization of Deep Neural Networks

Hyperparameters of deep neural networks are often optimized by grid sear...
research
08/19/2019

Towards Assessing the Impact of Bayesian Optimization's Own Hyperparameters

Bayesian Optimization (BO) is a common approach for hyperparameter optim...
research
11/10/2017

Efficient Representation for Natural Language Processing via Kernelized Hashcodes

Kernel similarity functions have been successfully applied in classifica...
research
12/18/2021

GPEX, A Framework For Interpreting Artificial Neural Networks

Machine learning researchers have long noted a trade-off between interpr...
research
08/10/2015

Learning Structural Kernels for Natural Language Processing

Structural kernels are a flexible learning paradigm that has been widely...
research
05/25/2021

Extending the Abstraction of Personality Types based on MBTI with Machine Learning and Natural Language Processing

A data-centric approach with Natural Language Processing (NLP) to predic...
research
04/11/2018

Word2Vec applied to Recommendation: Hyperparameters Matter

Skip-gram with negative sampling, a popular variant of Word2vec original...

Please sign up or login with your details

Forgot password? Click here to reset