DeepAI AI Chat
Log In Sign Up

LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks

by   Hendrik Strobelt, et al.
Ingenieurbüro Strobelt

Recurrent neural networks, and in particular long short-term memory (LSTM) networks, are a remarkably effective tool for sequence modeling that learn a dense black-box hidden representation of their sequential input. Researchers interested in better understanding these models have studied the changes in hidden state representations over time and noticed some interpretable patterns but also significant noise. In this work, we present LSTMVIS, a visual analysis tool for recurrent neural networks with a focus on understanding these hidden state dynamics. The tool allows users to select a hypothesis input range to focus on local state changes, to match these states changes to similar patterns in a large data set, and to align these results with structural annotations from their domain. We show several use cases of the tool for analyzing specific hidden state properties on dataset containing nesting, phrase structure, and chord progressions, and demonstrate how the tool can be used to isolate patterns for further statistical analysis. We characterize the domain, the different stakeholders, and their goals and tasks.


Simplified Gating in Long Short-term Memory (LSTM) Recurrent Neural Networks

The standard LSTM recurrent neural networks while very powerful in long-...

State-Denoised Recurrent Neural Networks

Recurrent neural networks (RNNs) are difficult to train on sequence proc...

Increasing the Interpretability of Recurrent Neural Networks Using Hidden Markov Models

As deep neural networks continue to revolutionize various application do...

Differential Recurrent Neural Networks for Action Recognition

The long short-term memory (LSTM) neural network is capable of processin...

PyRCN: Exploration and Application of ESNs

As a family member of Recurrent Neural Networks and similar to Long-Shor...

Tustin neural networks: a class of recurrent nets for adaptive MPC of mechanical systems

The use of recurrent neural networks to represent the dynamics of unstab...