Extracting Different Levels of Speech Information from EEG Using an LSTM-Based Model

Decoding the speech signal that a person is listening to from the human brain via electroencephalography (EEG) can help us understand how our auditory system works. Linear models have been used to reconstruct the EEG from speech or vice versa. Recently, Artificial Neural Networks (ANNs) such as Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) based architectures have outperformed linear models in modeling the relation between EEG and speech. Before attempting to use these models in real-world applications such as hearing tests or (second) language comprehension assessment we need to know what level of speech information is being utilized by these models. In this study, we aim to analyze the performance of an LSTM-based model using different levels of speech features. The task of the model is to determine which of two given speech segments is matched with the recorded EEG. We used low- and high-level speech features including: envelope, mel spectrogram, voice activity, phoneme identity, and word embedding. Our results suggest that the model exploits information about silences, intensity, and broad phonetic classes from the EEG. Furthermore, the mel spectrogram, which contains all this information, yields the highest accuracy (84

READ FULL TEXT
research
09/13/2019

Spoken Speech Enhancement using EEG

In this paper we demonstrate spoken speech enhancement using electroence...
research
02/08/2021

Extracting the Locus of Attention at a Cocktail Party from Single-Trial EEG using a Joint CNN-LSTM Model

Human brain performs remarkably well in segregating a particular speaker...
research
11/19/2021

Novel EEG based Schizophrenia Detection with IoMT Framework for Smart Healthcare

In the field of neuroscience, Brain activity analysis is always consider...
research
05/14/2021

Predicting speech intelligibility from EEG using a dilated convolutional network

Objective: Currently, only behavioral speech understanding tests are ava...
research
01/21/2021

Toxicity Detection in Drug Candidates using Simplified Molecular-Input Line-Entry System

The need for analysis of toxicity in new drug candidates and the require...
research
10/05/2022

Toward Knowledge-Driven Speech-Based Models of Depression: Leveraging Spectrotemporal Variations in Speech Vowels

Psychomotor retardation associated with depression has been linked with ...
research
06/04/2019

Detecting Syntactic Change Using a Neural Part-of-Speech Tagger

We train a diachronic long short-term memory (LSTM) part-of-speech tagge...

Please sign up or login with your details

Forgot password? Click here to reset