Brain2Char: A Deep Architecture for Decoding Text from Brain Recordings

09/03/2019
by   Pengfei Sun, et al.
0

Decoding language representations directly from the brain can enable new Brain-Computer Interfaces (BCI) for high bandwidth human-human and human-machine communication. Clinically, such technologies can restore communication in people with neurological conditions affecting their ability to speak. In this study, we propose a novel deep network architecture Brain2Char, for directly decoding text (specifically character sequences) from direct brain recordings (called Electrocorticography, ECoG). Brain2Char framework combines state-of-the-art deep learning modules --- 3D Inception layers for multiband spatiotemporal feature extraction from neural data and bidirectional recurrent layers, dilated convolution layers followed by language model weighted beam search to decode character sequences, optimizing a connectionist temporal classification (CTC) loss. Additionally, given the highly non-linear transformations that underlie the conversion of cortical function to character sequences, we perform regularizations on the network's latent representations motivated by insights into cortical encoding of speech production and artifactual aspects specific to ECoG data acquisition. To do this, we impose auxiliary losses on latent representations for articulatory movements, speech acoustics and session specific non-linearities. In 3 participants tested here, Brain2Char achieves 10.6%, 8.5% and 7.0% Word Error Rates (WER) respectively on vocabulary sizes ranging from 1200 to 1900 words. Brain2Char also performs well when 2 participants silently mimed sentences. These results set a new state-of-the-art on decoding text from brain and demonstrate the potential of Brain2Char as a high-performance communication BCI.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/15/2023

Decoding Chinese phonemes from intracortical brain signals with hyperbolic-space neural representations

Speech brain-computer interfaces (BCIs), which translate brain signals i...
research
08/25/2022

Decoding speech from non-invasive brain recordings

Decoding language from brain activity is a long-awaited goal in both hea...
research
12/05/2021

Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot Sentiment Classification

State-of-the-art brain-to-text systems have achieved great success in de...
research
11/13/2022

Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding

Decoding visual stimuli from brain recordings aims to deepen our underst...
research
01/19/2023

Subject-Independent Classification of Brain Signals using Skip Connections

Untapped potential for new forms of human-to-human communication can be ...
research
07/09/2019

Translating neural signals to text using a Brain-Machine Interface

Brain-Computer Interfaces (BCI) help patients with faltering communicati...

Please sign up or login with your details

Forgot password? Click here to reset