Text-mining the NeuroSynth corpus using Deep Boltzmann Machines

05/01/2016
by   Ricardo Pio Monti, et al.
0

Large-scale automated meta-analysis of neuroimaging data has recently established itself as an important tool in advancing our understanding of human brain function. This research has been pioneered by NeuroSynth, a database collecting both brain activation coordinates and associated text across a large cohort of neuroimaging research papers. One of the fundamental aspects of such meta-analysis is text-mining. To date, word counts and more sophisticated methods such as Latent Dirichlet Allocation have been proposed. In this work we present an unsupervised study of the NeuroSynth text corpus using Deep Boltzmann Machines (DBMs). The use of DBMs yields several advantages over the aforementioned methods, principal among which is the fact that it yields both word and document embeddings in a high-dimensional vector space. Such embeddings serve to facilitate the use of traditional machine learning techniques on the text corpus. The proposed DBM model is shown to learn embeddings with a clear semantic structure.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/15/2016

Topic Modeling Using Distributed Word Embeddings

We propose a new algorithm for topic modeling, Vec2Topic, that identifie...
research
04/11/2018

Learning Topics using Semantic Locality

The topic modeling discovers the latent topic probability of the given t...
research
04/24/2015

Efficient Non-parametric Estimation of Multiple Embeddings per Word in Vector Space

There is rising interest in vector-space word embeddings and their use i...
research
07/12/2018

Tracking the Evolution of Words with Time-reflective Text Representations

More than 80 unstructured datasets evolving over time. A large part of t...
research
09/10/2023

Chat2Brain: A Method for Mapping Open-Ended Semantic Queries to Brain Activation Maps

Over decades, neuroscience has accumulated a wealth of research results ...
research
05/20/2016

As Cool as a Cucumber: Towards a Corpus of Contemporary Similes in Serbian

Similes are natural language expressions used to compare unlikely things...
research
01/30/2022

Recognition of Implicit Geographic Movement in Text

Analyzing the geographic movement of humans, animals, and other phenomen...

Please sign up or login with your details

Forgot password? Click here to reset