Adaptation of Autoencoder for Sparsity Reduction From Clinical Notes Representation Learning

09/26/2022
by   Thanh-Dung Le, et al.
0

When dealing with clinical text classification on a small dataset recent studies have confirmed that a well-tuned multilayer perceptron outperforms other generative classifiers, including deep learning ones. To increase the performance of the neural network classifier, feature selection for the learning representation can effectively be used. However, most feature selection methods only estimate the degree of linear dependency between variables and select the best features based on univariate statistical tests. Furthermore, the sparsity of the feature space involved in the learning representation is ignored. Goal: Our aim is therefore to access an alternative approach to tackle the sparsity by compressing the clinical representation feature space, where limited French clinical notes can also be dealt with effectively. Methods: This study proposed an autoencoder learning algorithm to take advantage of sparsity reduction in clinical note representation. The motivation was to determine how to compress sparse, high-dimensional data by reducing the dimension of the clinical note representation feature space. The classification performance of the classifiers was then evaluated in the trained and compressed feature space. Results: The proposed approach provided overall performance gains of up to 3 achieved a 92 detecting the patient's condition. Furthermore, the compression working mechanism and the autoencoder prediction process were demonstrated by applying the theoretic information bottleneck framework.

READ FULL TEXT
research
04/08/2021

Detecting of a Patient's Condition From Clinical Narratives Using Natural Language Representation

This paper proposes a joint clinical natural language representation lea...
research
01/07/2018

Graph Autoencoder-Based Unsupervised Feature Selection with Broad and Local Data Structure Preservation

Feature selection is a dimensionality reduction technique that selects a...
research
01/05/2014

Feature Selection Using Classifier in High Dimensional Data

Feature selection is frequently used as a pre-processing step to machine...
research
12/06/2017

Sparsity Regularization for classification of large dimensional data

Feature selection has evolved to be a very important step in several mac...
research
12/20/2019

Group-Connected Multilayer Perceptron Networks

Despite the success of deep learning in domains such as image, voice, an...
research
04/08/2021

Machine Learning Based on Natural Language Processing to Detect Cardiac Failure in Clinical Narratives

The purpose of the study presented herein is to develop a machine learni...
research
08/21/2023

Feature Extraction Using Deep Generative Models for Bangla Text Classification on a New Comprehensive Dataset

The selection of features for text classification is a fundamental task ...

Please sign up or login with your details

Forgot password? Click here to reset