Automatic Identification of Alzheimer's Disease using Lexical Features extracted from Language Samples

07/16/2023
by   M. Zakaria Kurdi, et al.
0

Objective: this study has a twofold goal. First, it aims to improve the understanding of the impact of Dementia of type Alzheimer's Disease (AD) on different aspects of the lexicon. Second, it aims to demonstrate that such aspects of the lexicon, when used as features of a machine learning classifier, can help achieve state-of-the-art performance in automatically identifying language samples produced by patients with AD. Methods: data is derived from the ADDreSS challenge, which is a part of the DementiaBank corpus. The used dataset consists of transcripts of Cookie Theft picture descriptions, produced by 54 subjects in the training part and 24 subjects in the test part. The number of narrative samples is 108 in the training set and 48 in the test set. First, the impact of AD on 99 selected lexical features is studied using both the training and testing parts of the dataset. Then some machine learning experiments were conducted on the task of classifying transcribed speech samples with text samples that were produced by people with AD from those produced by normal subjects. Several experiments were conducted to compare the different areas of lexical complexity, identify the subset of features that help achieve optimal performance, and study the impact of the size of the input on the classification. To evaluate the generalization of the models built on narrative speech, two generalization tests were conducted using written data from two British authors, Iris Murdoch and Agatha Christie, and the transcription of some speeches by former President Ronald Reagan. Results: using lexical features only, state-of-the-art classification, F1 and accuracies, of over 91 by individuals with AD from the ones produced by healthy control subjects. This confirms the substantial impact of AD on lexicon processing.

READ FULL TEXT

page 5

page 13

page 15

page 16

page 17

page 18

page 19

page 21

research
11/18/2020

Combining Prosodic, Voice Quality and Lexical Features to Automatically Detect Alzheimer's Disease

Alzheimer's Disease (AD) is nowadays the most common form of dementia, a...
research
11/29/2018

The Effect of Heterogeneous Data for Alzheimer's Disease Detection from Speech

Speech datasets for identifying Alzheimer's disease (AD) are generally r...
research
05/07/2020

A Tale of Two Perplexities: Sensitivity of Neural Language Models to Lexical Retrieval Deficits in Dementia of the Alzheimer's Type

In recent years there has been a burgeoning interest in the use of compu...
research
10/25/2021

ML-Based Analysis to Identify Speech Features Relevant in Predicting Alzheimer's Disease

Alzheimer's disease (AD) is a neurodegenerative disease that affects nea...
research
06/28/2022

Exploring linguistic feature and model combination for speech recognition based automatic AD detection

Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating p...
research
08/11/2023

Evaluating Picture Description Speech for Dementia Detection using Image-text Alignment

Using picture description speech for dementia detection has been studied...
research
11/22/2021

Longitudinal Speech Biomarkers for Automated Alzheimer's Detection

We introduce a novel audio processing architecture, the Open Voice Brain...

Please sign up or login with your details

Forgot password? Click here to reset