Detail publikačního výsledku

Toward Inclusive Large-Scale Alzheimer’s Disease Detection via Speech and Language Modeling

FAVARO, A.; NOVOTNÝ, K.; HE, Y.; LENG, Y.; DAS, S.; MEKYSKA, J.; MORO-VELÁZQUEZ, L.; DEHAK, N.

Originální název

Toward Inclusive Large-Scale Alzheimer’s Disease Detection via Speech and Language Modeling

Anglický název

Toward Inclusive Large-Scale Alzheimer’s Disease Detection via Speech and Language Modeling

Druh

Stať ve sborníku v databázi WoS či Scopus

Originální abstrakt

Previous speech-based detection methods for Alzheimer’s Disease and Related Dementias (ADRD) have been constrained by small sample sizes, reliance on single corpora, languages, tasks, and recording conditions, limiting their generalizability. Moreover, many studies have simplified cognitive decline progression through binary classification. To address these limitations, we developed a multimodal framework integrating language-agnostic, multilingual, and language-dependent models with demographic data to enhance adaptability across diverse cohorts. We applied this model to a three-class classification problem—cognitively normal controls (CNs), Mild Cognitive Impairment (MCI), and ADRD—using the PREPARE Challenge corpus, which includes 2058 speakers (1140 CNs, 268 MCI, and 650 ADRD). Our best-performing model achieved an F1 score of 0.71 and a log loss of 0.63 on the internal test set, with strong generalization to external test data. Bias mitigation strategies addressed demographic imbalances, including model fusion, data augmentation, and weighted cross-entropy loss. However, challenges remain for underrepresented subgroups. This study highlights the importance of integrating generalizable and language-specific features for scalable, accurate ADRD detection. Future work will expand the dataset to include more languages, improve task diversity, and refine fusion strategies to enhance robustness and scalability in clinical settings.

Anglický abstrakt

Previous speech-based detection methods for Alzheimer’s Disease and Related Dementias (ADRD) have been constrained by small sample sizes, reliance on single corpora, languages, tasks, and recording conditions, limiting their generalizability. Moreover, many studies have simplified cognitive decline progression through binary classification. To address these limitations, we developed a multimodal framework integrating language-agnostic, multilingual, and language-dependent models with demographic data to enhance adaptability across diverse cohorts. We applied this model to a three-class classification problem—cognitively normal controls (CNs), Mild Cognitive Impairment (MCI), and ADRD—using the PREPARE Challenge corpus, which includes 2058 speakers (1140 CNs, 268 MCI, and 650 ADRD). Our best-performing model achieved an F1 score of 0.71 and a log loss of 0.63 on the internal test set, with strong generalization to external test data. Bias mitigation strategies addressed demographic imbalances, including model fusion, data augmentation, and weighted cross-entropy loss. However, challenges remain for underrepresented subgroups. This study highlights the importance of integrating generalizable and language-specific features for scalable, accurate ADRD detection. Future work will expand the dataset to include more languages, improve task diversity, and refine fusion strategies to enhance robustness and scalability in clinical settings.

Klíčová slova

Alzheimer's disease; speech; language; diagnosis; acoustic models; linguistic models

Klíčová slova v angličtině

Alzheimer's disease; speech; language; diagnosis; acoustic models; linguistic models

Autoři

FAVARO, A.; NOVOTNÝ, K.; HE, Y.; LENG, Y.; DAS, S.; MEKYSKA, J.; MORO-VELÁZQUEZ, L.; DEHAK, N.

Vydáno

14.07.2025

ISBN

979-8-3315-8618-8

Kniha

47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Strany od

1

Strany do

7

Strany počet

7

URL

BibTex

@inproceedings{BUT199710,
  author="{} and Kryštof {Novotný} and  {} and  {} and  {} and Jiří {Mekyska} and  {} and  {}",
  title="Toward Inclusive Large-Scale Alzheimer’s Disease Detection via Speech and Language Modeling",
  booktitle="47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)",
  year="2025",
  pages="1--7",
  doi="10.1109/EMBC58623.2025.11254036",
  isbn="979-8-3315-8618-8",
  url="https://doi.org/10.1109/EMBC58623.2025.11254036"
}