Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
Investigating Alzheimer's Disease Using a Sequential Analytical Pipeline for High-Dimensional, Low-Sample Biomedical Data. / Shchegoleva, N; Tonka, P; Zalutskaya, N.
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2025 WORKSHOPS, PT IX. Springer Nature, 2026. p. 340-355 (Lecture Notes in Computer Science; Vol. 15894 LNCS).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
}
TY - GEN
T1 - Investigating Alzheimer's Disease Using a Sequential Analytical Pipeline for High-Dimensional, Low-Sample Biomedical Data
AU - Shchegoleva, N
AU - Tonka, P
AU - Zalutskaya, N
N1 - Times Cited in Web of Science Core Collection: 0 Total Times Cited: 0 Cited Reference Count: 34
PY - 2026
Y1 - 2026
N2 - Dementia, particularly Alzheimer's disease, is a growing global medical and economic problem exacerbated by increasing life expectancy and aging population. This study demonstrates the critical need for efficient data processing methods in biomedical research, focusing on scenarios where the number of observations is smaller than the dimensionality of the data. Based on a dataset of patients with various neurodegenerative diseases including Alzheimer's disease, this paper describes a comprehensive data processing pipeline including dimensionality reduction and clustering. The methodology is based on the combined use of principal component analysis (PCA) and uniform manifold approximation and projection (UMAP) for dimensionality reduction, followed by clustering with optimized parameters. The study demonstrates how these methods can mitigate problems such as data sparsity and the curse of dimensionality, providing a pipeline to gain insights into potential biomarkers for early diagnosis of neurodegenerative diseases. Although this study does not directly interpret clustering results, it lays the groundwork for experts in the field to explore new hypotheses and improve diagnostic tools.
AB - Dementia, particularly Alzheimer's disease, is a growing global medical and economic problem exacerbated by increasing life expectancy and aging population. This study demonstrates the critical need for efficient data processing methods in biomedical research, focusing on scenarios where the number of observations is smaller than the dimensionality of the data. Based on a dataset of patients with various neurodegenerative diseases including Alzheimer's disease, this paper describes a comprehensive data processing pipeline including dimensionality reduction and clustering. The methodology is based on the combined use of principal component analysis (PCA) and uniform manifold approximation and projection (UMAP) for dimensionality reduction, followed by clustering with optimized parameters. The study demonstrates how these methods can mitigate problems such as data sparsity and the curse of dimensionality, providing a pipeline to gain insights into potential biomarkers for early diagnosis of neurodegenerative diseases. Although this study does not directly interpret clustering results, it lays the groundwork for experts in the field to explore new hypotheses and improve diagnostic tools.
KW - Alzheimer's disease
KW - dimensionality reduction
KW - clustering
KW - PCA
KW - UMAP
KW - HDBSCAN
UR - https://www.mendeley.com/catalogue/86199943-6e73-3843-8a10-c50458d152ea/
U2 - 10.1007/978-3-031-97648-3_23
DO - 10.1007/978-3-031-97648-3_23
M3 - статья в сборнике материалов конференции
SN - 9783031976476
T3 - Lecture Notes in Computer Science
SP - 340
EP - 355
BT - COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2025 WORKSHOPS, PT IX
PB - Springer Nature
Y2 - 30 June 2025 through 3 July 2025
ER -
ID: 151949548