Identification of Candidate Markers for Alzheimer’s Disease by Combined Machine Learning with GEO Database and Experimental Validation

ZHANG Huie 1 , XIAO Mengli 1 , JI Jinjin 1 , CHENG Yurong 2 , LU Fang 1

Neural Injury and Functional Reconstruction ›› 0

PDF(3434 KB)
中国科技核心期刊
美国《化学文摘》CAS数据库收录
日本科学技术振兴机构数据库收录
湖北省优秀期刊
中国知网网络首发期刊
PDF(3434 KB)
Neural Injury and Functional Reconstruction ›› 0 DOI: 10.16780/j.cnki.sjssgncj.20250944

Identification of Candidate Markers for Alzheimer’s Disease by Combined Machine Learning with GEO Database and Experimental Validation

  • ZHANG Huie 1 , XIAO Mengli 1 , JI Jinjin 1 , CHENG Yurong 2 , LU Fang
Author information +
History +

Abstract

To screen Alzheimer’s disease (AD) biomarkers using GEO (Gene Expression
Omnibus) database combined with machine learning. Methods: A total of 339 samples were included, including
168 AD samples and 171 samples from normal healthy people. The GEO database screened the datasets to derive
the differentially expressed genes, screened the predictive gene models by two algorithms: least absolute
shrinkage and selection operator (LASSO) logistic regression and random forest (RF), and plotted ROC curves to
evaluate the models. Clinical datasets (including multiple groups of AD patients and healthy control samples)
were used to validate the predicted genes. RT-qPCR quantitatively analyzed the expression of the predicted genes
in the normal and model groups of the AD cell model. Result: LASSO yielded 84 key markers, and RF
algorithm identified 7 genes. Venn diagram screening yielded overlapping genes for 2 algorithms, including
PLSCR4, GLIS3, PHYHD1, and HVCN1. ROC curves in the test set showed that the area under the curve of
these 4 candidate genes was greater than 0.7, and ROC curves in the validation set also showed that the area
under the curve of 3 of these candidates was greater than 0.7, among which GLIS3 (AUC=0.891) and HVCN1
(AUC=0.953) exhibited excellent diagnostic performance (AUC>0.89). The RT-qPCR method revealed that the
relative expression of PLSCR4, GLIS3, PHYHD1 and HVCN1 was elevated in the AD cell model compared
with the normal control group (all P<0.01), and the results were consistent with the bioinformatic predictions.
Conclusion: PLSCR4, GLIS3, PHYHD1 and HVCN1 may be used as molecular markers for clinical diagnosis
of AD.

Key words

Alzheimer’s disease
/ machine learning / diagnostic markers / experimental validation

Cite this article

Download Citations
ZHANG Huie 1 , XIAO Mengli 1 , JI Jinjin 1 , CHENG Yurong 2 , LU Fang 1.
Identification of Candidate Markers for Alzheimer’s Disease by Combined Machine Learning with GEO Database and Experimental Validation
[J]. Neural Injury and Functional Reconstruction. 0 https://doi.org/10.16780/j.cnki.sjssgncj.20250944
PDF(3434 KB)

Accesses

Citation

Detail

Sections
Recommended

/