Diagnostic Analysis of Depression Based on XGBoost Modeling

Authors

  • Yumeng Zhang
  • Baiyu Lu
  • Shuai Qing
  • Jing Li

DOI:

https://doi.org/10.54691/k4716c35

Keywords:

Depression Diagnosis; XGBoost; Feature Selection.

Abstract

This study aims to respond to the demand for research on the genetic mechanism of depression and to utilize gene expression data to construct an efficient depression diagnosis system. In view of the high-dimensional nature of gene expression data, cross-platform differences and limited sample size, this study first eliminates platform differences through Z-Score standardization, screens public gene features, and handles tag coding and missing values to ensure the consistency and completeness of the data. On this basis, the XGBoost binary classification model with AUC as the main performance index is constructed. The model gives full play to its feature selection and high-dimensional data processing capabilities to effectively cope with the situation of large number of features and small sample size. Key genes were identified through feature importance analysis, and the consistency and robustness of the model were verified using sensitivity analysis and stability test. This study provides a scientific and effective solution for clinical diagnosis of depression and precision medicine research.

Downloads

Download data is not yet available.

References

[1] Liu Yang, Su Zhaozhong, Zeng Fanjun, et al. Research progress of high-throughput sequencing standards[J]. Journal of Metrology,2024,45(01):128-134.

[2] Wu Yongzhi. Clinicopathological significance of gene expression profiling technology in detecting CUP patients[D]. Wannan Medical College, 2022.DOI: 10.27374/d.cnki. gwnyy.2022. 000174.

[3] Wang Cao. Research on depression diagnosis based on multimodal data [D]. North China Electric Power University (Beijing), 2022.DOI: 10.27140/d.cnki.ghbbu.2022.000354.

[4] Cui, Linshang. Application of composite XGBoost model on classification prediction of unbalanced datasets [D]. Lanzhou University,2018.

[5] Yang Yanping, Li Rong. Classification feature selection for high-dimensional data based on machine learning[J]. Journal of Hunan College of Arts and Sciences (Natural Science Edition),2025,37 (01): 23-31.

[6] Huang Zhiqiang, Zhong Shijiang. Research progress on the application of machine learning in the auxiliary diagnosis of depression[J]. Armed Police Medicine,2024,35(09): 806-812.DOI: 10. 14010/ j. cnki. wjyx.2024.09.016.

Downloads

Published

2025-03-19

Issue

Section

Articles

How to Cite

Zhang, Y., Lu, B., Qing, S., & Li, J. (2025). Diagnostic Analysis of Depression Based on XGBoost Modeling. Scientific Journal of Technology, 7(3), 223-229. https://doi.org/10.54691/k4716c35