Evaluation of risk factors and survival rates of patients with early-stage breast cancer with machine learning and traditional methods
•Comparison of prediction performances of traditional statistical methods and machine learning algorithms in terms of survival rate and C index. This article is aimed to make predictions in terms of prognostic factors and compare prediction methods by using Cox proportional hazards regression analys...
Saved in:
Published in: | International journal of medical informatics (Shannon, Ireland) Vol. 190; p. 105548 |
---|---|
Main Authors: | , , , |
Format: | Journal Article |
Language: | English |
Published: |
Ireland
Elsevier B.V
01-10-2024
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •Comparison of prediction performances of traditional statistical methods and machine learning algorithms in terms of survival rate and C index.
This article is aimed to make predictions in terms of prognostic factors and compare prediction methods by using Cox proportional hazards regression analysis (CPH), some machine learning techniques and Accelerated Failure Time (AFT) model for post-treatment survival probabilities according to clinical presentations and pathological information of early-stage breast cancer patients.
The study was carried out in three stages. In the first stage, the CPH method was applied. In the second stage, the AFT model and in the last stage, machine learning methods were applied. The data set consists of 697 breast cancer patients who applied to Marmara University Hospital oncology clinic between 01.01.1994 and 31.12.2009. The models obtained by using various parameters of the patients were compared according to the C index, 5-year survival rate and 10-year survival rate.
According to the models obtained as a result of the analyses applied, MetLN and age were obtained as a significant risk factor as a result of CPH method and AFT methods, while MetLN, age, tumor size, LV1 and extracapsular involvement were obtained as risk factors in machine learning methods. In addition, when the c-index values of the handheld models are examined, it is obtained as 69.8 for the CPH model, 70.36 for the AFT model, 72.1 for the random survival forest and 72.8 for the gradient boosting machine. In conclusion, the study highlights the potential of comparing conventional statistical methods and machine-learning algorithms to improve the precision of risk factor determination in early-stage breast cancer prognosis. Additionally, efforts should be made to enhance the interpretability of machine-learning models, ensuring that the results obtained can be effectively communicated and utilized by clinical practitioners. This would enable more informed decision-making and personalized care in the treatment and follow-up processes for early-stage breast cancer patients. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1386-5056 1872-8243 1872-8243 |
DOI: | 10.1016/j.ijmedinf.2024.105548 |