Explainable Machine Learning Approach for Hepatitis C Diagnosis Using SFS Feature Selection

Hepatitis C is a significant public health concern, resulting in substantial morbidity and mortality worldwide. Early diagnosis and effective treatment are essential to prevent the disease’s progression to chronic liver disease. Machine learning algorithms have been increasingly used to develop pred...

Full description

Saved in:

Bibliographic Details
Published in:	Machines (Basel) Vol. 11; no. 3; p. 391
Main Authors:	Ali, Ali Mohd, Hassan, Mohammad R, Aburub, Faisal, Alauthman, Mohammad, Aldweesh, Amjad, Al-Qerem, Ahmad, Jebreen, Issam, Nabot, Ahmad
Format:	Journal Article
Language:	English
Published:	Basel MDPI AG 01-03-2023
Subjects:	Accuracy Algorithms Analysis classification algorithms Creatinine data augmentation Data mining Datasets Decision trees Development and progression Diagnosis Diagnostic systems Effectiveness Endoscopy Enzymes Esophagus Feature selection Fuzzy logic Health aspects Hepatitis C Identification methods Liver Liver cancer Liver cirrhosis Liver diseases Machine learning Medical records Medical research Medicine, Experimental Model accuracy Mortality Neural networks Patients Performance evaluation Prediction models Public health SHAP Support vector machines Variables Egypt
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Hepatitis C is a significant public health concern, resulting in substantial morbidity and mortality worldwide. Early diagnosis and effective treatment are essential to prevent the disease’s progression to chronic liver disease. Machine learning algorithms have been increasingly used to develop predictive models for various diseases, including hepatitis C. This study aims to evaluate the performance of several machine learning algorithms in diagnosing chronic liver disease, with a specific focus on hepatitis C, to improve the cost-effectiveness and efficiency of the diagnostic process. We collected a comprehensive dataset of 1801 patient records, each with 12 distinct features, from Jordan University Hospital. To assess the robustness and dependability of our proposed framework, we conducted two research scenarios, one with feature selection and one without. We also employed the Sequential Forward Selection (SFS) method to identify the most relevant features that can enhance the model’s accuracy. Moreover, we investigated the effect of the synthetic minority oversampling technique (SMOTE) on the accuracy of the model’s predictions. Our findings indicate that all machine learning models achieved an average accuracy of 83% when applied to the dataset. Furthermore, the use of SMOTE did not significantly affect the accuracy of the model’s predictions. Despite the increasing use of machine learning models in medical diagnosis, there is a growing concern about their interpretability. As such, we addressed this issue by utilizing the Shapley Additive Explanations (SHAP) method to explain the predictions of our machine learning model, which was specifically developed for hepatitis C prediction in Jordan. This work provides a comprehensive evaluation of various machine learning algorithms in diagnosing chronic liver disease, with a particular emphasis on hepatitis C. The results provide valuable insights into the cost-effectiveness and efficiency of the diagnostic process and highlight the importance of interpretability in medical diagnosis.
ISSN:	2075-1702 2075-1702
DOI:	10.3390/machines11030391