Optimized support vector regression predicting treatment duration among tuberculosis patients in Malaysia

Balakrishnan, Vimala and Ramanathan, Ghayathri and Zhou, Siyi and Wong, Chee Kuan (2024) Optimized support vector regression predicting treatment duration among tuberculosis patients in Malaysia. Multimedia Tools and Applications, 83 (4). 11831 – 11844. ISSN 1380-7501, DOI https://doi.org/10.1007/s11042-023-16028-y.

Full text not available from this repository.


Machine learning models have emerged as an advanced tool for predicting diseases and their outcomes. This study developed a machine learning model to predict the treatment duration for Tuberculosis patients in Malaysia based on a real-life patient dataset. Six regression models, namely Support Vector Regression, Linear Regression, Lasso Regression, Ridge Regression, Random Forest Regression, and Gradient Boosting Regression were initially developed and then optimized through hyperparameter tuning to determine the best predictive model. Using a dataset of 435 Malaysian Tuberculosis patients, we compared our results with data from countries with high Tuberculosis prevalence rates, namely Belarus, Nigeria, and Georgia. Experimentations revealed Support Vector Regression emerged as the best performing model as it can predict treatment duration with the lowest error rates (Mean Absolute Error = 69.70; Root Mean Squared Error = 109.49). Eight significant risk factors were identified for the Malaysian dataset through Pearson correlation, namely, treatment outcome, treatment status, fixed dose combination dosage, maintenance phase regimen, chest X-ray findings, tuberculin skin test, location of treatment initiation, and levofloxacin-based regimen. Comparison with data from other countries confirmed the consistent performance of the optimized Support Vector Regression model in predicting Tuberculosis treatment duration, hence rendering the model generalizable. To the best of our knowledge, this is the first study to demonstrates the effectiveness of machine learning in predicting Tuberculosis treatment duration based on potential risk factors. These findings will help clinicians make informed decisions about the optimal treatment duration, prepare patients' expectations, and estimate the cost of Tuberculosis treatment. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

Item Type: Article
Uncontrolled Keywords: Machine learning; Malaysia; Risk factors; Treatment duration; Tuberculosis
Subjects: R Medicine
Divisions: Faculty of Medicine > Medicine Department
Depositing User: Ms. Juhaida Abd Rahim
Date Deposited: 16 Apr 2024 04:50
Last Modified: 16 Apr 2024 04:50
URI: http://eprints.um.edu.my/id/eprint/44989

Actions (login required)

View Item View Item