Tuberculosis classification using random forest with K-prototype as a method to overcome missing value

Eka Mala Sari Rochman, Miswanto -, Herry Suprajitno, Istifadatul Kamilah, Aeri Rachmad, Iwan Santosa

Abstract


Tuberculosis is a disease that attacks the core of the respiratory organs, which affects many people. This disease is one of the contributors to high mortality cases, especially in Indonesia. Based on its anatomical location, tuberculosis is divided into two classes, namely pulmonary for tuberculosis detected in lung parenchymal tissue and extrapulmonary for tuberculosis detected in organs other than the lungs. Detecting the location of the infection in the lungs requires some analysis of laboratory results for the triggering parameters where the analysis process is still done manually, so it takes longer, and because the input process is still done manually, patient data which causes the possibility of human error to be very high. Therefore, the solution offered and the aim of this study is the ease of patient diagnosis in determining the classification of TB disease. The method used in this study is k-prototype imputation to repair missing values that have different data types, then for tuberculosis data classification methods and medical record data processing using the Random Forest, Support Vector Machine, and Backpropagation methods. Of the three classification methods proposed in this study, all three have an excellent level of accuracy. However, the Random Forest method performs more than other methods, reaching 98.8%.

Full Text: PDF

Published: 2023-08-11

How to Cite this Article:

Eka Mala Sari Rochman, Miswanto -, Herry Suprajitno, Istifadatul Kamilah, Aeri Rachmad, Iwan Santosa, Tuberculosis classification using random forest with K-prototype as a method to overcome missing value, Commun. Math. Biol. Neurosci., 2023 (2023), Article ID 81

Copyright © 2023 Eka Mala Sari Rochman, Miswanto -, Herry Suprajitno, Istifadatul Kamilah, Aeri Rachmad, Iwan Santosa. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Commun. Math. Biol. Neurosci.

ISSN 2052-2541

Editorial Office: office@scik.org

 

Copyright ©2024 CMBN