journal article Open Access Jul 31, 2021

Classification of Diseases Using Machine Learning Algorithms: A Comparative Study

Mathematics Vol. 9 No. 15 pp. 1817 · MDPI AG
View at Publisher Save 10.3390/math9151817
Abstract
Machine learning in the medical area has become a very important requirement. The healthcare professional needs useful tools to diagnose medical illnesses. Classifiers are important to provide tools that can be useful to the health professional for this purpose. However, questions arise: which classifier to use? What metrics are appropriate to measure the performance of the classifier? How to determine a good distribution of the data so that the classifier does not bias the medical patterns to be classified in a particular class? Then most important question: does a classifier perform well for a particular disease? This paper will present some answers to the questions mentioned above, making use of classification algorithms widely used in machine learning research with datasets relating to medical illnesses under the supervised learning scheme. In addition to state-of-the-art algorithms in pattern classification, we introduce a novelty: the use of meta-learning to determine, a priori, which classifier would be the ideal for a specific dataset. The results obtained show numerically and statistically that there are reliable classifiers to suggest medical diagnoses. In addition, we provide some insights about the expected performance of classifiers for such a task.
Topics

No keywords indexed for this article. Browse by subject →

References
58
[1]
A Survey on the Explainability of Supervised Machine Learning

Nadia Burkart, Marco F. Huber

Journal of Artificial Intelligence Research 2021 10.1613/jair.1.12228
[2]
Duda, R.O., Hart, P.E., and Stork, D.G. (2001). Pattern Classification, John Wiley & Sons. [2nd ed.].
[3]
No free lunch theorems for optimization

D.H. Wolpert, W.G. Macready

IEEE Transactions on Evolutionary Computation 1997 10.1109/4235.585893
[4]
Demetriou "No free lunch theorem: A review" Approximation and Optimization (2019) 10.1007/978-3-030-12767-1_5
[5]
Ruan "Class-Specific Deep Feature Weighting for Naïve Bayes Text Classifiers" IEEE Access (2020) 10.1109/access.2020.2968984
[6]
Paranjape "A novel classifier for multivariate instance using graph class signatures" Front. Comput. Sci. (2020) 10.1007/s11704-019-8263-5
[7]
Guan "Thorax disease classification with attention guided convolutional neural network" Pattern Recognit. Lett. (2020) 10.1016/j.patrec.2019.11.040
[8]
Dua, D., and Taniskidou, E.K. (2017). UCI Machine Learning Repository, University of California, School of Information and Computer Science. Available online: http://archive.ics.uci.edu/ml.
[9]
Luengo "Keel data-mining software tool: Data set repository, integration of algorithms and experimental analysis framework" J. Mult.-Valued Log. Soft Comput. (2011)
[10]
Galar "Analysing the classification of unbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches" Knowl.-Based Syst. (2013) 10.1016/j.knosys.2013.01.018
[11]
Mullick "Appropriateness of performance indices for imbalanced data classification: An analysis" Pattern Recognit. (2020) 10.1016/j.patcog.2020.107197
[12]
The WEKA data mining software

Mark Hall, Eibe Frank, Geoffrey Holmes et al.

ACM SIGKDD Explorations Newsletter 2009 10.1145/1656274.1656278
[13]
Triguero "KEEL 3.0: An open source software for multi-stage analysis in data mining" Int. J. Comput. Intell. Syst. (2017) 10.2991/ijcis.10.1.82
[14]
Zhang "Cost-sensitive KNN classification" Neurocomputing (2020) 10.1016/j.neucom.2018.11.101
[15]
Zhao "Investigation and improvement of multi-layer perceptron neural networks for credit scoring" Expert Syst. Appl. (2015) 10.1016/j.eswa.2014.12.006
[16]
Livieris, I.E., Kanavos, A., Tampakas, V., and Pintelas, P. (2018). An Ensemble SSL Algorithm for Efficient Chest X-Ray Image Classification. J. Imaging, 4. 10.3390/jimaging4070095
[17]
Minaee, S., Yao, W., and Lui, Y.W. (2013, January 7). Prediction of Longterm Outcome of Neuropsychological Tests of MTBI Patients Using Imaging Features. Proceedings of the 2013 IEEE Signal Processing in Medicine and Biology Symposium (SPMB), Brooklyn, NY, USA. 10.1109/spmb.2013.6736783
[18]
Chan "Computer-Aided Diagnosis in the Era of Deep Learning" Med. Phys. (2020) 10.1002/mp.13764
[19]
Pathan, S., Kumar, P., Pai, R.M., and Bhandary, S.V. (2021). Automated Segmentation and Classifcation of Retinal Features for Glaucoma Diagnosis. Biomed. Signal Process. Control, 63. 10.1016/j.bspc.2020.102244
[20]
Gautam, A., and Raman, B. (2021). Towards Effective Classification of Brain Hemorrhagic and Ischemic Stroke Using CNN. Biomed. Signal Process. Control, 63. 10.1016/j.bspc.2020.102178
[21]
Mbarki, W., Bouchouicha, M., Frizzi, S., Tshibasu, F., Farhat, L.B., and Sayadi, M. (2020). Lumbar Spine Discs Classification Based on Deep Convolutional Neural Networks Using Axial View MRI. Interdiscip. Neurosurg. Adv. Tech. Case Manag., 22. 10.1016/j.inat.2020.100837
[22]
Martínez-Más, J., Bueno-Crespo, A., Martínez-España, R., Remezal-Solano, M., Ortiz-González, A., Ortiz-Reina, S., and Martínez-Cendán, J.P. (2020). Classifying Papanicolaou Cervical Smears through a Cell Merger Approach by Deep Learning Technique. Expert Syst. Appl., 160. 10.1016/j.eswa.2020.113707
[23]
Zhou "Online Transfer Learning for Differential Diagnosis of Benign and Malignant Thyroid Nodules with Ultrasound Images" IEEE Trans. Biomed. Eng. (2020) 10.1109/tbme.2020.2971065
[24]
Reyes-León, P., Salgado-Ramírez, J.C., and Velázquez-Rodríguez, J.L. (2020). Application of the Lernmatrix tau[9] to the classification of patterns in medical datasets. Int. J. Adv. Trends Comput. Sci. Eng., 9. 10.30534/ijatcse/2020/228952020
[25]
Ding "Fusion of intelligent learning for COVID-19: A state-of-the-art review and analysis on real medical data" Neurocomputing (2021) 10.1016/j.neucom.2021.06.024
[26]
Yu "ResGNet-C: A graph convolutional neural network for detection of COVID-19" Neurocomputing (2021) 10.1016/j.neucom.2020.07.144
[27]
Xu "MANet: A two-stage deep learning method for classification of COVID-19 from Chest X-ray images" Neurocomputing (2021) 10.1016/j.neucom.2021.03.034
[28]
Luján-García, J.E., Moreno-Ibarra, M.A., Villuendas-Rey, Y., and Yáñez-Márquez, C. (2020). Fast COVID-19 and Pneumonia Classification Using Chest X-Ray Images. Mathematics, 8. 10.3390/math8091423
[29]
Yazdani, S., Minaee, S., Kafieh, R., Saeedizadeh, N., and Sonka, M. (2020). COVID CT-Net: Predicting Covid-19 From Chest CT Images Using Attentional Convolutional Network. arXiv.
[30]
Gupta "InstaCovNet-19: A Deep Learning Classification Model for the Detection of COVID-19 Patients Using Chest X-Ray" Appl. Soft Comput. (2021) 10.1016/j.asoc.2020.106859
[31]
Junior "Automatic method for classifying COVID-19 patients based on chest X-ray images, using deep features and PSO-optimized XGBoost" Expert Syst. Appl. (2021) 10.1016/j.eswa.2021.115452
[32]
Rangarajan "A preliminary analysis of AI based smartphone application for diagnosis of COVID-19 using chest X-ray images" Expert Syst. Appl. (2021) 10.1016/j.eswa.2021.115401
[33]
Francisquini "Meteorological and human mobility data on predicting COVID-19 cases by a novel hybrid decomposition method with anomaly detection analysis: A case study in the capitals of Brazil" Expert Syst. Appl. (2021) 10.1016/j.eswa.2021.115190
[34]
Alhudhaif "Determination of COVID-19 pneumonia based on generalized convolutional neural network model from chest X-ray images" Expert Syst. Appl. (2021) 10.1016/j.eswa.2021.115141
[35]
Pasa "Efficient Deep Network Architectures for Fast Chest X-Ray Tuberculosis Screening and Visualization" Sci. Rep. (2019) 10.1038/s41598-019-42557-4
[36]
Khatibi, T., Shahsavari, A., and Farahani, A. (2021). Proposing a Novel Multi-Instance Learning Model for Tuberculosis Recognition from Chest X-Ray Images Based on CNNs, Complex Networks and Stacked Ensemble. Phys. Eng. Sci. Med., 44. 10.1007/s13246-021-00980-w
[37]
Shen, L., Margolies, L.R., Rothstein, J.H., Fluder, E., McBride, R., and Sieh, W. (2019). Deep Learning to Improve Breast Cancer Detection on Screening Mammography. Sci. Rep., 9. 10.1038/s41598-019-48995-4
[38]
Agarwal "Automatic Mass Detection in Mammograms Using Deep Convolutional Neural Networks" J. Med. Imaging (2019) 10.1117/1.jmi.6.3.031409
[39]
Deep Neural Networks Improve Radiologists’ Performance in Breast Cancer Screening

Nan Wu, Jason Phang, Jungkyu Park et al.

IEEE Transactions on Medical Imaging 2020 10.1109/tmi.2019.2945514
[40]
Lindberg "Developing Theory Through Integrating Human and Machine Pattern Recognition" J. Assoc. Inf. Syst. (2020)
[41]
Daqi "Classification methodologies of multilayer perceptrons with sigmoid activation functions" Pattern Recognit. (2005) 10.1016/j.patcog.2005.03.024
[42]
Quinlan "Improved use of continuous attributes in C4. 5" J. Artif. Intell. Res. (1996) 10.1613/jair.279
[43]
Induction of decision trees

J. R. Quinlan

Machine Learning 1986 10.1007/bf00116251
[44]
Otneim "Pairwise local Fisher and Naïve Bayes: Improving two standard discriminants" J. Econom. (2020) 10.1016/j.jeconom.2020.01.019
[45]
Nearest neighbor pattern classification

T. Cover, P. Hart

IEEE Transactions on Information Theory 1967 10.1109/tit.1967.1053964
[46]
Yamashita "Affine-transformation and 2D-projection invariant k-NN classification of handwritten characters via a new matching measure" Pattern Recognit. (2016) 10.1016/j.patcog.2015.10.002
[47]
Noh "Generative Local Metric Learning for Nearest Neighbor Classification" IEEE Trans. Pattern Anal. Mach. Intell. (2018) 10.1109/tpami.2017.2666151
[48]
Stoklasa "Efficient k-NN based HEp-2 cells classifier" Pattern Recognit. (2014) 10.1016/j.patcog.2013.09.021
[49]
Pernkopf "Bayesian network classifiers versus selective k-NN classifier" Pattern Recognit. (2005) 10.1016/j.patcog.2004.05.012
[50]
Ridge Estimators in Logistic Regression

S. Le Cessie, J. C. Van Houwelingen

Journal of the Royal Statistical Society Series C:... 1992 10.2307/2347628

Showing 50 of 58 references