journal article Open Access Nov 14, 2022

Explainable Machine Learning for Financial Distress Prediction: Evidence from Vietnam

Data Vol. 7 No. 11 pp. 160 · MDPI AG
View at Publisher Save 10.3390/data7110160
Abstract
The past decade has witnessed the rapid development of machine learning applied in economics and finance. Recent evidence suggests that machine learning models have produced superior results to traditional statistical models and have become the driving force for dramatic improvement in the financial industry. However, a much-debated question is whether the prediction results from black box machine learning models can be interpreted. In this study, we compared the predictive power of machine learning algorithms and applied SHAP values to interpret the prediction results on the dataset of listed companies in Vietnam from 2010 to 2021. The results showed that the extreme gradient boosting and random forest models outperformed other models. In addition, based on Shapley values, we also found that long-term debts to equity, enterprise value to revenues, account payable to equity, and diluted EPS had greatly influenced the outputs. In terms of practical contributions, the study helps credit rating companies have a new method for predicting the possibility of default of bond issuers in the market. The study also provides an early warning tool for policymakers about the risks of public companies in order to develop measures to protect retail investors against the risk of bond default.
Topics

No keywords indexed for this article. Browse by subject →

References
36
[1]
Beaver, W.H. (1966). Financial Ratios as Predictors of Failure. J. Account. Res., 71–111. 10.2307/2490171
[2]
FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY

Edward I. Altman

The Journal of Finance 1968 10.1111/j.1540-6261.1968.tb00843.x
[3]
Financial Ratios and the Probabilistic Prediction of Bankruptcy

James A. Ohlson

Journal of Accounting Research 1980 10.2307/2490395
[4]
Regression Models and Life-Tables

D. R. Cox

Journal of the Royal Statistical Society Series B:... 1972 10.1111/j.2517-6161.1972.tb00899.x
[5]
Kim "The Economic Explainability of Machine Learning and Standard Econometric Models-an Application to the US Mortgage Default Risk" Int. J. Strateg. Prop. Manag. (2021) 10.3846/ijspm.2021.15129
[6]
Olson "Comparative Analysis of Data Mining Methods for Bankruptcy Prediction" Decis. Support Syst. (2012) 10.1016/j.dss.2011.10.007
[7]
Chen "Alternative Diagnosis of Corporate Bankruptcy: A Neuro Fuzzy Approach" Expert Syst. Appl. (2009) 10.1016/j.eswa.2008.09.023
[8]
Random Forests

Leo Breiman

Machine Learning 2001 10.1023/a:1010933404324
[9]
Freund "A Short Introduction to Boosting" J. -Jpn. Soc. Artif. Intell. (1999)
[10]
Chen "Xgboost: Extreme Gradient Boosting" R Package Version 0.4-2 (2015)
[11]
Kruppa "Consumer Credit Risk: Individual Probability Estimates Using Machine Learning" Expert Syst. Appl. (2013) 10.1016/j.eswa.2013.03.019
[12]
The Nature of Statistical Learning Theory

Vladimir N. Vapnik

10.1007/978-1-4757-3264-1
[13]
Chen "Modeling Default Risk with Support Vector Machines" Quant. Financ. (2011) 10.1080/14697680903410015
[14]
Shin "An Application of Support Vector Machines in Bankruptcy Prediction Model" Expert Syst. Appl. (2005) 10.1016/j.eswa.2004.08.009
[15]
Zhao "Investigation and Improvement of Multi-Layer Perceptron Neural Networks for Credit Scoring" Expert Syst. Appl. (2015) 10.1016/j.eswa.2014.12.006
[16]
Geng "Prediction of Financial Distress: An Empirical Study of Listed Chinese Companies Using Data Mining" Eur. J. Oper. Res. (2015) 10.1016/j.ejor.2014.08.016
[17]
Machine learning models and bankruptcy prediction

Flavio Barboza, Herbert Kimura, Edward Altman

Expert Systems with Applications 2017 10.1016/j.eswa.2017.04.006
[18]
Chakraborty, C., and Joseph, A. (2017). Machine Learning at Central Banks, SSRN. 10.2139/ssrn.3031796
[19]
Fuster "Predictably Unequal? The Effects of Machine Learning on Credit Markets" J. Financ. (2022) 10.1111/jofi.13090
[20]
Dubyna "Mapping the Literature on Financial Behavior: A Bibliometric Analysis Using the VOSviewer Program" WSEAS Trans. Bus. Econ. (2022) 10.37394/23207.2022.19.22
[21]
Zhavoronok "The Role of Digital Technologies in the Transformation of Regional Models of Households’ Financial Behavior in the Conditions of the National Innovative Economy Development" Ingénierie Des Systèmes D’Inf. (2022) 10.18280/isi.270411
[22]
Doshi-Velez, F., and Kim, B. (2017). Towards a Rigorous Science of Interpretable Machine Learning. arXiv.
[23]
Explanation in artificial intelligence: Insights from the social sciences

Tim Miller

Artificial Intelligence 2019 10.1016/j.artint.2018.07.007
[24]
Bracke, P., Datta, A., Jung, C., and Sen, S. (2019). Machine Learning Explainability in Finance: An Application to Default Risk Analysis, SSRN. 10.2139/ssrn.3435104
[25]
Babaei, G., Giudici, P., and Raffinetti, E. (2021). Explainable Fintech Lending, SSRN. 10.2139/ssrn.3892652
[26]
Bussmann "Explainable Machine Learning in Credit Risk Management" Comput. Econ. (2021) 10.1007/s10614-020-10042-0
[27]
Lundberg, S.M., and Lee, S.-I. (2017, January 4). A unified approach to interpreting model predictions. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA.
[28]
Ribeiro, M.T., Singh, S., and Guestrin, C. (2016). “Why Should I Trust You?” Explaining the Predictions of Any Classifier. arXiv. 10.1145/2939672.2939778
[29]
Arroyo "Explainability of a Machine Learning Granting Scoring Model in Peer-to-Peer Lending" IEEE Access (2020) 10.1109/access.2020.2984412
[30]
Hadji Misheva, B., Hirsa, A., Osterrieder, J., Kulkarni, O., and Fung Lin, S. (2021). Explainable AI in Credit Risk Management. Credit. Risk Manag. 10.2139/ssrn.3795322
[31]
Array programming with NumPy

Charles R. Harris, K. Jarrod Millman, Stéfan J. van der Walt et al.

Nature 2020 10.1038/s41586-020-2649-2
[32]
McKinney, W. (July, January 28). Data Structures for Statistical Computing in Python. Proceedings of the 9th Python in Science Conference, Austin, TX, USA.
[33]
Pedregosa "Scikit-Learn: Machine Learning in Python" J. Mach. Learn. Res. (2011)
[34]
Waskom, M., Botvinnik, O., O’Kane, D., Hobson, P., Lukauskas, S., Gemperline, D.C., Augspurger, T., Halchenko, Y., Cole, J.B., and Warmenhoven, J. (2017). Mwaskom/Seaborn: V0. 8.1 (September 2017). Zenodo.
[35]
From local explanations to global understanding with explainable AI for trees

Scott M. Lundberg, Gabriel Erion, Hugh Chen et al.

Nature Machine Intelligence 2020 10.1038/s42256-019-0138-9
[36]
Castellano "A Comparative Study on Base Classifiers in Ensemble Methods for Credit Scoring" Expert Syst. Appl. (2017) 10.1016/j.eswa.2016.12.020
Metrics
49
Citations
36
References
Details
Published
Nov 14, 2022
Vol/Issue
7(11)
Pages
160
License
View
Funding
The Youth Incubator for Science and Technology Programe, managed by Youth Development Science and Technology Center - Ho Chi Minh Communist Youth Union and Department of Science and Technology of Ho Chi Minh City Award: 14/2021/ HĐ-KHCNT-VU
Cite This Article
Kim Long Tran, Hoang Anh Le, Thanh Hien Nguyen, et al. (2022). Explainable Machine Learning for Financial Distress Prediction: Evidence from Vietnam. Data, 7(11), 160. https://doi.org/10.3390/data7110160