journal article Open Access Aug 30, 2021

Credit Decision Support Based on Real Set of Cash Loans Using Integrated Machine Learning Algorithms

Electronics Vol. 10 No. 17 pp. 2099 · MDPI AG
View at Publisher Save 10.3390/electronics10172099
Abstract
One of the important research problems in the context of financial institutions is the assessment of credit risk and the decision to whether grant or refuse a loan. Recently, machine learning based methods are increasingly employed to solve such problems. However, the selection of appropriate feature selection technique, sampling mechanism, and/or classifiers for credit decision support is very challenging, and can affect the quality of the loan recommendations. To address this challenging task, this article examines the effectiveness of various data science techniques in issue of credit decision support. In particular, processing pipeline was designed, which consists of methods for data resampling, feature discretization, feature selection, and binary classification. We suggest building appropriate decision models leveraging pertinent methods for binary classification, feature selection, as well as data resampling and feature discretization. The selected models’ feasibility analysis was performed through rigorous experiments on real data describing the client’s ability for loan repayment. During experiments, we analyzed the impact of feature selection on the results of binary classification, and the impact of data resampling with feature discretization on the results of feature selection and binary classification. After experimental evaluation, we found that correlation-based feature selection technique and random forest classifier yield the superior performance in solving underlying problem.
Topics

No keywords indexed for this article. Browse by subject →

References
94
[1]
Koutanaei "A Hybrid Data Mining Model of Feature Selection Algorithms and Ensemble Learning Classifiers for Credit Scoring" J. Retail. Consum. Serv. (2015) 10.1016/j.jretconser.2015.07.003
[2]
Wang "A Hybrid System with Filter Approach and Multiple Population Genetic Algorithm for Feature Selection in Credit Scoring" J. Comput. Appl. Math. (2018) 10.1016/j.cam.2017.04.036
[3]
"Feature Selection in Credibility Study for Finance Sector" Procedia Comput. Sci. (2019) 10.1016/j.procs.2019.09.049
[4]
Tripathi "Credit Scoring Model Based on Weighted Voting and Cluster Based Feature Selection" Procedia Comput. Sci. (2018) 10.1016/j.procs.2018.05.055
[5]
Pawlak "Rough Sets and Fuzzy Sets" Fuzzy Sets Syst. (1985) 10.1016/s0165-0114(85)80029-4
[6]
Maldonado "Credit Scoring using Three-Way Decisions with Probabilistic Rough Sets" Inf. Sci. (2020) 10.1016/j.ins.2018.08.001
[7]
Capotorti "Credit Scoring Analysis using a Fuzzy Probabilistic Rough Set Model" Comput. Stat. Data Anal. (2012) 10.1016/j.csda.2011.06.036
[8]
Washio, T., Suzuki, E., Ting, K.M., and Inokuchi, A. (2008). A New Credit Scoring Method Based on Rough Sets and Decision Tree. Advances in Knowledge Discovery and Data Mining, Springer. 10.1007/978-3-540-68125-0
[9]
Zhou, J., and Tian, J. (2007). Credit Risk Assessment Based on Rough Set Theory and Fuzzy Support Vector Machine, Atlantis Press. 10.2991/iske.2007.157
[10]
Zhou, J., and Bai, T. (2008, January 25–28). Credit Risk Assessment using Rough Set Theory and GA-Based SVM. Proceedings of the 2008 the 3rd International Conference on Grid and Pervasive Computing—Workshops, Kunming, China. 10.1109/gpc.workshops.2008.56
[11]
Ziemba, P. (2021). Multi-Criteria Fuzzy Evaluation of the Planned Offshore Wind Farm Investments in Poland. Energies, 14. 10.3390/en14040978
[12]
Maldonado "Profit-Based Credit Scoring Based on Robust Optimization and Feature Selection" Inf. Sci. (2019) 10.1016/j.ins.2019.05.093
[13]
Liu "Data Mining Feature Selection for Credit Scoring Models" J. Oper. Res. Soc. (2005) 10.1057/palgrave.jors.2601976
[14]
Somol "Filter-versus Wrapper-Based Feature Selection for Credit Scoring" Int. J. Intell. Syst. (2005) 10.1002/int.20103
[15]
Ha "Credit Scoring with a Feature Selection Approach Based Deep Learning" MATEC Web of Conferences (2016) 10.1051/matecconf/20165405004
[16]
Aryuni "Feature Selection in Credit Scoring Model for Credit Card Applicants in XYZ Bank: A Comparative Study" Int. J. Multimed. Ubiquitous Eng. (2015) 10.14257/ijmue.2015.10.5.03
[17]
Boughaci "Three Local Search-Based Methods for Feature Selection in Credit Scoring" Vietnam J. Comput. Sci. (2018) 10.1007/s40595-018-0107-y
[18]
Van "A Hybrid Feature Selection Method for Credit Scoring" EAI Endorsed Trans. Context-Aware Syst. Appl. (2017)
[19]
Kozodoi "A Multi-Objective Approach for Profit-Driven Feature Selection in Credit Scoring" Decis. Support Syst. (2019) 10.1016/j.dss.2019.03.011
[20]
Guo, X., Yin, Y., Dong, C., Yang, G., and Zhou, G. (2008, January 18–20). On the Class Imbalance Problem. Proceedings of the Fourth International Conference on Natural Computation, Jinan, China. 10.1109/icnc.2008.871
[21]
Luengo "A Survey of Discretization Techniques: Taxonomy and Empirical Analysis in Supervised Learning" IEEE Trans. Knowl. Data Eng. (2013) 10.1109/tkde.2012.35
[22]
Ziemba "Client Evaluation Decision Models in the Credit Scoring Tasks" Procedia Comput. Sci. (2020) 10.1016/j.procs.2020.09.068
[23]
Becker "Rough Set Theory in the Classification of Loan Applications" Procedia Comput. Sci. (2020) 10.1016/j.procs.2020.09.125
[24]
Andersson "Credit Risk Optimization with Conditional Value-at Risk Criterion" Math. Program. (2001) 10.1007/pl00011399
[25]
Chen "Financial Credit Risk Assessment: A Recent Review" Artif. Intell. Rev. (2016) 10.1007/s10462-015-9434-x
[26]
Shen "The Prediction Model of Financial Crisis Based on the Combination of Principle Component Analysis and Support Vector Machine" Open J. Soc. Sci. (2014)
[27]
Altman "Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy" J. Financ. (1968) 10.1111/j.1540-6261.1968.tb00843.x
[28]
Kouki "Toward a Predicting Model of Firm Bankruptcy: Evidence from the Tunisian Context" Middle East. Financ. Econ. (2011)
[29]
Kwak "Bankruptcy Prediction for Korean Firms after the 1997 Financial Crisis: Using a Multiple Criteria Linear Programming Data Mining Approach" Rev. Quant. Financ. Account. (2012) 10.1007/s11156-011-0238-z
[30]
Cheng "Predicting Bankruptcy using the Discrete-Time Semiparametric Hazard Model" Quant. Financ. (2010) 10.1080/14697680902814274
[31]
Hwang "Predicting Issuer Credit Ratings using a Semiparametric Method" J. Empir. Financ. (2010) 10.1016/j.jempfin.2009.07.007
[32]
Klein "An Efficient Semiparametric Estimator for Binary Response Models" Econometrica (1993) 10.2307/2951556
[33]
Masten "CART-Based Selection of Bankruptcy Predictors for the Logit Model" Expert Syst. Appl. (2012) 10.1016/j.eswa.2012.02.125
[34]
Li "Parametric and Non-Parametric Combination Model to Enhance Overall Performance on Default Prediction" J. Syst. Sci. Complex. (2014) 10.1007/s11424-014-3273-8
[35]
Manzari "Financial Health Prediction Models using Artificial Neural Networks, Genetic Algorithm and Multivariate Discriminant Analysis: Iranian Evidence" Expert Syst. Appl. (2011) 10.1016/j.eswa.2011.02.082
[36]
Chen "A Stable Credit Rating Model Based on Learning Vector Quantization" Intell. Data Anal. (2011) 10.3233/ida-2010-0465
[37]
Blanco "Credit Scoring Models for the Microfinance Industry using Neural Networks: Evidence from Peru" Expert Syst. Appl. (2013) 10.1016/j.eswa.2012.07.051
[38]
Huang, F. (2008, January 4–6). A Genetic Fuzzy Neural Network for Bankruptcy Prediction in Chinese Corporations. Proceedings of the 2008 International Conference on Risk Management & Engineering Management, Beijing, China. 10.1109/icrmem.2008.93
[39]
Yang "Using Partial Least Squares and Support Vector Machines for Bankruptcy Prediction" Expert Syst. Appl. (2011) 10.1016/j.eswa.2011.01.021
[40]
Jeganathan "Bankruptcy Prediction using Svm and Hybrid Svm Survey" Int. J. Comput. Appl. (2011)
[41]
Li "Hybridizing Principles of TOPSIS with Case-Based Reasoning for Business Failure Prediction" Comput. Oper. Res. (2011) 10.1016/j.cor.2010.06.008
[42]
Wang "Big Data Analytics on Enterprise Credit Risk Evaluation of E-Business Platform" Inf. Syst. E-Bus. Manag. (2020) 10.1007/s10257-019-00414-x
[43]
Arora "A Bolasso Based Consistent Feature Selection Enabled Random Forest Classification Algorithm: An Application to Credit Risk Assessment" Appl. Soft Comput. (2020) 10.1016/j.asoc.2019.105936
[44]
Czarnowski, I., Howlett, R.J., and Jain, L.C. (2020). IVIFCM-TOPSIS for Bank Credit Risk Assessment. Intelligent Decision Technologies 2019, Springer. 10.1007/978-981-13-8311-3
[45]
Farazmehr "A Novel Dynamic Credit Risk Evaluation Method using Data Envelopment Analysis with Common Weights and Combination of Multi-Attribute Decision-Making Methods" Comput. Oper. Res. (2021) 10.1016/j.cor.2021.105223
[46]
Bellacosa, M. (2021, August 19). AI Can Transform Trade Finance through Better SME Credit Scoring. Available online: https://www.theglobaltreasurer.com/2018/06/08/ai-can-transform-trade-finance-through-better-sme-credit-scoring/.
[47]
Nguyen, N.T., and Kowalczyk, R. (2016). Web Projects Evaluation using the Method of Significant Website Assessment Criteria Detection. Transactions on Computational Collective Intelligence XXII, Springer.
[48]
Raitoharju "Human Experts vs. Machines in Taxa Recognition" Signal Process. Image Commun. (2020) 10.1016/j.image.2020.115917
[49]
Marous, J. (2021). Retail Banking Trends and Priorities, Temenos.
[50]
Sulikowski, P., and Zdziebko, T. (2020). Deep Learning-Enhanced Framework for Performance Evaluation of a Recommending Interface with Varied Recommendation Position and Intensity Based on Eye-Tracking Equipment Data Processing. Electronics, 9. 10.3390/electronics9020266

Showing 50 of 94 references

Cited By
26
Related

You May Also Like

Machine Learning Interpretability: A Survey on Methods and Metrics

Diogo V. Carvalho, Eduardo M. Pereira · 2019

1,384 citations

The k-means Algorithm: A Comprehensive Survey and Performance Evaluation

Mohiuddin Ahmed, Raihan Seraj · 2020

1,342 citations

Sentiment Analysis Based on Deep Learning: A Comparative Study

Nhan Cach Dang, María N. Moreno-García · 2020

550 citations