journal article Open Access Aug 24, 2025

Predicting water quality index using stacked ensemble regression and SHAP based explainable artificial intelligence

View at Publisher Save 10.1038/s41598-025-09463-4
Topics

No keywords indexed for this article. Browse by subject →

References
59
[1]
Antwi, H. A., Zhou, L., Xu, X. & Mustafa, T. Progressing towards environmental health targets in china: an integrative review of achievements in air and water pollution under the ecological civilisation and the beautiful China dream. Sustainability 13, 3664 (2021). 10.3390/su13073664
[2]
Han, X. et al. Water strategies and management: current paths to sustainable water use. Appl. Water Sci. 14, 154 (2024). 10.1007/s13201-024-02214-2
[3]
Choudhury, M. et al. Investigation of groundwater and soil quality near to a municipal waste disposal site in silchar, assam, India. Int. J. Energy Water Resour. 6, 37–47 (2022). 10.1007/s42108-021-00117-5
[4]
Thakur, A., Devi, P. A. & Comprehensive Review on Water Quality Monitoring Devices. Materials advances, current status, and future perspective. Crit. Rev. Anal. Chem. 54, 193–218 (2024). 10.1080/10408347.2022.2070838
[5]
Singh, B. P., Choudhury, M., Gupta, P., Chadha, U. & Zewude, D. Physicochemical and biological characteristics of river hindon at Galheta station from 2009 to 2020. Environ. Qual. Manage. 33, 331–344 (2024). 10.1002/tqem.22115
[6]
M., T. R. et al. Water quality level Estimation using IoT sensors and probabilistic machine learning model. Hydrol. Res. 55, 775–789 (2024). 10.2166/nh.2024.048
[7]
Yadav, A. K., Khan, P. & Sharma, S. K. Water quality index assessment ofgroundwater in Todaraisingh tehsil of Rajasthan state, India-A greener approach. J Chem 7, S428-S432 (2010). 10.1155/2010/419432
[8]
Singh, B. P., Choudhury, M., Samanta, P., Gaur, M. & Kumar, M. Ecological risk assessment of heavy metals in adjoining sediment of river ecosystem. Sustainability 13, 10330 (2021). 10.3390/su131810330
[9]
Das, S., Khondakar, K. R., Mazumdar, H., Kaushik, A. & Mishra, Y. K. AI and iot: supported sixth generation sensing for water quality assessment to empower sustainable ecosystems. ACS ES&T Water. 5, 490–510 (2025). 10.1021/acsestwater.4c00360
[10]
Zhong, S. et al. Machine learning: new ideas and tools in environmental science and engineering. Environ Sci. Technol Acs Est. 1c01339. https://doi.org/10.1021/acs.est.1c01339 (2021). 10.1021/acs.est.1c01339
[11]
Abdullah, S., Ismail, M., Ahmed, A. N. & Abdullah, A. M. Forecasting particulate matter concentration using linear and Non-Linear approaches for air quality decision support. Atmos. (Basel). 10, 667 (2019).
[12]
Manfreda, S. et al. Advancing river monitoring using image-based techniques: challenges and opportunities. Hydrol. Sci. J. 69, 657–677 (2024). 10.1080/02626667.2024.2333846
[13]
Alqadhi, S. et al. Combining logistic regression-based hybrid optimized machine learning algorithms with sensitivity analysis to achieve robust landslide susceptibility mapping. Geocarto Int. 37, 9518–9543 (2022). 10.1080/10106049.2021.2022009
[14]
Dostmohammadi, M., Pedram, M. Z., Hoseinzadeh, S. & Garcia, D. A. A GA-stacking ensemble approach for forecasting energy consumption in a smart household: A comparative study of ensemble methods. J. Environ. Manage. 364, 121264 (2024). 10.1016/j.jenvman.2024.121264
[15]
Zhang, Y., Liu, J. & Shen, W. A. Review of ensemble learning algorithms used in remote sensing applications. Appl. Sci. 12, 8654 (2022). 10.3390/app12178654
[16]
A Comprehensive Survey on Imputation of Missing Data in Internet of Things

Deepak Adhikari, Wei Jiang, Jinyu Zhan et al.

ACM Computing Surveys 2023 10.1145/3533381
[17]
Interpreting Black-Box Models: A Review on Explainable Artificial Intelligence

Vikas Hassija, Vinay Chamola, Atmesh Mahapatra et al.

Cognitive Computation 2024 10.1007/s12559-023-10179-8
[18]
Kovari, A. & AI for Decision Support. Balancing accuracy, transparency, and trust across sectors. Information 15, 725 (2024). 10.3390/info15110725
[19]
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)

Amina Adadi, Mohammed Berrada

IEEE Access 2018 10.1109/access.2018.2870052
[20]
Zacharias, J., von Zahn, M., Chen, J. & Hinz, O. Designing a feature selection method based on explainable artificial intelligence. Electron. Markets. 32, 2159–2184 (2022). 10.1007/s12525-022-00608-1
[21]
Höhl, A. et al. Opening the black box: A systematic review on explainable artificial intelligence in remote sensing. IEEE Geosci. Remote Sens. Mag. 12, 261–304 (2024). 10.1109/mgrs.2024.3467001
[22]
Howard-Azzeh, M., Pearl, D. L., O’Sullivan, T. L. & Berke, O. Comparing the diagnostic performance of ordinary, mixed, and Lasso logistic regression models at identifying opioid and cannabinoid poisoning in U.S. Dogs using pet demographic and clinical data reported to an animal poison control center (2005–2014). PLoS One. 18, e0288339 (2023). 10.1371/journal.pone.0288339
[23]
Adewale Abayomi Adeniran. Amaka peace onebunne & Paul william. Explainable AI (XAI) in healthcare: enhancing trust and transparency in critical decision-making. World J. Adv. Res. Reviews. 23, 2447–2658 (2024). 10.30574/wjarr.2024.23.3.2936
[24]
Das Kangabam, R., Bhoominathan, S. D., Kanagaraj, S. & Govindaraju, M. Development of a water quality index (WQI) for the Loktak lake in India. Appl. Water Sci. 7, 2907–2918 (2017). 10.1007/s13201-017-0579-4
[25]
Nasir, N. et al. Water quality classification using machine learning algorithms. J. Water Process. Eng. 48, 102920 (2022). 10.1016/j.jwpe.2022.102920
[26]
Jan, F., Min-Allah, N. & Düştegör, D. IoT based smart water quality monitoring: recent techniques, trends and challenges for domestic applications. Water (Basel). 13, 1729 (2021).
[27]
Thakur, A. & Kumar, A. Recent advances on rapid detection and remediation of environmental pollutants utilizing nanomaterials-based (bio)sensors. Sci. Total Environ. 834, 155219 (2022). 10.1016/j.scitotenv.2022.155219
[28]
Zhou, W., Yan, Z. & Zhang, L. A comparative study of 11 non-linear regression models highlighting autoencoder, DBN, and SVR, enhanced by SHAP importance analysis in soybean branching prediction. Sci. Rep. 14, 5905 (2024). 10.1038/s41598-024-55243-x
[29]
Pedro, F. A. and Machine Learning Approaches. J. Comput. Nat. Sci. 169–181 https://doi.org/10.53759/181X/JCNS202303016 (2023). Review of Data Mining, Big Data Analytics. 10.53759/181x/jcns202303016
[30]
Yang, R., Liu, H. & Li, Y. Quantifying uncertainty of marine water quality forecasts for environmental management using a dynamic multi-factor analysis and multi-resolution ensemble approach. Chemosphere 331, 138831 (2023). 10.1016/j.chemosphere.2023.138831
[31]
Janković, R., Mihajlović, I., Štrbac, N. & Amelio, A. Machine learning models for ecological footprint prediction based on energy parameters. Neural Comput. Appl. 33, 7073–7087 (2021). 10.1007/s00521-020-05476-4
[32]
Akhtar, N. et al. Modification of the water quality index (WQI) process for simple calculation using the Multi-Criteria Decision-Making (MCDM) method: A review. Water (Basel). 13, 905 (2021).
[33]
Tharwat, A. Classification assessment methods. Appl. Comput. Inf. 17, 168–192 (2021).
[34]
Bennett, N. D. et al. Characterising performance of environmental models. Environ. Model. Softw. 40, 1–20 (2013). 10.1016/j.envsoft.2012.09.011
[35]
Simeone, A., Woolley, E., Escrig, J. & Watson, N. J. Intelligent industrial cleaning: A Multi-Sensor approach utilising machine Learning-Based regression. Sensors 20, 3642 (2020). 10.3390/s20133642
[36]
Wijethilake, C., Munir, R. & Appuhami, R. Environmental innovation strategy and organizational performance: enabling and controlling uses of management control systems. J. Bus. Ethics. 151, 1139–1160 (2018). 10.1007/s10551-016-3259-7
[37]
Bhatti, U. A. et al. Investigating the nexus between energy, socio-economic factors and environmental pollution: A geo-spatial multi regression approach. Gondwana Res. 130, 308–325 (2024). 10.1016/j.gr.2024.02.007
[38]
Assefa, A. A., Tian, W., Hundera, N. W. & Aftab, M. U. Crowd density Estimation in Spatial and Temporal distortion environment using parallel Multi-Size receptive fields and stack ensemble Meta-Learning. Symmetry (Basel). 14, 2159 (2022). 10.3390/sym14102159
[39]
Abimannan, S. et al. Ensemble multifeatured deep learning models and applications: A survey. IEEE Access. 11, 107194–107217 (2023). 10.1109/access.2023.3320042
[40]
Li, W. & Hsu, C. Y. GeoAI for Large-Scale image analysis and machine vision: recent progress of artificial intelligence in geography. ISPRS Int. J. Geoinf. 11, 385 (2022). 10.3390/ijgi11070385
[41]
Ganesan, S. & Somasiri, N. Navigating the integration of machine learning in healthcare: challenges, strategies, and ethical considerations. J. Comput. Cogn. Eng. https://doi.org/10.47852/bonviewJCCE42023600 (2024). 10.47852/bonviewjcce42023600
[42]
Linardatos, P., Papastefanopoulos, V. & Kotsiantis, S. Explainable AI: A review of machine learning interpretability methods. Entropy 23, 18 (2020). 10.3390/e23010018
[43]
Ihsanullah, I., Alam, G., Jamal, A. & Shaik, F. Recent advances in applications of artificial intelligence in solid waste management: A review. Chemosphere 309, 136631 (2022). 10.1016/j.chemosphere.2022.136631
[44]
Chu, W. et al. SHAP-powered insights into Spatiotemporal effects: unlocking explainable Bayesian-neural-network urban flood forecasting. Int. J. Appl. Earth Obs. Geoinf. 131, 103972 (2024).
[45]
Deepika & Pandove, G. Prediction of traffic time using XGBoost model with hyperparameter optimization. Multimed Tools Appl. https://doi.org/10.1007/s11042-025-20646-z (2025). 10.1007/s11042-025-20646-z
[46]
Extremely randomized trees

Pierre Geurts, Damien Ernst, Louis Wehenkel

Machine Learning 2006 10.1007/s10994-006-6226-1
[47]
de Werner, V., Schneider Aranda, J. A., dos Santos Costa, R. & da Silva Pereira, P. R. Victória barbosa, J. L. Imbalanced data preprocessing techniques for machine learning: a systematic mapping study. Knowl. Inf. Syst. 65, 31–57 (2023). 10.1007/s10115-022-01772-8
[48]
Reimann, C., Filzmoser, P., Hron, K., Kynčlová, P. & Garrett, R. G. A new method for correlation analysis of compositional (environmental) data – a worked example. Sci. Total Environ. 607–608, 965–971 (2017). 10.1016/j.scitotenv.2017.06.063
[49]
Khanal, G., Thapa, A., Devkota, N. & Paudel, U. R. A review on harvesting and Harnessing rainwater: an alternative strategy to Cope with drinking water scarcity. Water Supply. 20, 2951–2963 (2020). 10.2166/ws.2020.264
[50]
Emeka, C., Nweke, B., Osere, J. & Ihunwo, C. K. Water quality index for the assessment of selected borehole water quality in rivers state. European J. Environ. Earth Sciences 1, 1-4 (2020). 10.24018/ejgeo.2020.1.6.101

Showing 50 of 59 references

Cited By
27
Metrics
27
Citations
59
References
Details
Published
Aug 24, 2025
Vol/Issue
15(1)
License
View
Cite This Article
Rakesh Choudhary, AJAY KUMAR, Priyadharsini C., et al. (2025). Predicting water quality index using stacked ensemble regression and SHAP based explainable artificial intelligence. Scientific Reports, 15(1). https://doi.org/10.1038/s41598-025-09463-4