Log in

Intelligent regional subsurface prediction based on limited borehole data and interpretability stacking technique of ensemble learning

  • Original Paper
  • Published:
Bulletin of Engineering Geology and the Environment Aims and scope Submit manuscript

Abstract

This study introduces an intelligent method for regional subsurface prediction using a Stacking ensemble learning approach, which incorporates K-Nearest Neighbors (KNN), Decision Tree (DT), Random Forest (RF), Gradient Boosted Decision Trees (GBDT), and Xgboost as base classifiers, with Logistic Regression (LR) serving as the meta-classifier. Leveraging data from 1119 boreholes in Zigong City, China, this method achieves a prediction accuracy of 93%, and notably improves the prediction of weak layers, with accuracy rates ranging from 71.4% to 81.5%. This enhancement is particularly significant in areas with a random distribution of excavation and backfill. Furthermore, this study employs the SHAP method (SHapley Additive explanations) to interpret the Stacking ensemble learning model, revealing that the outputs of the base classifiers enhance the feature set for the meta-classifier, effectively addressing the insensitivity of the spatial coordinates x, y, and z as input features for lithology prediction. The findings demonstrate that the expansion of effective feature dimensions is key to the superior performance of the Stacking ensemble learning method in regional subsurface lithology prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (France)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

Download references

Acknowledgments

This paper has been supported by the National Natural Science Foundation of China (Grant No. 42072339, 41702388, U19A2097), the State Key Laboratory of Geohazard Prevention and Geoenvironment Protection (Grant No. SKLGP2022Z006), and Everest Technology Research Proposal of Chengdu University of Technology (Grant No. 80000-2020ZF11411).

Code availability

GeoStackingPredictor Contact: 2021010113@stu.cdut.edu.cn. Program language: Python. The source codes in this paper are available for download at the link: https://github.com/Lukacdut/GeoStackingPredictor.git.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sheng Wang.

Ethics declarations

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bai, J., Wang, S., Xu, Q. et al. Intelligent regional subsurface prediction based on limited borehole data and interpretability stacking technique of ensemble learning. Bull Eng Geol Environ 83, 272 (2024). https://doi.org/10.1007/s10064-024-03758-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10064-024-03758-y

Keywords

Navigation