Large-scale digital mapping of topsoil total nitrogen using machine learning models and associated uncertainty map
- PMID: 33665671
- DOI: 10.1007/s10661-021-08947-w
Large-scale digital mapping of topsoil total nitrogen using machine learning models and associated uncertainty map
Abstract
Understanding the spatial distribution of soil nutrients and factors affecting their concentration and availability is crucial for soil fertility management and sustainable land utilization while quantifying factors affecting soil nitrogen distribution in Qorveh-Dehgolan plain is mostly lacking. This study, thus, aimed at digital modeling and mapping the spatial distribution of topsoil total nitrogen (TN) in Qorveh-Dehgolan plain with an area of 150,000 ha using random forest (RF), decision tree (DT), and cubist (CB) algorithms. A total of 130 observation points were collected from a depth of 0 to 30 cm from topsoil surfaces based on a random sampling pattern. Then, soil physicochemical properties, calcium carbonate equivalent, organic carbon, and topsoil total nitrogen were measured. A number of 51 environmental variables including 31 geomorphometric attributes derived from a digital elevation model with 12.5-m spatial resolution, 13 spectral indices and reflectance from SENTINEL-2 satellite (MSIsensor), and five soil properties and two spatial variables of latitude and longitude were used as covariates for digital mapping of topsoil total nitrogen. The most appropriate covariates were then selected by the Boruta algorithm in the R software environment. A standard deviation map was produced to show model uncertainty. The covariate selection resulted in the separation of 14 effective covariates in the spatial prediction of topsoil total nitrogen by using the data mining algorithms. The validation of digital mapping of topsoil total nitrogen by RF, DT, and CB models using 20% of independent data showed root mean square error (RMSE) of 0.032, 0.035, and 0.043%; mean absolute error (MAE) of 0.0008, 0.001, and 0.002%; and based on the coefficients of determination of 0.42, 0.38, 0.35, respectively. Relative importance (RI) of environmental covariates using the %IncMSE index indicated the importance of two geomorphometric variables of midslope position and normalized height along with SAVI and NDVI remote sensing variables in the spatial modeling and distribution of total nitrogen in the studied lands. The RF prediction and associated uncertainty maps, with show high accuracy and low standard deviation in the most part of study area, reveled low overfitting and overtraining in soil-landscape modeling; so, this model can lead to the development of a digital map of soil surface properties with acceptable accuracy for sustainable land utilization.
Keywords: Boruta feature selection; Digital soil mapping; Soil nitrogen mapping; Tree-based models.
Similar articles
-
Spatial prediction of soil organic carbon stocks in an arid rangeland using machine learning algorithms.Environ Monit Assess. 2021 Nov 17;193(12):815. doi: 10.1007/s10661-021-09543-8. Environ Monit Assess. 2021. PMID: 34787728
-
Mapping Topsoil Total Nitrogen Using Random Forest and Modified Regression Kriging in Agricultural Areas of Central China.Plants (Basel). 2023 Mar 27;12(7):1464. doi: 10.3390/plants12071464. Plants (Basel). 2023. PMID: 37050090 Free PMC article.
-
Digital mapping and spatial modeling of some soil physical and mechanical properties in a semi-arid region of Iran.Environ Monit Assess. 2023 Oct 24;195(11):1367. doi: 10.1007/s10661-023-11980-6. Environ Monit Assess. 2023. PMID: 37875717
-
Trend analysis of global usage of digital soil mapping models in the prediction of potentially toxic elements in soil/sediments: a bibliometric review.Environ Geochem Health. 2021 May;43(5):1715-1739. doi: 10.1007/s10653-020-00742-9. Epub 2020 Oct 22. Environ Geochem Health. 2021. PMID: 33094391 Review.
-
Digital technology dilemma: on unlocking the soil quality index conundrum.Bioresour Bioprocess. 2021;8(1):6. doi: 10.1186/s40643-020-00359-x. Epub 2021 Jan 10. Bioresour Bioprocess. 2021. PMID: 33457186 Free PMC article. Review.
Cited by
-
From Rangelands to Cropland, Land-Use Change and Its Impact on Soil Organic Carbon Variables in a Peruvian Andean Highlands: A Machine Learning Modeling Approach.Ecosystems. 2024;27(7):899-917. doi: 10.1007/s10021-024-00928-7. Epub 2024 Sep 9. Ecosystems. 2024. PMID: 39524473 Free PMC article.
References
-
- Adhikari, K., Owens, P. R., Ashworth, A. J., Sauer, T. J., Libohova, Z., Richter, J. L., & Miller, D. M. (2018). Topographic controls on soil nutrient variations in a silvopasture system. Agrosyst. Geosci. Environ, 1, 1–15.
-
- Banaie, M. H. (1998). Soil moisture and temperature regimes map of Iran. Soil and Water Research Institute. Ministry of Agriculture, Tehran, Iran, 1sheet.
-
- Behrens, T., Zhu, A. X., Schmidt, K., & Scholten, T. (2010). Multi-scale digital terrain analysis and feature selection for digital soil mapping. Geoderma, 155(3–4), 175–185.
-
- Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.
-
- Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. I. (1984). Classification and Regression Trees. Belmont, Calif: Wadsworth.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials