Datum
2023-06-24Schlagwort
004 Informatik 500 Naturwissenschaften DatenfusionValidierungInfrarotspektroskopieMaschinelles LernenStickstoffWasserstoffionenkonzentrationKohlenstoffTOCPhysikochemische BodeneigenschaftMetadata
Zur Langanzeige
Aufsatz
Optimised use of data fusion and memory-based learning with an Austrian soil library for predictions with infrared data
Zusammenfassung
Infrared spectroscopy in the visible to near-infrared (vis–NIR) and mid-infrared (MIR) regions is a well-established approach for the prediction of soil properties. Different data fusion and training approaches exist, and the optimal procedures are yet undefined and may depend on the heterogeneity present in the set and on the considered scale. The objectives were to test the usefulness of partial least squares regressions (PLSRs) for soil organic carbon (SOC), total carbon (Ct), total nitrogen (Nt) and pH using vis–NIR and MIR spectroscopy for an independent validation after standard calibration (use of a general PLSR model) or using memory-based learning (MBL) with and without spiking for a national spectral database. Data fusion approaches were simple concatenation of spectra, outer product analysis (OPA) and model averaging. In total, 481 soils from an Austrian forest soil archive were measured in the vis–NIR and MIR regions, and regressions were calculated. Fivefold calibration-validation approaches were carried out with a region-related split of spectra to implement independent validations with n ranging from 47 to 99 soils in different folds. MIR predictions were generally superior over vis–NIR predictions. For all properties, optimal predictions were obtained with data fusion, with OPA and spectra concatenation outperforming model averaging. The greatest robustness of performance was found for OPA and MBL with spiking with R2 ≥ 0.77 (N), 0.85 (SOC), 0.86 (pH) and 0.88 (Ct) in the validations of all folds. Overall, the results indicate that the combination of OPA for vis–NIR and MIR spectra with MBL and spiking has a high potential to accurately estimate properties when using large-scale soil spectral libraries as reference data. However, the reduction of cost-effectiveness using two spectrometers needs to be weighed against the potential increase in accuracy compared to a single MIR spectroscopy approach.
Highlights
- Optimal procedures for data fusion and training for SSL are yet undefined
- The combination of OPA with MBL and spiking has a high potential to estimate properties
- Besides the training and fusion effects, the IQR also affects model accuracies
Highlights
- Optimal procedures for data fusion and training for SSL are yet undefined
- The combination of OPA with MBL and spiking has a high potential to estimate properties
- Besides the training and fusion effects, the IQR also affects model accuracies
Zitierform
In: European Journal of Soil Science Volume 74 / Issue 4 (2023-06-24) eissn:1365-2389Förderhinweis
Gefördert im Rahmen des Projekts DEALZitieren
@article{doi:10.17170/kobra-202308058580,
author={Ludwig, Bernard and Greenberg, Isabel and Vohland, Michael and Michel, Kerstin},
title={Optimised use of data fusion and memory-based learning with an Austrian soil library for predictions with infrared data},
journal={European Journal of Soil Science},
year={2023}
}
0500 Oax 0501 Text $btxt$2rdacontent 0502 Computermedien $bc$2rdacarrier 1100 2023$n2023 1500 1/eng 2050 ##0##http://hdl.handle.net/123456789/15045 3000 Ludwig, Bernard 3010 Greenberg, Isabel 3010 Vohland, Michael 3010 Michel, Kerstin 4000 Optimised use of data fusion and memory-based learning with an Austrian soil library for predictions with infrared data / Ludwig, Bernard 4030 4060 Online-Ressource 4085 ##0##=u http://nbn-resolving.de/http://hdl.handle.net/123456789/15045=x R 4204 \$dAufsatz 4170 5550 {{Datenfusion}} 5550 {{Validierung}} 5550 {{Infrarotspektroskopie}} 5550 {{Maschinelles Lernen}} 5550 {{Stickstoff}} 5550 {{Wasserstoffionenkonzentration}} 5550 {{Kohlenstoff}} 5550 {{TOC}} 5550 {{Physikochemische Bodeneigenschaft}} 7136 ##0##http://hdl.handle.net/123456789/15045
2023-09-04T07:22:19Z 2023-09-04T07:22:19Z 2023-06-24 doi:10.17170/kobra-202308058580 http://hdl.handle.net/123456789/15045 Gefördert im Rahmen des Projekts DEAL eng Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/ data fusion independent validation infrared spectroscopy MBL nitrogen outer product analysis pH soil organic carbon spiking total carbon 004 500 Optimised use of data fusion and memory-based learning with an Austrian soil library for predictions with infrared data Aufsatz Infrared spectroscopy in the visible to near-infrared (vis–NIR) and mid-infrared (MIR) regions is a well-established approach for the prediction of soil properties. Different data fusion and training approaches exist, and the optimal procedures are yet undefined and may depend on the heterogeneity present in the set and on the considered scale. The objectives were to test the usefulness of partial least squares regressions (PLSRs) for soil organic carbon (SOC), total carbon (Ct), total nitrogen (Nt) and pH using vis–NIR and MIR spectroscopy for an independent validation after standard calibration (use of a general PLSR model) or using memory-based learning (MBL) with and without spiking for a national spectral database. Data fusion approaches were simple concatenation of spectra, outer product analysis (OPA) and model averaging. In total, 481 soils from an Austrian forest soil archive were measured in the vis–NIR and MIR regions, and regressions were calculated. Fivefold calibration-validation approaches were carried out with a region-related split of spectra to implement independent validations with n ranging from 47 to 99 soils in different folds. MIR predictions were generally superior over vis–NIR predictions. For all properties, optimal predictions were obtained with data fusion, with OPA and spectra concatenation outperforming model averaging. The greatest robustness of performance was found for OPA and MBL with spiking with R2 ≥ 0.77 (N), 0.85 (SOC), 0.86 (pH) and 0.88 (Ct) in the validations of all folds. Overall, the results indicate that the combination of OPA for vis–NIR and MIR spectra with MBL and spiking has a high potential to accurately estimate properties when using large-scale soil spectral libraries as reference data. However, the reduction of cost-effectiveness using two spectrometers needs to be weighed against the potential increase in accuracy compared to a single MIR spectroscopy approach. Highlights - Optimal procedures for data fusion and training for SSL are yet undefined - The combination of OPA with MBL and spiking has a high potential to estimate properties - Besides the training and fusion effects, the IQR also affects model accuracies open access Ludwig, Bernard Greenberg, Isabel Vohland, Michael Michel, Kerstin doi:10.1111/ejss.13394 Datenfusion Validierung Infrarotspektroskopie Maschinelles Lernen Stickstoff Wasserstoffionenkonzentration Kohlenstoff TOC Physikochemische Bodeneigenschaft publishedVersion eissn:1365-2389 Issue 4 European Journal of Soil Science Volume 74 false e13394
Die folgenden Lizenzbestimmungen sind mit dieser Ressource verbunden: