Optimized data-driven pipeline for digital mapping of quantitative and categorical properties of soils in Colombia

dc.contributor.authorCoca-Castro, Alejandro
dc.contributor.authorGutierrez-Díaz, Joan Sebastián
dc.contributor.authorCamacho, Victoria
dc.contributor.authorLópez, Andrés Felipe
dc.contributor.authorEscudero, Patricia
dc.contributor.authorSerrato, Pedro Karin
dc.contributor.authorVargas, Yesenia
dc.contributor.authorDevia, Ricardo
dc.contributor.authorGarcía, Juan Camilo
dc.contributor.authorFranco, Carlos
dc.contributor.authorGonzález, Janeth
dc.date.accessioned2022-09-15T14:33:01Z
dc.date.available2022-09-15T14:33:01Z
dc.date.issued2021-10-05
dc.description.abstractSoil maps provide a method for graphically communicating what is known about the spatial distribution of soil properties in nature. We proposed an optimized pipeline, named dino-soil toolbox, programmed in the R software for mapping quantitative and categorical properties of legacy soil data. The pipeline, composed of four main modules (data preprocessing, covariates selection, exploratory data analysis and modeling), was tested across a study area of 14,537 km 2 located between the departments of Cesar and Magdalena, Colombia. We assessed the feasibility of the toolbox to model three soil properties: pH at two depth intervals (0.00-0.30 and 0.30-1.00 m), soil taxonomy (great group) and taxonomic family by particle-size, according to a set of 25 environmental factors derived from auxiliary layers of climate, land cover and terrain. As a result, we successfully deployed the proposed semi-automatic and sequential pipeline, yielding rapid digital soil mapping (DSM) outputs across the study area. By providing multiple outputs such as tables, charts, maps, and geospatial data in four main modules, the pipeline offers considerable robustness to support outcomes and analysis of a DSM project. Future studies might be interesting to expand on further machine learning frameworks for predictive modeling of soil properties such as ensembles and deep learning models, which have shown a high performance for DSM.en
dc.identifier.citationCoca-Castro A, Gutierrez-Díaz JS, Camacho V, López AF, Escudero P, Serrato PK, Vargas Y, Devia R, García JC, Franco C, González J. Optimized data-driven pipeline for digital mapping of quantitative and categorical properties of soils in Colombia. Rev Bras Cienc Solo. 2021;45:e0210084.pt-BR
dc.identifier.issn1806-9657
dc.identifier.urihttps://locus.ufv.br//handle/123456789/29911
dc.language.isoengpt-BR
dc.publisherRevista Brasileira de Ciência do Solopt-BR
dc.relation.ispartofseriesVol. 45, 2021.pt-BR
dc.rightsCreative Commons Attribution Licensept-BR
dc.subjectsoil predictionen
dc.subjectsoil databasesen
dc.subjectmachine learningen
dc.subjectuncertaintyen
dc.subjecttoolboxen
dc.titleOptimized data-driven pipeline for digital mapping of quantitative and categorical properties of soils in Colombiaen
dc.typeArtigopt-BR

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
artigo.pdf
Size:
2.02 MB
Format:
Adobe Portable Document Format
Description:
Texto completo

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections