Download - Relations entre les caractéristiques pédologiques et les pratiques … · 2018-04-25 · iii Résumé La connaissance des sols est primordiale pour traiter des enjeux agronomiques

Relations entre les caractéristiques pédologiques et les pratiques de fertilisation et de conservation des sols

Thèse

Michaël Leblanc

Doctorat en sols et environnement

Philosophiae doctor (Ph.D.)

Québec, Canada

© Michaël Leblanc, 2016

Relations entre les caractéristiques pédologiques et les pratiques de fertilisation et de conservation des sols

Thèse

Michaël Leblanc

Sous la direction de :

Léon-Étienne Parent, directeur de recherche

iii

Résumé La connaissance des sols est primordiale pour traiter des enjeux agronomiques et environnementaux comme

ceux de la production de nourriture, de l’utilisation efficace des nutriments et de la conservation des sols. Ce

projet de doctorat a comme objectif de proposer des modèles reliant les caractéristiques pédologiques à la

gestion des pratiques de fertilisation et de conservation des sols, en particulier, des sols sous culture de

pommes de terre (Solanum tuberosum L.). Il comprend trois parties. D’abord, la méthode du pH Shoemaker-

McLean-Pratt (SMP), non respectueuse de l’environnement, s’est montré être inadéquat pour déterminer les

besoins en chaux (LR) des sols nécessitant de faibles pHH2O cibles (< 6.5) ou pour maintenir des pH adéquats

avec de faibles LR (< 1.5 g CaCO3 kg-1). La spectroscopie infrarouge a permis une estimation précise de la

capacité d’échange cationique déterminée à l’acétate d’ammonium (pH 7.0) (CEC-NH4OAc). La CEC-NH4OAc

inférée dans un intervalle de prédiction de ± 2.3 cmolc kg-1 permet d’élaborer un modèle d’estimation précis du

LR pour atteindre le pH cible. Afin d’orienter les pratiques bénéfiques à la qualité des sols, des critères

diagnostiques du degré d’agrégation des sols ont été élaborés en relation avec leur composition (texture, C

organique) en utilisant la prairie comme système cultural de référence. Ensuite, les cultures annuelles ont

montré un effet négatif plus important sur la stabilité des agrégats en sols argileux, alors que les sols loameux

et sableux se sont montrés plus susceptibles à la dégradation de la structure à de faibles teneurs en C

organique. Les indices courants d’agrégation basés sur le diamètre moyen des agrégats se sont montrés

moins adéquats au développement de seuils critiques comparativement aux indices fractal et compositionnel.

Afin d’intégrer les caractéristiques pédologiques aux modèles de réponse de la pomme de terre aux

fertilisants, un regroupement numérique de séries de sols a finalement été développé à partir des descriptions

morphologiques des profils modaux. Le gradient textural et les deux principaux processus pédogénétiques

(podzolisation, gleyification) ont été synthétisés en trois classes d’appartenance floue permettant une

expression graduelle des groupes de sols en appui aux modèles de réponse aux ajouts de fertilisants. Ce

projet présente des approches innovantes de synthèse et d’interprétation des connaissances pédologiques

appuyées par des techniques d’analyses numériques afin d’orienter les bonnes pratiques culturales

contribuant à la pérennité des agroécosystèmes.

iv

Abstract Soil knowledge base is fundamental to address environmental and agronomic issues such as food production,

efficient use of nutrients and soil conservation. This project aimed to provide models linking pedological

characteristics to management practices of crop fertilization and soil conservation, specifically to guide potato

(Solanum tuberosum L.) cropping. This Ph.D. project comprises three parts. First, the Shoemaker-McLean-

Pratt (SMP) pH buffer, that contains noxious substances, showed to be inappropriate to support lime

requirement (LR) determination of soils requiring low target pHH2O (< 6.5) or to maintain adequate pH levels at

low LRs (< 1.5 g CaCO3 kg-1). Cation exchange capacity determined by the ammonium acetate (pH 7.0)

method (CEC-NH4OAc) can be accurately estimated from infrared spectroscopy (IR). The CEC-NH4OAc

inferred by IR spectroscopy within a prediction interval of ± 2.3 cmolc kg-1 allowed supporting an accurate

estimation model of LR to achieve a target pH as alternative to the SMP buffer method. To guide beneficial

cultural practices that enhance soil quality, diagnostic criteria of soil aggregation degree were elaborated

thereafter in relation with soil composition (texture and organic C) using grassland as reference cropping

system. Annual cropping systems showed more deleterious effect on the aggregate stability of clayed soils

compared to other textural classes, while loamy and sandy soils were shown to be more susceptible to soil

structure degradation at lower organic C content. The current aggregation indices based on the weighted and

geometric mean diameter of aggregates was less appropriate to develop critical values compared to fractal

and compositional indices. Finally, to integrate pedological characteristics of subsoil into potato fertilization

response models, a numerical clustering of soil series has been developed using morphological descriptions of

modal profiles. The textural gradient and the two major pedogenic processes (podzolization, gleying), which

are primarily associated with water regime, were synthesized into three continuous soil classes allowing a

gradual expression of soil groups for integration into crop response nutrient models. This project presents

innovative approaches to synthesize and interpret the pedological knowledge as supported by digital analysis

techniques to guide good cultural practices that contribute to the sustainability of potato agro-ecosystems and

their ecosystem services.

v

Table des matières Résumé ............................................................................................................................................................. iii

Abstract ............................................................................................................................................................ iv

Table des matières............................................................................................................................................ v

Liste des tableaux ........................................................................................................................................... vii

Liste des figures ............................................................................................................................................ viii

Liste des abréviations ..................................................................................................................................... xi

Remerciements .............................................................................................................................................. xiii

Avant-propos ................................................................................................................................................. xiv

Chapitre 1: Introduction générale ................................................................................................................... 1

1.1 Introduction ............................................................................................................................................... 1

1.2 La pédologie et le concept de qualité des sols ......................................................................................... 2 1.2.1 Les classes pédologiques ................................................................................................................. 2 1.2.2 La qualité des sols ............................................................................................................................ 3 1.2.3 La relation entre les classes pédologiques et la qualité des sols ...................................................... 7 1.2.4 La résilience des sols ........................................................................................................................ 8

1.3 La classification des sols .......................................................................................................................... 9 1.3.1 Les systèmes de classification des sols .......................................................................................... 10 1.3.2 La classification numériques des sols ............................................................................................. 13

1.4 Mesure et analyse de données numériques de sols .............................................................................. 14 1.4.1 La spectroscopie infrarouge ............................................................................................................ 14 1.4.2 L’analyse de données compositionnelles ........................................................................................ 15

1.5 Hypothèses et objectifs .......................................................................................................................... 18

1.6 Bibliographie ........................................................................................................................................... 19

Chapitre 2: Lime requirement using Mehlich-III extraction and infrared-inferred cation exchange capacity ........................................................................................................................................................... 30

2.1 Résumé .................................................................................................................................................. 31

2.2 Abstract .................................................................................................................................................. 32

2.3 Introduction ............................................................................................................................................. 33

2.4 Material and methods ............................................................................................................................. 34 2.4.1 Soil properties ................................................................................................................................. 34 2.4.2 Soil-lime incubations ....................................................................................................................... 35 2.4.3 Near- and mid-infrared spectra ....................................................................................................... 35 2.4.4 Soil collection for pedotransfer functions......................................................................................... 35 2.4.5 The estimated exchangeable acidity index ..................................................................................... 35 2.4.6 Statistical analysis ........................................................................................................................... 36

2.5 Results and discussion ........................................................................................................................... 38 2.5.1 Soil properties in the lime incubation study ..................................................................................... 38 2.5.2 Conversion equations ..................................................................................................................... 38 2.5.3 Pedotransfer functions for CEC....................................................................................................... 38 2.5.4 Relationship between lime requirements and SMP and EEA indices ............................................. 39 2.5.5 Performance of the EEA index using IR-inferred CECs .................................................................. 40 2.5.6 Prediction of LR by infrared spectroscopy ....................................................................................... 40

vi

2.6 Conclusion .............................................................................................................................................. 41

2.7 Acknowledgements ................................................................................................................................ 41

2.8 References ............................................................................................................................................. 43

Chapitre 3: Critical values of wet aggregate-size distribution indices for soil quality assessment ....... 64

3.1 Résumé .................................................................................................................................................. 65

3.2 Abstract .................................................................................................................................................. 66

3.3 Introduction ............................................................................................................................................. 67

3.4 Material and methods ............................................................................................................................. 67 3.4.1 Soil collection and analysis ............................................................................................................. 67 3.4.2 Transformation of compositional data ............................................................................................. 68 3.4.3 Synthetic soil aggregate indices ...................................................................................................... 69 3.4.4 Statistical analysis ........................................................................................................................... 70

3.5 Results and discussion ........................................................................................................................... 71 3.5.1 Soil aggregation indices .................................................................................................................. 71 3.5.2 Soil composition effects on aggregation.......................................................................................... 72 3.5.3 Critical values according to soil composition ................................................................................... 72 3.5.4 Critical values by soil textural groups .............................................................................................. 73 3.5.5 Effects of annual cropping system on aggregation ......................................................................... 73 3.5.6 Calibration set size requirement ...................................................................................................... 73

3.6 Conclusion .............................................................................................................................................. 74

3.7 Acknowledgements ................................................................................................................................ 74

3.8 References ............................................................................................................................................. 75

Chapitre 4: Numerical clustering of soil series using morphological profile attributes for potato ......... 97

4.1 Résumé .................................................................................................................................................. 98

4.2 Abstract .................................................................................................................................................. 99

4.3 Introduction ........................................................................................................................................... 100

4.4 Materials and methods ......................................................................................................................... 100 4.4.1 Legacy database of profile descriptions of soil series ................................................................... 100 4.4.2 Transformation of morphological data ........................................................................................... 101 4.4.3 Dissimilarity measure between soil profiles ................................................................................... 102 4.4.4 Continuous classification of soil profiles ........................................................................................ 103 4.4.5 Statistics of the aggregated depth functions ................................................................................. 103

4.5 Results and discussion ......................................................................................................................... 104 4.5.1 Clustering of modal soil profiles .................................................................................................... 104 4.5.2 Cluster memberships of individual soil series ............................................................................... 105 4.5.3 Domain limit of the clusters ........................................................................................................... 106 4.5.4 Comparison of genetic horizons indexed with basic properties..................................................... 106

4.6 Conclusions .......................................................................................................................................... 106

4.7 Acknowledgements .............................................................................................................................. 107

4.8 References ........................................................................................................................................... 108

Chapitre 5: Conclusion générale ................................................................................................................. 118

5.1 Conclusions .......................................................................................................................................... 118

5.2 Perspectives ......................................................................................................................................... 119

5.3 Références ........................................................................................................................................... 121

vii

Liste des tableaux Tableau 1.1. Exemples d’indicateurs de la qualité des sols (USDA 2001). ........................................................ 4

Tableau 1.2. Exemples de guides d’inspection et d’évaluation de la qualité des sols (voir texte pour définition des catégories). .................................................................................................................................................. 5

Tableau 1.3. Exemples de critères physiques de qualité des sols. ..................................................................... 6

Tableau 1.4. Équivalence taxonomique des classes de sols minéraux des régions agricoles du Québec. ...... 11

Table 2.1. Spectral pretreatment and model calibrated to predict clay content, CEC-NH4OAc and lime requirement (LR) from near- (NIR) and mid-infrared (MIR) spectral bands. ..................................................... 48

Table 2.2. Chemical properties of the 50 soils used in the lime incubation study. ............................................ 49

Table 2.3. Relationships between total C (CCNS) and Walkley-Black organic C (OCWB) and between Mehlich III (MIII) and ammonium-acetate (pH 7.0) exchangeable (ex) K, Ca and Mg for soils with OCWB < 90 g kg-1. ...... 50

Table 2.4. Cross-validation performance of Shoemaker-McLean-Pratt (SMP) buffer and estimated exchangeable acidity (EEA) models using CEC-NH4OAc measured and CEC-NH4OAc inferred directly from near- (NIR) and mid-infrared (MIR) or from Walkley-Black organic C (OCWB) and NIR- and MIR-predicted clay contents to predict LR at four target pHH2O. ...................................................................................................... 51

Table 2.5. Regression models between lime requirement (g CaCO3 kg-1) (y) and Shoemaker-McLean-Pratt (SMP) buffer and estimated exchangeable acidity (EEA) using measured CEC-NH4OAc (x), respectively, across the soil dataset of the liming experiment. .............................................................................................. 52

Table 3.1. Sequential binary partitioning of basic soil composition components. ............................................. 79

Table 3.2. Sequential binary partitioning of soil aggregate-size components. .................................................. 80

Table 3.3. Performance of SVM models to estimated reference values of aggregate indices. ......................... 81

Table 3.4. Critical value and area under the curve (AUC) of binary partition of aggregation indices by textural groups. .............................................................................................................................................................. 82

Table 4.1. Indexing value of selected genetic horizons of the Canadian system of soil classification. ........... 110

Table 4.2. Sequential binary partitioning of soil textural components. ............................................................ 111

viii

Liste des figures Figure 1.1. Schéma relationnel entre les bases de données, les pratiques culturales et les services écosystémiques rendus par les sols associés à ce projet de doctorat. .............................................................. 2

Figure 1.2. Relation généralisée entre le rendement relatif d’une culture et la déviation à la masse volumique apparente (MVA) optimale (tirée de Kaufmann et al. 2010). ............................................................................... 7

Figure 1.3. Conceptualisation de la qualité inhérente de deux sols différents (a) et de la qualité dynamique des sols évoluant en fonction du temps (a et b) (adaptée de Karlen et al. 2001). ..................................................... 8

Figure 1.4. Principe de travailler en système de coordonnées (adapté de Pawlowsky-Glahn et al. 2011). Les données de base (points noirs) sont transformées en coordonnées ilr, les statistiques (ellipse bleue couvrant 95% de la distribution théorique autour du centroïde) sont calculées et exprimées dans l’espace compositionnel par transformation retour des ilrs. ............................................................................................ 17

Figure 2.1. Particle size distribution of lime study soils. Polygon limits represent the USDA textural classes. . 53

Figure 2.2. Distribution of clay and Walkley-Black organic C contents in the soil collection used to develop pedotransfer functions for CEC-NH4OAc and for the soils of the lime incubation experiment. ......................... 54

Figure 2.3. Relationship between measured CEC-NH4OAc and CEC-NH4OAc predicted from Walkley-Black organic C and clay contents for the cross-validation (empty dots) and validation (filled dots) data. ................. 55

Figure 2.4. Continuum-removed absorbance of the near- (a) and mid-infrared (b) spectra for the soil of the collection and the lime incubation experiment. ................................................................................................. 56

Figure 2.5. Relationships between laboratory-determined clay content and clay content predicted using near- (NIR) (a) and mid-infrared (MIR) (b) for cross-validation (empty dots) and validation (filled dots) data. ........... 57

Figure 2.6. Relationships between laboratory-determined CEC-NH4OAc and CEC-NH4OAc predicted from Walkley-Black organic C (OCWB) and clay contents predicted using near- (NIR) (a) and mid-infrared (MIR) (b) in Eq. 2.6 for cross-validation (empty dots) and validation (filled dots) data. .................................................... 58

Figure 2.7. Relationships between laboratory-determined CEC-NH4OAc and CEC-NH4OAc predicted using near- (NIR) (a) and mid-infrared (MIR) (b) for cross-validation (empty dots) and validation (filled dots) data. .. 59

Figure 2.8. Relationships between incubation lime requirement (LR) and Shoemaker-McLean-Pratt (SMP) buffer (on the left) and estimated exchangeable acidity (EEA) from measured CEC-NH4OAc (on the right) at four target pHH2O. Regression equations are presented in Table 2.5. ............................................................... 60

Figure 2.9. Relationships between incubation lime requirement (LR) and estimated exchangeable acidity (EEA) index using CEC-NH4OAc inferred directly from near- (NIR) and mid-infrared (MIR) or from Walkley-Black organic C (OCWB) and NIR- and MIR-predicted clay contents at four target pHH2O. Lines represent regressions between LR and EEA index using laboratory-determined CEC-NH4OAc as presented in Table 2.5. .......................................................................................................................................................................... 61

Figure 2.10. Binary partition of the relationship between the absolute prediction errors of infrared (IR)-inferred CEC-NH4OAc and lime requirement (LR) predicted from estimated exchangeable acidity (EEA) model using IR-inferred CEC-NH4OAc at target pHH2O of 7. The CEC-NH4OAc was inferred directly from near- (NIR) and mid-infrared (MIR) or from Walkley-Black organic C (OCWB) and NIR- and MIR-predicted clay contents. The threshold value of ±2.29 cmolc kg-1 for the IR-inferred CEC-NH4OAc was computed for a LR error of 1.29 g CaCO3 kg-1 with an area under the curve of 90.3% from the receiver operating characteristic curve. ... 62

Figure 2.11. Relationships between lime requirements (LR) derived of estimated exchangeable acidity (EEA) model and predicted by mid-infrared at four target pHH2O for the cross-validation (empty dots) and validation (filled dots) data. ............................................................................................................................................... 63

Figure 3.1. Soil particle size distribution for a) grassland, b) cereals, c) maize and d) potatoes systems. Polygon limits represent the USDA textural classes. ........................................................................................ 83

ix

Figure 3.2. Boxplot of soil organic carbon by textural group and cropping system. Diamonds represent averages. .......................................................................................................................................................... 84

Figure 3.3. Geometric representation of the balance aggregation index (BAI) computed into a coordinate system using isometric log-ratio (ilr) transformations of aggregate composition. The BAI index represents the orthogonal projection of vector a, defined by the ilr coordinates of the aggregate composition (x), onto vector b, defined by the ilr coordinates of the ultimate aggregation (N) and the ultimate disaggregation (M) compositions, expressing the linear gradient of the aggregation balance between ultimate structures. The figure is representative of a three-part composition but BAI can be computed for a D-part composition into a D-1 coordinate system. ..................................................................................................................................... 85

Figure 3.4. Bi-plot a) of aggregate-size balances along the first two principal components and b) correlation circle of synthetic indices, composition and aggregate-size balances (Tables 3.1-3.2) with the first two principal components of aggregate-size balances. Filled white circles represent centroid and elliptical areas scanning 95% of the theoretical distribution of grassland (G), cereals (C) maize (M) and potatoes (P) cropping systems. BAI = balance aggregation index; Df = scaling fractal coefficient; MWD = mean weight diameter; GMD = geometric mean diameter; ilr = isometric log ratio of basic components (C-) and aggregates (A-). ................. 86

Figure 3.5. Boxplot of soil aggregation indices. Diamonds represent averages. .............................................. 87

Figure 3.6. Relationships between aggregate balance A-ilr1 ([1-8 mm | ≤ 1 mm]) and composition balances a) C-ilr2 ([sand | silt, clay]) and b) C-ilr3 ([silt | clay]) by cropping system. ............................................................. 88

Figure 3.7. Relationships between aggregate balance A-ilr1 ([1-8 mm | ≤ 1 mm]) and composition balance [organic C | mineral] by cropping system and textural group. ........................................................................... 89

Figure 3.8. Boxplot of soil organic carbon in grassland cropping system by drainage class (1: very rapid; 2: rapid; 3: well; 4: moderately well; 5 imperfect, 6: poor; and 7: very poor (Day, 1982)) and textural group. Diamonds represent averages. ......................................................................................................................... 90

Figure 3.9. Relationships between the delta (∆) value and the predicted aggregate index and the density distribution of delta values for grassland testing dataset. ................................................................................. 91

Figure 3.10. Cumulative distribution function of delta (∆) values and relationship between optimal coverage probability (OCP) and the confidence level for soil aggregate indices Df and BAI. Critical values of 0.72 and 1.37 at 0.95 percentile, respectively for Df and BAI, are represented by empty dots. ....................................... 92

Figure 3.11. Soil aggregate-size distribution for a) clayey (G1), b) loamy (G2) and c) sandy (G3) textural group. Shaded area covers below-optimal aggregation states using the critical values of BAI (Table 3.4). ................ 93

Figure 3.12. Relationships between the delta (∆) value of BAI in reference to grassland and the composition balance [sand | silt, clay] by cropping system. Dashed lines represent the critical value of 1.37 (points above the dashed line indicate critical disaggregation). .............................................................................................. 94

Figure 3.13. Relationships between the delta (∆) value of BAI in reference to grassland and the composition balance [organic C | mineral components] by cropping system and textural group. Dashed lines represent the critical value of 1.37 (points above the dashed line indicate critical disaggregation). ....................................... 95

Figure 3.14. Size of training data sets to establish a) the critical value of ∆ BAI at a confidence level of 95% and b) the optimal coverage probability (OCP) on the testing data set. Dots represent averages and the shaded area covers the 5th to 95th percentile of 100 iterations for each size of data set randomly selected in the training data set (n = 1546). ........................................................................................................................ 96

Figure 4.1. Number of modal soil profiles (a) and fertilizer trial sites (b) by soil series. Soil series code refers to the updated file of soil names of Québec (Lamontagne and Nolin, 1997). For brevity, soil series variants were amalgamated into the corresponding soil series. ............................................................................................ 112

Figure 4.2. Summary of the numerical clustering process of soil profiles. ...................................................... 113

Figure 4.3. Distribution of the maximum fuzzy membership (max mc) to clusters A, B and C of modal soil profiles in fertilizer trials (a) and correlation circles of clr-transformed textural components and pedogenetic

x

features (see Table 4.1 for descriptions) of B and C horizons along the first two principal coordinates. Filled white circles represent exemplar positions and elliptical shaded areas cover 95% of the theoretical distribution of clusters A, B and C. .................................................................................................................................... 114

Figure 4.4. Aggregated depth functions of morphological variables (see Table 4.1 for descriptions) by soil class (A, B, and C) of trials-associated modal profiles. Black lines represent the mean and shaded area coverage ± standard deviation computed for each 1 cm layer. Dots represent observed values at midpoint of genetic horizons. ......................................................................................................................................................... 115

Figure 4.5. Distribution of the fuzzy membership to the extragrade cluster (m*) for all modal profiles of soil series along the first two principal coordinates. Filled white circles represent exemplar positions for clusters A, B and C. .......................................................................................................................................................... 116

Figure 4.6. Principal component analysis of genetic horizons of soil profiles of experimental sites using variables of Fig. 4.6b (a). Correlation circles of color and chemical variables (b) and pedogenetic features (c) (see Table 4.1 for descriptions) along the first two principal components. Filled white circles represent centroid and elliptical shaded areas cover 95% of the theoretical distribution of genetic horizon suffixes. .................. 117

xi

Liste des abréviations alr additive log-ratio (log-ratio additif)

AUC area under the receiver operating characteristic curve

BAI balance aggregation index

BRT boosted regression trees

CEC cation exchange capacity (capacité d’échange cationique)

clr centered log-ratio (log-ratio centré)

Df fractal scaling factor

EA exchange acidity

EEA estimated exchange acidity

FIR far-infrared (lointain infrarouge)

FKMe fuzzy k-means with extragrades

GMD geometric mean diameter

ilr isometric log-ratio (log-ratio isométrique)

IR infrared (infrarouge)

LR lime requirement

m* fuzzy membership to the extragrade cluster

max mc maximum fuzzy membership to clusters

MIR mid-infrared (moyen infrarouge)

MVA masse volumique apparente

MWD mean weight diameter

NIR near-infrared (proche infrarouge)

OC organic carbon

OCWB Walkley-Black organic carbon

OCP optimal coverage probability

pHCaCl2 pH in CaCl2

pHH2O pH in water

PLSR partial least squares regression

PTF pedotransfer function

R2 coefficient of determination

RMSE root mean square error

ROC receiver operating characteristic

SBP sequential binary partition

SMP Shoemaker-McLean-Pratt

SVM support vector machine

xii

L'or de l’éveil est dans le sol de notre esprit, mais si nous ne creusons pas, il reste caché.

Kalou Rinpoché

xiii

Remerciements Je tiens à remercier mon directeur de recherche, le professeur Léon-Étienne Parent, pour m’avoir guidé, aidé

et conseillé tout au long de ce projet de doctorat. Je vous suis grandement reconnaissant pour les

nombreuses opportunités et ouvertures interdisciplinaires dont vous m’avez fait bénéficier au cours de ma

formation. Mes sincères remerciements pour votre confiance, votre rigueur scientifique et votre inspiration.

Je voudrais remercier le Dr Dominique Arrouays, INRA Orléans, France, le professeur Michel Caillier,

Université Laval, également pré-lecteur de la thèse, et la professeure Josée Fortin, Université Laval, pour

avoir accepté d'être membres du jury d'évaluation. Vos commentaires et suggestions ont contribué à améliorer

cette thèse.

La réalisation de ce projet a été rendue possible grâce à la participation de nombreux collaborateurs. Je tiens

donc à remercier Gilles Gagné pour son soutien, son entière disponibilité et le partage de ses connaissances

en pédologie, le Dr Serge-Étienne Parent pour ses nombreux conseils et ses suggestions en programmation

et graphisme et les échanges sur les concepts et techniques d’analyse de données compositionnelles,

Élizabeth Parent et Marie-Hélène Lamontagne pour leur laborieux et efficient travail réalisé en laboratoire, leur

aide et leur dévouement à l’équipe, Nicolas Samson pour son aide, sa disponibilité et ses encouragements

quotidiens, le Dr Jérôme Fortin-Goulet pour ses conseils en modélisation, ainsi que le personnel du

Département des sols et de génie agroalimentaire de l’Université Laval pour les conseils et l’aide reçus. Mes

sincères remerciements à vous tous pour avoir contribué à ma formation et à la réalisation de ce projet.

I express my gratitude to Professor Alex McBratney and Professor Budiman Minasny, The University of

Sydney, Australia, for welcoming me in their research team for a training course in numerical soil classification.

I also thank Philip Hughes and Sebastian Campbell for their support and the wonderful experience.

Finalement, je tiens à remercier toute ma famille et mes amis pour leurs pensées, leur soutien, leurs

encouragements et les moments de qualité (parfois incongrus) partagés. À votre habituelle interrogation, je

peux enfin répondre d’un très grand sourire.

xiv

Avant-propos Cette thèse s’inscrit dans le cadre d’un projet intitulé « Mise en place de moyens pour améliorer les services

écosystémiques de la pomme de terre » financé par une subvention de recherche et développement

coopérative du Conseil de Recherches en Sciences Naturelles et en Génie du Canada (CRSNG-RDC) en

partenariat avec les entreprises Cultures Dolbec Inc., Groupe Gosselin FG Inc., Agriparmentier Inc.,

Prochamps Inc. et Ferme Daniel Bolduc et Fils Inc.

La thèse est composée de cinq chapitres. Le chapitre 1 comprend une introduction et une revue de littérature

des thématiques générales de la thèse. Le chapitre 2 est un article publié dans Soil Science Society of

America Journal portant sur un modèle de recommandation des besoins en chaux basé sur la capacité

d’échange cationique inférée par spectroscopie infrarouge et les cations non-acides extraits par la méthode

Mehlich-III. Le chapitre 3 est un article soumis à la revue Frontiers in Ecology and Evolution portant sur les

critères diagnostics des indices de distribution des agrégats en relation avec la composition granulométrique,

le C organique du sol et les pratiques culturales. Le chapitre 4 est un article de conférence publié dans le livre

Digital Soil Morphometrics édité par A.E. Hartemink et B. Minasny dans Progress in Soil Science Series de

Springer portant sur le regroupement numérique de séries de sols à partir des propriétés morphologiques des

profils en appui aux modèles de réponse de la culture de pomme de terre à l’ajout de fertilisants. Le dernier

chapitre comprend une conclusion générale et une liste de recommandations et de perspectives futures.

Les articles insérés dans la thèse ont été rédigés en anglais et je suis l’auteur principal de chacun d’eux. J’ai

participé à certaines activités de laboratoire, mais la plupart des analyses originales de cette thèse ont été

réalisées par Élizabeth Parent, Marie-Hélène Lamontagne, Nicolas Samson et Marie-Ève Tremblay, membres

de l’équipe de recherche du professeur Léon-Étienne Parent à l‘Université Laval. J’ai contribué à la cueillette

de données de terrain et j’ai réalisé la description des profils de sols des sites d’essais de fertilisation. J’ai été

le responsable des analyses statistiques, de l’interprétation des résultats et de la rédaction des chapitres. Les

co-auteurs des articles sont le Dr Léon-Étienne Parent, le Dr Serge-Étienne Parent et Élizabeth Parent de

l’Université Laval et Gilles Gagné du Centre d’Expertise et de Transfert en Agriculture Biologique et de

Proximité (CETAB+). Le chapitre 4 a bénéficié des suggestions des professeurs Alex McBratney et Budiman

Minasny, The University of Sydney, Australie, pour l’élaboration du processus de classification numérique des

profils de sols.

Les résultats de ces études ont été présentés au congrès de l’Association Québécoise de Spécialistes en

Sciences du Sol en 2014 à Victoriaville à titre de conférence liminaire et à l’Inaugural Global Workshop on

Digital Soil Morphometrics en 2015 à Madison, Wisconsin. Des travaux connexes aux chapitres de cette thèse

ont également été présentés lors du 13th International Symposium for Soil and Plant Analysis en 2013 à

http://link.springer.com/bookseries/8746

xv

Queenstown, Nouvelle-Zélande (géostatistiques de données compositionnelles) et lors d’un séminaire au

Department of Environmental Sciences, The University of Sydney, Australie, en 2013 (conceptualisation de

séries de sols par classification numérique) dans le cadre d’un séjour de 3 mois au sein de l’équipe de

recherche des professeurs Alex McBratney et Budiman Minasny. Enfin, j’ai participé au 4th International

Workshop on Compositional Data Analysis en 2011 à Sant Feliu de Guixols, Girona, Espagne.

1

Chapitre 1: Introduction générale

1.1 Introduction Le sol est une ressource fondamentale (Blanco-Canqui et Lal 2008). Il est à la base des services

écosystémiques de soutien (support pour les plantes et les activités humaines), de régulation (cycles

hydrologiques et biogéochimiques), d’approvisionnement (matières premières et production de biomasse) et

culturels (patrimoine archéologique et composante des paysages) (Millennium Ecosystem Assessment 2005;

Adhikari et Hartemink 2016). La connaissance des sols est primordiale pour traiter des enjeux agronomiques

et environnementaux comme ceux de la production de nourriture, de l’utilisation efficace des nutriments et de

la conservation des sols (Lal et Stewart 2013).

Par le passé, des bases de données ont été élaborées au Québec pour documenter la nature des sols, leur

utilisation et leur qualité ainsi que la réponse des cultures aux pratiques culturales. Trois principales bases de

données ont été constituées de façon indépendante : les données pédologiques provenant des inventaires

pédologiques (1943-2010), les données sur la qualité des sols provenant de l’inventaire des problèmes de

dégradation des sols agricoles (1990) et les données agronomiques sur la fertilisation des cultures et la

conservation des sols (1958-2014).

À ce jour, les sols du Québec ont été classifiés en 665 séries selon une approche naturaliste et orientée vers

un usage multidisciplinaire des sols. En regard d’objectifs agronomiques spécifiques, ces classes peuvent être

agrégées. Ces regroupements pédologiques facilitent l’interprétation et la mise en relation des séries de sols

avec d’autres paramètres relatifs à la fertilisation et à la conservation des sols. Les concepts de séries de sols

sont construits principalement à partir des propriétés permanentes des sols. Par contre, la productivité des

plantes est également influencée par des propriétés dynamiques. En lien avec les classes pédologiques, le

développement d’indices et de critères de qualité des sols est donc crucial pour orienter les pratiques

culturales.

Au Québec, le diagnostic de fertilité des sols pour les recommandations de fertilisation en éléments N, P et K

est réalisé en fonction du type de culture, de la biodisponibilité des éléments nutritifs et pour certaines

cultures, du groupe textural de la couche de surface de sols (Parent et Gagné 2010). Dans une approche

holistique, la détermination des besoins en fertilisants doit être basée sur le potentiel de croissance spécifique

qu’offrent aux cultures un site, une zone d’aménagement ou une région. Ce potentiel est lié à un ensemble

d’éléments comme les propriétés pédologiques, la qualité des sols, les pratiques culturales et les conditions

climatiques.

2

De nouvelles techniques de mesure comme la spectroscopie infrarouge sont désormais disponibles afin de quantifier les propriétés de sols et d’améliorer (coût, respect de l’environnement) l’évaluation de la ressource sols (Du et Zhou 2009; Soriano-Disla et al. 2014). Au cours des dernières années, des librairies spectrales dans la bande infrarouge ont été élaborées au Département des Sols et de Génie Agroalimentaire, Université Laval, pour des fins prédictives de diverses propriétés de sols.

Ce projet de doctorat vise à mettre en relation les caractéristiques pédologiques et de qualité des sols avec la gestion des pratiques culturales (Figure 1.1). Il fait appel à l’analyse de données compositionnelles et aux techniques de chimiométrie, de pédométrie et d’écologie numérique pour synthétiser et croiser les informations pédologiques, de qualité des sols, et agronomiques.

Figure 1.1. Schéma relationnel entre les bases de données, les pratiques culturales et les services écosystémiques rendus par les sols associés à ce projet de doctorat.

Ce premier chapitre introduit des concepts et des connaissances générales et énonce les hypothèses et les objectifs de ce projet de recherche visant à améliorer la prise de décision sur les services des agroécosystèmes en regard des pratiques de conservation et de fertilisation, et à faire avancer les connaissances agronomiques nécessaires à l’application des technologies d’agriculture de précision.

La pédologie est une discipline de la science du sol qui étudie la formation, la morphologie, la classification et la distribution des sols dans les pédopaysages (Gregorich et al. 2001). Le climat, les organismes, la

3

topographie, le matériau parental et le temps sont les cinq principaux facteurs de formation des sols qui

définissent les caractéristiques morphologiques de la pédosphère (Jenny 1941). D’autres modèles basés sur

des facteurs, des processus, des voies génétiques ou des concepts de thermodynamique ont été proposés

pour quantifier les mécanismes de pédogenèse (Minasny et al. 2008; Stockmann et al. 2011). La relation

facteurs → processus → propriétés a souvent été appelée le paradigme de la science du sol car plusieurs

relations, modèles, hypothèses et concepts sont dérivés de l’équation factorielle de Jenny (1941) et de son

lien entre les facteurs de formation et les propriétés des sols (Arnold 2006). De nombreux systèmes de

classification des sols, nationaux et internationaux, basés sur les caractéristiques morphogénétiques (Isbell

1996; GTCS 2002; IUSS Working Group WRB 2014; Soil Survey Staff 2014) reposent sur cette prémisse de

base de la science du sol (Bockheim et al. 2014). Ces systèmes sont principalement axés sur les propriétés

intrinsèques, aussi qualifiées de permanentes, inhérentes ou statiques, que les mécanismes pédogénétiques

font évoluer très lentement (Norfleet et al. 2003).

1.2.2 La qualité des sols

Bien que l’humain soit implicitement inclus parmi les organismes dans l’équation de Jenny (Amundson et

Jenny 1991), il est reconnu que ses interventions et ses pratiques de gestion ont un impact particulièrement

important sur l’évolution des sols (Dudal et al. 2002). Certaines propriétés des sols qualifiées de modifiables

ou de dynamiques sont sujettes à changement sur des périodes relativement courtes sous l’influence des

pratiques agronomiques (Nolin et al. 1993; Carter et al. 1997). Avec la reconnaissance des problèmes de

dégradation des sols et la nécessité de promouvoir une gestion pérenne des agroécosystèmes, le concept de

qualité des sols a été introduit (Carter et al. 1997). Diverses définitions de la qualité des sols ont été

proposées (Karlen et al. 1997; Singer et Ewind 1998; Karlen et al. 2001), mais elles relèvent principalement de

sa capacité à fonctionner à l’intérieur d’un écosystème afin de soutenir la productivité biologique, de maintenir

la qualité de l’environnement et de promouvoir la santé végétale et animale (Doran et Parkin 1994).

1.2.2.1 Les indicateurs de qualité des sols

Des indicateurs intégrant des propriétés physiques, chimiques et biologiques (Tableau 1.1), pouvant

également être dérivées d’équations de pédotransfert à partir des données pédologiques disponibles (Bouma

1989), permettent la comparaison et l’évaluation de la qualité des sols en regard de fonctions fondamentales

du sol (Doran et Parkin 1994; Larson et Pierce 1994; Doran et Parkin 1996). Il est toutefois reconnu qu’un sol

ayant une haute qualité pour une fonction peut ne pas être aussi bon pour d’autres fonctions (Nortcliff 2002).

Le concept de qualité des sols intègre simultanément des fonctions et des propriétés de sols diverses et

souvent conflictuelles (p. ex. les fortes teneurs en nitrate d’un sol peuvent être favorables à la croissance des

plantes mais défavorables à la qualité de l’eau) (Sojka et Upchurch 1999). Letey et al. (2003) ont donc

suggéré de relier les critères d’évaluation de la qualité des sols à une utilisation spécifique (c.-à-d. à une

4

situation réelle et envisageable de gestion et d’aménagement de l’agroécosystème) plutôt qu’à une capacité

fonctionnelle du sol. Malgré les désaccords terminologiques, le concept de qualité des sols demeure un outil

important pour la promotion et l’adoption de bonnes pratiques culturales contribuant à la pérennité des

agroécosystèmes et de leurs services pour les générations futures (Karlen et al. 2003).

Tableau 1.1. Exemples d’indicateurs de la qualité des sols (USDA 2001).

Indicateur Relation avec la qualité des sols

Matière organique Fertilité du sol; structure; stabilité; rétention des nutriments; réserve en eau utile; érosion

Physiques

Structure du sol Rétention et transport de l'eau et des nutriments; habitats des micro-organismes; érosion

Profondeur du sol et des racines Estimation du potentiel de production des cultures; compaction; semelle de labour

Infiltration et masse volumique apparente Mouvement de l'eau; porosité; maniabilité du sol

Capacité de rétention en eau Réserve et disponibilité de l'eau

Chimiques

pH Seuils d'activité biologique et chimique

Conductivité électrique Seuils d'activité des plantes et microbienne

N, P, K disponible Disponibilité des nutriments aux plantes; potentiel de perte du N et P

Biologiques

C et N de la biomasse microbienne Potentiel catalytique microbien et de réserve en C et N

Potentiel de minéralisation du N Productivité du sol et potentiel de fourniture en N

Respiration du sol Mesure de l'activité microbienne

Les indicateurs de qualité des sols peuvent s’exprimer le long d’un continuum allant de strictement analytique

à complètement descriptif. Liebig et Doran (1999) ont catégorisé les approches d’évaluation en quatre

groupes : [C1] analyses de laboratoire basées sur des protocoles bien établis et souvent considérées comme

les standards auxquels les autres formes d’évaluation sont comparées, [C2] analyses au champ permettant de

mesurer les indicateurs de qualité sur le site à l’étude, [C3] descriptions au champ reliant des observations

visuelles et tactiles mais suivant des méthodes de classification spécifiques (p. ex. évaluation texturale terrain

et description de la structure (Day 1982)) et [C4] perception des producteurs basée sur des évaluations

organoleptiques ou sensorielles au champ, c’est-à-dire des informations pouvant être appréciées par les sens

humains (vue, toucher, goût, ouïe et odorat). À travers ce continuum, des guides d’inspection et d’évaluation

intégrant différents ensembles d’indicateurs de qualité ont été élaborés afin d’orienter la gestion et l’utilisation

des sols (Tableau 1.2). Ces guides vont de la carte de pointage de qualité des sols afin d’améliorer la

sensibilisation en regard de la ressource sols et d’encourager les producteurs à regarder sous la surface du

sol au cadre d’évaluation quantitatif basé sur des critères diagnostiques.

5

Tableau 1.2. Exemples de guides d’inspection et d’évaluation de la qualité des sols (voir texte pour définition

des catégories).

Catégorie Titre Pays/État Référence

C1 The Soil Management Assessment Framework USA Andrew et al. 2004

C1 The Development and Use of Soil Quality Indicators for Assessing the Role of Soil in Environmental Interactions

UK Merrington et al. 2006

C1 - C2 Cornell Soil Health Assessment New York Gugino et al. 2009

C2 - C3 Visual soil examination techniques as part of a soil appraisal framework for farm evaluation

Australie McKenzie 2013

C2 - C3 The Muencheberg Soil Quality Rating Allemagne Müeller et al. 2007

C2 - C3 Soil quality test kit guide USA USDA 1999

C3† Soil profile description and evaluation Scotland Batey 2000

C3† Le profil cultural : une méthode d’étude in situ de la structure des sols cultivés

France Peigné et al. 2013

C3 L’évaluation visuelle de l’état du sol : la méthode du profil de sol agronomique

Québec Weill 2009a, Weill 2009b

C4 Farmer-based assessment of soil quality USA Romig et al. 1996

† Ces méthodes d’évaluation visuelle ont été comparées à plusieurs autres non-citées lors d’une rencontre terrain tenue en France par

l’Institut National de la Recherche Agronomique et l’International Soil Tillage Research Organization (Boizard et al. 2005).

1.2.2.2 L’agglomération des indicateurs de qualité des sols

Dans le but d’évaluer globalement la qualité des sols, différentes approches ont été proposées pour

agglomérer des ensembles d’indicateurs en indices synthétiques. L’analyse compensatoire combine, par

sommation, plusieurs attributs pondérés selon leur importance relative. Cette pondération est déterminée par

expérience, statistique ou modélisation (Romig et al. 1996; Wang et Gong 1998; Velasquez et al. 2007).

L’équation de pédotransfert est une variante de l’analyse compensatoire où les indices de qualité sont évalués

à partir de multiples propriétés disponibles (Bouma 1989). Avant d’être agglomérés en indices, les indicateurs

peuvent être reliés à des fonctions ou services écosystémiques spécifiques (p. ex. productivité, protection de

l’environnement) au moyen de fonctions de pointage (scoring functions) qui normalisent la valeur des

indicateurs entre 0 et 1 (Andrews et al. 2004; Idowu et al. 2008; Wienhold et al. 2009). Ces fonctions de

pointage, reflétant l’association entre les indicateurs et les fonctions du système, peuvent prendre la forme de

« plus c’est mieux », « moins c’est mieux » ou « optimum local ». Elles sont établis par des valeurs de la

littérature, par jugement d’experts ou par relations mathématiques (Karlen et Scott 1994; Wienhold et al.

2009). Pour une meilleure représentation des interactions entre les différents indicateurs, ces mêmes

fonctions peuvent être définies comme des fonctions d’appartenance (membership) à l’intérieur de systèmes

experts de logique floue. Ces systèmes d’inférence continue permettent d’agglomérer en une valeur unique

plusieurs indicateurs de qualité selon les principes d’appartenance relative à différentes classes (McBratney et

Odeh 1997; Kaufmann et al. 2009). L’indice de sensibilité moyen, établi par la moyenne des ratios

traitement/témoin de chacun des indicateurs, est une mesure de synthèse relative en relation avec une

6

référence (Bolinder et al. 1999). L’analyse factorielle et l’analyse en composantes principales sont des

techniques multidimensionnelles utilisées pour regrouper les indicateurs de qualité en un nombre inférieur de

facteurs ou de composantes (Schipper et Sparling 2000; Shukla et al. 2006). Lorsque les indicateurs sont de

nature compositionnelle, c’est-à-dire que la somme des indicateurs est contrainte à une constante (1 ou

100%), l’analyse compositionnelle permet de comparer, par une mesure unique de similarité (distance

d’Aitchison), l’information relative associée à l’état d’un sol donné à celle d’une référence tout en évitant les

biais inhérents à ce type de données associés à la redondance, à la dépendance d’échelle et à la distribution

non-normale (voir aussi section 1.4.2) (Bacon-Shone 2011).

1.2.2.3 Les critères diagnostics de qualité des sols

Les deux principales approches pour évaluer les systèmes de gestion et d’aménagement sont l’évaluation

comparative (système évalué en relation avec une solution alternative ou une référence) et l’évaluation

dynamique (système évalué sur une base temporelle) (Larson et Pierce 1994). La comparaison ou le suivi de

la qualité des sols permettent de qualifier ou de quantifier, souvent selon les concepts de « plus c’est mieux »

ou « moins c’est mieux », selon qu’un système cultural soutient, dégrade ou améliore la qualité du sol (Karlen

et al. 2001). Par contre, peu d’études ou d’inventaires rapportent des valeurs ou des intervalles de référence

permettant d’interpréter les indicateurs ou les indices de qualité des sols (Karlen et al. 2006; Kaufmann et al.

2009). Le tableau 1.3 présente quelques exemples de critères physiques de qualité des sols.

Tableau 1.3. Exemples de critères physiques de qualité des sols.

Indicateur Critique Optimum Critique Référence

inférieur Min Max supérieur

Matière organique (%) 2.3 3 5 6 Craul 1999; Greenland 1981; Sojka et Upchurch 1999

Masse volumique apparente 0.9 1.2 1.4-1.6 Reynolds et al. 2007

(g cm-3) 1.29 1.44 Carter 1990

1.75 Reinert et al. 2008

<1.55 1.65 Shukla et al. 2004

1.15 1.68 1.30-1.88 Kaufmann et al. 2010

Macroporosité (%) 6 17 Drewry et al. 2008

9 13 Reynolds et al. 2008

6-8 30 40 Sparling et al. 2008

Conductivité hydraulique saturée (cm s-1)

0.0001 0.005 0.0005 0.01 Reynolds et al. 2008

Par ailleurs, étant donné la variabilité des propriétés et la complexité des processus ayant lieu dans les sols

aux échelles spatiale et temporelle, l’évaluation de la qualité des sols à l’aide des valeurs critiques doit être

adaptée aux conditions spécifiques de chacun des milieux étudiés (Karlen et al. 2001). Les indices de qualité

7

et les valeurs optimales ne sont pas universellement applicables (Carter et al. 1997). Ainsi, les indices peuvent être développés sur la base de facteurs environnementaux inhérents comme la classe de matière organique (grade minéral vs organique), la texture, le climat, la période d’échantillonnage, la minéralogie, la région et la topographie (Andrews et al. 2004). Par exemple, puisque la masse volumique apparente (MVA) du sol est reliée à la texture, des équations de pédotransfert pour prédire la MVA optimale relativement à une fonction d’intérêt comme le rendement d’une culture peuvent être calibrées à partir de la composition texturale du sol. Des seuils critiques de MVA peuvent alors être établis selon le degré de déviation d’un spécimen par rapport à la MVA optimale (Figure 1.2) (Kaufmann et al. 2010). La MVA est un indicateur de compaction des sols qui est une forme de dégradation importante notamment dans les agroécosystèmes de la pomme de terre (Tabi et al. 1990; Stalham et al. 2006).

Figure 1.2. Relation généralisée entre le rendement relatif d’une culture et la déviation à la masse volumique apparente (MVA) optimale (tirée de Kaufmann et al. 2010).

Les difficultés liées au manque d'indicateurs communément convenus, leur fluctuation temporelle, le défi de recueillir certains de ces paramètres et le relativisme des valeurs centrées sur le jugement d’experts dans une variété d'environnements particuliers peuvent rendre l’évaluation de la qualité des sols inadéquate. C’est pourquoi une compréhension approfondie de la contextualisation du processus d’aide à la décision est nécessaire pour construire des évaluations de la qualité des sols utiles et fondées sur des données scientifiques (McBratney et al. 2012).

La pédologie considère essentiellement les propriétés d’ordre génétique alors que la qualité des sols réfère aux propriétés dépendantes de la gestion et de l’utilisation des sols affectant principalement la couche de

8

surface (Norfleet et al. 2003), bien que certains phénomènes induits par l’homme puissent aussi être observés dans les horizons du sous-sol (p. ex. accumulation de phosphore lessivé et fixé par les hydroxydes [Andersson et al., 1998]). D’après Norfleet et al. (2003), la qualité des sols est une extension de la science pédologique où les indices de qualité des sols varient selon l’influence du facteur humain, mais dont la portée de cette variabilité est liée aux propriétés d’ordre génétique. Les propriétés intrinsèques sont donc utiles pour définir la valeur potentielle des indices de qualité des sols et la susceptibilité de ces propriétés à changer selon la gestion et l’utilisation de sols (Figure 1.3) (Grossman et al. 2001). L’évaluation de la qualité des sols requiert des bases de données interprétatives des propriétés intrinsèques et des propriétés dynamiques des sols.

Figure 1.3. Conceptualisation de la qualité inhérente de deux sols différents (a) et de la qualité dynamique des sols évoluant en fonction du temps (a et b) (adaptée de Karlen et al. 2001).

La résilience a été définie comme la capacité d’un sol à récupérer son intégrité fonctionnelle et structurelle après avoir subi une perturbation (Seybold et al. 1999). La résistance, qui se différencie de la résilience, a été définie comme la capacité d’un sol à continuer de fonctionner sans changement suite à un bouleversement. Ainsi, pendant une perturbation, la qualité du sol dépend de sa résistance et, après cette perturbation, de sa résilience. Les perturbations étant omniprésentes dans les écosystèmes, les caractéristiques de résistance et de résilience deviennent des composantes fondamentales de la qualité des sols (Seybold et al. 1999).

Selon Schjønning et al. (2009), la stabilité du sol devrait intégrer les concepts de résistance et de résilience. Le terme vulnérabilité est étroitement mais inversement relié à la stabilité (Kay 1990). Jones et al. (2003) ont suggéré d’utiliser le terme « susceptibilité » pour désigner la vulnérabilité inhérente (p. ex. susceptibilité d’un sol à la compaction selon ses propriétés pédologiques), alors que la « vulnérabilité » devrait être associée aux facteurs extrinsèques comme les conditions climatiques (p. ex. vulnérabilité d’un sol à la compaction selon sa teneur en eau). La résilience et la résistance (ou la stabilité) peuvent être quantifiées, selon différents indices

9

de qualité, en comparant la capacité fonctionnelle d’un sol avant une perturbation à celle observée

immédiatement après (résistance) et suite à une période de récupération (résilience) (Seybold et al. 1999).

Par ailleurs, puisque l’utilisation et la gestion des sols font intégralement partie des agroécosystèmes, la

résilience devrait être reliée à l’état d’équilibre d’un système agricole et non pas à l’état qui prévalait dans

l’écosystème naturel d’origine (Blum 1997).

1.3 La classification des sols La nature du sol varie à la surface de la planète. Les variations latérale et verticale de la pédosphère sont le

résultat d’interactions complexes entre les processus de formation des sols dans leur contexte

environnemental durant des périodes de temps variables (Nortcliff 2006; Stockmann et al. 2011). Les sols sont

parmi les systèmes les plus complexes de la nature, possédant à la fois des propriétés de base résistant à

l’altération et des propriétés sujettes à des changements relativement rapides d’ordre saisonnier, périodique

ou induit par l’humain (Yaalon 2008). À travers cette variabilité et cette complexité, la classification est un outil

important dans l’étude des sols, permettant d’établir l’ordre et les possibles relations entre les pédons et

polypédons de localisations variées (Yaalon 2008).

La classification ou la catégorisation est un processus de tri ou d’arrangement d’objets en groupes sur la base

d'un ou de plusieurs objectifs conformément à un système ou un ensemble de principes (Buol et al. 2011). La

classification des sols est une structure abstraite permettant de mettre en ordre des informations multiples

(Yaalon 2008). Les fondements des systèmes de classification sont établis par les principes d’objectif, de

domaine et d’identité, et leur organisation est guidée par les principes de différenciation, de priorisation, de

diagnostic et d’appartenance (Arnold 2005). Le processus de catégorisation établit, selon des principes

logiques et de science cognitive, les différences et les similarités entre les sols pour assurer une meilleure

efficacité d’emmagasinage, de récupération et de communication de l’information (Buol et al. 2011).

La principale raison pour classifier les sols est d’organiser l’information pédologique à l’aide de classes qui

présentent des propriétés similaires et des comportements similaires à des interventions externes. Ainsi, il est

convenu que la classification des sols établisse des regroupements ou des subdivisions permettant de 1)

prédire leur comportement, 2) identifier les utilisations appropriées, 3) évaluer leur productivité, 4) identifier les

problèmes potentiels et les solutions potentielles pour entreprendre des actions correctives, 5) extrapoler les

connaissances acquises d’une localisation à une autre, 6) faciliter le transfert d’information et de technologie

et 7) fournir une base solide pour justifier de futures recherches (Nortcliff 2006; Buol et al. 2011). Il est

également reconnu qu’une classification ne peut être aussi bonne ou aussi mauvaise que l’objectif pour lequel

elle est désignée (Arnold 2006) et qu’elle peut rarement servir tout aussi bien deux objectifs à la fois (Cline

1949). Ainsi, la clef du succès d’une classification est de définir correctement l’objectif de classification et de

10

sélectionner les propriétés et les critères de différenciation permettant d’atteindre cet objectif (Nortcliff 2006).

La logique de classification reconnaît de nombreuses possibilités ou façons de regrouper les sources

majeures de variation dans les propriétés de sols (Yaalon 2008). Une classification basée sur autant de

caractéristiques que possible est la classification la plus naturelle qui est la mieux adaptée à des fins de

généralisation (Nortcliff 2006). Les systèmes de classification dits naturels ou universels sont axés sur les

concepts et les théories de la genèse du sol qui identifient les sols comme des entités naturelles reliées à

d’autres composantes de l’écosystème (Buol 2003). Elles peuvent être très bonnes à un niveau

d’interprétation générale mais moins valides à des fins spécifiques (Nortcliff 2006). Ainsi, d’autres systèmes

plus pragmatiques dits techniques, utilitaires, pratiques ou interprétatifs, organisent et classifient les sols selon

les propriétés mesurées à des fins spécifiques (Buol 2003).

À la base, toute définition crée deux classes : les entités qui sont incluses dans la définition et celles qui ne le

sont pas. Ainsi, lorsque le sol est défini, il y le sol et le non-sol (Arnold 2006). Une classe relève de deux

notions importantes, soit le concept central de la classe et les limites par rapport aux autres classes. Le

concept central est un regroupement défini par un intervalle étroit de propriétés de sol qui décrit l’entité

d’intérêt. Dans des schémas mutuellement exclusifs, les limites entre les classes sont rigidement fixées :

l’appartenance est oui ou non. Lorsqu’une gamme de propriétés multiples qui se chevauchent est acceptée,

les limites deviennent flexibles et l’appartenance conjointe peut être considérée. Une telle appartenance est

souvent exprimée comme un pourcentage ou degré d’appartenance à chacune des classes qui se

chevauchent. Ce concept est communément mentionné comme une classification basée sur la logique floue

ou continue (fuzzy logic) (Arnold 2006).

C’est le propre d’un système de classification de générer des situations de chevauchement peu importe le

niveau. Les critères établis pour un système national de classes mutuellement exclusives font qu'en certains

endroits la plupart des sols ont des propriétés qui chevauchent la limite établie entre deux taxons (GTCS

2002). Ces systèmes ne peuvent permettre des groupements adéquats pour tous les sols sur un même

territoire.

1.3.1 Les systèmes de classification des sols

1.3.1.1 La classification naturelle ou universelle des sols

L’objectif des systèmes nationaux et internationaux de classification des sols est d’abord de faciliter la

communication de la connaissance sur les sols (Buol et al. 2011). Le système canadien de classification des

sols comprend une organisation hiérarchique de plusieurs catégories qui permet de considérer les sols à

divers degrés de généralisation. Les différenciations entre les taxons sont basées sur des propriétés du sol qui

peuvent être observées et mesurées. Néanmoins, le système favorise les propriétés ou les combinaisons de

11

propriétés qui reflètent la genèse du sol. La philosophie générale du système canadien est pragmatique,

l'objectif étant d'organiser les connaissances sur les sols de façon raisonnable et utilisable. Les interprétations

sont une seconde étape afin de valoriser efficacement l'information pédologique (GTCS 2002). Au Québec, les

sols minéraux des régions agricoles sont classifiés en cinq principaux taxons (Ordre) du système canadien

(Tableau 1.4).

Tableau 1.4. Équivalence taxonomique des classes de sols minéraux des régions agricoles du Québec.

Système Canadien USA Soil Taxonomy Système International Chapitre

(GTCS 2002) (Soil Survey Staff 2014) (IUSS Working Group WRB 2014) de la thèse

Podzols Spodosols, quelques Inceptisols Podzols 2, 3 et 4

Gleysols Inceptisols, Entisols (Sous-ordres Aqu-) et quelques Alfisols

Gleysols 2, 3 et 4

Luvisols Alfisols (Udalfs) Luvisols 3

Brunisols Inceptisols, quelques Entisols (Psamments) Cambisols 2, 3 et 4

Régosols Entisols Fluvisols, Régosols 3

Les études de genèse et de classification des sols contribuent à la recherche et à l’acquisition de données

pour d’autres champs d’étude, comme la disponibilité et la distribution des nutriments essentiels aux plantes

dans différents types de sols (Buol et al. 2011; Leclerc et al. 2001; Leblanc et al. 2013). Une structure de

classification des sols peut fournir les bases pour prédire des propriétés et des comportements individuels à

partir des connaissances et des caractéristiques définies pour chaque classe (Nortcliff 2006). Néanmoins,

l’interprétation du comportement des sols nécessite des étapes ou des raisonnements additionnels associés à

des relations connues ou présumées entre le comportement et les propriétés mesurées (Ahrens et Arnold

2000). Ainsi, bien qu’il soit accepté que les classes taxonomiques puissent être regroupées, ou subdivisées et

regroupées, pour permettre des prédictions spécifiques, les classes d’un système peuvent ne pas être de

bons indicateurs, notamment du potentiel de séquestration du C (Simfukwe et al. 2011).

Les systèmes de classification sont basés sur les connaissances disponibles et ils sont susceptibles de

changer avec l’évolution des connaissances (Buol et al. 2011). La classification reflète ce qui est connu; les

bases de connaissances augmentent lorsque de nouveaux terrains sont explorés et de nouvelles techniques

analytiques sont utilisées pour décrire et analyser les sols (Buol et al. 2011), comme les techniques de

morphométrie numérique (spectroscopie infrarouge, spectrométrie par fluorescence de rayons X, microscopie

électronique à balayage, radar à pénétration de sol (géoradar), conductivité électrique) (Hartemink et Minasny,

2014).

12

1.3.1.2 La classification technique, pratique, interprétative ou utilitaire des sols

Depuis longtemps, les scientifiques reconnaissaient l'importance de catégoriser les types de sol et leurs

propriétés à l'égard de l'utilisation des sols, notamment à des fins agricoles (Carter et al. 1997). Les systèmes

de classification sont des outils de gestion potentiellement importants, mais sont, en général, mal ou

vaguement liés à la qualité du sol (MacEwan 1997). Les propriétés de la couche de surface ont une

importance primordiale pour la croissance des plantes. La couche de surface influence la germination, est le

siège de l’activité biologique, renferme une grande partie des réserves en nutriments, contient une proportion

majeure des racines et est un facteur déterminant pour les pratiques de travail du sol. Par contre, les

propriétés dynamiques de la couche de surface, lesquelles varient avec les pratiques de gestion, ne sont

généralement pas considérées dans les systèmes de classification. La priorisation des caractéristiques stables

du sous-sol et la sélection génétiquement inspirée des critères de différenciation des taxons ne relèvent pas

nécessairement de l’utilisation des sols. Des systèmes de classification technique comme le Fertility Capability

Soil Classification (Buol et al. 1975; Sanchez et al. 2003) ont été conçus afin de combiner une partie des

données contenues dans les études de sols avec des informations additionnelles relevant directement de la

croissance des plantes (Dudal 2003). L’Organisation des Nations Unies pour l'alimentation et l'agriculture

(FAO 1998) a proposé un complément au système de référence mondiale de la ressource sols afin de prendre

en considération les caractéristiques de la couche de sols de surface liées à la fertilité des sols et à leur

aménagement. À travers le monde, des regroupements de sols relatifs au potentiel de libération du potassium

basés sur le matériau (Heming 2004), la minéralogie (Srinivasa Rao et al. 2010; Britzke et al. 2012) ou les

réserves non échangeables de potassium (Andrist-Rangel et al. 2010) ont été proposés afin d’orienter les

recommandations en fertilisants. Au Québec, différents regroupements ont été proposés avec des objectifs

spécifiques comme la fertilité (Nolin et al. 1989), la gestion du phosphore (Leclerc et al. 2001; Vézina et al.

2000) ou le comportement hydrologique (Gagné et al. 2013).

Pour déterminer le potentiel de production des sols, il est nécessaire de porter une attention spéciale aux

conditions climatiques. Comme les sols avec des caractéristiques morphologiques et chimiques similaires

peuvent se retrouver sous différents climats, certains systèmes de classification ont introduit le facteur

climatique à différents niveaux de généralisation (p. ex. Soil Survey Staff 2014). Le nombre de subdivisions

climatiques doit nécessairement être limité pour que le nombre de taxons du système de classification soit

gérable. Or les données climatiques trop générales comme les moyennes annuelles d’humidité du sol et de

température ne fournissent pas assez d’information pour être associées à des sites spécifiques. Ainsi, la

superposition de données climatiques plus détaillées, indépendamment de la classification des sols, est à

privilégier (Dudal 2003).

13

1.3.2 La classification numériques des sols

La classification numérique des sols relève du domaine de la pédométrie qui est définie comme l’utilisation de

méthodes quantitatives (mathématiques, statistiques) pour étudier la distribution et la genèse des sols et

comme moyen de préserver la ressource sol (McBratney et Minasny 2010). La classification numérique a

émergé au cours des années 1950 et 1960. Le mouvement était alors essentiellement mené par les

biologistes (Sneath et Sokal 1973). À cette période, les algorithmes de classification de type hiérarchique n’ont

pas bien servi les pédologues puisque les populations de données de sols ne présentent pas ce mode

d’organisation (Henderson et al. 2008). Ces techniques étaient principalement appliquées à des études

locales de petite étendue (Moore et Russell 1966; Webster et Burrough 1972). Depuis, d’autres méthodes de

classification numérique non-hiérarchique comme les k-moyennes (k-means) ont été appliquées avec plus de

succès (De Gruijter et al. 1997; Triantafilis et al. 2001; Verheyen et al. 2001). Ces approches numériques

permettent le traitement simultané d’un grand nombre de propriétés de sol sans avoir à identifier des

propriétés à ordonner à divers niveaux d’une structure taxonomique (Powell et al. 1992). Avec l’avènement

d’ordinateurs très performants ayant de grandes capacités d’emmagasinage et le développement des

technologies d’observation et des techniques de traitement de données stimulant l’intérêt pour la quantification

des données de sol, la taxonomie numérique a un avenir prometteur (Buol et al. 2011; Hartemink et Minasny

2014; Brevik et al. 2015).

Selon Arkley (1976), le processus de classification numérique comprend quatre étapes principales : la

sélection et la standardisation des données, les mesures de similarité, les stratégies de tri et la présentation

des classes. Bien que la sélection et le codage des propriétés demeurent subjectifs (Simpson 1964), les

modèles de classification mathématique sont considérés moins intuitifs que les modèles mentaux du

pédologue (Grunwald 2006) puisque les classes sont établies sur de l’information mesurée et des critères

statistiques objectifs (McBratney et Minasny 2010). D’après Yaalon (2008), les méthodes d’assistance par

ordinateur seraient plus applicables à la cartographie des sols et elles ne seraient pas susceptibles de

remplacer les systèmes exhaustifs de classification des sols permettant les interprétations et les prédictions à

des fins générales. Néanmoins, des travaux récents témoignent du potentiel de la taxonomie numérique aux

échelles régionale et nationale (Odgers et al. 2011a; Odgers et al. 2011b; Láng et al. 2013, Hughes et al.

2014; Brevik et al. 2015) et un groupe de travail au sein de l’International Union of Soil Science est à l’œuvre

pour proposer un système universel de classification des sols basé sur les principes de classification

numérique (Hempel et al. 2013). Par ailleurs, la classification numérique pourrait être utilisée pour faire le pont

avec les systèmes taxonomiques nationaux ou pour des schémas de classification alternatifs de mêmes

données sous-jacentes, mais directement orientés vers des buts spécifiques (p. ex. générer un schéma de

classification appuyant la prise de décision pour la prévention de l’érosion) (Beaudette et al. 2013).

14

1.4 Mesure et analyse de données numériques de sols

1.4.1 La spectroscopie infrarouge

La spectroscopie infrarouge (IR) est une technique permettant l’observation quantitative in situ et ex situ des

sols, notamment associée au développement de la morphométrie et de la cartographie numérique (Hartemink

et Minasny 2014; Brevik et al. 2015), qui a connu un intérêt croissant au cours des deux dernières décennies

(Stenberg et al. 2010). La spectroscopie IR par réflectance diffuse est une technique analytique rapide et peu

coûteuse (la préparation d’échantillon implique uniquement un séchage et un broyage, la mesure ne prend

que quelques secondes et plusieurs propriétés peuvent être estimées à partir d’un même spectre),

respectueuse de l’environnement (aucun produit chimique potentiellement dangereux n’est requis), non

destructive (l'intégrité de l’échantillon n’est pas affectée par l'analyse), reproductible (les résultats sont

similaires dans des milieux différents - équipement, manipulateur) et répétable (les résultats sont similaires

pour un même milieu) (Viscarra Rossel et al. 2006; Soriano-Disla et al. 2014). Cette méthode permet la

caractérisation indirecte d’une multitude de propriétés chimiques, physiques et biologiques des sols incluant le

contenu en C organique, la distribution granulométrique, la capacité d’échange cationique, les macro et

microéléments, la capacité tampon et autres (Soriano-Disla et al. 2014). Les propriétés prédites par

spectroscopie IR peuvent ensuite appuyer le développement d’équations de pédotransfert (McBratney et al.

2006). La spectroscopie IR permet également la classification d’horizons et de classes de sols (Viscarra

Rossel et Webster 2011; Steffens et Buddenbaum 2013; Demattê et Terra 2014; Vasques et al. 2014).

La radiation IR couvre l’intervalle du spectre électromagnétique entre 780 nm et 1 mm et se subdivise en trois

principales régions : le proche (NIR; 780 à 2500 nm), le moyen (MIR; 2500 to 25 000 nm) et le lointain IR (FIR;

25 000 à 1 000 000 nm) (Nocita et al. 2015). Le principe de la spectroscopie IR est que le vecteur électrique

de la radiation IR interagit avec les liaisons interatomiques des molécules pour activer différentes vibrations

(étirement, flexion et torsion des atomes constituant les molécules) résultant d’une absorption de la radiation

IR. Cette absorption est due aux vibrations des liaisons moléculaires covalentes comme O-H, C-H, N-H, C=O,

C-N, N-O ou C=C, des groupes d’atomes nommés chromophores (Ben Dor et al. 1999; Stuart 2004). Dans les

régions MIR et FIR, l’absorption est due aux vibrations fondamentales (vibrations excitées lorsqu'un quantum

d'énergie est absorbé par la molécule dans son état fondamental) des composés organiques ou minéraux.

Dans la région NIR, l’absorption est due aux harmoniques (vibrations excitées lorsque deux quanta ou plus

sont absorbés) et aux combinaisons de vibrations fondamentales (lorsque plus de deux vibrations

fondamentales sont excités simultanément) des composés chimiques localisés dans l’eau libre, les inter-

couches des minéraux argileux et la matière organique (Janik et al. 1998). Dans la région MIR, à cause des

fortes vibrations fondamentales, la signature spectrale des sols est caractérisée par des pics clairement

identifiables (les faibles harmoniques et combinaisons de bandes dans le NIR peuvent se chevaucher) qui

15

fournissent plus d’information que la seule région NIR (la région NIR est insensible au quartz) et qui

conséquemment donnent des prédictions généralement plus précises et reproductibles (Soriano-Disla et al.

2014; Nocita et al. 2015; Terra et al. 2015). Au-delà de 10 000 nm, les différents pics sont difficiles à

interpréter puisque le spectre résulte d'un mélange de composés organiques et minéraux avec des

chevauchements de l’information spectrale (Calderon et al. 2011).

La prédiction des propriétés de sols qui sont directement reliées aux composés actifs soumis à la radiation IR

(p. ex. C organique, carbonates) est généralement meilleure que celle des autres propriétés pour lesquelles le

mécanisme de prédiction est relié à leur corrélation avec les chromophores (Nocita et al. 2015). La relation

complexe entre la signature spectrale et une propriété de sol est modélisée à l’aide de méthodes statistiques

(techniques de chimiométrie) comme la régression multiple, la régression des moindres carrés partiels

(méthode la plus communément utilisée) ou les techniques d’apprentissage automatique comme la machine à

vecteurs de support, les arbres de régression boostés ou les réseaux de neurones artificiels (Viscarra Rossel

et Behrens 2010; Soriano-Disla et al. 2014). Par ailleurs, un algorithme permet désormais de retirer l’effet de

la teneur en eau des lectures spectrales prises directement au champ sans séchage préalable (Minasny et al.

2011). Néanmoins, les calibrations spectroscopiques sont empiriques et elles ne peuvent donc pas produire

des prédictions précises pour les échantillons qui ne sont pas représentés dans les librairies spectrales de

calibration (Viscarra Rossel et al. 2008; Ramirez-Lopez et al. 2014). L’harmonisation des protocoles et la mise

en place de librairies sont des enjeux majeurs pour la démocratisation de la spectroscopie dans les

laboratoires et les champs aux échelles régionale, nationale et internationale (Soriano-Disla et al. 2014; Nocita

et al. 2015).

Une application directe de la spectroscopie IR serait la mesure du besoin en chaux afin de remplacer les

méthodes humides par solutions tampons qui contiennent souvent des contaminants (Sikora, 2006) et qui ont

montré avoir une faible sensibilité pour les sols sableux souvent faiblement tamponnés, comme ceux utilisés

en production de pommes de terre (McLean, 1982; Alabi et al. 1986; Godsey et al. 2007).

1.4.2 L’analyse de données compositionnelles

En science des sols comme dans les sciences sociales et naturelles, de nombreuses données sont de nature

compositionnelle c’est-à-dire qu’elles sont des fractions contraintes entre 0 et une unité de mesure (p. ex. 1,

100%, 1000 g kg-1 ou 106 mg kg-1). L’analyse de cette information relative par des méthodes statistiques

conventionnelles peut être biaisée par i) la redondance, car une composition peut être déduite par différence

entre l’unité de mesure et les autres proportions, ii) le biais négatif, car une covariance est forcée d’être

négative (fausse corrélation), iii) la dépendance d’échelle, car l’expression des proportions sur différentes

échelles de mesure (p. ex. base massique sèche ou humide) peut entraîner des résultats différents et des

16

interprétations contradictoires (incohérence de sous-composition) et iv) la distribution non-normale inhérente,

car l’occurrence des proportions est contrainte entre 0 et l’unité de mesure plutôt que dans l’espace des

nombres réels (±∞) (Aitchison 1986; Filzmoser et al. 2009; Filzmoser et Hron 2011). Les transformations log-

ratio additif (alr) et log-ratio centré (clr) développées par Aitchison (1986) permettent d’éviter les biais négatifs

et d’exprimer l’information compositionnelle dans l‘espace des nombres réels. La transformation log-ratio

isométrique (ilr) proposée par Egozcue et al. (2003) permet également d’éviter la redondance (singularité de la

matrice de corrélation) par l’expression de D composantes dans un espace Euclidien de D-1 log-contrastes

orthogonaux. Une partition binaire séquentielle qui consiste à une division séquentielle de la composition en

groupes de parties identifiées par (+) et (–) jusqu’à ce que tous les groupes soient constitués d’une seule

fraction donne la structure au calcul des ilrs selon l’équation suivante (Egozcue and Pawlowsky-Glahn, 2006) :

𝑖𝑙𝑟i = √risi

ri+si𝑙𝑛

g(x+)

g(x−) (1.1)

où ilri est la ie balance, où i Є [1, D-1], ri et si sont le nombre de composantes dans les groupes (+) et (–) et

g(x) la moyenne géométrique des composantes dans les groupes (+) et (–), respectivement.

Selon le principe de travailler en système de coordonnées (Mateu-Figueras et al. 2011), le simplex d’une

composition peut être projeté selon D-1 coordonnées ortho-normales par la transformation ilr, les statistiques

conventionnelles calculées à l’intérieur de l’espace des nombres réels (±∞) et les résultats exprimés dans le

domaine original par transformation inverse des coordonnées ilr (Figure 1.4).

En sciences du sol, l’analyse de données compositionnelles a notamment été appliquée à la classification et à

la cartographie (McBratney et al. 1992; Walvoort et De Gruijter 2001; Lark et Bishop 2007), au diagnostic

nutritif des plantes et des sols (Parent et Dafir 1992; Parent et al. 2012b; Parent et al. 2013), à la composition

de la solution du sol (Parent et al. 1997), à la distribution des agrégats (Parent et al. 2012a), à la dynamique

de l‘azote et du phosphore (Parent et al. 1992; Parent et al. 2000; Duguet et al. 2006; Leblanc et al. 2013) et

aux formes de phosphore (Parent et al. 2014; Abdi et al. 2015).

17

Figure 1.4. Principe de travailler en système de coordonnées (adapté de Pawlowsky-Glahn et al. 2011). Les données de base (points noirs) sont transformées en coordonnées ilr, les statistiques (ellipse bleue couvrant 95% de la distribution théorique autour du centroïde) sont calculées et exprimées dans l’espace compositionnel par transformation retour des ilrs.

18

1.5 Hypothèses et objectifs Il se dégage de cette revue de littérature que l’influence des pratiques culturales sur le rendement et la qualité

des cultures dépend de la classification pédologique et de la qualité des sols. Les hypothèses spécifiques

suivantes furent posées :

1. La spectroscopie infrarouge permet une prédiction précise de la capacité d’échange cationique (CEC)

en appui à un modèle de recommandation des besoins des sols en chaux. Cette hypothèse est vérifiée au

chapitre 2.

2. Des critères diagnostics du degré d’agrégation des sols de la couche de surface peuvent être

développés en fonction de la texture et du contenu en matière organique du sol et des pratiques culturales.

Cette hypothèse est vérifiée au chapitre 3.

3. La classification numérique permet le regroupement des séries de sols associées à la culture de la

pomme de terre à partir des propriétés morphologiques des profils de sols modaux. Cette hypothèse est

vérifiée au chapitre 4.

L’objectif général de la thèse est d’élaborer des modèles et des consignes, et de proposer des moyens de

relier les propriétés et les classes pédologiques aux pratiques de fertilisation et de conservation des sols,

notamment sous culture de pommes de terre. Les objectifs spécifiques sont de:

1. Élaborer des équations et des règles de pédotransfert et des critères permettant de mesurer la qualité

des sols de la couche de surface (acidité et structure). Il s’ensuit (1) un modèle de chaulage adapté aux

sols sableux acides sous culture de pommes de terre, et (2) la mesure non biaisée du degré d’agrégation

du sol par rapport à une composition de référence pour guider la gestion durable des sols sous culture de

pommes de terre.

2. Proposer un modèle de classification numérique des séries de sols basé sur le degré d’association entre

les caractéristiques morphologiques du profil. Il s’ensuit l’inclusion des profils pédologiques comme

variables synthétiques continues ou catégoriales dans les bases de données massives sur les essais de

fertilisation de la pomme de terre permettant la modélisation de la réponse de la culture à l‘ajout de

fertilisants (N, P, K) compte tenu de quelques propriétés du sous-sol en interaction avec les conditions

climatiques et les pratiques culturales.

19

1.6 Bibliographie Abdi, D., Cade-Menun, B.J., Ziadi, N. et Parent, L.E. 2015. Compositional statistical analysis of soil 31P-NMR forms. Geoderma, 257-258: 40-47.

Adhikari, K. et Hartemink, A.E. 2016. Linking soils to ecosystem services - A global review. Geoderma, 262: 101-111.

Ahrens, R.J. et Arnold, R.W. 2000. Soil taxonomy. Pages E117-E135 dans M.E. Sumner (éd.), Hanbook of soil science. CRC Press, Boca Raton, Florida, USA.

Aitchison, J. 1986. The statistical analysis of compositional data. Chapman and Hall, London. 416 p.

Alabi, K.E., Sorensen, R.C., Knudsen, D. et Rehm, G.W. 1986. Comparison of several lime requirement methods on coarse-textured soils of northeastern Nebraska. Soil Science Society of America Journal, 50: 937-941.

Amundson, R. et Jenny, H. 1991. The place of humans in the state factor theory of ecosystems and their soils. Soil Science, 151: 99-109.

Andersson, A., Eriksson, J. et Mattsson, L. 1998. Phosphorus accumulation in Swedish agricultural soils [en suèdois]. Institutionen för markvetenskap Sveriges Lantbruksuniversitet, Uppsala, Sverige. 34 p.

Andrew, S.S., Karlen, D.L. et Cambardella, C.A. 2004. The soil management assessment framework: A quantitative soil quality evaluation method. Soil Science Society America Journal, 68: 1945-1962.

Andrist-Rangel, Y., Hillier, S., Öborn, I., Lilly, A., Towers, W., Edwards, A.C. et Paterson, E. 2010. Assessing potassium reserves in northern temperate grassland soils: A perspective based on quantitative mineralogical analysis and aqua-regia extractable potassium. Geoderma, 158: 303-314.

Arkley, R.J. 1976. Statistical methods in soil classification research. Advances in Agronomy, 28: 37-70.

Arnold, R.W. 2005. Classification of soils. Pages 204-210 dans R. Lal (éd.), Encyclopedia of Soil Science, Second edition. Taylor and Francis, New York., USA.

Arnold R.W. 2006. Soil survey and soil classification. Pages 37-59 dans S. Grunwald (éd.), Environmental soil-landscape modeling: Geographic information technologies and pedometrics. CRC Press, Boca Raton, Florida.

Bacon-Shone, J. 2011. A short history of compositional data analysis. Pages 3-11 dans V. Pawlowsky-Glahn and A. Buccianti (éds.), Compositional Data Analysis: Theory and Applications, John Wiley and Sons, New-York.

Batey T. 2000. Soil profile description and evaluation. Pages 595-628 dans K.A. Smith et C.E. Mullins (éds.), Soil and environmental analysis: physical methods 2nd Edition. Marcel Dekker, Inc., New York.

Beaudette D.E., Roudier, P. et O’Geen, A.T. 2013. Algorithms for quantitative pedology: A toolkit for soil scientists. Computers and Geosciences, 52: 258-268.

Ben Dor, E., Irons, J.R. et Epema, J.F. 1999. Soil reflectance. Pages 111-188 dans R.A. Ryerson (éd.), Remote sensing for the earth sciences. John Wiley & Sons, New York.

20

Blanco-Canqui, H. et Lal, R. 2008. Principles of soil conservation and management. Springer, New York. 617 p.

Blum, W.E.H. 1997. Basic concepts: Degradation, resilience, and rehabilitation. Pages 1-16 dans R. Lal, W.E.H. Blum, C. Valentine et B.A. Stewart (éds.), Methods for assessment of soil degradation. CRC Press, Boca Raton, New York.

Bockheim, J.G., Gennadiyev, A.N., Hartemink, A.E. et Brevik, E.C. 2014. Soil-forming factors and Soil Taxonomy. Geoderma, 226-227: 231-237.

Boizard, H., Batey, T., McKenzie, D., Richard, G., Roger-Estrade, J., Ball, B., Bradley, I., Cattle, S., Hasinger, G., Munkholm, L., Murphy, B., Nievergelt, J., Peigné, J. et Shepherd, G. 2005. Field meeting “Visual soil structure assessment”. Detailed report. INRA et ISTRO.

Bolinder, M.A., Angers, D.A., Gregorich, E.G. et Carter, M.R. 1999. The response of soil quality indicators to conservation management. Canadian Journal of Soil Science, 79: 37-45.

Bouma, J. 1989. Using soil survey data for quantitative land evaluation. Advances in Soil Science, 9: 177-213.

Brevik, E.C., Calzolari, C., Miller, B.A., Pereira, P., Kabala, C., Baumgarten, A et Jordán, A. 2015. Soil mapping, classification, and pedologic modeling: History and future directions. Geoderma, (in press).

Britzke, D., Silva, L.S., Moterle, D.F., Rheinheimer, D.S. et Bortoluzzi, E.C. 2012. A study of potassium dynamics and mineralogy in soils from subtropical Brazilian lowlands. Journal of Soils and Sediments, 12:185-197.

Buol, S.W., Sanchez, P.A., Cate, R.B. et Granger, M.A. 1975. Soil fertility capability classification system: A technical soil classification system for fertility management. Pages 126-141 dans E. Bornemisza et A. Alvarado (éds.), Soil Management in Tropical America. North Carolina State University, Raleigh.

Buol, S.W. 2003. Philosophies of soil classifications: From is to does. Pages 3-10 dans H. Eswaran, T. Rice, R. Ahrens et B. Stewert (éds.) Soil Classification: A Global Desk Reference CRS Press LLC.

Buol, S.W., Southard, R.J., Graham, R.C. et McDaniel, P.A. 2011. Soil genesis and classification, Sixth Edition. John Wiley & Sons, Chichester, West Sussex, UK. 543 p.

Calderon, F.J., Reeves III, J.B., Collins, H.P. et Eldor, P.A. 2011. Chemical differences in soil organic matter fractions determined by diffuse-reflectance mid-infrared spectroscopy. Soil Science Society America Journal, 75: 568-579.

Carter, M.R. 1990. Relationship of strength properties to bulk density and macroporosity in cultivated loamy sand to loam soils. Soil and Tillage Research, 15: 257-268.

Carter, M.R., Gregorich, E.G., Anderson, D.W., Doran, J.W., Janzen, H.H. et Pierce, F.J. 1997. Concepts of soil quality and their significance. Pages 1-19 dans E.G. Gregorich et M.R. Carter (éds.), Soil quality for crop production and ecosystem health. Developments in soil science. Elsevier, Amsterdam.

Cline, M.G. 1949. Basic principles of soil classification. Soil Science, 67: 81–91.

Craul, P.J. 1999. Urban soils: Applications and practices. Wiley, Toronto. 384 p.

21

Day, G.H. (éd.) 1982. The Canada soil information system (CanSIS). Manual for describing soils in the field. Revised. Agriculture Canada, Ottawa, Ontario.

De Gruijter, J.J., Walvoort, D.J.J. et van Gaans, P.F.M. 1997. Continuous soil maps - a fuzzy set approach to bridge the gap between aggregation levels of process and distribution models. Geoderma, 77: 169-195.

Demattê, J.A.M. et Terra, F.S. 2014. Spectral pedology: A new perspective on evaluation of soils alongpedogenetic alterations. Geoderma, 217-218: 190-200.

Doran J.W. et Parkin, T.B. 1994. Defining and assessing soil quality. Pages 3-21 dans J.W. Doran, D.C. Coleman, D.F Bezdicek et B.A. Stewart (éds.), Defining soil quality for a sustainable environment. Special publication number 35, American Society of Agronomy, Madison, Wisconsin, USA.

Doran, J.W. et Parkin, T.B. 1996. Quantitative indicators of soil quality: A minimum data set. Pages 25-37 dans A.J. Jones et J.W. Doran (éds.), Methods for Assessing Soil Quality. SSSA Special Publication Number 49. Soil Science Society of America, Madison, USA.

Drewry, J.J., Cameron, K.C. et Buchan, G.D. 2008. Pasture yield and soil physical property responses to soil compaction from treading and grazing-a review. Australian Journal of Soil Research, 46: 237–256.

Du, C. et Zhou, J. 2009. Evaluation of soil fertility using infrared spectroscopy: A review. Environmental Chemistry Letters, 7: 97-113.

Dudal, R., Nachtergaele, F. et Purnell, M. 2002. The human factor of soil formation. Symposium 18, Vol. II, paper 93. Transactions 17th World Congress of Soil Science, Bangkok.

Dudal, R. 2003. How good is our soil classification. Pages 11-18 dans H. Eswaran, T.J. Rice, R. Ahrens, B.A. Stewart (éds.), Soil classification: A global desk reference. CRC Press, Boca Raton, Florida.

Duguet, F., Parent, L.E., Ndayegamiye, A. 2006. Compositional indices of net nitrification in cultivated organic soils. Soil Science 171, 886-901.

Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G. et Barceló-Vidal. C. 2003. Isometric logratio transformations for compositional data analysis. Mathematical Geology, 35: 279-300.

Egozcue, J.J. et Pawlowsky-Glahn, V. 2006. Simplicial geometry for compositional data. Geological Society. London, Special Publications, 264: 145-159.

FAO. 1998. Topsoil characterization for sustainable land management, draft. Land and Water Development Division, Soil Resources, Management and Conservation Service, Rome, 71 p.

Filzmoser, P., Hron, K. et Reimann, C. 2009. Univariate statistical analysis of environmental (compositional) data: problems and possibilities. Science of Total Environment, 407: 6100-6108.

Filzmoser, P et Hron, K. 2011. Robust statistical analysis, Pages 57-72 dans V. Pawlowsky-Glahn et A. Buccianti (éds.), Compositional data analysis: Theory and applications, John Wiley and Sons, New-York.

Gagné G., Beaudin, I., Leblanc, M., Drouin, A., Veilleux, G., Sylvain, J.D. et Michaud, A.R. 2013. Classement des séries de sols minéraux du Québec selon les groupes hydrologiques. Institut de recherche et de développement en agroenvironnement inc. (IRDA), Québec, Canada. 81 p.

22

Godsey, C.B., Pierzynski, G.M., Mengel, D.B. et Lamond, R.E. 2007. Evaluation of common lime requirement methods. Soil Sci. Soc. Am. J. 71: 843-850.

Greenland, D.J., 1981. Soil management and soil degradation. Journal of Soil Science, 32: 301-322.

Gregorich, E.G., Turchenek, L.W., Carter, M.R. et Angers, D.A. 2001. Soil and environmental science dictionary. CRC Press, Boca Raton, FL. 577 p.

Grossman, R.B., Harms, D.S., Seybold, C.A. et Herrick, J.E. 2001. Coupling use-dependant and use-invariant data for soil quality evaluation in the United States. Journal of soil and water conservation, 56: 63-68.

Groupe de travail sur la classification des sols (GTCS). 2002. Le système canadien de classification des sols, 3e édition. Direction générale de la recherche, Ministère de l’Agriculture et Agro-Alimentaire Canada. Publication 1646, Ottawa (ON). 196 p.

Grunwald, S. 2006. What do we really know about the space-time continuum of soil-landscape? Pages 3-36 dans S. Grunwald (éd.), Environmental soil-landscape modeling: Geographic information technologies and pedometrics. CRC Press, Boca Raton, Florida.

Gugino, B.K., Idowu, O.J., Schindelbeck, R.R., van Es, H.M., Wolfe, D.W., Moebius-Clune, B.N., Thies, J.E. and Abawi, G.S. 2009. Cornell soil health assessment training manual, Edition 2.0, Cornell University, Geneva, NY. 57 p.

Hartemink, A.E. et Minasny, B. 2014. Towards digital soil morphometrics. Geoderma, 230–231: 305–317.

Heming, S.D. 2004. Potassium balances for arable soils in southern England 1986-1999. Soil Use and Management, 20: 410-417.

Hempel, J., Micheli, E., Owens, P. et McBratney, A. 2013. Universal soil classification system report from the International Union of Soil Science working group. Soil Horizons, 54.

Henderson, B.L, Webster, R et McKenzie, N.J. 2008. Statistical analysis. Pages 327-347 dans N.J. McKenzie, M.J. Grundy, R. Webster, AJ. Ringrose-Voase (éds.), Guidelines for surveying soil and land resources (2nd Edition), Australian Soil and Land Survey Handbooks Series.

Hughes, P.A., McBratney, A.B., Minasny, B. et Campbell, S. 2014. End members, end points and extragrades in numerical soil classification. Geoderma 226-227, 365–375.

Idowu, O.J., van Es, H.M., Abawi, G.S., Wolfe, D.W., Ball, J.I., Gugino, B.K., Moebius, B.N., Schindelbeck, R.R. et Bilgili, A.V. 2008. Farmer-oriented assessment of soil quality using field, laboratory, and VNIR spectroscopy methods. Plant Soil, 307: 243-253.

Isbell, R.F. 1996. The Australian soil classification. CSIRO Publishing, Collingwood, Victoria, Australia.

IUSS Working Group WRB. 2014. World reference base for soil resources 2014. International soil classification system for naming soils and creating legends for soil maps. World Soil Resources Report No. 106. FAO, Rome. 182 p.

Janik, L.J., Merry, R.H. et Skjemstad, J.O. 1998. Can mid infrared diffuse reflectance analysis replace soil extractions? Australian Journal of Experimental Agriculture, 38: 681-696.

23

Jenny, H. 1941. Factors of soil formation. A system of quantitative pedology. McGraw-Hill Book Company, New York, USA. 281 p.

Jones, R.J.A., Spoor, G. et Thomasson, A.J., 2003. Vulnerability of subsoils in Europe to compaction: A preliminary analysis. Soil and Tillage Research, 73: 131-143.

Karlen, D.L. et Stott, D.E. 1994. A framework for evaluating physical and chemical indicators of soil quality. Pages 53–72 dans J.W. Doran, D.C. Coleman, D.F Bezdicek et B.A. Stewart (éds.), Defining soil quality for a sustainable environment. Special publication number 35, American Society of Agronomy, Madison, Wisconsin.

Karlen, D.L., Mausbach, M.J., Doran J.W., Cline R.G., Harris R.F et Schuman, G.E. 1997. Soil quality: a concept, definition, and framework for evaluation. Soil Science Society of America Journal, 6: 4-10.

Karlen, D.L., Andrews, S.S. et Doran, J.W. 2001. Soil quality: Current concepts and applications. Advances in Agronomy, 74: 1-40.

Karlen, D.L., Ditzler, C.A. et Andrews, S.S. 2003. Soil quality: Why and how? Geoderma, 114: 145-156.

Karlen, D.L., Hurley, E.G., Andrews, S.S., Cambardella, C.A., Meek, D.W., Duffy, M.D. et Mallarino, M.P. 2006. Crop rotation effects on soil quality at three northern corn/soybean belt locations. Agronomy Journal, 98: 484-495.

Kaufmann, M., Tobias, S. et Schulin, R. 2009. Quality evaluation of restored soils with a fuzzy logic expert system. Geoderma, 151: 209-302.

Kaufmann, M., Tobias, S. et Schulin, R. 2010. Comparison of critical limits for crop plant growth based on different indicators for the state of soil compaction. Journal of Plant Nutrition and Soil Science, 173: 573-583.

Kay, B.D. 1990. Rates of change of soil structure under different cropping systems. Advances in Soil Science, 12: 1-52.

Lal, R. et Stewart, B.A. 2013. Soil management for sustaining ecosystem services. Pages 521-536 dans R. Lal et B.A. Stewart (éds.), Principles of sustainable soil management in agroecosystems. Advances in Soil Science, CRC Press, Boca Raton, Florida, USA.

Láng, V., Fuchs, M., Waltner, I., et Michéli, E. 2013. Soil taxonomic distance, a tool for correlation: As exemplified by the hungarian brown forest soils and related WRB reference soil groups. Geoderma, 192, 269–276.

Lark, R.M. et Bishop, T.F.A. 2007. Cokriging particle size fractions of the soil. European Journal of Soil Science, 58: 763-774.

Larson, W.E. et Pierce, F.J. 1994. The dynamics of soil quality as a measure of sustainable management. Pages 37-51 dans J.W. Doran, D.C. Coleman, D.F Bezdicek et B.A. Stewart (éds.), Defining soil quality for a sustainable environment. Special publication number 35, American Society of Agronomy, Madison, Wisconsin.

Leblanc, M.A., Parent, L.E. et Gagné, G. 2013. Phosphate and nitrate release from mucky mineral soils. Open Journal of Soil Science, 3: 107-114.

Leclerc, M.L., Nolin, M.C., Cluis, D. et Simard, R.R. 2001. Grouping soils of the Montreal Lowlands (Québec) according to fertility and sorption and desorption characteristics. Canadian Journal of Soil Science, 81: 71-83.

24

Letey, J., Sojka, R.E., Upchurch, D.R., Cassel, D.K., Olson, K.R., Payne, W.A., Petrie, S.E., Price, G.H., Reginato, R.J., Scot, H.D., Smethurst, P.J., Triplett, G.B. 2003. Deficiencies in the soil quality concept and its application. Journal of Soil and Water Conservation, 58: 180-187.

Liebig, M.A., J.W. Doran, 1999. Evaluation of point-scale assessment of soil quality. Journal of Soil and Water Conservation, 54: 510-518.

MacEwan, R.J. 1997. Soil quality indicators: pedological aspects. Pages 143-166 dans E.G. Gregorich et M.R. Carter (éds.), Soil quality for crop production and ecosystem health. Developments in soil science. Elsevier, Amsterdam.

Mateu-Figueras, G., Pawlowsky-Glahn, V. et Egozcue, J.J. 2011. The principle of working on coordinates. Pages 31-42 dans V. Pawlowsky-Glahn et A. Buccianti (éds.), Compositional data analysis: Theory and applications. John Wiley and Sons, New York.

McBratney, A.B., De Gruijter, J.J. et Brus, D.J. 1992. Spacial prediction and mapping of continuous soil classes. Geoderma 54, 39-64.

McBratney, A.B. et Odeh, I.O.A., 1997. Application of fuzzy sets in soil science: Fuzzy logic, fuzzy measurements and fuzzy decisions. Geoderma, 77: 85–113.

McBratney, A.B., Minasny, B. et Viscarra Rossel, R. 2006. Spectral soil analysis and inference systems: A powerful combination for solving the soil data crisis. Geoderma, 136: 272-278.

McBratney, A.B. et Minasny, B. 2010. Pedometrics. Pages 1-5 dans R. Lal (éd.), Encyclopedia of Soil Science, Second edition. Taylor and Francis, New York., USA.

McBratney, A.B., Minasny, B., Wheeler, I., Malone, B.P. et van der Linden, D. 2012. Frameworks for digital soil assessment. Pages 9-14 dans B. Minasny, B.P. Malone et A.B. McBratney (éds.), Digital soil assessments and beyong, Taylor and Francis Group, London.

McKenzie, D.C. 2013. Visual soil examination techniques as part of a soil appraisal framework for farm evaluation in Australia. Soil and Tillage Research, 127: 26-33.

McLean, E.O. 1982. Soil pH and lime requirement. Pages 199-224 dans A.L. Page et al. (éds.), Methods of soils analysis, Part 2, Agronomy 9. Soil Science Society of America, Madison, WI.

Merrington, G., Fishwick, S., Barraclough, D., Morris, J., Preedy, N., Boucard, T., Reeve, M.J., Smith, P., Fang, C. 2006. The development and use of soil quality indicators for assessing the role of soil in environmental interactions. Environment Agency Science Report SC030265. 241 p.

Millennium Ecosystem Assessment. 2005. Ecosystem and human well-being - A Framework for assessment. Island Press, Washington, DC, USA.

Minasny, B., McBratney, A.B. et Salvador-Blanes, S. 2008. Quantitative models for pedogenesis: A review. Geoderma, 144: 140-157.

Minasny, B., McBratney, A.B., Bellon-Maurel, V., Roger, J.M., Gobrecht, A., Ferrand, L. et Joalland, S. 2011. Removing the effect of soil moisture from NIR diffuse reflectance spectra for the prediction of soil organic carbon. Geoderma 167-168: 118-124.

25

Moore, A.W. et Russell, J.S. 1966. Potential use of numerical analysis and Adansonian concepts in soil science. Australian Journal of Science, 29: 141–142.

Müeller, L., Schindler, U., Behrendt, A., Eulenstein, F., Dannowski, R., Schlindwein, S.L., Shepherd, T.G., Smolentseva, E., Rogasik, J., 2007. The Muencheberg soil quality rating (SQR). Field manual for detecting and assessing properties and limitations of soils for cropping and grazing. 102 p.

Nocita, M., Stevens, A., van Wesemael, B., Aitkenhead, M., Bachmann, M., Barthès, B., Ben Dor, E., Brown, D.J., Clairotte, M., Csorba, A., Dardenne, P., Demattê, J.A.M., Genot, V., Guerrero, C., Knadel, M., Montanarella, L., Noon, C., Ramirez-Lopez, L., Robertson, J., Sakai, H., Soriano-Disla, J.M., Shepherd, K.D., Stenberg, B., Towett, E.K., Vargas, R., et Wetterlind, J. 2015. Soil spectroscopy: An alternative to wet chemistry for soil monitoring. Advances in Agronomy, 132: 139-159.

Nolin, M.C., Wang, C. et Caillier, M.J. 1989. Fertility grouping of Montreal Lowlands soil mapping units based on selected soil characteristics of the plow layer. Canadian Journal of Soil Science, 69: 525-541.

Nolin, M.C., Wang, C. et Cossette, J.M. 1993. Évaluation et suivi de la qualité des sols en milieu agricole : approche canadienne. Agrosol, 5: 25-32.

Norfleet, M.L., Ditzler, C.A., Puckett, W.E. Grossman, R.B. et Shaw, J.N. 2003. Soil quality and its relationship to pedology. Soil Science, 168: 149-155.

Nortcliff, S. 2002. Standardisation of soil quality attributes. Agriculture, Ecosystems and Environment, 88:161–168.

Nortcliff, S. 2006. Classification: Need for systems. Pages 227-229 dans R. Lal (éd.), Encyclopedia of Soil Science, Second edition. Taylor and Francis, New York, USA.

Odgers, N.P., McBratney, A.B. et Minasny, B. 2011a. Bottom-up digital soil mapping. I. Soil layer classes. Geoderma, 163: 38-44.

Odgers, N.P., McBratney, A.B. et Minasny, B. 2011b. Bottom-up digital soil mapping. II. Soil series classes. Geoderma, 163: 30-37.

Parent, L.E. et Dafir, M. 1992. A theoretical concept of compositional nutrient diagnosis. Journal of American Society for Horticultural Science, 117: 239-242.

Parent, L.E., Sasseville, L., N’dayegamiye, A. et Karam, A. 1992. The P status of cultivated organic soils. Proceeding 8th International Peat Congress, Uppsala, Sweden, 2: 400-410.

Parent, L.E., Cissé, E.S., Tremblay N. et Bélair G. 1997. Row-centred logratios as nutrient indexes for saturated extracts of organic soils. Canadian Journal of Soil Science, 77: 571-578.

Parent, L.E., Viau, A.A. et Anctil, F. 2000. Nitrogen and phosphorus fractions as indicators of organic soil quality. Suo, 51: 71-81.

Parent, L.E. et Gagné G. (éds). 2010. Guide de référence en fertilisation. 2nd Ed., Centre de Référence en Agriculture et Agroalimentaire du Québec (CRAAQ), Québec.

Parent, L.E., de Almeida, C.X., Hernandes, A., Egozcue, J.J., Gülser, C., Bolinder, M.A., Kätterer, T., Andrén, O., Parent, S.É., Anctil, F., Centurion, J.F. et Natale, W. 2012a. Compositional analysis for an unbiased measure of soil aggregation. Geoderma, 179-180: 123-131.

26

Parent, S.É., Parent, L.E. Rozane, D.E. Hernandes, A. et Natale, W. 2012b. Nutrient balance as paradigm of plant and soil chemometrics. Pages 83-114 dans R.N. Issaka (éd.), Soil Fertility. InTech Publication, New-York. http://www.intechopen.com/books/soil-fertility.

Parent, S.É., Parent, L.E., Rozane D.E. et Natale, W. 2013. Nutrient balance ionomics: case study with mango (Mangifera indica). Frontiers Plant Science, 4: 449.

Parent, L.E., Parent, S.É. et Ziadi, N. 2014. Biogeochemistry of soil inorganic and organic phosphorus: A compositional analysis with balances. Journal of Geochemical Exploration, 141: 52-60.

Pawlowsky-Glahn, V., Egozcue, J.J. et Tolosana-Delgado, R. 2011. Lecture notes on compositional data analysis. University of Girona et Technical University of Catalonia, Espace.

Peigné, J, Vian, J.-F., Chrétien, O. et Gautronneau, Y. 2013. Le profil cultural : une méthode d’étude in situ de la structure des sols cultivés. Pages 103-118 dans D. Baize, O., Duval, et G. Richard (éds), Les sols et leurs structures. Observations à différentes échelles. Éditions Quae.

Powell, B., McBratney, A.B. et MacLeod, D.A. 1992. Fuzzy classification of soil profiles and horizons from the Lockyer Valley, Queensland, Australia. Geoderma, 52: 173-197.

Ramirez-Lopez, L., Schmidt, K., Behrens, T., van Wesemael, B., Demattê, J.A.M. et Scholten. T. 2014. Sampling optimal calibration sets in soil infrared spectroscopy. Geoderma, 226-227: 140-150.

Reinert, D.J., Albuquerque, J.A., Reichert, J.M., Aita, C. et Andrada, M.C.C. 2008. Bulk density critical limits for normal root growth cover crops. Revista Brasileira de Ciência do Solo, 32: 1805- 1816.

Reynolds, W.D., Drury, C.F., Yang, X.M., Fox, C.A., Tan, C.S. et Zhang, T.Q. 2007. Land management effects on the near-surface physical quality of a clay loam soil. Soil and Tillage Research 96: 316-330.

Reynolds, W.D., Drury, C.F., Yang, X.M. et Tan, C.S. 2008. Optimal soil physical quality inferred through structural regression and parameter interactions. Geoderma, 146: 466-474.

Romig, D.E., Garlynd, M.J. et Harris, R.F. 1996. Farmer-based assessment of soil quality: A soil health scorecard. Pages 39-60 dans J. Jones et J.W. Doran (éds.), Methods for assessing soil quality. SSSA Special Publication Number 49. Soil Science Society of America, Madison, USA.

Sanchez, P.A., Palm, C. A. et Buol S.W. 2003. Fertility capability soil classification: A tool to help assess soil quality in the tropics. Geoderma, 114: 157-185.

Schipper, L.A. et Sparling, G.P. 2000. Performance of soil condition indicators across taxonomic groups and land uses. Soil Science Society of America Journal, 64: 300-311.

Schjønning, P., Heckrath, G. et Christensen, B.T. 2009. Threats to soil quality in Denmark: A review of existing knowledge in the context of the EU soil thematic strategy. DJF Report plant science No. 143. Aarhus University, Tjele, Denmark.

Seybold, C.A., Herrick, J.E. et Brejda, J.J. 1999. Soil resilience: a fundamental component of soil quality. Soil Science, 164: 224-234.

Shukla, M.K, Lal, R et Ebinger, M. 2004. Soil quality indicators for the north Appalachian experimental watersheds in Coshocton, Ohio. Soil Science, 169: 195-205.

27

Shukla, M.K., Lal, R. et Ebinger, M. 2006. Determining soil quality indicators by factor analysis. Soil and Tillage Research, 87: 194-204.

Sikora, F.J. 2006. A buffer that mimics the SMP buffer for determining lime requirement of soil. Soil Science Society of America Journal, 70: 474-486.

Simfukwe, P., Hill, P.W., Emmett, B.A. et Jones, D.L. 2011. Soil classification provides a poor indicator of carbon turnover rates in soil. Soil Biology and Biochemistry, 43: 1688-1696.

Simpson, G.G. 1964. Numerical taxonomy and biological classification. Science, 144: 712-713.

Singer, M.J. et Ewind, S.A. 1998. Soil quality. Pages 271-278 dans M.E. Summer (éd.), Handbook of soil science. CRC Press, Boca Raton, Florida.

Sneath, P.H.A. et Sokal, R.R. 1973. Numerical taxonomy: The principles and practice of numerical classification. W.H. Freeman and Company, San Francisco. 573 p.

Soil Survey Staff. 2014. Keys to soil taxonomy. 12th edition. USDA-Natural Resources Conservation Service, Washington, DC. 360 p.

Sojka, R.E. et Upchurch, D.R. 1999. Reservations regarding the soil quality concept. Soil Science Society of America Journal, 63: 1039-1054.

Soriano-Disla, J.M., Janik, L.J., Viscarra Rossel, R.A., Macdonald, L.M. et McLaughlin, M.J. 2014. The performance of visible, near-, and mid-infrared reflectance spectroscopy for prediction of soil physical, chemical, and biological properties. Applied Spectroscopy Reviews, 49: 139-186.

Sparling, G., Lilburne, L. et Vojvodic-Vukovic, M. 2008. Provisional target for soil quality indicators in New Zealand. Landcare Research, Palmerston North, New-Zealand., 64 p.

Srinivasa Rao, C., Subba Rao, A., Rao, K.V., Venkataeswarulu, B. et Singh, A.K. 2010. Categorization of districts based on the non-exchangeable potassium: Implications in efficient K fertility management in Indian agriculture. Indian Journal of fertilisers, 6: 40-55.

Stalham, M.A., Allen, E.J. et Henry, F.X. 2005. Effects of soil compaction on potato growth and its removal by cultivation. Research Review, Ref. R261. British Potato Council, Oxford.

Steffens, M. et Buddenbaum, H. 2013. Laboratory imaging spectroscopy of a stagnic Luvisol profile - high resolution soil characterisation, classification and mapping of elemental concentrations. Geoderma, 195: 122-132.

Stenberg, B., Viscarra Rossel, R.A., Mouazen, A.M. et Wetterlind, J. 2010. Visible and near infrared spectroscopy in soil science. Advances in Agronomy, 107: 163-215.

Stockmann, U., Minasny, B. et McBratney, A.B. 2011. Quantifying processes of pedogenesis. Advances in Agronomy, 113: 1-71.

Stuart, B. 2004. Infrared spectroscopy: Fundamentals and applications. John Wiley & Sons, New York.

Tabi, M., Tardif, L., Carrier, D., Laflamme, G. et Rompré, M., 1990. Inventaire des problèmes de dégradation des sols agricoles du Québec. Rapport synthèse. MAPAQ, Québec, Canada.

28

Terra, F.S., Demattê, J.A.M. et Viscarra Rossel, R.A. 2015. Spectral libraries for quantitative analyses of tropical Brazilian soils: Comparing vis–NIR and mid-IR reflectance data. Geoderma, 255-256: 81-93.

Triantafilis, J., Ward, W.T., Odeh, I.O.A et McBratney, A.B. 2001. Creation and interpolation of continuous soil layer classes in the Lower Namoi Valley. Soil Science Society America Journal, 65: 403-413.

USDA. 1999. Soil quality test kit guide. United States Department of Agriculture (USDA), Natural Resources Conservation Service, Agricultural Research Service, Soil Quality Institute. 82 p.

USDA. 2001. Guidelines for soil quality assessment in conservation planning. United States Department of Agriculture (USDA), Natural Resources Conservation Service, Soil Quality Institute. 39 p.

Vasques, G.M., Demattê, J.A.M., Viscarra Rossel, R.A., Ramírez-López, L. et Terra, F.S 2014. Soil classification using visible/near-infrared diffuse reflectance spectra from multiple depths. Geoderma, 223-225: 73-78.

Velasquez, E., Lavelle, P. et Andrade, M. 2007. GISQ, a multifunctional indicator of soil quality. Soil Biology and Biochemistry, 39: 3066-3080.

Verheyen, K., Adriaens, D., Hermy, M. et Deckers, S. 2001. High-resolution continuous soil classification using morphological soil profile descriptions. Geoderma, 101: 31-48.

Vézina, L., Carrier, D., Giroux, M., Rompré, M., Laflamme, G. et Moreau, A. 2000. Proposition de regroupement des sols du Québec selon leur capacité de fixation du phosphore en relation avec leurs caractéristiques pédologiques. Agrosol, 11: 15-25.

Viscarra Rossel, R.A., Walvoort, D.J.J., McBratney, A.B., Janik, L.J. et Skjemstad, J.O. 2006. Visible, near infrared, mid infrared or combined diffuse reflectance spectroscopy for simultaneous assessment of various soil properties. Geoderma, 131: 59-75.

Viscarra Rossel, R.A., Jeon, Y.S., Odeh, I.O.A. et McBratney, A.B. 2008. Using a legacy soil sample to develop a mid-IR spectral library. Australian Journal of Soil Research, 46: 1-16.

Viscarra Rossel, R.A. et Behrens, T. 2010. Using data mining to model and interpret soil diffuse reflectance spectra. Geoderma, 158: 46-54.

Viscarra Rossel, R.A. et Webster, R. 2011. Discrimination of Australian soil horizons and classes from their visible–near infrared spectra. European Journal of Soil Science, 62: 637-647.

Walvoort, D.J.J. et De Gruijter, J.J. 2001. Compositional kriging: A spatial interpolation method for compositional data. Mathematical Geology, 33: 951-966.

Wang, X. et Gong, Z. 1998. Assessment and analysis of soil quality changes after eleven years of reclamation in subtropical China. Geoderma, 81: 339-355.

Webster, R. et Burrough, P.A. 1972. Computer-based soil mapping of small areas from sample data. I. Multivariate classification and ordination. Journal of Soil Science, 23: 210-221.

Weill, A. 2009a. Les profils de sol agronomiques : Un outil de diagnostic sur l’état des sols. Centre de référence en agriculture et agroalimentaire du Québec. 136 p.

29

Weill, A. 2009b. L’évaluation visuelle de l’état du sol : La méthode du profil de sol agronomique. Centre de référence en agriculture et agroalimentaire du Québec. 38 p.

Wienhold, B.J., Karlen, D.L., Andrews, S.S. et Stott, D.E. 2009. Protocol for indicator scoring in the soil management assessment framework (SMAF). Renewable Agriculture and Food Systems, 24: 260-266.

Yaalon, D.H. 2008. Classification: Historical developments. Pages 1-3 dans R. Lal (éd.), Encyclopedia of Soil Science, Second edition. Taylor and Francis, New York., USA.

30

Chapitre 2: Lime requirement using Mehlich-III extraction and infrared-inferred cation exchange capacity Michaël A. Leblanc, Élizabeth Parent, and Léon E. Parent

Department of Soils and Agri-Food Engineering, Paul-Comtois Building, Université Laval, Québec,

QC G1V 0A6, Canada.

Publié dans Soil Science Society of America Journal, 2016

31

2.1 Résumé Le besoin en chaux (LR pour «lime requirement») pour atteindre et maintenir le pH du sol pourrait être

déterminé selon des méthodes respectueuses de l’environnement afin d’éviter l’utilisation de solutions

tampons contenant des produits toxiques. La spectroscopie infrarouge (IR) permet la prédiction de propriétés

reliées au pouvoir tampon du sol et ainsi potentiellement de supporter des modèles de LR. L’objectif de cette

étude était de développer des modèles de LR basés sur la spectroscopie IR pour une large gamme de sols du

Québec, Canada. L’indice de l’acidité échangeable estimée (EEA) a été calculé à partir de la capacité

d’échange cationique déterminée par la méthode à l’acétate d’ammonium (pH 7.0) (CEC-NH4OAc) moins la

somme des cations non-acides extraits au Mehlich-III. La performance était élevée pour les CEC-NH4OAc

inférées par IR, soient prédites par le proche IR (NIR) et moyen le IR (MIR) ou calculées à partir du contenu

en C organique mesuré et du contenu en argile prédit par NIR et MIR (0.82 ≤ R2 ≤ 0.91 en validation). La

qualité d’ajustement en validation croisée de l’EEA en utilisant la CEC-NH4OAc mesurée (0.81 ≤ R2 ≤ 0.89)

était supérieure à celle du pH tampon Shoemaker-McLean-Pratt (SMP) (0.15 ≤ R2 ≤ 0.77) pour les pH à l’eau

(pHH2O) cibles entre 5.5 et 6.5 et similaire au pH SMP à pHH2O cible de 7. Afin de quantifier le LR des systèmes

culturaux comme celui de la pomme de terre (Solanum tuberosum L.) qui nécessitent de faibles pHH2O (< 6.5)

ou de maintenir des pH adéquats avec de faibles LR (< 1.5 g CaCO3 kg-1), l’indice EEA, en utilisant une CEC-

NH4OAc inférée dans un intervalle de ±2.3 cmolc kg-1, a montré une précision supérieure à celle du pH SMP.

La prédiction directe du LR dérivé de l’EEA en utilisant la spectroscopie NIR et MIR a montré de faible

performance (R2 ≤ 0.48 en validation) comparativement à l’EEA calculé à partir des CEC-NH4OAc inférées par

IR. La spectroscopie IR peut améliorer la détermination en analyse de routine du LR à faible coût.

32

2.2 Abstract Lime requirement (LR) to reach and maintain desired soil pH levels could be determined using environmental-

friendly methods to avoid buffer solutions containing toxic chemicals. Infrared (IR) spectroscopy can predict

soil properties related to soil pH buffering capacity hence potentially supporting LR models. The aim of this

study was to develop IR-based LR models across a wide range of soils from Québec, Canada. Estimated

exchangeable acidity (EEA) index was computed from cation exchange capacity determined by ammonium

acetate (pH 7.0) method (CEC-NH4OAc) minus the sum of Mehlich-III extractable non-acidic cations. The

performance was high for IR-inferred CEC-NH4OAc predicted from mid-infrared (MIR) and near-infrared (NIR)

or computed from quantified organic C and NIR- and MIR-predicted clay content (0.82 ≤ R2 ≤ 0.91 in

validation). The goodness of fit in cross-validation of EEA using quantified CEC-NH4OAc (0.81 ≤ R2 ≤ 0.89)

was higher compared to Shoemaker-McLean-Pratt (SMP) buffer (0.15 ≤ R2 ≤ 0.77) for target water pH (pHH2O)

values between 5.5 and 6.5, and similar to SMP for target pHH2O of 7.0. To support LR determination of

cropping systems such as potato (Solanum tuberosum L.) that requires low pHH2O (< 6.5) or to maintain

adequate pH levels at low LRs (< 1.5 g CaCO3 kg-1), the EEA index using CEC-NH4OAc inferred within ±2.3

cmolc kg-1 was found to be more accurate than the SMP-buffer. Direct prediction of EEA-derived LR using NIR

and MIR spectroscopy showed low performance (validation R2 ≤ 0.48) compared to EEA computed from IR-

inferred CEC-NH4OAc. The IR spectroscopy can improve routine LR determination at low cost.

33

2.3 Introduction Soil acidity impacts on plant and microbial activities through the availability of nutrient ions and harmful acidic

cations (Robarge, 2008). Liming acid soils is an efficient practice to attain and maintain soil pH within

prescribed ranges. Lime requirement (LR) can be determined by soil-lime incubations, soil-base titrations, soil-

buffer equilibration and indirect determination methods (van Lierop, 1990; Viscarra Rossel et al., 2006; Ziadi

and Tran, 2007; Liu et al., 2008). Incubation and titration are standard methods to calibrate buffer and indirect

determination methods; however, titration methods result in less LR than incubation methods (Godsey et al.,

2007).

Buffer methods are commonly used to measure LR. Tran and van Lierop (1981a, 1982) investigated the

Shoemaker-McLean-Pratt (SMP) buffer solution (Shoemaker et al., 1961) that became the official Québec LR

soil testing procedure (Parent and Gagné, 2010). Designed originally for soils high in exchangeable Al, the

SMP-buffer method proved to be rapid and accurate across a wide range of soil conditions (van Lierop, 1990).

While the SMP buffer method has been designed for soils requiring > 4.5 Mg CaCO3 ha-1 (i.e., > 2.0 g CaCO3

kg-1 assuming a bulk density of 1.3 Mg m-3 and an incorporation depth of 0.17 m) (McLean, 1982), the SMP

buffer method provides a poor LR estimate for soils with low LR or low target pH such as soils used for potato

(Solanum tuberosum L.) production. In addition, the SMP-buffer solution contains toxic chemicals such as p-

nitrophenol and potassium chromate and alternative buffer methods have been successfully developed

(Sikora, 2006; Sikora and Moore, 2008; Wolf et al., 2008; Tunney et al., 2010). However, methods of green

chemistry received little attention, especially for soils where LR is low.

Although less accurate than buffer pH methods, indirect LR determination methods may be useful in some

circumstances (van Lierop, 1990). Soil pH buffering capacity is determined by permanent charges from partial

isomorphous substitution within the lattice of clay-size layer silicates, and by variable charges generated by the

pH-dependent dissociation of hydroxyl groups in phenols, carboxylic acids and silanol and aluminols groups

(Bache, 2008). The LR could thus be estimated from organic matter and clay contents, exchangeable Al,

cation exchange capacity (CEC) and the desired change of soil pH (Keeney and Corey, 1963; Pionke et al.,

1968; Rémy and Marin-Laflèche, 1974; Tran and van Lierop, 1981b; Quaggio et al., 1985; Goulding et al.,

1989; Weaver et al., 2004; Lemire et al., 2006; Wong et al., 2013).

Exchangeable acidity (EA) or base unsaturation computed as CEC minus the sum of non-acidic cations (basic

cations or bases) has been found to be closely related to LR (Keeney and Corey, 1963; Tran and van Lierop,

1981b). The EA could be estimated from laboratory-determined CEC-NH4OAc (ammonium acetate-NH4OAc-1

mol L-1, pH 7.0, method) minus the sum of exchangeable or Mehlich III extractable base cations (Tran and

Simard, 1993). However, the chemical determination of CEC is unsuitable for routine soil testing. The clay and

34

organic C contents that contribute to CEC could be predicted by infrared (IR) spectroscopy (Soriano-Disla et

al., 2014). Infrared spectroscopy could predict LR directly for Australian (Alfisols and Ultisols) and Brazilian

(Oxisols and Entisols) soils (Janik et al., 1998; Viscarra Rossel et al., 2006; Araújo et al., 2013). In general, the

mid-infrared (MIR) region shows higher performance compared to near-infrared (NIR) because the

fundamental vibrations occurring in the MIR region provides more spectral information (Soriano-Disla et al.,

2014; Nocita et al., 2015). While the LR reference from soil-lime incubations is time-consuming and costly, the

representative samples must be in sufficient number to support model calibrations of IR spectroscopy with high

generalization capacity (Ramirez-Lopez et al., 2014).

The objectives of this study were to: (i) evaluate the accuracy of the routine SMP buffer method to predict LR

for target soil pH values of 5.5 and 6.0 as required by the potato crop, (ii) calibrate a LR model using the CEC-

NH4OAc (pH 7.0) method and the Mehlich III extractable basic cations, (iii) compare the performance of the

model using IR-inferred CEC-NH4OAc predicted from NIR and MIR spectroscopy or computed from organic C

content and NIR- and MIR-predicted clay content and, (iv) evaluate the performance of NIR and MIR

spectroscopy to predict LR in selected soils from Québec, Canada.

2.4 Material and methods

2.4.1 Soil properties

Fifty soil samples collected in the arable layer (0 up to 0.30 m) of soils classified as Spodosols, Inceptisols and

Entisols from the Saguenay-Lac Saint-Jean Lowlands, the St. Lawrence Lowlands and the Appalachian region

of Québec, Canada, were selected for a lime incubation study. Soils were dried at 45°C and passed through a

2-mm sieve. Particle size distribution was determined by the sedimentation method (Sheldrick and Wang,

1993). Total C was quantified using a CNS 2000 Leco analyzer (Leco Corporation, St. Joseph, Michigan). Soil

pH was determined in 0.01 mol L-1 CaCl2 with a 1:2 (w/v) soil/solution ratio (pHCaCl2) (Hendershot et al., 1993a)

and converted to pH value in a 1:1 (w/v) soil/water ratio (pHH2O) as follows (Cescas, 1978):

𝑝𝐻𝐻2𝑂 = 0.27 + 1.03 ∙ 𝑝𝐻𝐶𝑎𝐶𝑙2 (2.1)

The SMP buffer pH was measured in a 1:1:2 (w/v/v) soil/water/buffer ratio (Tran and van Lierop, 1993). The K,

Ca and Mg were extracted according to Mehlich (1984) and quantified by inductively coupled argon plasma

optical emission spectrometry. The CEC was determined using the 1 M ammonium acetate method at pH 7.0

(CEC-NH4OAc) and ammonium was quantified colorimetrically at 410 nm (Hendershot et al., 1993b).

35

2.4.2 Soil-lime incubations

The lime incubation experiment was conducted according to Tran and van Lierop (1982). Duplicated 200-g soil

samples were placed into 500-mL plastic containers and given reagent-grade CaCO3 at rates of 0 to 7.7 g

CaCO3 kg-1 by 3 to 6 doses equally spaced but varied among soils to approach pHH2O of 7.0 at the highest rate

as estimated from SMP buffer pH. Treated soils were moistened to gravimetric water content near field

capacity as previously estimated by suction in a Buchner funnel (Klute, 1986) and incubated during three

months at room temperature (20 to 22°C), alternating between wetting and drying cycles every two weeks;

after the drying cycle, soils were remixed and remoistened. At the end of the incubation period, soil samples

were air dried and analyzed for pH in 0.01 M CaCl2. Statistical analyses were conducted on a mass basis (g

kg-1). The conversion of LR from laboratory to field scale (Mg ha-1) requires additional soil parameters provided

by the end user such as volume of coarse fragments, bulk density and working depth of surface layer.

2.4.3 Near- and mid-infrared spectra

Soil samples were scanned by a Nicolet Antaris FT-NIR analyzer (Thermo Electron Corp., Ann Arbor, Mich.) to

measure absorbance (log (1/R), where R is reflectance). The final spectrum was an average of 3 replicated

samples with 30 readings per sample, hence 90 scans, in the range of 9000 to 4000 cm-1 (1100 to 2500 nm) at

a resolution of 2 cm-1. Mid-infrared spectra of soil samples were collected using a Varian 1000 FT-IR Scimitar

series spectrometer (Varian Inc., Palo Alto, CA) to measure absorbance (log (1/T), where T is transmittance).

Before scanning, 2 mg of soil and 200 mg of KBr, ground to less than 74 μm and dried at 105 °C for 3 h, were

mixed and pressed under a load of 517 MPa for 3 min using a manual hydraulic pellet press (Carver, Model

4350.L, Carver, Inc., Wabash, Indiana) to obtain a clear disc of 13 mm of diameter. The final spectrum was an

average of two replicated pellets with 10 readings per pellet, hence 20 scans, corrected against the spectrum

with ambient air as background, in the range of 4000 to 400 cm-1 (2500 to 25 000 nm) at a resolution of 4 cm-1.

2.4.4 Soil collection for pedotransfer functions

A Québec soil library of proprieties and infrared spectra compiled at the Department of Soils and Agrifood

Engineering, Université Laval, Québec, Canada, were used to elaborate conversion equations for C, K, Ca

and Mg as well as pedotransfer functions (PTFs) for CEC-NH4OAc, clay content and LR. Soils with organic C

(OC) content less than 90 g OC kg-1 were selected, excluding organic soils and mucky mineral soils. The

organic C content was determined by the Walkley-Black method (Tiessen and Moir, 1993). Exchangeable

cations were extracted using the NH4OAc method at pH 7.0 (Hendershot et al., 1993b).

2.4.5 The estimated exchangeable acidity index

The relationship between LR and EA depends on the departure of soil pH from initial conditions. Some LR

predictive models multiply the EA measured in a buffer solution or following BaCl2-TEA titration by the fraction

36

of CEC occupied or unoccupied by exchangeable bases at original and desired soil pH values (Adams and

Evans, 1962; Ketterings et al., 2006). In soil titration studies with pHH2O values ranging from 4.5-5.0 to 6.5-7.5,

the EA is replaced proportionally by bases on exchange sites (McLean, 1978; Magdoff and Bartlett, 1985;

Curtin and Ukrainetz, 1997; Weaver et al., 2004; Liu et al., 2005). The estimated exchangeable acidity (EEA)

(cmolc kg-1) that must be neutralized to reach the target pH was computed as follows:

EEA = (CEC − bases) ∙ pHt − pH

7 − pH (2.2)

Where CEC is CEC-NH4OAc at pH 7.0 (cmolc kg-1), bases is the sum of exchangeable non-acid cations (K,

Ca, Mg) (cmolc kg-1), and pHt and pH are target and original water pH (1:1 (w/v) soil/water ratio), respectively.

2.4.6 Statistical analysis

Analysis of variance was performed on the change of soil pH after the 3-month incubation at increasing rates

of CaCO3. The LR for reaching pHH2O values of 5.5, 6.0, 6.5 and 7.0 was estimated by regression from

significant linear or quadratic effects (p < 0.05). The relationship between LR and SMP buffer pH was fitted to

curvilinear regression models (Tran and van Lierop, 1982) and that between LR and EEA by linear regression

models for four target pH values. The LR models using SMP buffer and EEA indices were evaluated using a 2-

fold cross-validation repeated five times. For each replication, the dataset was split randomly into two equal

subsets. The subsampling was realized in the sequence of target pH groups where initial pH was less than

target pH to maintain the same proportion of subset size (1:1) across target pH groups. The models were fitted

with subset A and tested with subset B, then trained on B and tested on A. For each combination (5x2=10

times), model performance was evaluated on the held out subset using the coefficient of determination (R2)

and the root mean square error (RMSE) computed as follows:

𝑅2 = 1 − (∑ (Pi − Oi)2N

i=1 / ∑ (Oi − O)2)Ni=1 (2.3)

RMSE = √1

N∑ (Pi − Oi)

2Ni=1 (2.4)

Where N is the number of data, Pi is the predicted value, Oi is the observed value, and O is the mean of

observed values. The EEA model calibrated using measured CEC-NH4OAc in cross-validation was validated

against IR-inferred CEC-NH4OAc. The EEA model was also used to derive LR of the soils of spectral collection

showing the predictor variables (CEC-NH4OAc, Mehlich III extracts and pH).

Calibration models to predict clay content, CEC-NH4OAc and LR from infrared spectra were developed using

spectral preprocessing and modelling techniques optimizing the predictive performance (Table 2.1) (Hastie et

37

al., 2009; Wehrens, 2011). The estimation of parameters for boosted regression trees (BRT) (number of

principal components as input predictors, number of trees, tree complexity, and learning rate) and partial least

squares regression (PLSR) (number of factors) was optimized using a 5-fold cross-validation on the calibration

dataset. Models were validated against the lime incubation experiments. A spectra removal continuum was

performed to illustrate the characteristic absorption of the spectral library (Clark and Roush, 1984).

The statistical analysis of compositional data (proportions constrained to 100% or 1000 g kg-1) is subject to

redundancy, scale dependency or sub-compositional incoherence and non-normal distribution (Filzmoser et

al., 2009). The log ratios can avoid such problems (Aitchison, 1986; Egozcue et al., 2003; Filzmoser and Hron,

2011). Clay content and the filling value to 1000 g kg-1 (1000 g kg-1 minus the sum of sand and silt) were thus

transformed into an isometric log ratio (ilr) as follows (Egozcue and Pawlowsky-Glahn, 2006):

ilri = √risi

ri+siln

g(x+)

g(x−) (2.5)

Where ilri is the ith balance between two subsets defined for a D-part composition where i Є [1, D-1], ri and si

are number of components in the + (numerator) and – (denominator) groups, respectively, and g(x+) and

g(x−) are geometric means of components in the + and – subsets, respectively. The spectroscopic model was

calibrated against ilr values and the output ilr was back-transformed into the familiar units of the compositional

space to predict CEC-NH4OAc from organic C and clay contents.

The relationship between the absolute prediction errors of IR-inferred CEC-NH4OAc and EEA-predicted LR

using IR-inferred CEC-NH4OAc was partitioned using a receiver operating characteristic (ROC) procedure

(Fawcett, 2006). The prediction interval of EEA-predicted LR using measured CEC-NH4OAc was defined as

reference cutoff. The test performance was measured using the area under the ROC curve relating sensitivity

to specificity and the optimal predictor threshold (prediction error of IR-inferred CEC-NH4OAc) was the point of

the ROC curve returning the highest Youden’s index (Youden, 1950), i.e. sensitivity + specificity -1.

Statistical analyses were conducted in the R statistical environment (R Development Core Team, 2011) with R

packages compositions (van den Boogaart et al., 2014) for compositional data transformation, hyperSpec

(Beleites, 2014), prospectr (Stevens and Ramirez-Lopez, 2013) and ptw (Gerretzen et al., 2015) for spectral

data manipulation, caret (Kuhn, 2014) and gbm (Ridgeway, 2013) for BRT modelling, pls (Mevik et al., 2013)

for PLSR modelling, plyr (Wickham, 2015) for data manipulation, pROC (Robin et al., 2014) for ROC partition

and ggplot2 (Wickham and Chang, 2015) and ggtern (Hamilton, 2014) for data visualization.

38

2.5 Results and discussion

2.5.1 Soil properties in the lime incubation study

The lime incubation study included a large spectrum of soil textures from sandy to loamy and clayey soils (Fig.

2.1) and of soil chemical properties as shown by CEC-NH4OAc values varying from 5 to 30 cmolc kg-1, a 5X

variation in C contents, a 100X variation in Ca content and even more variation in K and Mg contents (Table

2.2). As a result, our models scanned a wide range of mineral soil properties found in Québec.

2.5.2 Conversion equations

Because Québec soils generally show negligible contents of inorganic C (carbonate), total C (CNS 2000 Leco)

was linearly related to the Walkley-Black organic C (OCWB) (Tiessen and Moir, 1993) with a regression slope of

0.977 (Table 2.3). As result, the C determined as total C or Walkley-Black OC can be used in both calibration

studies. Mehlich-III K, Ca and Mg (Mehlich, 1984) were also closely related to exchangeable cations as

extracted using the NH4OAc method at pH 7.0 (Hendershot et al., 1993b). The K, Ca, and Mg from the

Mehlich-III method can thus be converted accurately to ammonium-acetate (pH 7.0) exchangeable cations

using linear regression (Table 2.3).

2.5.3 Pedotransfer functions for CEC

The PTF to predict CEC-NH4OAc from laboratory-determined OCWB and clay contents was calibrated using a

library of 526 soil samples showing CEC-NH4OAc values between 3.3 and 50.9 cmolc kg-1. The distribution of

PTF predictor variables in the lime incubation study was well represented by the calibration data set (Fig. 2.2).

The CEC-NH4OAc prediction model using OCWB (g kg-1) and clay (g kg-1) contents (Eq. 2.6) showed a R2 value

of 0.90 and RMSE of 2.03 in validation for the soils of the lime incubation study (Fig. 2.3).

𝐶𝐸𝐶𝑁𝐻4𝑂𝐴𝑐 = 0.434 + 0.285 (𝑂𝐶WB) + 0.034 (𝑐𝑙𝑎𝑦) (2.6)

The regression coefficients were within the range of those (-0.025 to 0.706 for intercept, 0.277 to 0.950 for

OCWB and 0.027 to 0.076 for clay) reported for Québec soils (Martel and Laverdière, 1976; Laverdière et al.,

1981; Parent and Gagné, 2010).

A library of 322 soil samples was used to develop prediction models using NIR and MIR spectroscopy for clay

contents between 12 and 738 g kg-1 and CEC-NH4OAc values between 3.7 and 37.7 cmolc kg-1. The spectral

variability for the soils of the lime incubation experiment was within the spectral variation of the soils of the

collection, although some samples were close to the limits of the spectral range (Fig. 2.4).

39

The prediction of clay content from NIR and MIR spectroscopy showed similar goodness of fit (Fig. 2.5).

Compared to the laboratory-determined clay content, clay content predicted by IR spectroscopy in Eq. 2.6

showed slightly lower performance to predict CEC-NH4OAc of soils of the lime incubation experiment (R2 =

0.88 and RMSE = 2.26 from NIR and R2 = 0.86 and RMSE = 2.38 from MIR) (Fig. 2.6), resulting from the

propagation of clay-predicted uncertainty in the models. Compared to NIR, the MIR model showed a higher

performance in predicting CEC-NH4OAc (Fig. 2.7). As reported by Soriano-Disla et al. (2014), the MIR region

is more sensitive to clay mineralogy and organic compounds that contribute to CEC. Among IR-inferred CEC-

NH4OAc, the CEC-NH4OAc predicted directly by MIR produced the best goodness of fit with a R2 of 0.91 and

RMSE of 1.97 for validation using the soils of the lime incubation experiment. The IR-inferred CEC-NH4OAc

estimates were further tested as input variables in the EEA models.

2.5.4 Relationship between lime requirements and SMP and EEA indices

The SMP buffer pH was loosely related to LR at target pHH2O values of 5.5 and 6.0 (0.15 ≤ R2 ≤ 0.63 in cross-

validation) (Table 2.4), particularly for soils having LR values below 1.5 g CaCO3 kg-1 (Fig. 2.8). The

performance of SMP was improved for target pH values approaching the pH of the buffer solution, as already

reported (Webber et al., 1977; Tran and van Lierop, 1982; Dietzel et al., 2009). The goodness of fit of the EEA

index using measured CEC-NH4OAc was higher compared to SMP buffer pH for the target pHH2O values of 5.5

to 6.5 and similar for target pHH2O of 7.0 (Table 2.4).

In contrast with SMP, the performance (R2) of the EEA index decreased as target pH increased, especially for

target pHH2O of 7.0 (Table 2.4), but this can be explained theoretically. First, the EEA index assumes that the

sum of exchangeable basic cations equals CEC-NH4OAc at pH 7.0, which may be a gross approximation;

depending on nature of permanent and variable charges related to soil genesis, exchangeable acidity may

reach zero at higher pH values (Rowell, 1994). In addition, because soils are highly buffered above pHH2O of

7.5 (Magdoff and Bartlett, 1985), the assumption of linearity between acidity on exchange sites and soil pH

may not hold near pHH2O of 7.0. Finally, for higher LR levels, the multiplicative nature of the EEA index (Eq.

2.2) may induce higher propagation of the uncertainty associated with input predictors (CEC, basic cations and

pH). For example, the pH measurement error enhances uncertainty in the prediction of LR where

exchangeable acidity increases. On the other hand, the use of pHCaCl2, a more stable pH measurement

compared to pHH2O (Hendershot et al., 1993a) is more appropriate than pHH2O to support the EEA index. Linear

regressions between indices and LR developed with the entire dataset of the lime incubation study are

presented in Table 2.5.

40

2.5.5 Performance of the EEA index using IR-inferred CECs

The relationship between the incubation LR and the EEA index was affected by the accuracy of IR-inferred

CEC-NH4OAc (Fig. 2.9). At a target pHH2O of 5.5 to 6.5, the performance of the EEA model to predict LR in

cross-validation was similar to measured CEC-NH4OAc (0.86 ≤ R2 ≤ 0.89) for the MIR-predicted CEC-NH4OAc

(0.77 ≤ R2 ≤ 0.82) and CEC-NH4OAc computed from OCWB and NIR- and MIR-predicted clay contents (0.76 ≤

R2 ≤ 0.89) (Table 2.4). The CEC-NH4OAc predicted directly from NIR was not accurate enough to support the

EEA model (0.28 ≤ R2 ≤ 0.41). The performance of EEA model across IR-inferred CEC-NH4OAc decreased at

target pHH2O of 7.0 while the CEC-NH4OAc predicted from MIR showed the highest performance (R2 = 0.61 in

cross-validation). The CEC-NH4OAc predicted directly from MIR has been found to be most suitable compared

to the other IR-inferred CEC-NH4OAc to support the EEA model across target pH values. The EEA index using

IR-inferred CEC-NH4OAc provides an alternative to the SMP buffer, especially at low target pHH2O (< 6.5).

The EEA model using measured CEC-NH4OAc (Table 2.5) showed 95% prediction intervals of ±0.35, ±0.51,

±0.80, and ±1.29 g CaCO3 kg-1 for target pHH2O of 5.5, 6.0, 6.5 and 7.0, respectively. Using CEC-NH4OAc

predicted directly from NIR or computed from OCWB and NIR- and MIR-predicted clay contents, some soils

showed predictor errors of LR greatly exceeding the prediction interval of the EEA model using measured

CEC-NH4OAc at target pHH2O of 7.0 (±1.29 g CaCO3 kg-1) (Fig. 2.10). The binary partition of the relationship

between prediction errors of LR and IR-inferred CEC-NH4OAc showed a threshold CEC-NH4OAc value of ±2.3

cmolc kg-1 value (Fig. 2.10) which was similar across target pH values. The threshold value of the IR-inferred

CEC-NH4OAc indicated that LR determination of soils could not be supported by the EEA model (Table 2.5)

due to a prediction error on IR-inferred CEC-NH4OAc exceeding ±2.3 cmolc kg-1. Although the CEC-NH4OAc

predicted directly from MIR could support the EEA model across target pH values, the spectral library should

be enhanced to gain more representability across a larger pedodiversity within the geographical area of

interest and to support the generalization capacity of the models (Viscarra Rossel et al., 2008; Ramirez-Lopez

et al., 2014).

2.5.6 Prediction of LR by infrared spectroscopy

The infrared models to predict LR from the EEA model (Table 2.5) was calibrated using 43, 130, 202, and 247

samples (from library with available variables) for target pHH2O of 5.5, 6.0, 6.5 and 7.0, respectively. The MIR

models showed R2 values between 0.03 and 0.50 in cross-validation and between 0.14 and 0.48 in validation

(Fig. 2.11). The NIR spectroscopy (results not presented) did not improve the performance to predict EEA-

derived LR. The goodness of fit for MIR models was too low to support LR determination. Such performance

was lower than that reported by Soriano-Disla et al. (2014) using MIR (median R2 value of 0.81). Because

EEA-derived LR in the spectral collection was skewed positively, the calibration model could benefit from the

addition of soil samples with higher LR values.

41

The LR derived from EEA proved to be a more appropriate reference than the SMP buffer pH for calibrated

spectroscopy model, especially for target pHH2O values < 6.5. Nevertheless, the LR cannot be directly

predicted from IR spectroscopy, although a direct prediction of LR would be interesting to avoid wet laboratory

analysis.

The EEA model using IR-inferred CEC-NH4OAc within ±2.3 cmolc kg-1 provided an accurate and

environmental friendly method to predict LR. The EEA approach does not eliminate wet analyses, but the input

variables (Mehlich III extracts and pH) are provided by Québec soil testing laboratories. Buffer solutions

without hazardous constituent such as Sikora and modified Mehlich buffers (Sikora, 2006; Hoskins and Erich,

2008) were found to be as good as or better than SMP in predicting LR (Wolf et al., 2008; Pagani and

Mallarino, 2012). However, as observed for the SMP, alternative buffers are more reliable LR predictors at

high than low LR (Viswakumar et al., 2010). Although single titration provides a quick measurement of LR with

very little (Thompson et al., 2010; Kissel et al., 2012) or slightly more chemical use (Wang et al., 2015), initial

pH analysis is also required. Compared to recent alternative methods for SMP, the EEA approach is not lesser

time-consuming while IR spectroscopy to predict CEC-NH4OAc provides additional soil properties (e.g. clay

content) to evaluate soil quality and facilitate soil fertility interpretation.

2.6 Conclusion While the SMP buffer pH proved to be inadequate to predict LR at low target pHH2O (< 6.5) and low LR value

(< 1.5 g CaCO3 kg-1), the EEA index using measured CEC-NH4OAc was found to be a valuable predictor of

LR, especially for potato soils that require low pH values. To maintain adequate pH levels at low LRs, an EEA

index using IR-inferred CEC-NH4OAc within ±2.3 cmolc kg-1 was found to be more accurate than the SMP

buffer pH. The MIR spectroscopy has been found to be more accurate than NIR to predict CEC-NH4OAc

directly and more suitable to support the EEA model.

Uncertainty analysis of predicted CEC will be useful to identify the predictions not accurate enough to support

EEA model and needing improved PTFs to estimate CEC (Minasny and Bishop, 2008; Tranter et al., 2010).

The standardization of spectroscopy measurement protocols and the development of spectral libraries (Nocita

et al., 2015) will facilitate the integration of the EEA model into routine LR determination procedures.

2.7 Acknowledgements This study is part of the soil quality module of the research project entitled “Implementing means to increase

potato ecosystem services” (CRDPJ 385199 – 09). We acknowledge the financial support of the Natural

Sciences and Engineering Research Council of Canada (NSERC), as well as farm partners as follows:

Cultures Dolbec Inc., St-Ubalde, Québec, Canada; Groupe Gosselin FG Inc., Pont Rouge, Québec, Canada;

42

Agriparmentier Inc. and Prochamps Inc., Notre-Dame-du-Bon-Conseil, Québec, Canada; Ferme Daniel Bolduc

et Fils Inc., Péribonka, Québec, Canada. We thank Nicolas Samson and Marie-Hélène Lamontagne for

technical help.

43

2.8 References Adams, F., and C.E. Evans. 1962. A rapid method for measuring lime requirement of red-yellow podzolic soils. Soil Sci. Soc. Am. Proc. 26: 355–357.

Aitchison, J. 1986. The statistical analysis of compositional data. Chapman and Hall, London.

Araújo, S.R., J.A.M. Demattê, and H. Bellinaso. 2013. Analysing the effects of applying agricultural lime to soils by VNIR spectral sensing: a quantitative and quick method. Int. J. Remote Sens. 34: 4570–4584.

Bache, B.W. 2008. Base saturation. Encycl. Soil Sci.: 52–54.

Beleites, C. 2014. hyperSpec: Interface for hyperspectral data, i.e. spectra + meta info (spatial, time, concentration, ...). R package version 0.98-20140523. R Foundation for Statistical Computing, Vienna, Austria.

Cescas, M.P. 1978. Table interprétative de la mesure du pH des sols du Québec par quatre méthodes différentes. Le Nat. Can. 105: 259–263.

Clark, R.N., and T.L. Roush. 1984. Reflectance spectroscopy: Quantitative analysis techniques for remote sensing applications. J. Geophys. Res. 89: 6329–6340.

Curtin, D., and H. Ukrainetz. 1997. Acidification rate of limed soil in a semiarid environment. Can. J. Plant Sci. 77: 415–420.

Dietzel, K.A., Q.M. Ketterings, and R. Rao. 2009. Predictors of lime needs for pH and aluminum management of New York agricultural soils. Soil Sci. Soc. Am. J. 73: 443–448.

Egozcue, J.J., and V. Pawlowsky-Glahn. 2006. Simplicial geometry for compositional data. Geol. Soc. London, Spec. Publ. 264: 145–159.

Egozcue, J.J., V. Pawlowsky-Glahn, G. Mateu-Figueras, and C. Barceló-Vidal. 2003. Isometric logratio transformations for compositional data analysis. Math. Geol. 35: 279–300.

Fawcett, T. 2006. An introduction to ROC analysis. Pattern Recognit. Lett. 27: 861–874.

Filzmoser, P., and K. Hron. 2011. Robust statistical analysis. In: V. Pawlowsky-Glahn et al., editors, Compositional data analysis: Theory and applications. John Wiley and Sons, New York. p. 57–72.

Filzmoser, P., K. Hron, and C. Reimann. 2009. Univariate statistical analysis of environmental (compositional) data: problems and possibilities. Sci. Total Environ. 407: 6100–6108.

Gerretzen, J., P. Eilers, H. Wouters, T. Bloemberg, and R. Wehrens. 2015. ptw: Parametric time warping. R package version 1.0-7. R Foundation for Statistical Computing, Vienna, Austria.

Godsey, C.B., G.M. Pierzynski, D.B. Mengel, and R.E. Lamond. 2007. Evaluation of common lime requirement methods. Soil Sci. Soc. Am. J. 71: 843–850.

Goulding, K.W.T., S.P. McGrath, and A.E. Johnston. 1989. Predicting the lime requirement of soils under permanent grassland and arable crops. Soil Use Manag. 5: 54–58.

Hamilton, N. 2014. ggtern: An extension to ggplot2, for the creation of ternary diagrams. R package version 1.0.3.2. R Foundation for Statistical Computing, Vienna, Austria.

44

Hastie, T., R.J. Tibshirani, and J. Friedman. 2009. The elements of statistical learning: Data mining, inference and prediction. 2nd Ed. Springer, New York.

Hendershot, W.H., H. Lalande, and M. Duquette. 1993a. Soil reaction and exchangeable acidity. In: M.R. Carter, editor, Soil sampling and methods of analysis. Lewis, Boca Raton, FL. p. 141–145.

Hendershot, W.H., H. Lalande, and M. Duquette. 1993b. Ion exchange and exchangeable cations. In: M.R. Carter, editor, Soil sampling and methods of analysis. Lewis, Boca Raton, FL. p. 167–175.

Hoskins, B.R., and M.S. Erich. 2008. Modification of the Mehlich lime buffer test. Commun. Soil Sci. Plant Anal. 39: 2270–2281.

Janik, L.J., R.H. Merry, and J.O. Skjemstad. 1998. Can mid infrared diffuse reflectance analysis replace soil extractions? Aust. J. Exp. Agric. 38: 681–696.

Keeney, D.R., and R.B. Corey. 1963. Factors affecting the lime requirements of Wisconsin soils. Soil Sci. Soc. Am. Proc. 27: 277–280.

Ketterings, Q.M., W.S. Reid, and K.J. Czymmek. 2006. Lime guidelines for field crops in New York. First Release. Department of Crop and Soil Sciences Extension, Series E06-2. Cornell University, Ithaca, New-York.

Kissel, D.E., L.S. Sonon, and M.L. Cabrera. 2012. Rapid measurement of soil pH buffering capacity. Soil Sci. Soc. Am. J. 76: 694–699.

Klute, A. 1986. Water retention: Laboratory methods. In: A. Klute, editor, Methods of soils analysis, Part 1, Agronomy 9. SSSA, Madison, WI. p. 635–662.

Kuhn, M. 2014. caret: Classification and regression training. R package version 6.0-37. R Foundation for Statistical Computing, Vienna, Austria.

Laverdière, M.R., C.R. De Kimpe, and A. D’Avignon. 1981. Caractéristiques minéralogiques et chimiques de quelques sols sableux du Québec en regard de leur évolution pédologique. Can. J. Plant Sci. 61: 273–283.

Lemire, E., K.M. Taillon, and W.H. Hendershot. 2006. Using pH-dependent CEC to determine lime requirement. Can. J. Soil Sci. 86: 133–139.

Liu, M., D.E. Kissel, M.L. Cabrera, and P.F. Vendrell. 2005. Soil lime requirement by direct titration with a single addition of calcium hydroxide. Soil Sci. Soc. Am. J. 69: 522–530.

Liu, M., D.E. Kissel, L.S. Sonon, M.L. Cabrera, and P.F. Vendrell. 2008. Effects of biological nitrogen reactions on soil lime requirement determined by incubation. Soil Sci. Soc. Am. J. 72: 720–726.

Magdoff, F.R., and R.J. Bartlett. 1985. Soil pH buffering revisited. Soil Sci. Soc. Am. J. 49: 145–148.

Martel, Y.A., and M.R. Laverdière. 1976. Facteurs qui influencent la teneur de la matière organique et les propriétés d’échange cationique des horizons Ap des sols de grandes culture du Québec. Can. J. Soil Sci. 56: 213–221.

McLean, E.O. 1978. Principles underlying the practice of determining lime requirements of acid soils by use of buffer methods. Commun. Soil Sci. Plant Anal. 9: 699–715.

45

McLean, E.O. 1982. Soil pH and lime requirement. In: A.L. Page et al., editors, Methods of soils analysis, Part 2, Agronomy 9. SSSA, Madison, WI. p. 199–224.

Mehlich, A. 1984. Mehlich 3 soil test extractant: A modification of Mehlich 2 extractant. Commun. Soil Sci. Plant Anal. 15: 1409–1416.

Mevik, B.-H., R. Wehrens, and K.H. Liland. 2013. pls: Partial least squares and principal component regression. R package version 2.4-3. R Foundation for Statistical Computing, Vienna, Austria.

Minasny, B., and T.F.A. Bishop. 2008. Analysing uncertainty. In: N. McKenzie et al., editors, Guidelines for surveying soil and resources. 2nd Ed. Australian Soil and Land Survey Handbooks Series 2. CSIRO Publishing. p. 383–393.

Nocita, M., A. Stevens, B. van Wesemael, M. Aitkenhead, M. Bachmann, B. Barthès, E. Ben Dor, D.J. Brown, M. Clairotte, A. Csorba, P. Dardenne, J.A.M. Demattê, V. Genot, C. Guerrero, M. Knadel, L. Montanarella, C. Noon, L. Ramirez-Lopez, J. Robertson, H. Sakai, J.M. Soriano-Disla, K.D. Shepherd, B. Stenberg, E.K. Towett, R. Vargas, and J. Wetterlind. 2015. Soil spectroscopy: An alternative to wet chemistry for soil monitoring. Adv. Agron. 132: 139–159.

Pagani, A., and A.P. Mallarino. 2012. Comparison of methods to determine crop lime requirement under field conditions. Soil Sci. Soc. Am. J. 76: 1855–1866.

Parent, L.E., and G. Gagné. 2010. Guide de référence en fertilisation. 2e Édition. Centre de référence en agriculture et agroalimentaire du Québec.

Pionke, H.B., R.B. Corey, and E.E. Schulte. 1968. Contributions of soil factors to lime requirement and lime requirement tests. Soil Sci. Soc. Am. Proc. 32: 113–117.

Quaggio, J.A., B. Van Raij, and E. Malavolta. 1985. Alternative use of the SMP-buffer solution to determine lime requirement of soils. Commun. Soil Sci. Plant Anal. 16: 245–260.

R Development Core Team, R. 2011. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.

Ramirez-Lopez, L., K. Schmidt, T. Behrens, B. van Wesemael, J.A.M. Demattê, and T. Scholten. 2014. Sampling optimal calibration sets in soil infrared spectroscopy. Geoderma 226-227: 140–150.

Rémy, J.C., and A. Marin-Laflèche. 1974. L’analyse de la terre: réalisation d'un programme d'interprétation automatique. Ann. Agron. 25: 607–632.

Ridgeway, G. 2013. gbm: Generalized boosted regression models. R package version 2.1. R Foundation for Statistical Computing, Vienna, Austria

Robarge, W.P. 2008. Soil acidity. Encycl. Soil Sci.: 10–21.

Robin, X., N. Turck, A. Hainard, N. Tiberti, F. Lisacek, J.-C. Sanchez, and M. Müller. 2014. pROC: Display and analyze ROC curves. R package version 1.7.3. R Foundation for Statistical Computing, Vienna, Austria.

Rowell, D.L. editor. 1994. Soil science: Methods and applications. Longman, Essex, UK.

Sheldrick, B.H., and C. Wang. 1993. Particle size distribution. In: M.R. Carter, editor, Soil sampling and methods of analysis. Lewis, Boca Raton, FL. p. 499–513.

46

Shoemaker, H.E., E.O. Mclean, and P.F. Peatt. 1961. Buffer methods for determining lime requirement of soils with appreciable amounts of extractable aluminum. Soil Sci. Soc. Am. Proc. 25: 274–277.

Sikora, F.J. 2006. A buffer that mimics the SMP buffer for determining lime requirement of soil. Soil Sci. Soc. Am. J. 70: 474–486.

Sikora, F.J., and K.P. Moore. 2008. The Moore–Sikora buffer for lime requirement determinations. Soil Sci. Soc. Am. J. 72: 1163–1173.

Soriano-Disla, J.M., L.J. Janik, R.A. Viscarra Rossel, L.M. Macdonald, and M.J. McLaughlin. 2014. The performance of visible, near-, and mid-infrared reflectance spectroscopy for prediction of soil physical, chemical, and biological properties. Appl. Spectrosc. Rev. 49: 139–186.

Stevens, A., and L. Ramirez-Lopez. 2013. prospectr: Miscellaneous functions for processing and sample selection of vis-NIR diffuse reflectance data. R package version 0.1.3. R Foundation for Statistical Computing, Vienna, Austria.

Thompson, J.S., D.E. Kissel, M.L. Cabrera, and L.S. Sonon. 2010. Equilibration reaction from single addition of base to determine soil lime requirement. Soil Sci. Soc. Am. J. 74: 663–669.

Tiessen, H., and J.O. Moir. 1993. Total and organic carbon. In: M.R. Carter, editor, Soil sampling and methods of analysis. Lewis, Boca Raton, FL. p. 187–199.

Tran, T.S., and W. van Lierop. 1981a. Evaluation and improvement of buffer-pH lime requirement methods. Soil Sci. 131: 178–188.

Tran, T.S., and W. van Lierop. 1981b. Evaluation des méthodes de détermination du besoin en chaux en relation avec les propriétés physiques et chimiques des sols acides. Sci. du sol 12: 253–267.

Tran, T.S., and W. van Lierop. 1982. Lime requirement determination for attaining pH 5.5 and 6.0 of coarse-textured soils using buffer-pH methods. Soil Sci. Soc. Am. J. 46: 1008–1014.

Tran, T.S., and W. van Lierop. 1993. Lime requirement. In: M.R. Carter, editor, Soil sampling and methods of analysis. Lewis, Boca Raton, FL. p. 109–113.

Tran, T.S., and R.R. Simard. 1993. Mehlich-III extractable elements. In: M.R. Carter, editor, Soil sampling and methods of analysis. Lewis, Boca Raton, FL. p. 43–49.

Tranter, G., B. Minasny, and A.B. McBratney. 2010. Estimating pedotransfer function prediction limits using fuzzy k-means with extragrades. Soil Sci. Soc. Am. J. 74: 1967–1975.

Tunney, H., F.J. Sikora, D. Kissel, A. Wolf, L. Sonon, and K. Goulding. 2010. A comparison of lime requirements by five methods on grassland mineral soils in Ireland. Soil Use Manag. 26: 126–132.

Van den Boogaart, K.G., R. Tolosana-Delgado, and M. Bren. 2014. compositions: Compositional data analysis. R package version 1.40-1. R Foundation for Statistical Computing, Vienna, Austria.

Van Lierop, W. 1990. Soil pH and lime requirement determination. In: R. L. Westerman, editor, Soil testing and plant analysis. 3rd Ed. SSSA, Madison, WI. p. 73–126.

Viscarra Rossel, R.A., Y.S. Jeon, I.O.A. Odeh, and A.B. McBratney. 2008. Using a legacy soil sample to develop a mid-IR spectral library. Aust. J. soil Res. 46: 1–16.

47

Viscarra Rossel, R.A., D.J.J. Walvoort, A.B. McBratney, L.J. Janik, and J.O. Skjemstad. 2006. Visible, near infrared, mid infrared or combined diffuse reflectance spectroscopy for simultaneous assessment of various soil properties. Geoderma 131: 59–75.

Viswakumar, A., R.W. Mullen, K.A. Diedrick, W.A. Dick, and N.T. Basta. 2010. Evaluation of four buffer solutions for determining the lime requirement for Ohio soils. Commun. Soil Sci. Plant Anal. 41: 424–437.

Wang, X., C. Tang, S. Mahony, J.A. Baldock, and C.R. Butterly. 2015. Factors affecting the measurement of soil pH buffer capacity: approaches to optimize the methods. Eur. J. Soil Sci. 66: 53–64.

Weaver, A.R., D.E. Kissel, F. Chen, L.T. West, W. Adkins, D. Rickman, and J.C. Luvall. 2004. Mapping soil pH buffering capacity of selected fields in the coastal plain. 68: 662–668.

Webber, M.D., P.B. Hoyt, M. Nyborg, and D. Corneau. 1977. A comparison of lime requirement methods for acid canadian soils. Can. J. Soil Sci. 57: 361–370.

Wehrens, R. 2011. Chemometrics with R. Multivariate data analysis in the natural sciences and life sciences. Springer.

Wickham, H. 2015. plyr: Tools for splitting, applying and combining data. R package version 1.8.1. R Foundation for Statistical Computing, Vienna, Austria.

Wickham, H., and W. Chang. 2015. ggplot2: An implementation of the grammar of graphics. R package version 1.01. R Foundation for Statistical Computing, Vienna, Austria.

Wolf, A.M., D.B. Beegle, and B. Hoskins. 2008. Comparison of Shoemaker-McLean-Pratt and modified Mehlich buffer tests for lime requirement on Pennsylvania soils. Commun. Soil Sci. Plant Anal. 39: 1848–1857.

Wong, M.T.F., M.J. Webb, and K. Wittwer. 2013. Development of buffer methods and evaluation of pedotransfer functions to estimate pH buffer capacity of highly weathered soils. Soil Use Manag. 29: 30–38.

Youden, W.J. 1950. Index for rating diagnostic tests. Cancer 3: 32–35.

Ziadi, N., and T.S. Tran. 2007. Lime requirement. In: M.R. Carter et al., editors, Soil sampling and methods of analysis. 2nd Ed. CRC Press, Boca Raton, FL. p. 129–134.

48

Table 2.1. Spectral pretreatment and model calibrated to predict clay content, CEC-NH4OAc and lime

requirement (LR) from near- (NIR) and mid-infrared (MIR) spectral bands.

Attribute Band Pretreatment† Model†

Clay NIR 1st derivative BRT

MIR SNV BRT

CEC NIR 1st derivative BRT

MIR Asysm PLSR

LR MIR SNV PLSR

† SNV, Standard normal variate scaling; Asysm, baseline removal using an asymmetric least squares; BRT,

boosted regression trees; PLSR, partial least squares regression

49

Table 2.2. Chemical properties of the 50 soils used in the lime incubation study.

Soil property Range Median Mean SD

Total C, g kg-1 11.9-59.0 23.3 25.1 9.8

pH CaCl2 1:2 (w/v) 4.10-6.22 5.17 5.19 0.53

pH SMP 1:1:2 (w/v/v) 5.62-7.00 6.45 6.40 0.29

K, mg kg-1 24.7-423.2 111.1 147.5 99.5

Ca, mg kg-1 41.9-4280.8 1318.3 1594.8 1212.5

Mg, mg kg-1 0.0-852.0 97.4 190.6 215.4

CEC NH4OAc pH 7, cmolc kg-1 5.6-29.8 12.7 14.7 6.5

50

Table 2.3. Relationships between total C (CCNS) and Walkley-Black organic C (OCWB) and between Mehlich III

(MIII) and ammonium-acetate (pH 7.0) exchangeable (ex) K, Ca and Mg for soils with OCWB < 90 g kg-1.

Relationship Unit n Intercept (a) Slope (b) R2 RMSE

OCWB = a + b(CCNS) g kg-1 157 0.272 0.977 0.958 2.715

Kex = a + b(KMIII) cmolc kg-1 155 0.003 0.969 0.982 0.032

Caex = a + b(CaMIII) cmolc kg-1 155 -0.206 0.961 0.978 1.018

Mgex = a + b(MgMIII) cmolc kg-1 155 -0.112 0.961 0.982 0.221

51

Table 2.4. Cross-validation performance of Shoemaker-McLean-Pratt (SMP) buffer and estimated

exchangeable acidity (EEA) models using CEC-NH4OAc measured and CEC-NH4OAc inferred directly from

near- (NIR) and mid-infrared (MIR) or from Walkley-Black organic C (OCWB) and NIR- and MIR-predicted clay

contents to predict LR at four target pHH2O.

Index Average R2 Average RMSE

5.5 6 6.5 7 5.5 6 6.5 7

SMP buffer 0.146 0.628 0.723 0.766 0.497 0.545 0.646 0.765

EEA-CEC measured 0.884 0.893 0.867 0.807 0.187 0.291 0.433 0.696

EEA-CEC [OCWB, Clay (NIR)] 0.824 0.843 0.764 0.510 0.230 0.355 0.597 1.095

EEA-CEC [OCWB, Clay (MIR)] 0.859 0.888 0.782 0.520 0.189 0.297 0.569 1.072

EEA-CEC (NIR) 0.385 0.414 0.410 0.282 0.396 0.676 0.930 1.336

EEA-CEC (MIR) 0.769 0.824 0.774 0.611 0.238 0.378 0.577 0.969

52

Table 2.5. Regression models between lime requirement (g CaCO3 kg-1) (y) and Shoemaker-McLean-Pratt

(SMP) buffer and estimated exchangeable acidity (EEA) using measured CEC-NH4OAc (x), respectively,

across the soil dataset of the liming experiment.

Target pH n SMP model EEA model using measured CEC

Equation R2 RMSE Equation R2 RMSE

5.5 24 y = 116.0 - 36.0x + 2.80x2 0.537 0.405 y = -0.106 + 0.414x 0.915 0.173

6.0 36 y = 136.8 - 40.7x + 3.04x2 0.758 0.440 y = -0.030 + 0.413x 0.919 0.255

6.5 47 y = 138.2 - 39.3x + 2.80x2 0.811 0.550 y = 0.114 + 0.421x 0.897 0.406

7.0 50 y = 124.2 - 33.3x + 2.23x2 0.823 0.681 y = 0.430 + 0.413x 0.838 0.652

53

Figure 2.1. Particle size distribution of lime study soils. Polygon limits represent the USDA textural classes.

54

Figure 2.2. Distribution of clay and Walkley-Black organic C contents in the soil collection used to develop pedotransfer functions for CEC-NH4OAc and for the soils of the lime incubation experiment.

55

Figure 2.3. Relationship between measured CEC-NH4OAc and CEC-NH4OAc predicted from Walkley-Black organic C and clay contents for the cross-validation (empty dots) and validation (filled dots) data.

56

Figure 2.4. Continuum-removed absorbance of the near- (a) and mid-infrared (b) spectra for the soil of the collection and the lime incubation experiment.

57

Figure 2.5. Relationships between laboratory-determined clay content and clay content predicted using near- (NIR) (a) and mid-infrared (MIR) (b) for cross-validation (empty dots) and validation (filled dots) data.

58

Figure 2.6. Relationships between laboratory-determined CEC-NH4OAc and CEC-NH4OAc predicted from Walkley-Black organic C (OCWB) and clay contents predicted using near- (NIR) (a) and mid-infrared (MIR) (b) in Eq. 2.6 for cross-validation (empty dots) and validation (filled dots) data.

59

Figure 2.7. Relationships between laboratory-determined CEC-NH4OAc and CEC-NH4OAc predicted using near- (NIR) (a) and mid-infrared (MIR) (b) for cross-validation (empty dots) and validation (filled dots) data.

60

Figure 2.8. Relationships between incubation lime requirement (LR) and Shoemaker-McLean-Pratt (SMP) buffer (on the left) and estimated exchangeable acidity (EEA) from measured CEC-NH4OAc (on the right) at four target pHH2O. Regression equations are presented in Table 2.5.

61

Figure 2.9. Relationships between incubation lime requirement (LR) and estimated exchangeable acidity (EEA) index using CEC-NH4OAc inferred directly from near- (NIR) and mid-infrared (MIR) or from Walkley-Black organic C (OCWB) and NIR- and MIR-predicted clay contents at four target pHH2O. Lines represent regressions between LR and EEA index using laboratory-determined CEC-NH4OAc as presented in Table 2.5.

62

Figure 2.10. Binary partition of the relationship between the absolute prediction errors of infrared (IR)-inferred CEC-NH4OAc and lime requirement (LR) predicted from estimated exchangeable acidity (EEA) model using IR-inferred CEC-NH4OAc at target pHH2O of 7. The CEC-NH4OAc was inferred directly from near- (NIR) and mid-infrared (MIR) or from Walkley-Black organic C (OCWB) and NIR- and MIR-predicted clay contents. The threshold value of ±2.29 cmolc kg-1 for the IR-inferred CEC-NH4OAc was computed for a LR error of 1.29 g CaCO3 kg-1 with an area under the curve of 90.3% from the receiver operating characteristic curve.

63

Figure 2.11. Relationships between lime requirements (LR) derived of estimated exchangeable acidity (EEA) model and predicted by mid-infrared at four target pHH2O for the cross-validation (empty dots) and validation (filled dots) data.

64

Chapitre 3: Critical values of wet aggregate-size distribution indices for soil quality assessment Michaël A. Leblanc1, Serge-Étienne Parent1, Gilles Gagné2 and Léon-Étienne Parent1

1Department of Soils and Agri-Food Engineering, Paul-Comtois Building, Université Laval, Québec,

QC G1V 0A6, Canada.

2 Centre d’Expertise et de Transfert en Agriculture Biologique et de Proximité, 475, rue Notre-Dame

Est Victoriaville QC G6P 4B3, Canada.

Sera soumis à Frontiers in Ecology and Evolution

65

3.1 Résumé L’agrégation est un indicateur de la qualité physique des sols sensible aux pratiques culturales qui est relié

notamment à la structure, à la transmission de l’eau et des nutriments et à l’activité biologique du sol. La

synthèse de la distribution de la taille des agrégats permet l’expression du degré d’agrégation des sols en un

indice unique. L’objectif de cette étude était de développer des valeurs critiques pour l’interprétation des

indices synthétiques d’agrégation en relation avec le contenu en C organique et la granulométrie des sols en

utilisant la prairie comme système cultural de référence. Les cultures annuelles ont montré un effet négatif

plus important sur la stabilité des agrégats en sols argileux, alors que les sols loameux et sableux se sont

révélés plus susceptibles à la dégradation de l’agrégation à de faibles teneurs en C organique. Les indices

courants basés sur le diamètre moyen pondéré et géométrique des agrégats se sont montrés moins aptes au

développement de seuils critiques comparativement aux indices basés sur la géométrie fractale et

compositionnelle. L’efficacité des mesures correctives et des pratiques bénéfiques à la qualité des sols peut

être mesurée en fonction de la composition primaire (C organique, texture) des sols en utilisant le degré de

déviation de l’indice d’agrégation avec le système cultural de référence sous prairie et les seuils critiques

proposés.

66

3.2 Abstract Soil aggregation is a soil quality indicator sensitive to agricultural practices that influences soil structure, water

and nutrients transmission and soil biological activities. The synthesis of aggregate-size distribution allows

expressing the aggregation degree of soils by a single index. The aim of this paper was to develop critical

values of wet aggregate-size distribution indices in relation to basic soil composition (soil texture and organic C

content) using grassland as reference cropping system. The current indices based on the weighted and the

geometric mean diameter of aggregate were found to less appropriate than fractal and compositional indices

to derive critical values. Compared to grassland, annual cropping systems showed more deleterious effects on

the aggregate stability of clayed soils, while loamy and sandy soils showed higher susceptibility to soil

structure degradation at lower organic C contents. The efficiency of corrective measures and beneficial cultural

practices could be measured for corresponding basic soil composition (C content, texture) using the degree of

aggregation index from the reference cropping system and the proposed critical values.

67

3.3 Introduction Soil aggregates are soil structural units made of textural units (sand, silt, clay) and organic matter stuck

together (Cambardella, 2002). Soil aggregation is a soil quality index highly sensitive to agricultural practices

(Carter, 2002; Stott et al., 2013), that influences C sequestration (Six et al., 2004; Devine et al., 2014), water

and solute transport (Mueller et al., 2010), biological habitats, resistance to erosion (Eash et al., 1994),

seedbed quality, trafficability of machinery, ease of cultivation (National Soil Research Institute, 2001) and crop

productivity (Carter, 2002). Mean weight diameter (MWD) (van Bavel, 1949), geometric mean diameter (GMD)

(Mazurak, 1950), and the fractal scaling factor (Df) (Anderson et al., 1997) are the current synthetic indices to

describe aggregate-size distribution in soils. To support corrective measures and beneficial practices,

thresholds values have been reported for general interpretation of the current synthetic aggregation indices

(Lal, 1994; Mukherjee and Lal, 2014). However, no critical values have been proposed for the adequate

aggregation state of specific soil types.

The current synthetic indices listed above are computed from compositional data and could thus be distorted

by methodological flaws or weak assumptions (Parent et al., 2012). Aggregate-size proportions, which are

constrained to sum to 100%, are subjected to a negative bias, because one covariance is forced to be

negative, to sub-compositional incoherence, because results depend on measurement scale, to collinearity

(redundancy of information), because one component can be deduced by difference from 100%, and to non-

normal distribution, because proportions are restricted to lie between 0 and 100%. Hence, MWD and GMD,

that conduct mathematical operations directly on raw proportions, are systematically biased. On the other

hand, the value of Df is strongly dependent on particle counts and multifractal patterns (Anderson et al., 1997).

Recently, Parent et al. (2012) and Parent et al. (in preparation) proposed a balance aggregation index which is

fully compliant with the principles of compositional data analysis.

The aim of this paper is to develop critical values of indices synthesizing wet aggregate-size distribution in

relation to soil composition (soil texture and organic C content). We used a large Québec survey on soil

degradation to diagnose the soil aggregation state under annual cropping systems using grassland as

reference cropping system.

3.4 Material and methods

3.4.1 Soil collection and analysis

A survey of soil degradation (Tabi et al., 1990) conducted in Québec, Canada, across 164 soil series

(Spodosols, Alfisols, Inceptisols and Entisols) under conventional tillage management provided a large data

set to assess soil quality under four cropping systems: maize [Zea mays L.], potato [Solanum tuberosum L.],

68

spring cereals and cultivated grassland. Fourteen sites separated approximately 80 m from each other and

located in two different fields have been sampled in the plough layer (0 up to 30 cm) in each combination of

soil types and cropping systems. The whole data set contained 5431 soil observations across a wide range of

soil textures and organic carbon contents (Fig. 3.1 and 3.2).

Particle size distribution was determined using the sedimentation method (Bouyoucos, 1962), soil organic

carbon content using the Walkley-Black wet oxidation (Allison, 1965), water pH using a 1:1 soil-solution ratio

(w:v) (Peech, 1965), and Mehlich-3 extractable K, Ca, Mg, Fe, and Al (Mehlich, 1984) quantified by plasma

emission spectroscopy (ICAP, Jarrell-Ash 9000). Soil aggregation was determined using the wet sieving

method of Kemper and Chepil (1965) with correction for sand and fragments of size similar to that of

aggregates. Large air-dried aggregates (5-8 mm) were wet under vacuum and broken under a mechanical

stress imposed by water soaking. The soil macro-aggregation indices were computed across aggregate sizes

5-8 mm, 2-5 mm, 1-2 mm, and ≤1 mm. Taxonomic characteristics of soil series referred to the soil names file

of Québec (Lamontagne and Nolin, 1997).

3.4.2 Transformation of compositional data

The statistical analysis of compositional data (proportions constrained to 100% or 1000 g kg-1) are subject to

distortions caused by redundancy, scale dependency and inherent non-normal distribution (Filzmoser et al.,

2009). The log ratio approach can tackle such problems (Aitchison, 1986; Egozcue et al., 2003; Filzmoser and

Hron, 2011). Soil components (organic C, sand, silt and clay) and aggregate-size fractions were transformed

into isometric log ratios (ilr) as follows (Egozcue and Pawlowsky-Glahn, 2006) according to sequential binary

partitions (SBP) in Tables 3.1 and 3.2:

𝑖𝑙𝑟𝑖 = √𝑟𝑖𝑠𝑖

𝑟𝑖+𝑠𝑖𝑙𝑛

𝑔(x+)

𝑔(x−) (3.1)

Where ilri is the ith balance defined for a D-part composition, with i Є [1, D-1], ri and si are number of

components in the + (numerator) and – (denominator) groups, respectively, √𝑟𝑖𝑠𝑖

𝑟𝑖+𝑠𝑖 is a normalization

coefficient, and g(x+) and g(x-) are geometric means of components in the + and – groups, respectively. The

SBPs are D × D - 1 matrices structuring ilrs in balances by defining – and + groups in each ordered rows,

where components labelled with a zero are excluded from the balance. The SBPs used in this paper were

designed to address meaningful contrasts between classes. In the case of the basic composition-SBP (Table

3.1), the first partition (C-ilr1) is the balance between organic and mineral parts. The second partition (C-ilr2)

contrasts sand and small particles and the last one (C-ilr3) contrasts silt and clay. In the case of the

aggregation-SBP (Table 3.2), the partitions contrast in cascade the smallest aggregate fraction with the large

69

fractions. As proposed by Martín-Fernandez et al. (2011), the zeroes can be replaced by a fraction (65%) of

the detection limit to minimize the impact of zeroes on the covariance structure of the compositional data set.

3.4.3 Synthetic soil aggregate indices

The mean weight diameter (MWD) was computed as follows (van Bavel, 1949):

𝑀𝑊𝐷 = ∑ ��𝑖𝑤𝑖𝐷𝑖=1 (3.2)

Where xi (mm) is the average size between two consecutive sieves and wi (%) is the weight fraction retained

on each sieve.

The geometric mean diameter (GMD) is computed as follows (Mazurak, 1950):

𝐺𝑀𝐷 = exp( ∑ 𝑤𝑖 𝑙𝑜𝑔 ��𝑖 / ∑ 𝑤𝑖𝐷𝑖=1 )𝐷

𝑖=1 (3.3)

The fragmentation fractal scaling coefficient (Df) is computed as follows (Rieu and Sposito, 1991):

𝑁(𝑑𝑘) = ∑ 𝑁(𝑑𝑖)𝑘𝑖=0 = 𝛼𝑑𝑘

−𝐷𝑓 (3.4)

Where N(dk) is the number of particles with diameter ≤ dk, N(di) is the number of particles in the ith size

fraction, α is a proportionality parameter and Df is the fragmentation fractal scaling coefficient computed by

regression analysis of the log-log relationship between N(dk) and dk. The number of particles N(di) in the ith size

fraction is obtained by dividing total mass of aggregate M(di) of bulk density ρi and shape coefficient ci in the ith

size class by a the mass of single particle as follows (Kozak et al., 1996):

𝑁(𝑑𝑖) = 𝑀(𝑖) (𝑑𝑖3𝑐𝑖𝜌𝑖)⁄ (3.5)

Where di is the aggregate side for cubic shape ci (Anderson et al., 1997) estimated as average mesh size

between two consecutive sieves and ρi is aggregate bulk density; (𝑑𝑖3𝑐𝑖𝜌𝑖) is the mass of a single particle.

The balance aggregation index (BAI) was computed as follows (Parent et al., in preparation):

𝐵𝐴𝐼 =𝑎 ∙ 𝑏

|𝑏| (3.6)

Where 𝑎 ∙ 𝑏 is the scalar product of the vectors 𝑎 and 𝑏, 𝑎 is the vector defined by the ilr coordinates of the

aggregate composition (𝑥) and |𝑏| is the length of vector 𝑏 defined by the ilr coordinates of the ultimate

aggregation (𝑁) (soil with the larger aggregates only) and disaggregation (𝑀) (soil with the smaller aggregates

only). Vectors 𝑎 and 𝑏 are computed as follows:

70

𝑎 = 𝑖𝑙𝑟(𝑥) (3.7)

𝑏 = 𝑖𝑙𝑟(𝑁) − 𝑖𝑙𝑟(𝑀) (3.8)

The BAI index is defined by the orthogonal projection of vector 𝑎 onto vector 𝑏 expressing the linear gradient

of the aggregation balance between the ultimate aggregation and the ultimate disaggregation (Fig. 3.3). The

vector 𝑏 gives the direction and the sign of BAI: positive distance towards the aggregated structure and

negative distance towards the disaggregated structure. The ilr coordinates (Eq. 3.1) was computed according

to the Table 3.2 although the design of the SBP does not influence the BAI value.

3.4.4 Statistical analysis

Pedotransfer functions (PTFs) were developed to derive soil aggregate indices from ilr-transformed soil

compositions (Table 3.1). The grassland data set (n=2319) was randomly split into training (2/3) and testing

(1/3) data sets. Support vector machines (SVM), a modelling technique to optimize the predictive performance,

were used to account for non-linear relationships between variables in PTFs. A radial basis kernel was used

and the estimation of hyper-parameters, cost and kernel width, was optimized using a thorough grid-based

search approach with a 5-fold cross-validation on training data (Hastie et al., 2009; Karatzoglou et al., 2006).

Models were evaluated with training and testing data based on the coefficient of determination (R2) and the

root mean square error (RMSE) computed as follows:

𝑅2 = 1 − (∑ (𝑃𝑖 − 𝑂𝑖)2𝑁𝑖=1 / ∑ (𝑂𝑖 − ��)2)𝑁

𝑖=1 (3.9)

𝑅𝑀𝑆𝐸 = √1

𝑁∑ (𝑃𝑖 − 𝑂𝑖)2𝑁

𝑖=1 (3.10)

Where N is the number of data, Pi is the predicted value, Oi is the observed value, and O is the mean of

observed values. For the entire data set, reference values of aggregate indices were estimated through PTFs

according to specific soil compositions.

The effect of cropping systems on soil aggregation was quantified as follow:

Δ = 𝑃𝑖 − 𝑂𝑖 (3.11)

Where delta (Δ) is the difference between the index predicted by the PTFs (based on grassland) and the index

of aggregate observations. Hence, Δ is the departure of an observed index from the index that would have

been obtained if the soil, as characterized by its basic compositions, would have been subjected to grassland

conditions. For the Df index, the sign was reversed because ∆ increase relates to soil degradation.

71

On the training grassland data set, the critical value classifying soils as degraded was set as the upper limit of

the cumulative ∆ distribution where 95% of grassland population is included. The ∆ critical value was

evaluated on the testing grassland data set with the probability that a specimen be below the upper limit. The

optimum coverage probability (OCP) was computed as follows (Shrestha and Solomatine, 2006):

𝑂𝐶𝑃 = 1

𝑉𝑐𝑜𝑢𝑛𝑡 (𝑈𝐿 ≤ Δ𝑗) (3.12)

Where V is number of data in the testing data set, UL is upper limit, count is specimen count below the upper

limit, and Δj is the jth delta value, with j Є [1, V]. An OCP value close to one signifies an appropriate evaluation

of optimum coverage of soil aggregation as delineated by the critical value.

In Québec, soil textural classes are divided into three categories for managerial purposes: G1 (clayey soils:

heavy clay, clay, clay loam, silty clay loam, silty clay and sandy clay), G2 (loamy soils: sandy clay loam, loam,

silt loam and silt) and G3 (sandy soils: sand, loamy sand and sandy loam) (Parent and Gagné, 2010). Using

the ∆ critical value as reference cutoff, the relationship between ∆ and aggregation indices was partitioned by

textural group using a receiver operating characteristic (ROC) procedure (Fawcett, 2006; Swets, 1988). The

test performance was measured using the area under the ROC curve (AUC) relating sensitivity to specificity.

The optimum predictor threshold is the point of the ROC curve returning the highest Youden’s index (Youden,

1950), i.e. sensitivity + specificity -1.

Statistical computations were conducted in the R statistical environment (R Development Core Team, 2011)

using packages "compositions" (van den Boogaart et al., 2014) for compositional data transformation,

"mvoutlier" (Filzmoser and Gschwandtner, 2014) for outliers removal, "caret" (Kuhn, 2014) and "kernlab"

(Karatzoglou et al., 2014) for SVM modelling, "pROC" (Robin et al., 2014) for ROC partition and "ggplot2"

(Wickham and Chang, 2015) and “ggtern” (Hamilton, 2014) for data visualization.


3.5.1 Soil aggregation indices

Soil aggregate-size distribution under the four cropping systems presented in Fig. 3.4a shows that the largest

effect occurring in the first principal component was highly correlated with the synthetic indices. Cropping

systems influence soil aggregation significantly (p < 0.0001) across aggregation indices (Fig. 3.5). Boxplots

showed that Df and BAI values were more normally distributed compared to MWD and GMD. Because

grassland showed greatest macro-aggregation, grassland compositions were used as benchmarks to develop

critical values for aggregation indices. Moreover, overlapping of elliptical areas show that annual crop could

present an aggregation state similar to that of grassland.

72

3.5.2 Soil composition effects on aggregation

Among the aggregate-size balances, aggregate balance A-ilr1 was the most closely related to the three

balances of basic soil compositions (Fig. 3.4b). Except for the potato cropping systems where soil texture

varied within a narrow range, balances between particle-size proportions were fairly well correlated (r = -0.21

to -0.67) with A-ilr1 (Fig. 3.6). As reported in many studies (e.g. Kemper and Koch, 1966), the resistance of

macro-aggregates to breakdown is lesser in sandy compared to clayey soils. Organic matter is known to

stabilize aggregates by increasing the inter-particle cohesion and lowering their wettability (Abiven et al.,

2009). Chaney and Swift (1984) observed that relationship between organic matter and aggregate stability was

influenced by soil texture. Indeed, the organic C balance was not significantly correlated (r = -0.019 to 0.069)

with aggregate stability in the clayey soils only (Fig. 3.7), emphasizing the importance of organic matter in

loamy and sandy soils, especially under intensive cropping systems (Bronick and Lal, 2005). Organic C

content is a dynamic property sensitive to cultural practices (Fig. 3.2) as affected by soil texture and the

moisture regime represented by drainage classes (Fig. 3.8). Therefore organic C was retained as input

predictor to define reference aggregation state.

3.5.3 Critical values according to soil composition

Optimal hyper-parameters and the performance of PTFs to estimate reference values for synthetic soil

aggregate indices are listed in Table 3.3. Model performance was similar in the training and testing data sets,

suggesting optimal training for PTFs. The performance in prediction was relatively low across indices

(validation R2 ranging from 0.24 to 0.37). Soil composition balances explained 53% of the variance of

aggregate balance A-ilr1. Besides basic soil composition and the random measurement error, the variance of

aggregation indices can be associated with many edaphic (CEC, pH, inorganic binding agents, clay

mineralogy, biologic activity) and anthropic factors (organic amendment, fertilization, duration since crop

establishment, tillage practices) affecting the aggregation status (Bronick and Lal, 2005; Six et al., 2004). The

addition of properties such as soil pH in water, cationic bases (Ca and Mg) and oxides (Fe and Al) extracted by

Mehlich-III did not increase the performance of the models, most of these added components being correlated

with texture and organic C. The assumption of homoscedasticity was met for the testing data set with Df and

BAI, but not with MWD and GMD (Fig. 3.9). A deviation approximately equal across predicted values was

required to define a single critical value along the soil composition gradient. Thus MWD and GMD were

inappropriate indices for the proposed methodology.

At 0.95 percentile, critical ∆ values of Df and BAI were 0.72 and 1.37, respectively (Fig. 3.10). Beyond those

thresholds, beneficial cultural practices are expected to move soil aggregation state closer to that of grassland

soils. The probability of testing population being below critical ∆ value at various confidence levels was close

to 1 across the Df and BAI indices (Fig. 3.10) due to similar patterns of cumulative distribution functions in

73

testing and training data sets. Therefore the cumulative distribution of ∆ values could be used to define

subclass thresholds according a large range of desired confidence levels and to develop scoring or

membership functions to integrate aggregate-size indices into an overall soil quality index (Kaufmann et al.,

2009; Wienhold et al., 2009).

3.5.4 Critical values by soil textural groups

The critical values were computed by soil textural group for indices Df and BAI (Table 3.4). Critical values

showed high AUC values (96 to 97%) among indices by textural group for diagnostic purposes. Using critical

values of BAI as reference cut-off (Table 3.4), relationships between BAI and MWD and GMD indices were

binary partitioned to estimate critical values (4.20, 3.51 and 2.60 for MWD and 3.06, 2.24 and 1.52 for GMD,

for the clayey, loamy and sandy groups, respectively). Such critical values were similar to those reported by

Mukherjee and Lal (2014) to separate classes where aggregation could present moderate limitation and no

limitation for water infiltration and soil erosion problems. Amalgamating the 1-2 mm and 2-5 mm aggregate-

size fractions, soil aggregation distribution and critical BAI values were projected into ternary diagrams (Fig.

3.11) that facilitate classifying soil aggregate-size distributions in the structural diagram.

3.5.5 Effects of annual cropping system on aggregation

For the cereal, maize and potato cropping systems, respectively 33%, 30% and 65% of the sites exceeded the

critical value of ∆ BAI (against the grassland reference). Mitigation measures are thus critically required to

promote soil aggregate stability in most potato agro-ecosystems. With regard to the grassland reference state,

the deleterious effect of annual cropping on soil aggregation was slightly increased with clay content (Fig.

3.12). Elustondo et al. (1990) reported that the beneficial effect of grassland over annual cropping on soil

aggregation increased with soil clay content. The decrease of organic C in clayey soils, although higher under

maize than cereal (Fig. 3.2), did not change significantly the degree of soil aggregation within the present

range of organic C content (Fig. 3.13). On the other hand, in the loamy and sandy soil groups, the deleterious

effect of annual cropping decreased as organic C content increased. As soil aggregation and soil organic

matter content are closely related each other (Angers and Chenu, 1998; Chaney and Swift, 1984), our results

show that this relationship is even more critical for medium and coarse-textured soils.

3.5.6 Calibration set size requirement

The relationship between the size of the training data set and the critical value of ∆ BAI and the OCP (Fig.

3.14) shows that approximately 200 observations were needed to compute an accurate estimate of the critical

value. Particularly for small sizes of calibration data sets (<100), the deviations from expected values (dashed

line) were higher probably due to insufficient coverage of the predictor (soil composition) and a poor

74

representation of the validation population. The archived data set of the Québec soil survey was large enough

to calibrate the PTFs using the entire training reference data set (n=1546).

Depending on the purpose of the study, different methods (sample preparation, disruptive procedure, number

and size of aggregate classes) are required to establish the relationship between aggregate-size classes and

soil processes (Dıaz-Zorita, 2002). The derivation of critical values from soil compositions and PTFs well

calibrated for specific soil processes and functions (biological activities, crop yield or erosion susceptibility)

may consider other methodologies, pedological regions or reference populations. Figure 3.14 gives an

appreciation of sample size for further model developments.

3.6 Conclusion Soil aggregate stability was largely influenced by soil composition, making it possible to relate soil texture and

organic C to soil structure through PTFs and define critical values for Df and BAI indices. The effect of

beneficial practices must move soil aggregation indices closer to those of reference grassland soils. Critical

values were also derived by textural groups for MWD and GMD indices although those currently used indices

are less appropriate to describe aggregate-size distribution. Compared to grassland, annual cropping systems

showed to have more deleterious effect on aggregate stability of clayed (G1) soils, while loamy (G2) and

sandy (G3) soils were shown to be more susceptible to soil structure degradation at lower organic C content.

The efficiency of beneficial cultural practices (i.e. conservation tillage, organic matter inputs, crop rotations,

cover crops) could be measured for the corresponding soil texture and C content using the proposed ∆ BAI

index and the critical value in reference to grassland cropping system.

3.7 Acknowledgements This study is part of the soil quality module of the research project entitled ‘Implementing means to increase

potato ecosystem services’ (CRDPJ 385199 – 09). We acknowledge the financial support of the Natural

Sciences and Engineering Research Council of Canada (NSERC), as well as farm partners as follows:



et Fils Inc., Péribonka, Québec, Canada.

75

3.8 References Abiven, S., Menasseri, S., Chenu, C., 2009. The effects of organic inputs over time on soil aggregate stability – A literature analysis. Soil Biol. Biochem. 41, 1–12.

Aitchison, J., 1986. The statistical analysis of compositional data. Chapman and Hall, London.

Allison, L.E., 1965. Organic carbon, in: Black, C.A. (Ed.), Methods of Soils Analysis. American Society of Agronomy, Madison, Wisconsin, pp. 1367–1378.

Anderson, A.N., McBratney, A.B., Crawford, J.W., 1997. Applications of fractals to soil studies. Adv. Agron. 63, 1–76.

Angers, D.A., Chenu, C., 1998. Dynamics of soil aggregation and C sequestration, in: Lal, R., Kimble, J.M., Follett, R.F., Stewart, B.A. (Eds.), Soil Processes and the Carbon Cycle. Advances in Soil Science. CRC Press, Boca Raton, Florida, pp. 199–206.

Bouyoucos, G.J., 1962. Hydrometer method improved for making particle size analyses of soils. Agron. J. 54, 464–465.

Bronick, C.J., Lal, R., 2005. Soil structure and management: a review. Geoderma 124, 3–22.

Cambardella, C.A., 2002. Aggregation and organic matter, in: Encyclopedia of Soil Science. Marcel Dekker, Inc., New York, pp. 41–44.

Carter, M.R., 2002. Soil Quality for Sustainable Land Management. Agron. J. 94, 38–47.

Chaney, K., Swift, R.S., 1984. The influence of organic matter on aggregate stability in some British soils. J. Soil Sci. 35, 223–230.

Day, G.H. (Ed.), 1982. The Canada soil information system (CanSIS). Manual for describing soils in the field. Revised. Agriculture Canada, Ottawa, Ontario.

Devine, S., Markewitz, D., Hendrix, P., Coleman. D., 2014. Soil aggregates and associated organic matter under conventional tillage, no-tillage, and forest succession after three decades. PLoS One, 9: 1–12.

Dıaz-Zorita, M., 2002. Disruptive methods for assessing soil structure. Soil Tillage Res. 64, 3–22.

Eash, N.S., Karlen, D., Parkin, T., 1994. Fungal contributions to soil aggregation and soil quality, in: Doran, J.W., Coleman, D.C., Bezdicek, D.F., Stewart, B.A. (Eds.), Defining Soil Quality for a Sustainable Environment. SSSA Spec. Publ. 35, Madison WI, pp. 221–228.

Egozcue, J.J., Pawlowsky-Glahn, V., 2006. Simplicial geometry for compositional data. Geol. Soc. London, Spec. Publ. 264, 145–159.

Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G., Barceló-Vidal, C., 2003. Isometric logratio transformations for compositional data analysis. Math. Geol. 35, 279–300.

Elustondo, J., Angers, D.A., Laverdière, M.R., N’Dayegamiye, A., 1990. Étude comparative de l’agrégation et de la matière organique associée aux fractions granulométriques de sept sols sous culture de maïs ou en prairie. Can. J. Soil Sci. 70, 395–402.

76

Fawcett, T., 2006. An introduction to ROC analysis. Pattern Recognit. Lett. 27, 861–874.

Filzmoser, P., Gschwandtner, M., 2014. “mvoutlier”: Multivariate outlier detection based on robust methods. R package version 2.0.5. R Found. Stat. Comput., Vienna, Austria.

Filzmoser, P., Hron, K., 2011. Robust statistical analysis, in: Pawlowsky-Glahn, V., Buccianti, A. (Eds.), Compositional Data Analysis: Theory and Applications. John Wiley and Sons, New York, pp. 57–72.

Filzmoser, P., Hron, K., Reimann, C., 2009. Univariate statistical analysis of environmental (compositional) data: problems and possibilities. Sci. Total Environ. 407, 6100–6108.

Hamilton, N., 2014. “ggtern”: An extension to ggplot2, for the creation of ternary diagrams. R package version 1.0.3.2. R Found. Stat. Comput., Vienna, Austria.

Hastie, T., Tibshirani, R.J., Friedman, J., 2009. The elements of statistical learning: Data mining, inference and prediction. 2nd Ed. Springer, New York.

Karatzoglou, A., Meyer, D., Hornik, K., 2006. Support vector machines in R. J. Stat. Softw. 15, 1–28.

Karatzoglou, A., Smola, A., Hornik, K., 2014. “kernlab”: Kernel-based machine learning lab. R package version 0.9-19. R Found. Stat. Comput., Vienna, Austria.

Kaufmann, M., Tobias, S., Schulin, R., 2009. Quality evaluation of restored soils with a fuzzy logic expert system. Geoderma 151, 290–302.

Kemper, W.D., Koch, E.J., 1966. Aggregate stability of soils from western United States and Canada. Technical Bulletin # 1355. Agricultural Research Services. USDA in Cooperation with Colorado Agricultural Experiment Station.

Kemper, W.D., Chepil, W.S., 1965. Size distribution and aggregate, in: Black, C.A. (Ed.), Methods of Soils Analysis. American Society of Agronomy, Madison, Wisconsin, pp. 499–510.

Kozak, E., Pachepsky, Y., Sokolowski, S., Sokolowska, Z., Stepniewski, W., 1996. A Modified Number-based Method for Estimating Fragmentation Fractal Dimensions of Soils. Soil Sci. Soc. Am. J. 60, 1291–1297.

Kuhn, M., 2014. “caret”: Classification and regression training. R package version 6.0-37. R Found. Stat. Comput., Vienna, Austria.

Lal, R., 1994. Methods and guidelines for assessing sustainable use of soil and water resources in the tropics. USDA/SMSS Technical Monograph 21, Washington, D.C.

Lamontagne, L., Nolin, M.C., 1997. Dossier des noms de sols du Québec. SISCan. Agriculture et Agroalimentaire Canada, Sainte-Foy, Québec, Canada.

Martín-Fernandez, J, A., Palarea-Albaladejo, J., and Olea, R.A., 2011. Dealing with zeroes, in: Pawlowsky-Glahn V. and Buccianti, A. (Eds.), Compositional data analysis: Theory and applications. John Wiley and Sons, New York, pp. 43–58.

Mazurak, A.P., 1950. Effect of gaseous phase on water-stable synthetic aggregates. Soil Sci. 69, 135–148.

Mehlich, A., 1984. Mehlich 3 soil test extractant: A modification of Mehlich 2 extractant. Commun. Soil Sci. Plant Anal. 15, 1409–1416.

77

Mukherjee, A., Lal, R., 2014. Comparison of soil quality index using three methods. PLoS One 9, 1–15.

Mueller, L., Schindler, U., Mirschel, W., Shepperd, T.G., Ball, B.C., Helming, K., Rogasik, J., Eulenstein, F., Wiggering, H., 2010. Assessing the productivity function of soils, A review. Agronomy for Sustainable Development 30, 601–614.

National Soil Research Institute, 2001. A guide to better soil structure. Cranfield university, Silsoe, UK.

Parent, L.E., de Almeida, C.X., Hernandes, A., Egozcue, J.J., Gülser, C., Bolinder, M.A., Kätterer, T., Andrén, O., Parent, S.-É., Anctil, F., Centurion, J.F., Natale, W., 2012. Compositional analysis for an unbiased measure of soil aggregation. Geoderma 179-180, 123–131.

Parent, L.E., Gagné, G.. 2010. Guide de référence en fertilisation. 2e Édition. Centre de référence en agriculture et agroalimentaire du Québec.

Parent, S.E., Leblanc, M.A., Egozcue, J.J. and Parent, L.E. A robust soil aggregation index as displacement from a balanced soil structure. Article in preparation.

Peech, M., 1965. Hydrogen-Ion activity, in: Black, C.A. (Ed.), Methods of Soils Analysis. American Society of Agronomy, Madison, Wisconsin, pp. 914–932.

R Development Core Team, R., 2011. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.

Rieu, M., Sposito, G., 1991. Fractal Fragmentation, Soil Porosity, and Soil Water Properties: II. Applications. Soil Sci. Soc. Am. J. 55, 1239.

Robin, X., Turck, N., Hainard, A., Tiberti, N., Lisacek, F., Sanchez, J.-C., Müller, M., 2014. “pROC”: Display and analyze ROC curves. R package version 1.7.3. R Found. Stat. Comput., Vienna, Austria.

Shrestha, D.L., Solomatine, D.P., 2006. Machine learning approaches for estimation of prediction interval for the model output. Neural Networks 19, 225–35.

Six, J., Bossuyt, H., Degryze, S., Denef, K., 2004. A history of research on the link between (micro)aggregates, soil biota, and soil organic matter dynamics. Soil Tillage Res. 79, 7–31.

Stott, D.E., Karlen, D.L., Cambardella, C.A., Harmel, R.D., 2013. A Soil Quality and Metabolic Activity Assessment after Fifty-Seven Years of Agricultural Management. Soil Sci. Soc. Am. J. 77, 903.

Swets, J.A., 1988. Measuring the accuracy of diagnostic systems. Science 240, 1285–1293.

Tabi, M., Tardif, L., Carrier, D., Laflamme, G., Rompré, M., 1990. Inventaire des problèmes de dégradation des sols agricoles du Québec. Rapport synthèse. MAPAQ, Québec, Canada.

Van Bavel, C.H.M., 1949. Mean weight-diameter of soil aggregates as a statistical index of aggregation. Proceedings. Soil Sci. Soc. Am. 14, 20–23.

Van den Boogaart, K.G., Tolosana-Delgado, R., Bren, M., 2014. “compositions”: Compositional data analysis. R package version 1.40-1. R Found. Stat. Comput., Vienna, Austria.

Wickham, H., Chang, W., 2015. “ggplot2”: An implementation of the grammar of graphics. R package version 1.01. R Found. Stat. Comput., Vienna, Austria.

78

Wienhold, B.J., Karlen, D.L., Andrews, S.S., Stott, D.E., 2009. Protocol for indicator scoring in the soil management assessment framework (SMAF). Renew. Agric. Food Syst. 24, 260–266.

Youden, W.J., 1950. Index for rating diagnostic tests. Cancer 3, 32–35.

79

Table 3.1. Sequential binary partitioning of basic soil composition components.

C-ilr Organic C Sand Silt Clay

C-ilr1 +1 -1 -1 -1

C-ilr2 0 +1 -1 -1

C-ilr3 0 0 +1 -1

80

Table 3.2. Sequential binary partitioning of soil aggregate-size components.

A-ilr 5-8 mm 2-5 mm 1-2 mm ≤1 mm

A-ilr1 +1 +1 +1 -1

A-ilr2 +1 +1 -1 0

A-ilr3 +1 -1 0 0

81

Table 3.3. Performance of SVM models to estimated reference values of aggregate indices.

Aggregate index

Hyper-parameter R2 RMSE

Cost Kernel width Training Testing Training Testing

MWD 1.5 0.18 0.265 0.241 0.871 0.850

GMD 0.5 0.19 0.338 0.311 1.063 1.057

Df 0.9 0.18 0.376 0.358 0.388 0.380

BAI 1 0.17 0.387 0.366 0.741 0.728

82

Table 3.4. Critical value and area under the curve (AUC) of binary partition of aggregation indices by textural

groups.

Aggregate index

Critical value AUC (%)

G1 G2 G3 G1 G2 G3

Df 2.59 2.89 3.11 96.39 98.76 97.69

BAI 0.65 0.08 -0.53 96.44 98.85 97.35

83

Figure 3.1. Soil particle size distribution for a) grassland, b) cereals, c) maize and d) potatoes systems. Polygon limits represent the USDA textural classes.

84

Figure 3.2. Boxplot of soil organic carbon by textural group and cropping system. Diamonds represent averages.

85

Figure 3.3. Geometric representation of the balance aggregation index (BAI) computed into a coordinate system using isometric log-ratio (ilr) transformations of aggregate composition. The BAI index represents the

orthogonal projection of vector , defined by the ilr coordinates of the aggregate composition ( ), onto vector

, defined by the ilr coordinates of the ultimate aggregation (N) and the ultimate disaggregation (M) compositions, expressing the linear gradient of the aggregation balance between ultimate structures. The figure is representative of a three-part composition but BAI can be computed for a D-part composition into a D-1 coordinate system.

86

Figure 3.4. Bi-plot a) of aggregate-size balances along the first two principal components and b) correlation circle of synthetic indices, composition and aggregate-size balances (Tables 3.1-3.2) with the first two principal components of aggregate-size balances. Filled white circles represent centroid and elliptical areas scanning 95% of the theoretical distribution of grassland (G), cereals (C) maize (M) and potatoes (P) cropping systems. BAI = balance aggregation index; Df = scaling fractal coefficient; MWD = mean weight diameter; GMD = geometric mean diameter; ilr = isometric log ratio of basic components (C-) and aggregates (A-).

87

Figure 3.5. Boxplot of soil aggregation indices. Diamonds represent averages.

88

Figure 3.6. Relationships between aggregate balance A-ilr1 ([1-8 mm | ≤ 1 mm]) and composition balances a) C-ilr2 ([sand | silt, clay]) and b) C-ilr3 ([silt | clay]) by cropping system.

89

Figure 3.7. Relationships between aggregate balance A-ilr1 ([1-8 mm | ≤ 1 mm]) and composition balance [organic C | mineral] by cropping system and textural group.

90

Figure 3.8. Boxplot of soil organic carbon in grassland cropping system by drainage class (1: very rapid; 2: rapid; 3: well; 4: moderately well; 5 imperfect, 6: poor; and 7: very poor (Day, 1982)) and textural group. Diamonds represent averages.

91

Figure 3.9. Relationships between the delta (∆) value and the predicted aggregate index and the density distribution of delta values for grassland testing dataset.

92

Figure 3.10. Cumulative distribution function of delta (∆) values and relationship between optimal coverage probability (OCP) and the confidence level for soil aggregate indices Df and BAI. Critical values of 0.72 and 1.37 at 0.95 percentile, respectively for Df and BAI, are represented by empty dots.

93

Figure 3.11. Soil aggregate-size distribution for a) clayey (G1), b) loamy (G2) and c) sandy (G3) textural group. Shaded area covers below-optimal aggregation states using the critical values of BAI (Table 3.4).

94

Figure 3.12. Relationships between the delta (∆) value of BAI in reference to grassland and the composition balance [sand | silt, clay] by cropping system. Dashed lines represent the critical value of 1.37 (points above the dashed line indicate critical disaggregation).

95

Figure 3.13. Relationships between the delta (∆) value of BAI in reference to grassland and the composition balance [organic C | mineral components] by cropping system and textural group. Dashed lines represent the critical value of 1.37 (points above the dashed line indicate critical disaggregation).

96

Figure 3.14. Size of training data sets to establish a) the critical value of ∆ BAI at a confidence level of 95% and b) the optimal coverage probability (OCP) on the testing data set. Dots represent averages and the shaded area covers the 5th to 95th percentile of 100 iterations for each size of data set randomly selected in the training data set (n = 1546).

97

Chapitre 4: Numerical clustering of soil series using morphological profile attributes for potato Michaël A. Leblanc1, Gilles Gagné2 and Léon E. Parent1

1Department of Soils and Agri-Food Engineering, Paul-Comtois Building, Université Laval, Québec,

QC G1V 0A6, Canada.

2Centre de Référence en Agriculture et Agroalimentaire du Québec (CRAAQ), Delta 1 Building,

Québec, QC G1V 2M2, Canada.

Publié dans : Digital Soil Morphometrics, A.E. Hartemink et B. Minasny (éds),

Progress in Soil Science Series, Springer, 2016.

98

4.1 Résumé Des essais de fertilisation pour la culture de pomme de terre ont été conduits sur 46 séries de sols au Québec,

Canada. Cette étude vise à établir un ensemble de classes de sols représentatives à partir des données

morphologiques reflétant les propriétés de sols appropriées pour la production de pomme de terre. Les

données de profils modaux des séries de sols contiennent les attributs morphologiques des horizons (incluant

le roc) avec leurs suffixes indiquant l’absence (0), la faible expression (0.5) et la présence (1) de propriétés

spécifiques (caractéristiques pédogénétiques), et leur composition granulométrique. Une matrice de distances

a été calculée pour représenter la dissemblance entre les profils de sols. Suite à leur positionnement dans un

espace multidimensionnel, les profils de sols ont été regroupés à partir de l’algorithme de classification floue k-

moyennes avec extragrades permettant l’expression des groupes de sols en variables continues, facilitant

ainsi la modélisation. La mesure de dissemblance entre les profils de sols calculée en utilisant les descriptions

de sols (p. ex. couleur, pH, contenu en C) de sites expérimentaux a montré que les horizons génétiques

indexés sont cohérents avec les propriétés de base pour comparer des profils de sols et les allouer à des

classes existantes. En conclusion, le regroupement numérique fournit une base quantitative pour intégrer les

descriptions de profils de sols dans des modèles synthétiques de réponses des cultures à l’ajout de

fertilisants.

99

4.2 Abstract Potato fertilization trials have been conducted on 46 soil series in the province of Québec, Canada. This study

aims to create a set of representative soil classes based on morphological data that reflect soil properties

suitable for growing potato. Data of modal soil profiles of soil series contain morphological attributes from

master horizons (including bedrock) with diagnoses indicating the absence (0), weak expression (0.5) or

presence (1) of specific properties (pedogenetic features), and particle-size distribution. A distance matrix was

calculated to represent the dissimilarity between the soil profiles. Using a multidimensional scaling technique,

soil profiles distributed in a feature space were clustered using the fuzzy k-means with extragrades algorithm

to allow expressing soil groups as continuous variables, hence facilitating modeling. The dissimilarity between

soil profiles computed using soil descriptions (e.g. color, pH, C content) at experimental sites showed that

genetic horizon indices are consistent with basic properties to compare soil profiles and allocate them to

existing classes. In conclusion, numerical clustering provided a quantitative basis to integrate soil profile

descriptions into general crop response models to added nutrients across fertilization trials.

100

4.3 Introduction Soil classification systems have been traditionally oriented towards agricultural applications. Crop productivity

is influenced by the soil’s capacity to supply sufficient amounts of water, nitrogen and mineral nutrients in

interaction with climatic conditions and soil management. However, the morphological soil properties used as

taxonomic criteria at the soil series level such as texture, carbonates and lithic contact and the properties that

influence the agronomic soil potential cannot be generalized consistently at the highest hierarchical levels of

the Canadian system of soil classification such as orders and great groups (Soil Classification Working Group,

1998). An alternative numerical procedure must be developed to synthesize a large number of input variables

provided by soil surveys for use in agronomic models.

Techniques of morphometrics allow deriving depth functions from quantified soil profile attributes to support

numerical soil classification (Hartemink and Minasny, 2014). Usually, a soil series is represented by one or

more modal soil profiles, described by a sequence of horizons with different morphological characteristics.

After deriving depth functions, soil profiles can be clustered into continuous soil classes using the degree of

similarity between soil profiles and principles of fuzzy classification (Carré and Jacobson, 2009; Odgers et al.,

2011). Such groups with fuzzy memberships can be integrated as continuous variables into crop response

models allowing a more realistic expression of the gradual nature of soils (McBratney and De Gruijter, 1992).

The aim of this work is to aggregate soil series commonly used for potato cropping in Québec, Canada, into a

number of representative classes. New profile observations can be allocated to those classes to facilitate

running a potato fertilizer response models.

4.4 Materials and methods

4.4.1 Legacy database of profile descriptions of soil series

Soil surveys conducted in Québec between 1943 and 2005 described 649 series of mineral soils as shown in

the updated file of soil names (Lamontagne and Nolin, 1997). The genetic horizon code and the clay (0-0.002

mm), silt (0.002-0.05 mm) and sand (0.05-2 mm) contents determined by sedimentation techniques are some

of the properties recorded in the 1657 modal profiles from 547 soil series, representing 92% of the mapped

area. Horizon designations were harmonized to the current Canadian system (Soil Classification Working

Group, 1998) by expert knowledge and the clay and silt measurements separated at 0.005 mm in soil surveys

before 1957 have been transformed into the current system using regression equations (unpublished).

The potato fertilizer trials (N, P and K treatments) archived in Québec (1958-2014) have been conducted on 46

soil series associated with 310 modal profiles (Fig. 4.1). The profiles located in the Saguenay-Lac Saint-Jean

101

and the St. Lawrence Lowlands were mainly developed on fluviatile, lacustrine and marine deposits ranging

from very rapidly to poorly drained and classified as Podzols, Brunisols and Gleysols.

4.4.2 Transformation of morphological data

4.4.2.1 Genetic horizons

The pedometrics approach that compares soil taxa quantitatively using the key attributes of modal soil profiles

as central entities has been applied to derive metric distances between genetic soil horizons (Láng et al.,

2013; Mazaheri et al., 1995; Minasny and McBratney, 2007). The diagnostic horizons were transformed into

eight indices of 0, 0.5, and 1 indicating respectively the absence, weak expression and presence of specific

properties as pedogenetic features (Table 4.1). For example, we characterized a podzolized (Pof), weakly

gleyed (Gle) and cemented (Har) soil horizon showing the secondary distinction of fcgj in the Canadian system

by a vector [0, 0, 1, 0, 0, 0.5, 0, 1] in the order reported in Table 4.1.

4.4.2.2 Soil textural components

Sand, silt and clay contents are compositional data (proportions constrained between zero and 100% or 1000

g kg-1). Their conventional statistical analysis can create distortions caused by redundancy, scale dependency

and the inherent non-normal distribution (Filzmoser et al., 2009). The log ratio transformations avoid such

problems (Aitchison, 1986; Egozcue et al., 2003; Filzmoser and Hron, 2011). Soil textural components were

thus transformed into isometric log ratios (ilr) according to a sequential binary partition (SBP) (Table 4.2) as

follows (Egozcue and Pawlowsky-Glahn, 2006):

𝑖𝑙𝑟𝑖 = √𝑟𝑖𝑠𝑖

𝑟𝑖+𝑠𝑖𝑙𝑛

𝑔(x+)

𝑔(x−) , (4.1)

where ilri is the ith balance of a D-part composition, i Є [1, D-1], ri and si are the numbers of components in the

+ (numerator) and – (denominator) groups, respectively, and g(x+) and g(x–) are the geometric means across

components in the + and – groups, respectively. The SBP consists of dividing sequentially the composition into

two groups of parts which are indicated by +1 and −1 until all groups are made of a single part. Label 0

indicates that this part is not involved in the partition at this order (Egozcue and Pawlowsky-Glahn, 2006). To

support the interpretation of principal coordinate analysis, textural components were transformed into centered

log-ratio (clr) (Aitchison, 1986) making their interpretation possible in terms of the original compositional parts

in the principal coordinate analysis (Filzmoser and Hron, 2011):

𝑐𝑙𝑟𝑖 = 𝑙𝑛𝑥𝑖

𝑔(𝑥), (4.2)

102

where xi is the ith part of the composition and g(x) is the geometric mean. As proposed by Martín-Fernandez et

al. (2011), the zeroes can be replaced by a fraction of the detection limit (e.g. 0.65 ∙ 10 g kg-1 = 6.5 g kg-1) to

compute log ratios while minimizing the impact of zeroes on the covariance structure of compositional data

sets.

4.4.2.3 Data scaling

A classification should be performed using variables of equal weights (Sneath and Sokal, 1973). To assign a

common footing to the pedogenetic variables ranging between 0 and 1, the ilr-transformed textural

components were centered by subtracting the mean and scaling to two times the standard deviation (Gelman,

2008). We preferred this option instead of the standardization of all variables, i.e., mean centered and scaled

to their standard deviation, because variables showing low values of standard deviations (e.g. carbonated

index) were up-scaled, and of minimum-maximum transformation because textural variables scaled in the

range [0-1] were down-scaled compared to genetic horizon variables.

4.4.2.4 Fertilizer trials experimental sites

In addition to the soil series, soil profiles from 49 experimental sites located in the Saguenay-Lac Saint-Jean

and the St. Lawrence Lowlands were characterized by genetic horizon designations, moist colors of matrix and

mottles using the Munsell soil color chart, pH in CaCl2 (Hendershot et al., 1993), and C content (dry

combustion in the Leco-CNS instrument). Munsell color code (hue, value, and chroma) was projected into the

L-a-b colour space (Viscarra Rossel et al., 2006). Contrast of mottles was defined as an Euclidean distance

between the mottle and matrix L-a-b coordinates. In the absence of mottles, the distance was reported as zero

(Malone et al., 2014).

4.4.3 Dissimilarity measure between soil profiles

The soil horizon attributes of each soil profile were disaggregated (segmented) to a common depth to allow

pairwise comparisons of soil profiles by a 5 cm slice (Beaudette et al., 2013). The dissimilarity measure (dxy)

between soil profiles x and y, where dxy is the element of the distance matrix D of size p × p and p is the

number of soil profiles, was calculated as the mean Euclidean distance between the ith slice of each soil

profile, with i Є [1, N], as follows:

𝑑𝑥𝑦 = 1

𝑁∑ √∑ (𝑣𝑥𝑗 − 𝑣𝑦𝑗)2𝑀

𝑗=1𝑁𝑖=1 , (4.3)

where vj is the jth soil morphological property of the slice, with j Є [1, M]. Non-soil material (e.g. bedrock) that

has no soil morphological attribute was defined as the maximum between-slice dissimilarity to reflect the fact

103

that soil and non-soil materials are very different (Beaudette et al., 2013). A distance of zero was imputed

between corresponding slices of non-soil material.

Usually, soil series are conceptualized using soil properties of the sub-surface layers. The legacy data set

includes modal soil profile descriptions from cultivated and undisturbed areas. To avoid the influence of land

use on this classification, dissimilarity measures were computed for soil layers at depths ranging from 30 to

100 cm. Disregarding the upper 30 cm is justified by the fact that more detailed information of surface layer

(e.g. texture, Mehlich-III extracts) can be integrated into models crop response to added nutrients.

4.4.4 Continuous classification of soil profiles

The distance matrix of the soil profiles was projected into a Cartesian framework using the multidimensional

scaling technique. Briefly, after transforming the eigen-decomposition of the dissimilarity matrix D by Gower’s

centring, we computed principal coordinates by scaling the eigenvectors to lengths equal to the square roots of

their eigenvalues (Legendre and Legendre, 2012).

The fuzzy k-means with extragrades (FKMe) clustering algorithm (McBratney and De Gruijter, 1992) was

performed on the principal coordinates to partition the soil profile distribution into clusters (defined by their

centroids) that shared similar morphological descriptions. The FKMe is an unsupervised clustering procedure

that accommodates, in an extragrade cluster, the soil profiles with atypical properties located at the periphery

of the soil distribution to decrease their effect on the normal (intragrade) clusters. Briefly, the FKMe algorithm

finds the position of cluster centroids in the space of coordinates by minimizing its objective function using the

distance between individuals and centroids. The algorithm considers three parameters: the number of classes

(k) and the fuzziness exponent (ɸ) which were determined from relationships with the fuzziness performance

index and the derivative of the fuzzy k-mean objective function with respect to ɸ (Odeh et al., 1992) and the

parameter that defines the contribution of the extragrade class (α) which was set by iteration to allocate 5% of

the total population in the extragrade cluster (95% in the intragrade clusters) creating a confidence region

about the intragrade clusters.

After completing the classification, the closest modal profiles of each cluster centroid were defined as

exemplars (highest membership to a class). Using a real profile rather than a centroid, a new profile can be

classified without principal coordinates calculation. The fuzzy memberships of soil profiles were computed by

the dissimilarity measure to the exemplars using allocation equations (McBratney, 1994).

4.4.5 Statistics of the aggregated depth functions

According to the principle of working on coordinates (ilr) (Mateu-Figueras et al., 2011) for each 1 cm slice, the

average and standard deviation were calculated for the ilr-transformed textural components and back-

104

transformed to the original domain (i.e., 0 to 100%). Genetic horizon indices (index value of 0, 0.5 or 1) and

their counterparts (i.e. 1-index) were also transformed individually into isometric log-ratio before computing

descriptive statistics. As explained above, the zeroes were replaced by a fraction of the detection limit of 1%

(i.e. 0.65 ∙ 0.01 = 0.0065).

The numerical clustering process summarized in the Fig. 4.2 were conducted using the FuzME software

(Minasny and McBratney, 2002) for the FKMe classification and the R statistical environment (R Development

Core Team, 2011) using packages "aqp" (Beaudette and Roudier, 2015) for soil profile data manipulation,

"compositions" (van den Boogaart et al., 2014) for compositional data transformation, "ggplot2" (Wickham and

Chang, 2015) for data visualisation and "plyr" (Wickham, 2015) for summary statistics.


4.5.1 Clustering of modal soil profiles

FKMe clustering was carried out on modal soil profiles of soil series where fertilizer trials have been conducted

(Fig. 4.3). The algorithm was set with k = 3 and ɸ = 1.6 (Odeh et al., 1992) and with a α = 0.6 leaving 4.9% of

the modal profiles as extragrades. A small k value restricted the number of variables in the potato fertilization

models.

The soil profile distribution along the first two principal coordinates that explained 51% of the variance was

mainly correlated with the textural components and the pedogenetic variables Pof and Gle (Fig. 4.3). The

elliptical areas of class A, B and C were thus consistent with the textural gradient and the gleying and

podzolization processes. Those morphological characteristics were aggregated into depth functions by soil

classes (Fig. 4.4). The depth functions of the podzolized (Pof) horizons followed a peak function, while the

gleyed horizon followed a wetting front function (Minasny et al., 2016).

Class A is dominated by sandy soil profiles with podzolized (Pof) horizons down to 60 cm. Those soils are well

to very well-drained and classified as Podzols or Brunisols. They are susceptible to nitrate leaching during

intensive precipitation events and rain-fed crops are vulnerable to water stress during dry periods.

Soil profiles of class B are composed of sandy gleyed horizons. That group also includes soil profiles with

podzolized (Pof) horizons to a depth ranging from 20 to 40 cm, sometimes showing both Pof and Gle features

(i.e. Bfg) or both Poh and Pof (i.e. Bhf). Soil profiles in class B are imperfectly to poorly drained and classified

as Gleysols and gleyed Podzols. Potato crops grown on such soils are less affected by water stress compared

to class A soils.

105

Soil profiles of class C are fine-textured ranging from loam to silty clay loam that also include shallow sandy

soil overlying clay materials. Textural variability is slightly higher in class C compared to classes A and B.

Although some profiles show altered or weakly podzolized (Pof) horizons, gleying is the dominant pedogenic

feature of class C soils. Some profiles also have carbonates in deeper horizons. Soil profiles of class C are

moderately well to poorly drained and classified as Gleysols, gleyed Podzols or Brunisols. In general, water

storage and fertility are higher in soils of class C compared to those of classes A and B, but crops could be

affected by prolonged wet periods.

Partitioning the data into three soil classes, the soil profiles characterized by low-frequency morphological

features (e.g. Har and Alt) that are weakly correlated to the first two principal coordinates (Fig. 4.3) were not

discriminated. Soil profiles with hardened horizons (strongly cemented or high density pedogenetic) were

allocated to classes A and B while profiles with altered horizons were classified in classes A and C. However,

low-frequency features were kept rather than excluded by cluster analysis in order to classify new specimens

correctly. Hence, soil profiles far from the cluster centers could be located in the periphery of the soil

distribution (extragrades). On the other hand, with additional fertilizer trials increasing the degrees of freedom

to run the crop response models, the classification could be refined with a higher number of soil classes to

account more for soil heterogeneity. Uncertainty analysis of the models can also be used to identify soil

classes requiring more field trials.

4.5.2 Cluster memberships of individual soil series

Fertilizer trials sites are most often associated with a single soil series name without profile description. To

allocate a set of fuzzy memberships to each of the soil series, the soil profile closest to the centroid of

individual soil series was selected as the exemplar. In our opinion, the exemplar profile is more appropriate to

represent a soil series than a combination of average depth functions of variables, because a given

combination could represent a nonexistent profile. A soil profile can be allocated to this numerical class without

the need to designate the soil series.

On the other hand, the soil series concepts elaborated over a long period (1943-2005) evolved differently

across soil surveys. Although a soil series represented by many soil profiles from different soil surveys is more

indicative of the soil series concepts variability, their allocation to soil classes could also be performed

individually. Furthermore, as discussed by Láng et al. (2013), the taxonomic distance between modal soil

profiles is an important quantitative tool to correlate and harmonize soil series that arise from different soil

surveys.

106

4.5.3 Domain limit of the clusters

All soil profiles (n = 1657) of soil series were classified into clusters A, B, C and extragrade. Therefore, potato

fields with soil series not associated with fertilizer trials can be supported by crop response models. However,

prediction models are not appropriate when extrapolating to cases far from the calibration domain (Tranter et

al., 2010). The domain limit of the clustering was defined by the extragrade class that delineates a confidence

region about intragrade clusters (Fig. 4.5). Because soil profile distribution is not normally distributed in the

principal coordinate space, the extragrade region is more appropriate as a domain limit compared to an

arbitrary distance cut-off from a centroid. Indeed, 15.4% of the soil profiles (22.5% of soil series) were

classified as extragrades. Most extragrades were unfavorable to potato cropping (clayed, carbonated, shallow

soil profiles).

4.5.4 Comparison of genetic horizons indexed with basic properties

Principal component analysis was conducted on the basic soil attributes (color variables, pH, C content) of the

horizons (n = 160) of soil profiles from the surveyed fertilization experimental sites (Fig. 4.6). The distribution of

podzolized to gleyed horizons was correlated with colour coordinates and contrasts of mottles, which is

consistent with field identification of genetic horizons broadly based on colour, arrangement of particles

(structure and consistence) and the HCl-effervescence reaction. However, their genetic horizons overlapped.

Basic properties, such as colours that are sensitive to the intensity of pedological development and related to

the nature of the soil material (geological origin), can be more informative compared to genetic horizon

designations and can explain the overlapping distributions. A correlation coefficient of 0.556 (p < 0.001) was

obtained between profile distances computed from basic properties or genetic horizons indices. Indexing

genetic horizons is thus consistent with basic properties to compare soil profiles quantitatively.

4.6 Conclusions Numerical clustering allowed aggregating of soil series based on the degree of similarity between modal soil

profiles. Using both quantitative and qualitative information on soil horizons, the textural gradient and the two

major pedogenic processes of potato soil profiles were partitioned into three continuous soil classes. With the

upgrading of soil legacy databases, the grouping of soil series could be refined using more exhaustive

descriptions (e.g. sand fractions, coarse fragments, mineralogical composition). Using soil profile databases

from soil surveys, this study showed that tools of pedometrics can synthesize the information on soil profiles

into continuous indices to facilitate running site-specific crop response models.

107

4.7 Acknowledgements We wish to thank Professor Alex B. McBratney and Professor Budiman Minasny from the University of

Sydney, Australia, and their research team, especially Philip A. Hughes, for their support to develop the

numerical classification process of this work. This study is part of the soil quality module of the research

project entitled ‘Implementing means to increase potato ecosystem services’ (CRDPJ 385199 – 09) funded by

Natural Sciences and Engineering Research Council of Canada (NSERC) as well as farm partners as follows:



et Fils Inc., Péribonka, Québec, Canada.

108

4.8 References Aitchison, J. (1986). The statistical analysis of compositional data. Chapman and Hall, London.

Beaudette, D.E., and Roudier, P. (2015). “aqp”: Algorithms for quantitative pedology. R package version 1.8-6. R Found. Stat. Comput., Vienna, Austria.

Beaudette, D.E., Roudier, P., and O’Geen, A.T. (2013). Algorithms for quantitative pedology: A toolkit for soil scientists. Comput. Geosci. 52, 258–268.

Carré, F., and Jacobson, M. (2009). Numerical classification of soil profile data using distance metrics. Geoderma 148, 336–345.

Egozcue, J.J., and Pawlowsky-Glahn, V. (2006). Simplicial geometry for compositional data. Geol. Soc. London, Spec. Publ. 264, 145–159.

Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G., and Barceló-Vidal, C. (2003). Isometric logratio transformations for compositional data analysis. Math. Geol. 35, 279–300.

Filzmoser, P., and Hron, K. (2011). Robust statistical analysis. In: Compositional data analysis: Theory and applications. V. Pawlowsky-Glahn and A. Buccianti, eds. pp. 57–72. John Wiley and Sons, New York.

Filzmoser, P., Hron, K., and Reimann, C. (2009). Univariate statistical analysis of environmental (compositional) data: problems and possibilities. Sci. Total Environ. 407, 6100–6108.

Gelman, A. (2008). Scaling regression inputs by dividing by two standard deviations. Stat. Med. 27, 2865–2873.

Hartemink, A.E., and Minasny, B. (2014). Towards digital soil morphometrics. Geoderma 230-231, 305–317.

Hendershot, W.H., Lalande, H., and Duquette, M. (1993). Soil reaction and exchangeable acidity. In: Soil sampling and methods of analysis. M.R. Carter, ed. pp. 141–145. Lewis Publication, Boca Raton, Florida.

Lamontagne, L., and Nolin, M.C. (1997). Dossier des noms de sols du Québec. SISCan. Agriculture et Agroalimentaire Canada, Sainte-Foy, Québec, Canada.

Láng, V., Fuchs, M., Waltner, I., and Michéli, E. (2013). Soil taxonomic distance, a tool for correlation: As exemplified by the hungarian brown forest soils and related WRB reference soil groups. Geoderma 192, 269–276.

Legendre, P., and Legendre, L. (2012). Numerical ecology. Third edition. Elsevier, Amsterdam.

Malone, B.P., Hughes, P., McBratney, A.B., and Minasny, B. (2014). A model for the identification of terrons in the Lower Hunter Valley, Australia. Geoderma Regional 1, 31-47.

Martín-Fernandez, J, A., Palarea-Albaladejo, J., and Olea, R.A. (2011). Dealing with zeroes. In: Compositional data analysis: Theory and applications. V. Pawlowsky-Glahn and A. Buccianti, eds. pp. 43–58. John Wiley and Sons, New York.

Mateu-Figueras, G., Pawlowsky-Glahn, V., and Egozcue, J.J. (2011). The principle of working on coordinates. In: Compositional data analysis: Theory and applications. V. Pawlowsky-Glahn and A. Buccianti, eds. pp. 31–42. John Wiley and Sons, New York.

109

Mazaheri, S.A., Koppi, A.J., and McBratney, A.B. (1995). A fuzzy allocation scheme for the Australian Great Soil Groups Classification system. Eur. J. Soil Sci. 46, 601–612.

McBratney, A. (1994). Allocation of new individuals to continuous soil classes. Aust. J. Soil Res. 32, 623–633.

McBratney, A.B., and De Gruijter, J.J. (1992). A continuum approach to soil classification by modified fuzzy k-means with extragrades. J. Soil Sci. 43, 159–175.

Minasny, B., and McBratney, A.B. (2002). FuzME version 3.0. Australian Centre for Precision Agriculture, The University of Sydney, Australia.

Minasny, B., and McBratney, A.B. (2007). Incorporating taxonomic distance into spatial prediction and digital mapping of soil classes. Geoderma 142, 285–293.

Minasny, B., Stockmann, U., Hartemink, A.E., and McBratney, A.B. (2016). Measuring and modelling soil depth functions. In: Digital Soil Morphometrics. A.E. Hartemink and B. Minasny, eds. Springer.

Odeh, I.O.A., Mcbratney, A.B., and Chittleborough, D.J. (1992). Soil pattern recognition with fuzzy-c means: Applications to classification and soil-landform interrelationships. Soil Sci. Soc. Am. J. 56, 505–516.

Odgers, N.P., McBratney, A.B., and Minasny, B. (2011). Bottom-up digital soil mapping. II. Soil series classes. Geoderma 163, 30–37.

R Development Core Team, R. (2011). R: A language and environment for statistical computing. R Found. Stat. Comput., Vienna, Austria.

Sneath, P.H.A., and Sokal, R.R. (1973). Numerical taxonomy: The principles and practice of numerical classification. W.H. Freeman and Company, San Francisco.

Soil Classification Working Group (1998). The Canadian system of soil classification, 3rd Ed. Agriculture and Agri-Food Canada. Publication 1646.

Tranter, G., Minasny, B., and McBratney, A.B. (2010). Estimating pedotransfer function prediction limits using fuzzy k-means with extragrades. Soil Sci. Soc. Am. J. 74, 1967-1975.

Van den Boogaart, K.G., Tolosana-Delgado, R., and Bren, M. (2014). “compositions”: Compositional data analysis. R package version 1.40-1. R Found. Stat. Comput., Vienna, Austria.

Viscarra Rossel, R.A., Minasny, B., Roudier, P., and McBratney, A.B. (2006). Colour space models for soil science. Geoderma 133, 320–337.

Wickham, H. (2015). “plyr”: Tools for splitting, applying and combining data. R package version 1.8.1. R Found. Stat. Comput., Vienna, Austria.

Wickham, H., and Chang, W. (2015). “ggplot2”: An implementation of the grammar of graphics. R package version 1.01. R Found. Stat. Comput., Vienna, Austria.

110

Table 4.1. Indexing value of selected genetic horizons of the Canadian system of soil classification.

Horizon Morphologic features Suffix Index

Eluviated (Elu) Horizon characterized by the eluviation of clay, Fe, Al, or organic matter alone or in combination.

e 1

ej 0.5

- 0

Illuviated (Ili) Illuvial horizon enriched with silicate clay. t 1

tj 0.5

- 0

Podzolized (Pof) Horizon enriched with amorphous material, principally Al and Fe.

f 1

fj 0.5

- 0

Podzolized (Poh) Horizon (B) enriched with organic matter. h 1

- 0

Altered (Alt) Horizon slightly altered by hydrolysis, oxidation, or solution, or all three.

m 0.5

- 0

Gleyed (Gle) Horizon characterized by gray colors, or prominent mottling, or both, indicating permanent or periodic intense reduction.

g 1

gj 0.5

- 0

Carbonated (Car) Denotes the presence of carbonate as indicated by HCl-effervescence reaction.

k 1

kj 0.5

- 0

Hardened (Har) Strongly cemented or high density pedogenic horizon.

c, x 1

cj, cc, xj 0.5

- 0

111

Table 4.2. Sequential binary partitioning of soil textural components.

ilr Sand Silt Clay

ilr1 +1 -1 -1

ilr2 0 +1 -1

112

Figure 4.1. Number of modal soil profiles (a) and fertilizer trial sites (b) by soil series. Soil series code refers to the updated file of soil names of Québec (Lamontagne and Nolin, 1997). For brevity, soil series variants were amalgamated into the corresponding soil series.

113

Figure 4.2. Summary of the numerical clustering process of soil profiles.

114

Figure 4.3. Distribution of the maximum fuzzy membership (max mc) to clusters A, B and C of modal soil profiles in fertilizer trials (a) and correlation circles of clr-transformed textural components and pedogenetic features (see Table 4.1 for descriptions) of B and C horizons along the first two principal coordinates. Filled white circles represent exemplar positions and elliptical shaded areas cover 95% of the theoretical distribution of clusters A, B and C.

115

Figure 4.4. Aggregated depth functions of morphological variables (see Table 4.1 for descriptions) by soil class (A, B, and C) of trials-associated modal profiles. Black lines represent the mean and shaded area coverage ± standard deviation computed for each 1 cm layer. Dots represent observed values at midpoint of genetic horizons.

116

Figure 4.5. Distribution of the fuzzy membership to the extragrade cluster (m*) for all modal profiles of soil series along the first two principal coordinates. Filled white circles represent exemplar positions for clusters A, B and C.

117

Figure 4.6. Principal component analysis of genetic horizons of soil profiles of experimental sites using variables of Fig. 4.6b (a). Correlation circles of color and chemical variables (b) and pedogenetic features (c) (see Table 4.1 for descriptions) along the first two principal components. Filled white circles represent centroid and elliptical shaded areas cover 95% of the theoretical distribution of genetic horizon suffixes.

118

Chapitre 5: Conclusion générale

5.1 Conclusions Ce projet de doctorat avait comme objectif général de proposer des modèles reliant les caractéristiques

pédologiques à la gestion des pratiques de fertilisation et de conservation des sols. Les données pédologiques

étant de nature variée, diverses techniques d’analyses numériques (chimiométrie, pédométrie, morphométrie)

ont permis d’effectuer ces relations objectivement sur des bases mathématiques et statistiques. L’analyse de

données compositionnelles a également été appliquée afin d’éviter les biais associés aux propriétés

intrinsèques de données contraintes à une somme constante.

Les résultats du deuxième chapitre ont montré que la capacité d’échange cationique déterminée par la

méthode à l’acétate d’ammonium (pH 7.0) (CEC-NH4OAc), une méthode nécessitant une longue et coûteuse

procédure de laboratoire avec l’utilisation de plusieurs extractifs chimiques, peut être estimée avec précision

par spectroscopie infrarouge (IR). Lorsque prédite dans un intervalle de prédiction adéquat de

± 2.3 cmolc kg-1, la CEC-NH4OAc inférée par spectroscopie IR permet d’appuyer un modèle d’estimation du

besoin en chaux (LR) nécessaire à la correction de l’acidité échangeable pour atteindre le pH désiré. Le

modèle proposé est une solution alternative au pH tampon Shoemaker-McLean-Pratt (SMP), une méthode

non respectueuse de l’environnement qui s’est montrée inadéquate pour mesurer LR des systèmes culturaux

comme celui de la pomme de terre ou qui nécessitent de faibles LR (< 1.5 g CaCO3 kg-1).

L’agrégation est un indicateur important de la qualité physique des sols sensible aux pratiques culturales et

relié à la structure, à la transmission de l’eau et des nutriments, aux habitats des micro-organismes et à la

résistance à l’érosion. Le troisième chapitre a permis le développement de critères diagnostiques en utilisant

la prairie comme système cultural de référence pour l‘interprétation des indices synthétiques de la distribution

des agrégats stables à l’eau selon les propriétés primaires (texture, C organique) de la couche de sols de

surface. Les indices courants basés sur le diamètre moyen (moyenne pondérée ou géométrique) des agrégats

se sont montrés moins aptes au développement de seuils critiques comparativement aux indices fractal et

compositionnel. La synthèse de données d’agrégation par des statistiques conventionnelles (moyenne) ne

devrait pas être préconisée afin d’éviter les biais méthodologiques liés au traitement de données

compositionnelles (Parent et al. 2012; Parent et al. en préparation [a]). Comparativement à l’état d’agrégation

en système sous prairie, les cultures annuelles ont montré un effet négatif plus important sur la stabilité des

agrégats en sols argileux, alors que les sols loameux et sableux se sont montrés plus susceptibles à la

dégradation de la structure à de faibles teneurs en C organique. La culture de pomme de terre a un effet

négatif sur la stabilité des agrégats, d’où l’importance de réduire la fréquence de cette culture dans les

rotations et de favoriser les apports de matière organique lorsque le degré de déviation de l’indice d’agrégation

119

avec le système cultural de référence sous prairie dépasse le seuil critique. Les critères diagnostiques

développés dans cette thèse permettent d’appuyer, à l’aide des propriétés primaires du sol, la prise de

décision quant à l’implantation de mesures correctives et de pratiques bénéfiques à qualité des sols comme le

travail réduit, les apports de matière organique, la rotation des cultures et les cultures de couverture.

La réponse des cultures à la fertilisation est influencée par les conditions climatiques, les pratiques culturales

et les caractéristiques pédologiques de la couche de surface et du sous-sol. Les résultats du quatrième

chapitre ont montré que les techniques de pédométrie permettent le regroupement des séries de sols

communément utilisées en culture de pomme de terre à partir des données morphologiques des profils

modaux. Le gradient textural et les deux principaux processus pédogénétiques (podzolisation et gleyification),

des attributs morphologiques principalement associés au régime hydrique des sols, ont été synthétisés en

trois classes de sols représentatives des profils de sols cultivés en pomme de terre. L’appartenance floue

(fuzzy membership) des séries de sols à chacune des classes a permis une expression structurée de

l’information des profils de sols tout en réduisant la perte de détails liée au processus de généralisation de la

classification. Avec l’ajout d’essais de fertilisation et l’amélioration des bases de données analytiques des

séries de sols, cette classification pourra être raffinée en ajoutant d’autres caractéristiques comme les

fractions sableuses et le contenu en fragments grossiers. Le degré de similarité entre les profils de sols et la

classification numérique fournissent des bases quantitatives pour la corrélation des concepts de séries de sols

provenant de différentes études pédologiques et facilitent l’intégration des descriptions de profils de sols aux

modèles de réponse des cultures aux fertilisants.

5.2 Perspectives À travers le monde, de nombreuses équipes de recherche travaillent au développement d’outils d’acquisition

de données numériques de sols. Des techniques comme la spectroscopie infrarouge facilitent la

caractérisation quantitative de la variabilité latérale et verticale des sols. Des efforts sont actuellement requis

afin d’uniformiser les protocoles et d’implanter de grandes librairies spectrales nécessaires au développement

de modèles robustes et applicables à de grands territoires caractérisés par une grande pédodiversité (Nocita

et al. 2015). Le développement de ces techniques permettra d’appuyer les modèles de recommandation et

d’interprétation de la qualité de sols, comme ceux présentés au deuxième et troisième chapitre, qui

nécessitent la détermination de propriétés conventionnellement mesurées en laboratoire à coûts élevés et

avec des délais possibles dans la prise de décision. Ces technologies sont également de nouvelles avenues

pour intégrer des propriétés comme la composition minéralogique des sols (Mulder et al. 2013; Jones et

McBratney 2015) aux modèles de réponse des cultures aux ajouts de fertilisants dans un siècle où diminuent

les réserves en phosphore et en potassium facilement accessibles. De nouvelles techniques basées sur le

traitement d’images provenant d’appareils mobiles (Aitkenhead et al. 2015) devraient également être

120

investiguées afin de supporter l’évaluation visuelle des sols et d’orienter les bonnes pratiques culturales qui

sont à la base du maintien de la qualité des sols et de la pérennité des services écosystémiques rendus par

les sols.

Les regroupements de séries de sols présentés au quatrième chapitre facilitent l’intégration des

caractéristiques pédologiques du sous-sol à des modèles de réponse de la pomme de terre aux fertilisants (N,

P, K) en interaction avec les conditions climatiques et les pratiques culturales (Parent et al. en préparation [b]).

La connaissance des caractéristiques pédologiques permet d’ajuster les doses de fertilisants.

Conséquemment, ces résultats témoignent de l’importance de la mise à jour de la couverture pédologique du

Québec afin d’améliorer et de faciliter l’identification des classes de sols (séries et variantes de sols ou autres

caractéristiques morphologiques du profil de sols) sur le territoire agricole. Désormais, des techniques de

pédométrie permettent la décomposition de délimitations cartographiques à l’aide de variables auxiliaires

(p. ex. variables dérivées du modèle numérique de terrain) afin de raffiner les couvertures pédologiques

existantes (Odgers et al. 2014). Les techniques de pédométrie et de morphométrie numériques, appuyées par

un travail continu sur le terrain, offrent le potentiel d’actualiser, de détailler et de personnaliser la classification

et la cartographie des sols du Québec afin d’améliorer la communication des connaissances pédologiques et

leur intégration dans une variété de domaines dont notamment la gestion des pratiques de fertilisation des

cultures et de conservation des sols.

121

5.3 Références Aitkenhead, M.J., Donnelly, D., Sutherland, L., Miller, D.G., Coull, M.C. et Black, H.I.J. 2015. Predicting Scottish topsoil organic matter content from colour and environmental factors. European Journal of Soil Science, 66: 112-120.

Jones, E. et McBratney, A.B. 2015. In-situ analysis of soil mineralogy through conjoint use of visible, near-infrared and X-ray fluorescence spectroscopy. Résumé de l’Inaugural Global Workshop on Digital Soil Morphometrics, 2 au 4 juin 2015, Madison, Wisconsin, USA.

Mulder, V.L., Plotze, M., de Bruin, S., Schaepman, M.E., Mavris, C., Kokaly, R.F. et Egli, M. 2013. Quantifying mineral abundances of complex mixtures by coupling spectral deconvolution of SWIR spectra (2.1–2.4um) and regression tree analysis. Geoderma, 207-208: 279-290.

Nocita, M., Stevens, A., van Wesemael, B., Aitkenhead, M., Bachmann, M., Barthès, B., Ben Dor, E., Brown, D.J., Clairotte, M., Csorba, A., Dardenne, P., Demattê, J.A.M., Genot, V., Guerrero, C., Knadel, M., Montanarella, L., Noon, C., Ramirez-Lopez, L., Robertson, J., Sakai, H., Soriano-Disla, J.M., Shepherd, K.D., Stenberg, B., Towett, E.K., Vargas, R., et Wetterlind, J. 2015. Soil spectroscopy: An alternative to wet chemistry for soil monitoring. Advances in Agronomy, 132: 139-159.

Odgers, N.P., Sun, W., McBratney, A.B., Minasny, B. et Clifford, D. 2014. Disaggregating and harmonising soil map units through resampled classification trees. Geoderma, 214: 91-100.

Parent, L.E., de Almeida, C.X., Hernandes, A., Egozcue, J.J., Gülser, C., Bolinder, M.A., Kätterer, T., Andrén, O., Parent, S.É., Anctil, F., Centurion, J.F. et Natale, W. 2012. Compositional analysis for an unbiased measure of soil aggregation. Geoderma, 179-180: 123-131.

Parent, S.E., Leblanc, M.A., Egozcue, J.J. et Parent, L.E. (a) A robust soil aggregation index as displacement from a balanced soil structure. Article en préparation pour Frontiers in Ecology and evolution.

Parent, S.E., Leblanc, M.A., Parent, A.C. et Parent, L.E. Multilevel mixed-effects modeling of potato (MEM-Potato) response to nitrogen fertilization. Article en préparation pour Precision Agriculture.

Download - Relations entre les caractéristiques pédologiques et les pratiques … · 2018-04-25 · iii Résumé La connaissance des sols est primordiale pour traiter des enjeux agronomiques

Top Related